Alleviating class imbalance using data sampling: Examining the effects on classification algorithms

File
Publisher
Florida Atlantic University
Date Issued
2006
Description
Imbalanced class distributions typically cause poor classifier performance on the minority class, which also tends to be the class with the highest cost of mis-classification. Data sampling is a common solution to this problem, and numerous sampling techniques have been proposed to address it. Prior research examining the performance of these techniques has been narrow and limited. This work uses thorough empirical experimentation to compare the performance of seven existing data sampling techniques using five different classifiers and four different datasets. The work addresses which sampling techniques produce the best performance in the presence of class unbalance, which classifiers are most robust to the problem, as well as which sampling techniques perform better or worse with each classifier. Extensive statistical analysis of these results is provided, in addition to an examination of the qualitative effects of the sampling techniques on the types of predictions made by the C4.5 classifier.
Note

College of Engineering and Computer Science

Language
Type
Extent
100 p.
Identifier
9780542931291
ISBN
9780542931291
Additional Information
College of Engineering and Computer Science
FAU Electronic Theses and Dissertations Collection
Thesis (M.S.)--Florida Atlantic University, 2006.
Date Backup
2006
Date Text
2006
Date Issued (EDTF)
2006
Extension


FAU
FAU
admin_unit="FAU01", ingest_id="ing1508", creator="staff:fcllz", creation_date="2007-07-18 22:53:16", modified_by="staff:fcllz", modification_date="2011-01-06 13:08:48"

IID
FADT13413
Issuance
monographic
Person Preferred Name

Napolitano, Amri E.
Graduate College
Physical Description

100 p.
application/pdf
Title Plain
Alleviating class imbalance using data sampling: Examining the effects on classification algorithms
Use and Reproduction
Copyright © is held by the author, with permission granted to Florida Atlantic University to digitize, archive and distribute this item for non-profit research and educational purposes. Any reuse of this item in excess of fair use or other copyright exemptions requires permission of the copyright holder.
http://rightsstatements.org/vocab/InC/1.0/
Origin Information

2006
monographic

Boca Raton, Fla.

Florida Atlantic University
Physical Location
Florida Atlantic University Libraries
Place

Boca Raton, Fla.
Sub Location
Digital Library
Title
Alleviating class imbalance using data sampling: Examining the effects on classification algorithms
Other Title Info

Alleviating class imbalance using data sampling: Examining the effects on classification algorithms