Evolutionary Methods for Mining Data with Class Imbalance

File
Publisher
Florida Atlantic University
Date Issued
2007
EDTF Date Created
2007
Description
Class imbalance tends to cause inferior performance in data mining learners,
particularly with regard to predicting the minority class, which generally imposes
a higher misclassification cost. This work explores the benefits of using genetic
algorithms (GA) to develop classification models which are better able to deal with
the problems encountered when mining datasets which suffer from class imbalance.
Using GA we evolve configuration parameters suited for skewed datasets for three
different learners: artificial neural networks, 0 4.5 decision trees, and RIPPER. We
also propose a novel technique called evolutionary sampling which works to remove
noisy and unnecessary duplicate instances so that the sampled training data will
produce a superior classifier for the imbalanced dataset. Our GA fitness function
uses metrics appropriate for dealing with class imbalance, in particular the area
under the ROC curve. We perform extensive empirical testing on these techniques
and compare the results with seven exist ing sampling methods.
Note

College of Engineering and Computer Science

Language
Type
Extent
172 p.
Identifier
FA00012515
Additional Information
College of Engineering and Computer Science
Thesis (M.S.)--Florida Atlantic University, 2007.
FAU Electronic Theses and Dissertations Collection
Date Backup
2007
Date Created Backup
2007
Date Text
2007
Date Created (EDTF)
2007
Date Issued (EDTF)
2007
Extension


FAU

IID
FA00012515
Organizations
Person Preferred Name

Drown, Dennis J.
Graduate College
Physical Description

application/pdf
172 p.
Title Plain
Evolutionary Methods for Mining Data with Class Imbalance
Use and Reproduction
Copyright © is held by the author with permission granted to Florida Atlantic University to digitize, archive and distribute this item for non-profit research and educational purposes. Any reuse of this item in excess of fair use or other copyright exemptions requires permission of the copyright holder.
http://rightsstatements.org/vocab/InC/1.0/
Origin Information

2007
2007
Florida Atlantic University

Boca Raton, Fla.

Physical Location
Florida Atlantic University Libraries
Place

Boca Raton, Fla.
Sub Location
Digital Library
Title
Evolutionary Methods for Mining Data with Class Imbalance
Other Title Info

Evolutionary Methods for Mining Data with Class Imbalance