Gene selection for sample sets with biased distribution

File
Publisher
Florida Atlantic University
Date Issued
2009
Description
Microarray expression data which contains the expression levels of a large number of simultaneously observed genes have been used in many scientific research and clinical studies. Due to its high dimensionalities, selecting a small number of genes has shown to be beneficial for many tasks such as building prediction models from the microarray expression data or gene regulatory network discovery. Traditional gene selection methods, however, fail to take the class distribution into the selection process. In biomedical science, it is very common to have microarray expression data which is severely biased with one class of examples (e.g., diseased samples) significantly less than other classes (e.g., normal samples). These sample sets with biased distributions require special attention from researchers for identification of genes responsible for a particular disease. In this thesis, we propose three filtering techniques, Higher Weight ReliefF, ReliefF with Differential Minority Repeat and ReliefF with Balanced Minority Repeat to identify genes responsible for fatal diseases from biased microarray expression data. Our solutions are evaluated on five well-known microarray datasets, Colon, Central Nervous System, DLBCL Tumor, Lymphoma and ECML Pancreas. Experimental comparisons with the traditional ReliefF filtering method demonstrate the effectiveness of the proposed methods in selecting informative genes from microarray expression data with biased sample distributions.
Note

by Abu Hena Mustafa Kamal.

Language
Type
Form
Extent
x, 98 p. : ill. (some col.).
Identifier
318327331
OCLC Number
318327331
Additional Information
by Abu Hena Mustafa Kamal.
Thesis (M.S.C.S.)--Florida Atlantic University, 2009.
Includes bibliography.
Electronic reproduction. Boca Raton, Fla., 2009. Mode of access: World Wide Web.
Date Backup
2009
Date Text
2009
Date Issued (EDTF)
2009
Extension


FAU
FAU
admin_unit="FAU01", ingest_id="ing3635", creator="creator:SPATEL", creation_date="2009-04-13 14:17:09", modified_by="super:SPATEL", modification_date="2011-04-18 08:57:56"

IID
FADT186330
Person Preferred Name

Kamal, Abu Hena Mustafa.
Graduate College
Physical Description

electronic
x, 98 p. : ill. (some col.).
Title Plain
Gene selection for sample sets with biased distribution
Use and Reproduction
http://rightsstatements.org/vocab/InC/1.0/
Origin Information


Boca Raton, Fla.

Florida Atlantic University
2009
Place

Boca Raton, Fla.
Title
Gene selection for sample sets with biased distribution
Other Title Info

Gene selection for sample sets with biased distribution