You are viewing a javascript disabled version of the site. Please enable Javascript for this site to function properly.
Go to headerGo to navigationGo to searchGo to contentsGo to footer
In content section. Select this link to jump to navigation

A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system

Abstract

Finding the minimum number of appropriate biomarkers for specific targets such as a lung cancer has been a challenging issue in bioinformatics. We propose a hierarchical two-phase framework for selecting appropriate biomarkers that extracts candidate biomarkers from the cancer microarray datasets and then selects the minimum number of appropriate biomarkers from the extracted candidate biomarkers datasets with a specific neuro-fuzzy algorithm, which is called a neural network with weighted fuzzy membership function (NEWFM). In this context, as the first phase, the proposed framework is to extract candidate biomarkers by using a Bhattacharyya distance method that measures the similarity of two discrete probability distributions. Finally, the proposed framework is able to reduce the cost of finding biomarkers by not receiving medical supplements and improve the accuracy of the biomarkers in specific cancer target datasets.

References

[1] 

Alba E, , Garcia-Nieto J, , Jourdan L, , Talbi E (2007). Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. Evolutionary Computation IEEE.

[2] 

Kent Ridge Bio-medical Data Repository (http://datam.i2r.a-star.edu.sg/datasets/krbd/).

[3] 

Bhattacharyya A (1943). On a measure of divergence between two statistical populations defined by their probability distributions. Bulletin of the Calcutta Mathematical Society 35: 99-109.

[4] 

Reyes-Aldasoro CC, , Bhalerao A (2006). The Bhattacharyya space for feature selection and its application to texture segmentation. Pattern Recognition 39(5): 812-826.

[5] 

Golub T, , Slonim D, , Tamayo P, , Huard C, , Caasenbeek JM, , Coller H, , Loh M, , Downing J, , Caligiuri M, , Bloomfield C, , Lander E (1999). Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286: 531-537.

[6] 

Liu X, , Krishnan A, , Mondry A (2005). An entropy based gene selection method for cancer classification using microarray data. BMC Bioinformatics 6: 1-14.

[7] 

Peng H, , Long F, , Ding C (2005). Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence 27: 1226-1238.

[8] 

Ben-Dor A, , Bruhn L, , Friedman N, , Nachman I, , Schummer M, , Yakhini Z (2000). Tissue Classification with Gene Expression Profiles. J Computational Biology 7: 559-584.

[9] 

http://www.genome.jp/kegg/.

[10] 

Hong Y, , Kwong S, , Chang Y, , Ren Q (2008). Unsupervised feature selection using clustering ensembles and population based incremental learning algorithm. Pattern Recognition 41: 2742-2756.

[11] 

Frank O, , Brors B, , Fabarius A, , Li L, , Haak M, , Merk S, , Schwindel U, , Zheng C, , Müller MC, , Gretz N, , Hehlmann R, , Hochhaus A, , Seifarth W (2006). Gene expression signature of primary imatinib-resistant chronic myeloid leukemia patients. Leukemia 20: 1400-1407.

[12] 

Guyon I, , Weston J, , Barnhill S, , Vapnik V (2002). Gene selection for cancer classification using support vector machines. Machine Learning 46: 389-422.

[13] 

Maji P, , Paul S (2011). Rough set based maximum relevance-maximum significance criterion and gene selection from microarray data. International Journal of Approximate Reasoning 52: 408-426.

[14] 

Li J, , Su H, , Chen H, , Futscher BW (2007). Optimal search-based gene subset selection for gene array cancer classification. IEEE Transactions on Information Technology in Biomedicine 11: 398-405.

[15] 

Lim JS (2009). Finding Features for Real-Time Premature Ventricular Contraction Detection Using a Fuzzy Neural Network System. IEEE Transactions on Neural Networks 20: 522-527.

[16] 

Lee SH, , Lim JS (2011). Forecasting KOSPI based on a neural network with weighted fuzzy membership functions. Expert Systems with Applications 38: 4259-4263.

[17] 

Lee SH, , Lim JS (2012). Parkinson's disease classification using gait characteristics and wavelet-based feature extraction. Expert Systems with Application 39: 7338-7344.

[18] 

Cho JH, , Lee D, , Park JH, , Lee IB (2004). Gene selection and classification from microarray data using kernel machine. FEBS Letters 571: 93-98.

[19] 

Wang Y, , Makedon FS, , Ford JC, , Pearlman J (2005). HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data. Bioinformatics 21: 1530-1537.

[20] 

Wang L, , Khan L (2006). Automatic image annotation and retrieval using weighted feature selection. Multimed Tools Appl 29: 55-71.

[21] 

Coleman GB, , Andrews HC (1979). Image segmentation by clustering. Proc IEEE 67(5): 773-785.