https://scholars.lib.ntu.edu.tw/handle/123456789/438495
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Liu H.-C. | en_US |
dc.contributor.author | Peng P.-C. | en_US |
dc.contributor.author | Hsieh T.-C. | en_US |
dc.contributor.author | Yeh T.-C. | en_US |
dc.contributor.author | Lin C.-J. | en_US |
dc.contributor.author | Chen C.-Y. | en_US |
dc.contributor.author | Hou J.-Y. | en_US |
dc.contributor.author | Shih L.-Y. | en_US |
dc.contributor.author | CHIH-JEN LIN | en_US |
dc.contributor.author | CHIEN-YU CHEN | en_US |
dc.creator | Liu H.-C.;Peng P.-C.;Hsieh T.-C.;Yeh T.-C.;Lin C.-J.;Chen C.-Y.;Hou J.-Y.;Shih L.-Y.;Liang D.-C. | - |
dc.date.accessioned | 2019-12-24T09:05:49Z | - |
dc.date.available | 2019-12-24T09:05:49Z | - |
dc.date.issued | 2013 | - |
dc.identifier.issn | 15455963 | - |
dc.identifier.uri | https://scholars.lib.ntu.edu.tw/handle/123456789/438495 | - |
dc.description.abstract | The amount of gene expression data of microarray has grown exponentially. To apply them for extensive studies, integrated analysis of cross-laboratory (cross-lab) data becomes a trend, and thus, choosing an appropriate feature selection method is an essential issue. This paper focuses on feature selection for Affymetrix (Affy) microarray studies across different labs. We investigate four feature selection methods: (t)-test, significance analysis of microarrays (SAM), rank products (RP), and random forest (RF). The four methods are applied to acute lymphoblastic leukemia, acute myeloid leukemia, breast cancer, and lung cancer Affy data which consist of three cross-lab data sets each. We utilize a rank-based normalization method to reduce the bias from cross-lab data sets. Training on one data set or two combined data sets to test the remaining data set(s) are both considered. Balanced accuracy is used for prediction evaluation. This study provides comprehensive comparisons of the four feature selection methods in cross-lab microarray analysis. Results show that SAM has the best classification performance. RF also gets high classification accuracy, but it is not as stable as SAM. The most naive method is (t)-test, but its performance is the worst among the four methods. In this study, we further discuss the influence from the number of training samples, the number of selected genes, and the issue of unbalanced data sets. ? 2013 IEEE. | - |
dc.relation.ispartof | IEEE/ACM Transactions on Computational Biology and Bioinformatics | - |
dc.subject.classification | [SDGs]SDG3 | - |
dc.subject.other | Decision trees; Diseases; Gene expression; Laboratories; Microarrays; Statistical tests; Acute lymphoblastic leukemia; cancer; Classification performance; Comprehensive comparisons; Feature selection methods; Laboratory experiments; Microarray data analysis; Significance Analysis of Microarrays; Feature extraction; DNA microarray; factual database; gene expression profiling; gene expression regulation; genetics; human; metabolism; neoplasm; procedures; statistical model; Databases, Factual; Gene Expression Profiling; Gene Expression Regulation, Neoplastic; Humans; Models, Statistical; Neoplasms; Oligonucleotide Array Sequence Analysis | - |
dc.title | Comparison of Feature Selection Methods for Cross-Laboratory Microarray Analysis | en_US |
dc.type | journal article | en |
dc.identifier.doi | 10.1109/TCBB.2013.70 | - |
dc.identifier.scopus | 2-s2.0-84908347124 | - |
dc.identifier.url | https://www2.scopus.com/inward/record.uri?eid=2-s2.0-84908347124&doi=10.1109%2fTCBB.2013.70&partnerID=40&md5=30468b11e1df0b5de51998175c03b1a3 | - |
dc.relation.pages | 593-604 | - |
dc.relation.journalvolume | 10 | - |
dc.relation.journalissue | 3 | - |
item.cerifentitytype | Publications | - |
item.fulltext | no fulltext | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.openairetype | journal article | - |
item.grantfulltext | none | - |
crisitem.author.dept | Networking and Multimedia | - |
crisitem.author.dept | Computer Science and Information Engineering | - |
crisitem.author.dept | Industrial Engineering | - |
crisitem.author.dept | Center for Artificial Intelligence and Advanced Robotics | - |
crisitem.author.dept | Biomechatronics Engineering | - |
crisitem.author.dept | Center for Biotechnology | - |
crisitem.author.dept | Genome and Systems Biology Degree Program | - |
crisitem.author.orcid | 0000-0003-4684-8747 | - |
crisitem.author.orcid | 0000-0002-6940-6389 | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
crisitem.author.parentorg | College of Engineering | - |
crisitem.author.parentorg | Others: University-Level Research Centers | - |
crisitem.author.parentorg | College of Bioresources and Agriculture | - |
crisitem.author.parentorg | Others: University-Level Research Centers | - |
crisitem.author.parentorg | College of Life Science | - |
顯示於: | 生物機電工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。