The feature selection bias problem in relation to high-dimensional gene data

•We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four feature selection methods.•For evaluating feature selection performance we use double cross-validation.•By the way, we examine the stability of t...

Full description

Saved in:

Bibliographic Details
Published in	Artificial intelligence in medicine Vol. 66; pp. 63 - 71
Main Authors	Krawczuk, Jerzy, Łukaszuk, Tomasz
Format	Journal Article
Language	English
Published	Netherlands Elsevier B.V 01.01.2016
Subjects	Accuracy Algorithms Bias Biomarkers, Tumor - genetics Cancer Classification Computational Biology - methods Convex and piecewise linear classifier Data mining Data Mining - methods Databases, Genetic Decision Support Techniques Feature selection bias Gene Expression Profiling - methods Gene Expression Regulation, Neoplastic Gene selection Genes Humans Internal Medicine Learning Linear Models Mathematical models Microarray data Oligonucleotide Array Sequence Analysis Other Pattern Recognition, Automated Reproducibility of Results Support Vector Machine Convex and piecewise linear classifier Support vector machine Gene selection Microarray data Feature selection bias
Online Access	Get full text
ISSN	0933-3657 1873-2860
DOI	10.1016/j.artmed.2015.11.001

Cover

Abstract	•We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four feature selection methods.•For evaluating feature selection performance we use double cross-validation.•By the way, we examine the stability of the feature selection methods.•We recommend cross-validation for feature selection in order to reduce the selection bias. Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature—colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
AbstractList	Objective Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. Methods and materials We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature-colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Results Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). Conclusions This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias. •We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four feature selection methods.•For evaluating feature selection performance we use double cross-validation.•By the way, we examine the stability of the feature selection methods.•We recommend cross-validation for feature selection in order to reduce the selection bias. Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature—colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias. Highlights • We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure. • We examine its importance by an empirical study of four feature selection methods. • For evaluating feature selection performance we use double cross-validation. • By the way, we examine the stability of the feature selection methods. • We recommend cross-validation for feature selection in order to reduce the selection bias. OBJECTIVEFeature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection.METHODS AND MATERIALSWe address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature-colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability.RESULTSOur main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation).CONCLUSIONSThis work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias. Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature-colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
Author	Krawczuk, Jerzy Łukaszuk, Tomasz
Author_xml	– sequence: 1 givenname: Jerzy surname: Krawczuk fullname: Krawczuk, Jerzy – sequence: 2 givenname: Tomasz surname: Łukaszuk fullname: Łukaszuk, Tomasz email: t.lukaszuk@pb.edu.pl
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/26674595$$D View this record in MEDLINE/PubMed
BookMark	eNqVkk9v1DAQxS1URLeFb4BQjlwSPHZiOwghoYp_0kocKGdr4ky6Xpyk2AlSvz3ebuGAhFpOluzfex69N2fsZJonYuw58Ao4qFf7CuMyUl8JDk0FUHEOj9gGjJalMIqfsA1vpSylavQpO0tpzznXNagn7FQopeumbTZse7mjYiBc1khFokBu8fNUdB5TcR3nLtBY-KmIFPD2YZmLnb_alb0faUr5BkNxRRMVPS74lD0eMCR6dnees28f3l9efCq3Xz5-vni3LZ3iainBOKVaB0rm0U0HSANyDYLEgKLrB4FNfkYNREaiQD4oox30YhC8ltrIc_by6Jsn_LFSWuzok6MQcKJ5TRZ0K4WqjagfgCrTaGG0eAiquW5B84y-uEPXLldgr6MfMd7Y37lmoD4CLs4pRRr-IMDtoT67t8f67KE-C2BzfVn2-i-Z88tt8EtEH-4Tvz2KKUf_01O0yXmaHPU-5lptP_v_NXDBT95h-E43lPbzGnPdOQibhOX262G9DtsFTVbXQmWDN_82uP__X9uS32Y
CitedBy_id	crossref_primary_10_3389_fgene_2021_618277 crossref_primary_10_1016_j_neucom_2018_02_100 crossref_primary_10_1016_j_cmpb_2018_03_017 crossref_primary_10_1038_s41389_019_0157_8 crossref_primary_10_7717_peerj_18405 crossref_primary_10_3389_fnhum_2021_734501 crossref_primary_10_1109_JIOT_2023_3237032 crossref_primary_10_23736_S1824_4785_19_03213_8 crossref_primary_10_1016_j_cmpb_2023_107987 crossref_primary_10_1016_j_engappai_2021_104628 crossref_primary_10_1016_j_ygeno_2019_01_006 crossref_primary_10_3390_ijms241311133 crossref_primary_10_1016_j_asoc_2025_112784 crossref_primary_10_1042_CS20180745 crossref_primary_10_1016_j_compbiomed_2022_105349 crossref_primary_10_1093_bioinformatics_btx298 crossref_primary_10_1093_bib_bbz061 crossref_primary_10_3934_mbe_2023366 crossref_primary_10_1016_j_artmed_2021_102228 crossref_primary_10_1016_j_tifs_2024_104853 crossref_primary_10_1016_j_jprot_2024_105298 crossref_primary_10_1007_s42235_023_00400_7 crossref_primary_10_1109_JBHI_2019_2908773 crossref_primary_10_1007_s11517_020_02301_x crossref_primary_10_1177_20552076231189331 crossref_primary_10_1007_s41062_024_01419_3 crossref_primary_10_1093_bioinformatics_bty710 crossref_primary_10_1038_s41598_021_92287_9 crossref_primary_10_1109_TCYB_2020_3015756 crossref_primary_10_1016_j_ygeno_2019_07_002 crossref_primary_10_1109_TVCG_2021_3137174 crossref_primary_10_1038_s41598_021_91614_4 crossref_primary_10_1016_j_cmpb_2023_107934 crossref_primary_10_1038_s41598_022_27132_8 crossref_primary_10_1016_j_jacr_2023_06_025 crossref_primary_10_1049_el_2017_4550 crossref_primary_10_1109_TFUZZ_2022_3146969 crossref_primary_10_1038_s41538_021_00100_8 crossref_primary_10_1093_nargab_lqab065 crossref_primary_10_1016_j_artmed_2018_04_002 crossref_primary_10_1016_j_ccr_2025_216583 crossref_primary_10_1016_j_gce_2025_01_003 crossref_primary_10_1016_j_ijcip_2021_100436 crossref_primary_10_1109_TAI_2024_3436664 crossref_primary_10_1002_cyto_a_24901 crossref_primary_10_1016_j_rse_2019_111273 crossref_primary_10_1016_j_neucom_2024_128099 crossref_primary_10_1016_j_jneumeth_2021_109339
Cites_doi	10.1016/0031-3203(84)90059-1 10.1126/science.270.5235.467 10.1126/science.286.5439.531 10.1073/pnas.111153698 10.1038/ng1296-457 10.1093/bioinformatics/btm117 10.1016/S1535-6108(02)00030-2 10.1056/NEJMoa021967 10.1093/bioinformatics/18.10.1332 10.1038/nm0102-68 10.1073/pnas.96.12.6745 10.1073/pnas.102102699 10.1111/1468-0262.00152 10.1016/j.ins.2014.01.008 10.1111/j.2517-6161.1974.tb00994.x 10.1038/415530a 10.1038/415436a 10.1073/pnas.96.16.9212 10.1142/S0219720005001004 10.1023/A:1012487302797 10.1016/0031-3203(91)90005-P
ContentType	Journal Article
Copyright	2015 Elsevier B.V. Elsevier B.V. Copyright © 2015 Elsevier B.V. All rights reserved.
Copyright_xml	– notice: 2015 Elsevier B.V. – notice: Elsevier B.V. – notice: Copyright © 2015 Elsevier B.V. All rights reserved.
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 7QO 8FD FR3 P64 7SC JQ2 L7M L~C L~D
DOI	10.1016/j.artmed.2015.11.001
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic Biotechnology Research Abstracts Technology Research Database Engineering Research Database Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic Engineering Research Database Biotechnology Research Abstracts Technology Research Database Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional
DatabaseTitleList	Computer and Information Systems Abstracts Engineering Research Database MEDLINE - Academic MEDLINE
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Computer Science
EISSN	1873-2860
EndPage	71
ExternalDocumentID	26674595 10_1016_j_artmed_2015_11_001 S0933365715001426 1_s2_0_S0933365715001426
Genre	Validation Studies Comparative Study Research Support, Non-U.S. Gov't Journal Article
GroupedDBID	--- --K --M .1- .DC .FO .~1 0R~ 1B1 1P~ 1RT 1~. 1~5 23N 4.4 457 4G. 53G 5GY 5VS 7-5 71M 77I 77K 8P~ 9JM 9JN AAEDT AAEDW AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AATTM AAWTL AAXKI AAXUO AAYFN AAYWO ABBOA ABBQC ABFNM ABIVO ABJNI ABMAC ABMZM ABWVN ABXDB ACDAQ ACGFS ACIEU ACIUM ACLOT ACNNM ACRLP ACRPL ACVFH ACZNC ADBBV ADCNI ADEZE ADJOM ADMUD ADNMO AEBSH AEIPS AEKER AENEX AEUPX AEVXI AFJKZ AFPUW AFRHN AFTJW AFXIZ AGHFR AGQPQ AGUBO AGYEJ AHHHB AHZHX AIALX AIEXJ AIGII AIIUN AIKHN AITUG AJRQY AJUYK AKBMS AKRWK AKYEP ALMA_UNASSIGNED_HOLDINGS AMRAJ ANKPU ANZVX AOUOD APXCP ASPBG AVWKF AXJTR AZFZN BKOJK BLXMC BNPGV CS3 EBS EFJIC EFKBS EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-2 G-Q GBLVA GBOLZ HEA HMK HMO HVGLF HZ~ IHE J1W KOM LZ2 M29 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- ROL RPZ SAE SDF SDG SDP SEL SES SEW SPC SPCBC SSH SSV SSZ T5K UHS WH7 WUQ Z5R ~G- ~HD AACTN AFCTW AFKWA AJOXV AMFUW RIG AAIAV ABLVK ABYKQ AJBFU LCYCR AAYXX CITATION AGCQF AGRNS CGR CUY CVF ECM EIF NPM 7X8 7QO 8FD FR3 P64 7SC JQ2 L7M L~C L~D
ID	FETCH-LOGICAL-c606t-18c669c1632018b1aefa0712e2fa2bdf2a569ca71ee83a2a0f687c1d2f2043783
IEDL.DBID	.~1
ISSN	0933-3657
IngestDate	Sun Sep 28 10:41:13 EDT 2025 Tue Oct 07 09:30:43 EDT 2025 Sun Sep 28 02:50:55 EDT 2025 Mon Jul 21 06:04:22 EDT 2025 Wed Oct 01 00:45:53 EDT 2025 Thu Apr 24 22:52:26 EDT 2025 Fri Feb 23 02:25:15 EST 2024 Sun Feb 23 10:19:32 EST 2025 Tue Oct 14 19:30:12 EDT 2025
IsPeerReviewed	true
IsScholarly	true
Keywords	Convex and piecewise linear classifier Support vector machine Gene selection Microarray data Feature selection bias
Language	English
License	Copyright © 2015 Elsevier B.V. All rights reserved.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c606t-18c669c1632018b1aefa0712e2fa2bdf2a569ca71ee83a2a0f687c1d2f2043783
Notes	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Undefined-3 ObjectType-Article-1 ObjectType-Feature-2
PMID	26674595
PQID	1767079170
PQPubID	23479
PageCount	9
ParticipantIDs	proquest_miscellaneous_1793264824 proquest_miscellaneous_1768572872 proquest_miscellaneous_1767079170 pubmed_primary_26674595 crossref_primary_10_1016_j_artmed_2015_11_001 crossref_citationtrail_10_1016_j_artmed_2015_11_001 elsevier_sciencedirect_doi_10_1016_j_artmed_2015_11_001 elsevier_clinicalkeyesjournals_1_s2_0_S0933365715001426 elsevier_clinicalkey_doi_10_1016_j_artmed_2015_11_001
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2016-01-01
PublicationDateYYYYMMDD	2016-01-01
PublicationDate_xml	– month: 01 year: 2016 text: 2016-01-01 day: 01
PublicationDecade	2010
PublicationPlace	Netherlands
PublicationPlace_xml	– name: Netherlands
PublicationTitle	Artificial intelligence in medicine
PublicationTitleAlternate	Artif Intell Med
PublicationYear	2016
Publisher	Elsevier B.V
Publisher_xml	– name: Elsevier B.V
References	Dan, Tsunoda, Kitahara, Yanagawa, Zembutsu, Katagiri (bib0015) 2002; 62 Guyon, Elisseeff (bib0045) 2003; 3 Ambroise, McLachlan (bib0055) 2002; 99 Singhi, Liu (bib0080) 2006 Bobrowski, Łukaszuk (bib0125) 2009; 29 Zhang, Yu, Singer, Xiong (bib0070) 2001; 98 Bobrowski L. Feature subsets selection based on linear separbilty, Lecture notes of the VII-th ICB seminar: statistics and clinical practice. Golub, Slonim, Tamayo, Huard, Gaasenbeek, Mesirov (bib0020) 1999; 286 van ‘t Veer, Dai, van de Vijver, He, Hart, Mao (bib0030) 2002; 415 Singh, Febbo, Ross, Jackson, Manola, Ladd (bib0190) 2002; 1 Krishnapuram, Carin, Hartemink (bib0110) 2004 Lustgarten, Gopalakrishnan, Visweswaran (bib0095) 2009 Kononenko (bib0160) 1994; vol. 784 Peralta, Soto (bib0120) 2014; 269 Gordon, Jensen, Hsiao, Gullans (bib0175) 2002; 62 Shipp, Ross, Tamayo, Weng, Kutok, Aguiar (bib0185) 2002; 8 Ding, Peng (bib0170) 2005; 3 Stone (bib0200) 1974 Pomeroy, Tamayo, Gaasenbeek, Sturla, Angelo, McLaughlin (bib0180) 2002; 415 Alon, Barkai, Gish, Ybara, Mack (bib0065) 1999; 96 Bobrowski, Łukaszuk (bib0130) 2011 Kira, Rendell (bib0165) 1992 Liu, Motoda (bib0050) 2007 White (bib0060) 2000; 68 Bobrowski, Łukaszuk (bib0205) 2004; vol. 3070 Bobrowski, Niemiro (bib0150) 1984; 17 Bellman (bib0040) 1961 Perou, Jeffrey, Van De Rijn, Rees, Eisen, Ross (bib0025) 1999; 96 Guyon, Weston, Barnhill, Vapnik (bib0075) 2002; 46 Bobrowski (bib0135) 2005 Tuv, Borisov, Runger, Torkkola (bib0115) 2009; 10 Bobrowski (bib0140) 1991; 24 Yu, Liu (bib0195) 2004 Perkins, Lacker, Theiler (bib0100) 2003; 3 Van De Vijver, He, van’t Veer, Dai, Hart, Voskuil (bib0035) 2002; 347 Kuncheva (bib0090) 2007 Zhu, Rosset, Hastie, Tibshirani (bib0145) 2004; 16 Wood, Visscher, Mengersen (bib0085) 2007; 23 Schena, Shalon, Davis, Brown (bib0005) 1995; 270 Li, Campbell, Tipping (bib0105) 2002; 18 DeRisi, Penland, Brown, Bittner, Meltzer, Ray (bib0010) 1996; 14 Zhang (10.1016/j.artmed.2015.11.001_bib0070) 2001; 98 Kononenko (10.1016/j.artmed.2015.11.001_bib0160) 1994; vol. 784 Stone (10.1016/j.artmed.2015.11.001_bib0200) 1974 Alon (10.1016/j.artmed.2015.11.001_bib0065) 1999; 96 Guyon (10.1016/j.artmed.2015.11.001_bib0075) 2002; 46 Golub (10.1016/j.artmed.2015.11.001_bib0020) 1999; 286 Bobrowski (10.1016/j.artmed.2015.11.001_bib0135) 2005 Krishnapuram (10.1016/j.artmed.2015.11.001_bib0110) 2004 Bobrowski (10.1016/j.artmed.2015.11.001_bib0125) 2009; 29 Bobrowski (10.1016/j.artmed.2015.11.001_bib0130) 2011 White (10.1016/j.artmed.2015.11.001_bib0060) 2000; 68 Shipp (10.1016/j.artmed.2015.11.001_bib0185) 2002; 8 Tuv (10.1016/j.artmed.2015.11.001_bib0115) 2009; 10 Liu (10.1016/j.artmed.2015.11.001_bib0050) 2007 Perkins (10.1016/j.artmed.2015.11.001_bib0100) 2003; 3 10.1016/j.artmed.2015.11.001_bib0155 Bellman (10.1016/j.artmed.2015.11.001_bib0040) 1961 Li (10.1016/j.artmed.2015.11.001_bib0105) 2002; 18 Lustgarten (10.1016/j.artmed.2015.11.001_bib0095) 2009 Zhu (10.1016/j.artmed.2015.11.001_bib0145) 2004; 16 Van De Vijver (10.1016/j.artmed.2015.11.001_bib0035) 2002; 347 Wood (10.1016/j.artmed.2015.11.001_bib0085) 2007; 23 Bobrowski (10.1016/j.artmed.2015.11.001_bib0150) 1984; 17 Peralta (10.1016/j.artmed.2015.11.001_bib0120) 2014; 269 DeRisi (10.1016/j.artmed.2015.11.001_bib0010) 1996; 14 Gordon (10.1016/j.artmed.2015.11.001_bib0175) 2002; 62 Yu (10.1016/j.artmed.2015.11.001_bib0195) 2004 van ‘t Veer (10.1016/j.artmed.2015.11.001_bib0030) 2002; 415 Ding (10.1016/j.artmed.2015.11.001_bib0170) 2005; 3 Dan (10.1016/j.artmed.2015.11.001_bib0015) 2002; 62 Guyon (10.1016/j.artmed.2015.11.001_bib0045) 2003; 3 Kira (10.1016/j.artmed.2015.11.001_bib0165) 1992 Singh (10.1016/j.artmed.2015.11.001_bib0190) 2002; 1 Bobrowski (10.1016/j.artmed.2015.11.001_bib0140) 1991; 24 Schena (10.1016/j.artmed.2015.11.001_bib0005) 1995; 270 Ambroise (10.1016/j.artmed.2015.11.001_bib0055) 2002; 99 Kuncheva (10.1016/j.artmed.2015.11.001_bib0090) 2007 Pomeroy (10.1016/j.artmed.2015.11.001_bib0180) 2002; 415 Bobrowski (10.1016/j.artmed.2015.11.001_bib0205) 2004; vol. 3070 Perou (10.1016/j.artmed.2015.11.001_bib0025) 1999; 96 Singhi (10.1016/j.artmed.2015.11.001_bib0080) 2006
References_xml	– year: 1961 ident: bib0040 article-title: Adaptive control processes: a guided tour – start-page: 406 year: 2009 end-page: 410 ident: bib0095 article-title: Measuring stability of feature selection in biomedical datasets publication-title: AMIA annual symposium proceedings, vol. 2009 – volume: 270 start-page: 467 year: 1995 end-page: 470 ident: bib0005 article-title: Quantitative monitoring of gene expression patterns with a complementary DNA microarray publication-title: Science – volume: 24 start-page: 863 year: 1991 end-page: 870 ident: bib0140 article-title: Design of piecewise linear classifiers from formal neurons by some basis exchange technique publication-title: Pattern Recognit – start-page: 111 year: 1974 end-page: 147 ident: bib0200 article-title: Cross-validatory choice and assessment of statistical predictions publication-title: J Royal Stat Soc Ser B (Methodol) – volume: 68 start-page: 1097 year: 2000 end-page: 1126 ident: bib0060 article-title: A reality check for data snooping publication-title: Econometrica – volume: 3 start-page: 185 year: 2005 end-page: 205 ident: bib0170 article-title: Minimum redundancy feature selection from microarray gene expression data publication-title: J Bioinform Comput Biol – start-page: 737 year: 2004 end-page: 742 ident: bib0195 article-title: Redundancy based feature selection for microarray data publication-title: KDD ‘04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining – volume: 347 start-page: 1999 year: 2002 end-page: 2009 ident: bib0035 article-title: A gene-expression signature as a predictor of survival in breast cancer publication-title: N E J Med – volume: 269 start-page: 176 year: 2014 end-page: 187 ident: bib0120 article-title: Embedded local feature selection within mixture of experts publication-title: Inf Sci – volume: vol. 784 start-page: 171 year: 1994 end-page: 182 ident: bib0160 article-title: Estimating attributes: analysis and extensions of Relief publication-title: Machine learning, ECML-94 – year: 2005 ident: bib0135 article-title: Data mining based on convex and piecewise linear (CPL) criterion functions (in Polish) – volume: 96 start-page: 9212 year: 1999 end-page: 9217 ident: bib0025 article-title: Distinctive gene expression patterns in human mammary epithelial cells and breast cancers publication-title: Proc Natl Acad Sci – volume: 98 start-page: 6730 year: 2001 end-page: 6735 ident: bib0070 article-title: Recursive partitioning for tumor classification with gene expression microarray data publication-title: Proc Natl Acad Sci – start-page: 249 year: 1992 end-page: 256 ident: bib0165 article-title: A practical approach to feature selection publication-title: Proceedings of the ninth international workshop on machine learning – volume: 415 start-page: 436 year: 2002 end-page: 442 ident: bib0180 article-title: Prediction of central nervous system embryonal tumour outcome based on gene expression publication-title: Nature – volume: 96 start-page: 6745 year: 1999 end-page: 6750 ident: bib0065 article-title: Broad patterns of gene expressions revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays publication-title: PNAS – volume: 8 start-page: 68 year: 2002 end-page: 74 ident: bib0185 article-title: Diffuse large b-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning publication-title: Nat Med – volume: vol. 3070 start-page: 544 year: 2004 end-page: 549 ident: bib0205 article-title: Selection of the linearly separable feature subsets publication-title: Artificial intelligence and soft computing: ICAISC‘2004 – volume: 29 start-page: 43 year: 2009 end-page: 59 ident: bib0125 article-title: Feature selection based on relaxed linear separability publication-title: Biocybern Biomed Eng – volume: 46 start-page: 389 year: 2002 end-page: 422 ident: bib0075 article-title: Gene selection for cancer classification using support vector machines publication-title: Mach Learn – volume: 62 start-page: 4963 year: 2002 end-page: 4967 ident: bib0175 article-title: Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesotheliomar publication-title: Cancer Res – volume: 415 start-page: 530 year: 2002 end-page: 536 ident: bib0030 article-title: Gene expression profiling predicts clinical outcome of breast cancer publication-title: Nature – volume: 23 start-page: 1363 year: 2007 end-page: 1370 ident: bib0085 article-title: Classification based upon gene expression data: bias and precision of error rates publication-title: Bioinformatics – start-page: 849 year: 2006 end-page: 856 ident: bib0080 article-title: Feature subset selection bias for classification learning publication-title: Proceedings of the 23rd international conference on machine learning, ICML ‘06 – start-page: 421 year: 2007 end-page: 427 ident: bib0090 article-title: A stability index for feature selection publication-title: Artificial intelligence and applications – volume: 3 start-page: 1333 year: 2003 end-page: 1356 ident: bib0100 article-title: Grafting: Fast, incremental feature selection by gradient descent in function space publication-title: J Mach Learn Res – volume: 3 start-page: 1157 year: 2003 end-page: 1182 ident: bib0045 article-title: An introduction to variable and feature selection publication-title: J Mach Learn Res – year: 2011 ident: bib0130 article-title: Relaxed linear separability (RLS) approach to feature (gene) subset selection publication-title: Selected works in bioinformatics – volume: 1 start-page: 203 year: 2002 end-page: 209 ident: bib0190 article-title: Gene expression correlates of clinical prostate cancer behavior publication-title: Cancer cell – volume: 14 start-page: 457 year: 1996 end-page: 460 ident: bib0010 article-title: Use of a CDNA microarray to analyse gene expression patterns in human cancer publication-title: Nat Genet – reference: Bobrowski L. Feature subsets selection based on linear separbilty, Lecture notes of the VII-th ICB seminar: statistics and clinical practice. – volume: 62 start-page: 1139 year: 2002 end-page: 1147 ident: bib0015 article-title: An integrated database of chemosensitivity to 55 anticancer drugs and gene expression profiles of 39 human cancer cell lines publication-title: Cancer Res – volume: 99 start-page: 6562 year: 2002 end-page: 6566 ident: bib0055 article-title: Selection bias in gene extraction on the basis of microarray gene-expression data publication-title: Proc Natl Acad Sci – volume: 16 start-page: 49 year: 2004 end-page: 56 ident: bib0145 article-title: 1-norm support vector machines publication-title: Adv Neural Inf Process Syst – start-page: 299 year: 2004 end-page: 317 ident: bib0110 article-title: Gene expression analysis: joint feature selection and classifier design publication-title: Kernel Methods Comput Biol – volume: 18 start-page: 1332 year: 2002 end-page: 1339 ident: bib0105 article-title: Bayesian automatic relevance determination algorithms for classifying gene expression data publication-title: Bioinformatics – volume: 10 start-page: 1341 year: 2009 end-page: 1366 ident: bib0115 article-title: Feature selection with ensembles, artificial variables, and redundancy elimination publication-title: J Mach Learn Res – volume: 286 start-page: 531 year: 1999 end-page: 537 ident: bib0020 article-title: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring publication-title: Sciences – year: 2007 ident: bib0050 article-title: Computational methods of feature selection – volume: 17 start-page: 205 year: 1984 end-page: 210 ident: bib0150 article-title: A method of synthesis of linear discriminant function in the case of nonseparability publication-title: Pattern Recognit – year: 1961 ident: 10.1016/j.artmed.2015.11.001_bib0040 – volume: 3 start-page: 1157 year: 2003 ident: 10.1016/j.artmed.2015.11.001_bib0045 article-title: An introduction to variable and feature selection publication-title: J Mach Learn Res – volume: 17 start-page: 205 issue: 2 year: 1984 ident: 10.1016/j.artmed.2015.11.001_bib0150 article-title: A method of synthesis of linear discriminant function in the case of nonseparability publication-title: Pattern Recognit doi: 10.1016/0031-3203(84)90059-1 – volume: 270 start-page: 467 issue: 5235 year: 1995 ident: 10.1016/j.artmed.2015.11.001_bib0005 article-title: Quantitative monitoring of gene expression patterns with a complementary DNA microarray publication-title: Science doi: 10.1126/science.270.5235.467 – volume: 286 start-page: 531 year: 1999 ident: 10.1016/j.artmed.2015.11.001_bib0020 article-title: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring publication-title: Sciences doi: 10.1126/science.286.5439.531 – volume: 98 start-page: 6730 issue: 12 year: 2001 ident: 10.1016/j.artmed.2015.11.001_bib0070 article-title: Recursive partitioning for tumor classification with gene expression microarray data publication-title: Proc Natl Acad Sci doi: 10.1073/pnas.111153698 – start-page: 421 year: 2007 ident: 10.1016/j.artmed.2015.11.001_bib0090 article-title: A stability index for feature selection – start-page: 737 year: 2004 ident: 10.1016/j.artmed.2015.11.001_bib0195 article-title: Redundancy based feature selection for microarray data – volume: 16 start-page: 49 issue: 1 year: 2004 ident: 10.1016/j.artmed.2015.11.001_bib0145 article-title: 1-norm support vector machines publication-title: Adv Neural Inf Process Syst – volume: 14 start-page: 457 issue: 4 year: 1996 ident: 10.1016/j.artmed.2015.11.001_bib0010 article-title: Use of a CDNA microarray to analyse gene expression patterns in human cancer publication-title: Nat Genet doi: 10.1038/ng1296-457 – volume: vol. 3070 start-page: 544 year: 2004 ident: 10.1016/j.artmed.2015.11.001_bib0205 article-title: Selection of the linearly separable feature subsets – volume: 23 start-page: 1363 issue: 11 year: 2007 ident: 10.1016/j.artmed.2015.11.001_bib0085 article-title: Classification based upon gene expression data: bias and precision of error rates publication-title: Bioinformatics doi: 10.1093/bioinformatics/btm117 – volume: 10 start-page: 1341 year: 2009 ident: 10.1016/j.artmed.2015.11.001_bib0115 article-title: Feature selection with ensembles, artificial variables, and redundancy elimination publication-title: J Mach Learn Res – volume: 1 start-page: 203 issue: 2 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0190 article-title: Gene expression correlates of clinical prostate cancer behavior publication-title: Cancer cell doi: 10.1016/S1535-6108(02)00030-2 – volume: 347 start-page: 1999 issue: 25 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0035 article-title: A gene-expression signature as a predictor of survival in breast cancer publication-title: N E J Med doi: 10.1056/NEJMoa021967 – year: 2011 ident: 10.1016/j.artmed.2015.11.001_bib0130 article-title: Relaxed linear separability (RLS) approach to feature (gene) subset selection – volume: 18 start-page: 1332 issue: 10 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0105 article-title: Bayesian automatic relevance determination algorithms for classifying gene expression data publication-title: Bioinformatics doi: 10.1093/bioinformatics/18.10.1332 – start-page: 249 year: 1992 ident: 10.1016/j.artmed.2015.11.001_bib0165 article-title: A practical approach to feature selection – volume: 8 start-page: 68 issue: 1 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0185 article-title: Diffuse large b-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning publication-title: Nat Med doi: 10.1038/nm0102-68 – volume: 96 start-page: 6745 year: 1999 ident: 10.1016/j.artmed.2015.11.001_bib0065 article-title: Broad patterns of gene expressions revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays publication-title: PNAS doi: 10.1073/pnas.96.12.6745 – ident: 10.1016/j.artmed.2015.11.001_bib0155 – volume: 62 start-page: 1139 issue: 4 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0015 article-title: An integrated database of chemosensitivity to 55 anticancer drugs and gene expression profiles of 39 human cancer cell lines publication-title: Cancer Res – volume: 99 start-page: 6562 issue: 10 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0055 article-title: Selection bias in gene extraction on the basis of microarray gene-expression data publication-title: Proc Natl Acad Sci doi: 10.1073/pnas.102102699 – volume: 68 start-page: 1097 issue: 5 year: 2000 ident: 10.1016/j.artmed.2015.11.001_bib0060 article-title: A reality check for data snooping publication-title: Econometrica doi: 10.1111/1468-0262.00152 – start-page: 406 year: 2009 ident: 10.1016/j.artmed.2015.11.001_bib0095 article-title: Measuring stability of feature selection in biomedical datasets – year: 2007 ident: 10.1016/j.artmed.2015.11.001_bib0050 – volume: 269 start-page: 176 year: 2014 ident: 10.1016/j.artmed.2015.11.001_bib0120 article-title: Embedded local feature selection within mixture of experts publication-title: Inf Sci doi: 10.1016/j.ins.2014.01.008 – volume: 29 start-page: 43 issue: 2 year: 2009 ident: 10.1016/j.artmed.2015.11.001_bib0125 article-title: Feature selection based on relaxed linear separability publication-title: Biocybern Biomed Eng – volume: 62 start-page: 4963 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0175 article-title: Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesotheliomar publication-title: Cancer Res – start-page: 111 year: 1974 ident: 10.1016/j.artmed.2015.11.001_bib0200 article-title: Cross-validatory choice and assessment of statistical predictions publication-title: J Royal Stat Soc Ser B (Methodol) doi: 10.1111/j.2517-6161.1974.tb00994.x – volume: 415 start-page: 530 issue: 6871 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0030 article-title: Gene expression profiling predicts clinical outcome of breast cancer publication-title: Nature doi: 10.1038/415530a – volume: 3 start-page: 1333 year: 2003 ident: 10.1016/j.artmed.2015.11.001_bib0100 article-title: Grafting: Fast, incremental feature selection by gradient descent in function space publication-title: J Mach Learn Res – volume: 415 start-page: 436 issue: 6870 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0180 article-title: Prediction of central nervous system embryonal tumour outcome based on gene expression publication-title: Nature doi: 10.1038/415436a – volume: 96 start-page: 9212 issue: 16 year: 1999 ident: 10.1016/j.artmed.2015.11.001_bib0025 article-title: Distinctive gene expression patterns in human mammary epithelial cells and breast cancers publication-title: Proc Natl Acad Sci doi: 10.1073/pnas.96.16.9212 – start-page: 299 year: 2004 ident: 10.1016/j.artmed.2015.11.001_bib0110 article-title: Gene expression analysis: joint feature selection and classifier design publication-title: Kernel Methods Comput Biol – start-page: 849 year: 2006 ident: 10.1016/j.artmed.2015.11.001_bib0080 article-title: Feature subset selection bias for classification learning – volume: vol. 784 start-page: 171 year: 1994 ident: 10.1016/j.artmed.2015.11.001_bib0160 article-title: Estimating attributes: analysis and extensions of Relief – volume: 3 start-page: 185 issue: 2 year: 2005 ident: 10.1016/j.artmed.2015.11.001_bib0170 article-title: Minimum redundancy feature selection from microarray gene expression data publication-title: J Bioinform Comput Biol doi: 10.1142/S0219720005001004 – volume: 46 start-page: 389 issue: 1–3 year: 2002 ident: 10.1016/j.artmed.2015.11.001_bib0075 article-title: Gene selection for cancer classification using support vector machines publication-title: Mach Learn doi: 10.1023/A:1012487302797 – year: 2005 ident: 10.1016/j.artmed.2015.11.001_bib0135 – volume: 24 start-page: 863 issue: 9 year: 1991 ident: 10.1016/j.artmed.2015.11.001_bib0140 article-title: Design of piecewise linear classifiers from formal neurons by some basis exchange technique publication-title: Pattern Recognit doi: 10.1016/0031-3203(91)90005-P
SSID	ssj0007416
Score	2.3776596
Snippet	•We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four... Highlights • We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure. • We examine its importance by an empirical... Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this... OBJECTIVEFeature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being... Objective Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being...
SourceID	proquest pubmed crossref elsevier
SourceType	Aggregation Database Index Database Enrichment Source Publisher
StartPage	63
SubjectTerms	Accuracy Algorithms Bias Biomarkers, Tumor - genetics Cancer Classification Computational Biology - methods Convex and piecewise linear classifier Data mining Data Mining - methods Databases, Genetic Decision Support Techniques Feature selection bias Gene Expression Profiling - methods Gene Expression Regulation, Neoplastic Gene selection Genes Humans Internal Medicine Learning Linear Models Mathematical models Microarray data Oligonucleotide Array Sequence Analysis Other Pattern Recognition, Automated Reproducibility of Results Support Vector Machine
Title	The feature selection bias problem in relation to high-dimensional gene data
URI	https://www.clinicalkey.com/#!/content/1-s2.0-S0933365715001426 https://www.clinicalkey.es/playcontent/1-s2.0-S0933365715001426 https://dx.doi.org/10.1016/j.artmed.2015.11.001 https://www.ncbi.nlm.nih.gov/pubmed/26674595 https://www.proquest.com/docview/1767079170 https://www.proquest.com/docview/1768572872 https://www.proquest.com/docview/1793264824
Volume	66
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVESC databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier) customDbUrl: eissn: 1873-2860 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0007416 issn: 0933-3657 databaseCode: GBLVA dateStart: 20110101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier ScienceDirect customDbUrl: eissn: 1873-2860 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0007416 issn: 0933-3657 databaseCode: ACRLP dateStart: 19950201 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals customDbUrl: eissn: 1873-2860 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0007416 issn: 0933-3657 databaseCode: AIKHN dateStart: 19950201 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Science Direct customDbUrl: eissn: 1873-2860 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0007416 issn: 0933-3657 databaseCode: .~1 dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVLSH databaseName: Elsevier Journals customDbUrl: mediaType: online eissn: 1873-2860 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0007416 issn: 0933-3657 databaseCode: AKRWK dateStart: 19890101 isFulltext: true providerName: Library Specific Holdings
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwELYQSBUXWmh5lSIjcTW78TtHhEBLH1wKEjfLdhxpEcoislz725lJHAoqBcRtd2NvksnY8038-RtC9m3pS17ryMZeCQYRX7KgPX4qIXlOSRShI8ie6cmF_H6pLhfI0bAXBmmVee7v5_Ruts6_jLI1RzfT6eg35uJCKwOQBnA-R9ltKQ1WMTj485fmgYij09sTgmHrYftcx_GC_4OYgwQvdYBanrk0zDPh6X_wswtDJ5_ISsaP9LC_xFWykJo18nGozUDzUF0jH37lRfPP5Ce4Aq1Tp-BJ267uDTwMGqa-pbmeDJ029DbT4uh8RlHEmFUo_N-LdlBws0SRTfqFXJwcnx9NWC6iwCLkJnNW2Kh1GQF2wV3aUPhUe4AVPPHa81DV3Cs47E2RkhWe-3GtrYlFxWvcNWusWCeLzaxJm4SW3FQClxlDFaWMpRdBVzr6UARfBiu3iBhs52JWGMdCF9duoJJdud7iDi0OyQcy6rYIe-h10ytsvNJeDY_FDbtHYb5zEAJe6Wee65faPGhbV7iWu7H7x7Ee93zim284597gNw6GLa7F-CbN7uBcRqM2YWHGL7axykBKy19qgwBcWg7W3-gd88GKgL2MVKXafvf1fyXL8C2_ctohi_Pbu_QNQNg87HajbJcsHZ7-mJzdA27YLwY
linkProvider	Elsevier
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwEB5RKrW9lJa-gD5cqVezGzt-5Fihom27cClI3CzbcaRFVRaR5cpvZyZxoFUpVL1FGzvJTsaeb-LP3wB8spWvRKMjn3olOUb8kgft6ajC5DklWYSeIHuoZ8fltxN1sgZ7414YolXmuX-Y0_vZOv8yydacnC0Wkx-Ui0utDEIaxPlCP4CHpRKGMrDdyxueB0GOXnBPSk7Nx_1zPckLL4hBhxheapfEPHNtmFvi09_wZx-H9p_B0wwg2efhGZ_DWmo3YWMszsDyWN2ERwd51fwFzNEXWJN6CU_W9YVv8G2wsPAdywVl2KJl55kXx1ZLRirGvCbl_0G1g6GfJUZ00pdwvP_laG_GcxUFHjE5WfHCRq2riLgL_6UNhU-NR1whkmi8CHUjvMLT3hQpWemFnzbamljUoqFts8bKV7DeLtv0BlglTC1pnTHUsSxj5WXQtY4-FMFXwZZbIEfbuZglxqnSxU83cslO3WBxRxbH7IModVvAr3udDRIb97RX42tx4_ZRnPAcxoB7-pnb-qUuj9rOFa4Tbur-8Kxfe_7mnP9wz4-j3zgct7QY49u0vMB7GU3ihIWZ3tnGKoM5rbirDSHw0gq0_uvBMa-tiODLlKpS2__9_B_g8ezoYO7mXw-_78ATPJO_P72F9dX5RXqHiGwV3vcj7gop_DCb
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+feature+selection+bias+problem+in+relation+to+high-dimensional+gene+data&rft.jtitle=Artificial+intelligence+in+medicine&rft.au=Krawczuk%2C+Jerzy&rft.au=ukaszuk%2C+Tomasz&rft.date=2016-01-01&rft.issn=0933-3657&rft.volume=66&rft.spage=63&rft.epage=71&rft_id=info:doi/10.1016%2Fj.artmed.2015.11.001&rft.externalDBID=NO_FULL_TEXT
thumbnail_m	http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fcdn.clinicalkey.com%2Fck-thumbnails%2F09333657%2FS0933365716X00024%2Fcov150h.gif