The feature selection bias problem in relation to high-dimensional gene data

•We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four feature selection methods.•For evaluating feature selection performance we use double cross-validation.•By the way, we examine the stability of t...

Full description

Saved in:
Bibliographic Details
Published inArtificial intelligence in medicine Vol. 66; pp. 63 - 71
Main Authors Krawczuk, Jerzy, Łukaszuk, Tomasz
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 01.01.2016
Subjects
Online AccessGet full text
ISSN0933-3657
1873-2860
DOI10.1016/j.artmed.2015.11.001

Cover

Abstract •We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four feature selection methods.•For evaluating feature selection performance we use double cross-validation.•By the way, we examine the stability of the feature selection methods.•We recommend cross-validation for feature selection in order to reduce the selection bias. Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature—colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
AbstractList Objective Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. Methods and materials We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature-colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Results Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). Conclusions This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
•We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four feature selection methods.•For evaluating feature selection performance we use double cross-validation.•By the way, we examine the stability of the feature selection methods.•We recommend cross-validation for feature selection in order to reduce the selection bias. Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature—colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
Highlights • We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure. • We examine its importance by an empirical study of four feature selection methods. • For evaluating feature selection performance we use double cross-validation. • By the way, we examine the stability of the feature selection methods. • We recommend cross-validation for feature selection in order to reduce the selection bias.
OBJECTIVEFeature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection.METHODS AND MATERIALSWe address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature-colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability.RESULTSOur main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation).CONCLUSIONSThis work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this paper, we consider feature selection for the classification of gene datasets. Gene data is usually composed of just a few dozen objects described by thousands of features. For this kind of data, it is easy to find a model that fits the learning data. However, it is not easy to find one that will simultaneously evaluate new data equally well as learning data. This overfitting issue is well known as regards classification and regression, but it also applies to feature selection. We address this problem and investigate its importance in an empirical study of four feature selection methods applied to seven high-dimensional gene datasets. We chose datasets that are well studied in the literature-colon cancer, leukemia and breast cancer. All the datasets are characterized by a significant number of features and the presence of exactly two decision classes. The feature selection methods used are ReliefF, minimum redundancy maximum relevance, support vector machine-recursive feature elimination and relaxed linear separability. Our main result reveals the existence of positive feature selection bias in all 28 experiments (7 datasets and 4 feature selection methods). Bias was calculated as the difference between validation and test accuracies and ranges from 2.6% to as much as 41.67%. The validation accuracy (biased accuracy) was calculated on the same dataset on which the feature selection was performed. The test accuracy was calculated for data that was not used for feature selection (by so called external cross-validation). This work provides evidence that using the same dataset for feature selection and learning is not appropriate. We recommend using cross-validation for feature selection in order to reduce selection bias.
Author Krawczuk, Jerzy
Łukaszuk, Tomasz
Author_xml – sequence: 1
  givenname: Jerzy
  surname: Krawczuk
  fullname: Krawczuk, Jerzy
– sequence: 2
  givenname: Tomasz
  surname: Łukaszuk
  fullname: Łukaszuk, Tomasz
  email: t.lukaszuk@pb.edu.pl
BackLink https://www.ncbi.nlm.nih.gov/pubmed/26674595$$D View this record in MEDLINE/PubMed
BookMark eNqVkk9v1DAQxS1URLeFb4BQjlwSPHZiOwghoYp_0kocKGdr4ky6Xpyk2AlSvz3ebuGAhFpOluzfex69N2fsZJonYuw58Ao4qFf7CuMyUl8JDk0FUHEOj9gGjJalMIqfsA1vpSylavQpO0tpzznXNagn7FQopeumbTZse7mjYiBc1khFokBu8fNUdB5TcR3nLtBY-KmIFPD2YZmLnb_alb0faUr5BkNxRRMVPS74lD0eMCR6dnees28f3l9efCq3Xz5-vni3LZ3iainBOKVaB0rm0U0HSANyDYLEgKLrB4FNfkYNREaiQD4oox30YhC8ltrIc_by6Jsn_LFSWuzok6MQcKJ5TRZ0K4WqjagfgCrTaGG0eAiquW5B84y-uEPXLldgr6MfMd7Y37lmoD4CLs4pRRr-IMDtoT67t8f67KE-C2BzfVn2-i-Z88tt8EtEH-4Tvz2KKUf_01O0yXmaHPU-5lptP_v_NXDBT95h-E43lPbzGnPdOQibhOX262G9DtsFTVbXQmWDN_82uP__X9uS32Y
CitedBy_id crossref_primary_10_3389_fgene_2021_618277
crossref_primary_10_1016_j_neucom_2018_02_100
crossref_primary_10_1016_j_cmpb_2018_03_017
crossref_primary_10_1038_s41389_019_0157_8
crossref_primary_10_7717_peerj_18405
crossref_primary_10_3389_fnhum_2021_734501
crossref_primary_10_1109_JIOT_2023_3237032
crossref_primary_10_23736_S1824_4785_19_03213_8
crossref_primary_10_1016_j_cmpb_2023_107987
crossref_primary_10_1016_j_engappai_2021_104628
crossref_primary_10_1016_j_ygeno_2019_01_006
crossref_primary_10_3390_ijms241311133
crossref_primary_10_1016_j_asoc_2025_112784
crossref_primary_10_1042_CS20180745
crossref_primary_10_1016_j_compbiomed_2022_105349
crossref_primary_10_1093_bioinformatics_btx298
crossref_primary_10_1093_bib_bbz061
crossref_primary_10_3934_mbe_2023366
crossref_primary_10_1016_j_artmed_2021_102228
crossref_primary_10_1016_j_tifs_2024_104853
crossref_primary_10_1016_j_jprot_2024_105298
crossref_primary_10_1007_s42235_023_00400_7
crossref_primary_10_1109_JBHI_2019_2908773
crossref_primary_10_1007_s11517_020_02301_x
crossref_primary_10_1177_20552076231189331
crossref_primary_10_1007_s41062_024_01419_3
crossref_primary_10_1093_bioinformatics_bty710
crossref_primary_10_1038_s41598_021_92287_9
crossref_primary_10_1109_TCYB_2020_3015756
crossref_primary_10_1016_j_ygeno_2019_07_002
crossref_primary_10_1109_TVCG_2021_3137174
crossref_primary_10_1038_s41598_021_91614_4
crossref_primary_10_1016_j_cmpb_2023_107934
crossref_primary_10_1038_s41598_022_27132_8
crossref_primary_10_1016_j_jacr_2023_06_025
crossref_primary_10_1049_el_2017_4550
crossref_primary_10_1109_TFUZZ_2022_3146969
crossref_primary_10_1038_s41538_021_00100_8
crossref_primary_10_1093_nargab_lqab065
crossref_primary_10_1016_j_artmed_2018_04_002
crossref_primary_10_1016_j_ccr_2025_216583
crossref_primary_10_1016_j_gce_2025_01_003
crossref_primary_10_1016_j_ijcip_2021_100436
crossref_primary_10_1109_TAI_2024_3436664
crossref_primary_10_1002_cyto_a_24901
crossref_primary_10_1016_j_rse_2019_111273
crossref_primary_10_1016_j_neucom_2024_128099
crossref_primary_10_1016_j_jneumeth_2021_109339
Cites_doi 10.1016/0031-3203(84)90059-1
10.1126/science.270.5235.467
10.1126/science.286.5439.531
10.1073/pnas.111153698
10.1038/ng1296-457
10.1093/bioinformatics/btm117
10.1016/S1535-6108(02)00030-2
10.1056/NEJMoa021967
10.1093/bioinformatics/18.10.1332
10.1038/nm0102-68
10.1073/pnas.96.12.6745
10.1073/pnas.102102699
10.1111/1468-0262.00152
10.1016/j.ins.2014.01.008
10.1111/j.2517-6161.1974.tb00994.x
10.1038/415530a
10.1038/415436a
10.1073/pnas.96.16.9212
10.1142/S0219720005001004
10.1023/A:1012487302797
10.1016/0031-3203(91)90005-P
ContentType Journal Article
Copyright 2015 Elsevier B.V.
Elsevier B.V.
Copyright © 2015 Elsevier B.V. All rights reserved.
Copyright_xml – notice: 2015 Elsevier B.V.
– notice: Elsevier B.V.
– notice: Copyright © 2015 Elsevier B.V. All rights reserved.
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
7QO
8FD
FR3
P64
7SC
JQ2
L7M
L~C
L~D
DOI 10.1016/j.artmed.2015.11.001
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
Biotechnology Research Abstracts
Technology Research Database
Engineering Research Database
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
Engineering Research Database
Biotechnology Research Abstracts
Technology Research Database
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList Computer and Information Systems Abstracts

Engineering Research Database

MEDLINE - Academic
MEDLINE

Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Computer Science
EISSN 1873-2860
EndPage 71
ExternalDocumentID 26674595
10_1016_j_artmed_2015_11_001
S0933365715001426
1_s2_0_S0933365715001426
Genre Validation Studies
Comparative Study
Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
--K
--M
.1-
.DC
.FO
.~1
0R~
1B1
1P~
1RT
1~.
1~5
23N
4.4
457
4G.
53G
5GY
5VS
7-5
71M
77I
77K
8P~
9JM
9JN
AAEDT
AAEDW
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AATTM
AAWTL
AAXKI
AAXUO
AAYFN
AAYWO
ABBOA
ABBQC
ABFNM
ABIVO
ABJNI
ABMAC
ABMZM
ABWVN
ABXDB
ACDAQ
ACGFS
ACIEU
ACIUM
ACLOT
ACNNM
ACRLP
ACRPL
ACVFH
ACZNC
ADBBV
ADCNI
ADEZE
ADJOM
ADMUD
ADNMO
AEBSH
AEIPS
AEKER
AENEX
AEUPX
AEVXI
AFJKZ
AFPUW
AFRHN
AFTJW
AFXIZ
AGHFR
AGQPQ
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIGII
AIIUN
AIKHN
AITUG
AJRQY
AJUYK
AKBMS
AKRWK
AKYEP
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
ANKPU
ANZVX
AOUOD
APXCP
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BLXMC
BNPGV
CS3
EBS
EFJIC
EFKBS
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
GBLVA
GBOLZ
HEA
HMK
HMO
HVGLF
HZ~
IHE
J1W
KOM
LZ2
M29
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
ROL
RPZ
SAE
SDF
SDG
SDP
SEL
SES
SEW
SPC
SPCBC
SSH
SSV
SSZ
T5K
UHS
WH7
WUQ
Z5R
~G-
~HD
AACTN
AFCTW
AFKWA
AJOXV
AMFUW
RIG
AAIAV
ABLVK
ABYKQ
AJBFU
LCYCR
AAYXX
CITATION
AGCQF
AGRNS
CGR
CUY
CVF
ECM
EIF
NPM
7X8
7QO
8FD
FR3
P64
7SC
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c606t-18c669c1632018b1aefa0712e2fa2bdf2a569ca71ee83a2a0f687c1d2f2043783
IEDL.DBID .~1
ISSN 0933-3657
IngestDate Sun Sep 28 10:41:13 EDT 2025
Tue Oct 07 09:30:43 EDT 2025
Sun Sep 28 02:50:55 EDT 2025
Mon Jul 21 06:04:22 EDT 2025
Wed Oct 01 00:45:53 EDT 2025
Thu Apr 24 22:52:26 EDT 2025
Fri Feb 23 02:25:15 EST 2024
Sun Feb 23 10:19:32 EST 2025
Tue Oct 14 19:30:12 EDT 2025
IsPeerReviewed true
IsScholarly true
Keywords Convex and piecewise linear classifier
Support vector machine
Gene selection
Microarray data
Feature selection bias
Language English
License Copyright © 2015 Elsevier B.V. All rights reserved.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c606t-18c669c1632018b1aefa0712e2fa2bdf2a569ca71ee83a2a0f687c1d2f2043783
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Undefined-3
ObjectType-Article-1
ObjectType-Feature-2
PMID 26674595
PQID 1767079170
PQPubID 23479
PageCount 9
ParticipantIDs proquest_miscellaneous_1793264824
proquest_miscellaneous_1768572872
proquest_miscellaneous_1767079170
pubmed_primary_26674595
crossref_primary_10_1016_j_artmed_2015_11_001
crossref_citationtrail_10_1016_j_artmed_2015_11_001
elsevier_sciencedirect_doi_10_1016_j_artmed_2015_11_001
elsevier_clinicalkeyesjournals_1_s2_0_S0933365715001426
elsevier_clinicalkey_doi_10_1016_j_artmed_2015_11_001
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2016-01-01
PublicationDateYYYYMMDD 2016-01-01
PublicationDate_xml – month: 01
  year: 2016
  text: 2016-01-01
  day: 01
PublicationDecade 2010
PublicationPlace Netherlands
PublicationPlace_xml – name: Netherlands
PublicationTitle Artificial intelligence in medicine
PublicationTitleAlternate Artif Intell Med
PublicationYear 2016
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Dan, Tsunoda, Kitahara, Yanagawa, Zembutsu, Katagiri (bib0015) 2002; 62
Guyon, Elisseeff (bib0045) 2003; 3
Ambroise, McLachlan (bib0055) 2002; 99
Singhi, Liu (bib0080) 2006
Bobrowski, Łukaszuk (bib0125) 2009; 29
Zhang, Yu, Singer, Xiong (bib0070) 2001; 98
Bobrowski L. Feature subsets selection based on linear separbilty, Lecture notes of the VII-th ICB seminar: statistics and clinical practice.
Golub, Slonim, Tamayo, Huard, Gaasenbeek, Mesirov (bib0020) 1999; 286
van ‘t Veer, Dai, van de Vijver, He, Hart, Mao (bib0030) 2002; 415
Singh, Febbo, Ross, Jackson, Manola, Ladd (bib0190) 2002; 1
Krishnapuram, Carin, Hartemink (bib0110) 2004
Lustgarten, Gopalakrishnan, Visweswaran (bib0095) 2009
Kononenko (bib0160) 1994; vol. 784
Peralta, Soto (bib0120) 2014; 269
Gordon, Jensen, Hsiao, Gullans (bib0175) 2002; 62
Shipp, Ross, Tamayo, Weng, Kutok, Aguiar (bib0185) 2002; 8
Ding, Peng (bib0170) 2005; 3
Stone (bib0200) 1974
Pomeroy, Tamayo, Gaasenbeek, Sturla, Angelo, McLaughlin (bib0180) 2002; 415
Alon, Barkai, Gish, Ybara, Mack (bib0065) 1999; 96
Bobrowski, Łukaszuk (bib0130) 2011
Kira, Rendell (bib0165) 1992
Liu, Motoda (bib0050) 2007
White (bib0060) 2000; 68
Bobrowski, Łukaszuk (bib0205) 2004; vol. 3070
Bobrowski, Niemiro (bib0150) 1984; 17
Bellman (bib0040) 1961
Perou, Jeffrey, Van De Rijn, Rees, Eisen, Ross (bib0025) 1999; 96
Guyon, Weston, Barnhill, Vapnik (bib0075) 2002; 46
Bobrowski (bib0135) 2005
Tuv, Borisov, Runger, Torkkola (bib0115) 2009; 10
Bobrowski (bib0140) 1991; 24
Yu, Liu (bib0195) 2004
Perkins, Lacker, Theiler (bib0100) 2003; 3
Van De Vijver, He, van’t Veer, Dai, Hart, Voskuil (bib0035) 2002; 347
Kuncheva (bib0090) 2007
Zhu, Rosset, Hastie, Tibshirani (bib0145) 2004; 16
Wood, Visscher, Mengersen (bib0085) 2007; 23
Schena, Shalon, Davis, Brown (bib0005) 1995; 270
Li, Campbell, Tipping (bib0105) 2002; 18
DeRisi, Penland, Brown, Bittner, Meltzer, Ray (bib0010) 1996; 14
Zhang (10.1016/j.artmed.2015.11.001_bib0070) 2001; 98
Kononenko (10.1016/j.artmed.2015.11.001_bib0160) 1994; vol. 784
Stone (10.1016/j.artmed.2015.11.001_bib0200) 1974
Alon (10.1016/j.artmed.2015.11.001_bib0065) 1999; 96
Guyon (10.1016/j.artmed.2015.11.001_bib0075) 2002; 46
Golub (10.1016/j.artmed.2015.11.001_bib0020) 1999; 286
Bobrowski (10.1016/j.artmed.2015.11.001_bib0135) 2005
Krishnapuram (10.1016/j.artmed.2015.11.001_bib0110) 2004
Bobrowski (10.1016/j.artmed.2015.11.001_bib0125) 2009; 29
Bobrowski (10.1016/j.artmed.2015.11.001_bib0130) 2011
White (10.1016/j.artmed.2015.11.001_bib0060) 2000; 68
Shipp (10.1016/j.artmed.2015.11.001_bib0185) 2002; 8
Tuv (10.1016/j.artmed.2015.11.001_bib0115) 2009; 10
Liu (10.1016/j.artmed.2015.11.001_bib0050) 2007
Perkins (10.1016/j.artmed.2015.11.001_bib0100) 2003; 3
10.1016/j.artmed.2015.11.001_bib0155
Bellman (10.1016/j.artmed.2015.11.001_bib0040) 1961
Li (10.1016/j.artmed.2015.11.001_bib0105) 2002; 18
Lustgarten (10.1016/j.artmed.2015.11.001_bib0095) 2009
Zhu (10.1016/j.artmed.2015.11.001_bib0145) 2004; 16
Van De Vijver (10.1016/j.artmed.2015.11.001_bib0035) 2002; 347
Wood (10.1016/j.artmed.2015.11.001_bib0085) 2007; 23
Bobrowski (10.1016/j.artmed.2015.11.001_bib0150) 1984; 17
Peralta (10.1016/j.artmed.2015.11.001_bib0120) 2014; 269
DeRisi (10.1016/j.artmed.2015.11.001_bib0010) 1996; 14
Gordon (10.1016/j.artmed.2015.11.001_bib0175) 2002; 62
Yu (10.1016/j.artmed.2015.11.001_bib0195) 2004
van ‘t Veer (10.1016/j.artmed.2015.11.001_bib0030) 2002; 415
Ding (10.1016/j.artmed.2015.11.001_bib0170) 2005; 3
Dan (10.1016/j.artmed.2015.11.001_bib0015) 2002; 62
Guyon (10.1016/j.artmed.2015.11.001_bib0045) 2003; 3
Kira (10.1016/j.artmed.2015.11.001_bib0165) 1992
Singh (10.1016/j.artmed.2015.11.001_bib0190) 2002; 1
Bobrowski (10.1016/j.artmed.2015.11.001_bib0140) 1991; 24
Schena (10.1016/j.artmed.2015.11.001_bib0005) 1995; 270
Ambroise (10.1016/j.artmed.2015.11.001_bib0055) 2002; 99
Kuncheva (10.1016/j.artmed.2015.11.001_bib0090) 2007
Pomeroy (10.1016/j.artmed.2015.11.001_bib0180) 2002; 415
Bobrowski (10.1016/j.artmed.2015.11.001_bib0205) 2004; vol. 3070
Perou (10.1016/j.artmed.2015.11.001_bib0025) 1999; 96
Singhi (10.1016/j.artmed.2015.11.001_bib0080) 2006
References_xml – year: 1961
  ident: bib0040
  article-title: Adaptive control processes: a guided tour
– start-page: 406
  year: 2009
  end-page: 410
  ident: bib0095
  article-title: Measuring stability of feature selection in biomedical datasets
  publication-title: AMIA annual symposium proceedings, vol. 2009
– volume: 270
  start-page: 467
  year: 1995
  end-page: 470
  ident: bib0005
  article-title: Quantitative monitoring of gene expression patterns with a complementary DNA microarray
  publication-title: Science
– volume: 24
  start-page: 863
  year: 1991
  end-page: 870
  ident: bib0140
  article-title: Design of piecewise linear classifiers from formal neurons by some basis exchange technique
  publication-title: Pattern Recognit
– start-page: 111
  year: 1974
  end-page: 147
  ident: bib0200
  article-title: Cross-validatory choice and assessment of statistical predictions
  publication-title: J Royal Stat Soc Ser B (Methodol)
– volume: 68
  start-page: 1097
  year: 2000
  end-page: 1126
  ident: bib0060
  article-title: A reality check for data snooping
  publication-title: Econometrica
– volume: 3
  start-page: 185
  year: 2005
  end-page: 205
  ident: bib0170
  article-title: Minimum redundancy feature selection from microarray gene expression data
  publication-title: J Bioinform Comput Biol
– start-page: 737
  year: 2004
  end-page: 742
  ident: bib0195
  article-title: Redundancy based feature selection for microarray data
  publication-title: KDD ‘04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
– volume: 347
  start-page: 1999
  year: 2002
  end-page: 2009
  ident: bib0035
  article-title: A gene-expression signature as a predictor of survival in breast cancer
  publication-title: N E J Med
– volume: 269
  start-page: 176
  year: 2014
  end-page: 187
  ident: bib0120
  article-title: Embedded local feature selection within mixture of experts
  publication-title: Inf Sci
– volume: vol. 784
  start-page: 171
  year: 1994
  end-page: 182
  ident: bib0160
  article-title: Estimating attributes: analysis and extensions of Relief
  publication-title: Machine learning, ECML-94
– year: 2005
  ident: bib0135
  article-title: Data mining based on convex and piecewise linear (CPL) criterion functions (in Polish)
– volume: 96
  start-page: 9212
  year: 1999
  end-page: 9217
  ident: bib0025
  article-title: Distinctive gene expression patterns in human mammary epithelial cells and breast cancers
  publication-title: Proc Natl Acad Sci
– volume: 98
  start-page: 6730
  year: 2001
  end-page: 6735
  ident: bib0070
  article-title: Recursive partitioning for tumor classification with gene expression microarray data
  publication-title: Proc Natl Acad Sci
– start-page: 249
  year: 1992
  end-page: 256
  ident: bib0165
  article-title: A practical approach to feature selection
  publication-title: Proceedings of the ninth international workshop on machine learning
– volume: 415
  start-page: 436
  year: 2002
  end-page: 442
  ident: bib0180
  article-title: Prediction of central nervous system embryonal tumour outcome based on gene expression
  publication-title: Nature
– volume: 96
  start-page: 6745
  year: 1999
  end-page: 6750
  ident: bib0065
  article-title: Broad patterns of gene expressions revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
  publication-title: PNAS
– volume: 8
  start-page: 68
  year: 2002
  end-page: 74
  ident: bib0185
  article-title: Diffuse large b-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning
  publication-title: Nat Med
– volume: vol. 3070
  start-page: 544
  year: 2004
  end-page: 549
  ident: bib0205
  article-title: Selection of the linearly separable feature subsets
  publication-title: Artificial intelligence and soft computing: ICAISC‘2004
– volume: 29
  start-page: 43
  year: 2009
  end-page: 59
  ident: bib0125
  article-title: Feature selection based on relaxed linear separability
  publication-title: Biocybern Biomed Eng
– volume: 46
  start-page: 389
  year: 2002
  end-page: 422
  ident: bib0075
  article-title: Gene selection for cancer classification using support vector machines
  publication-title: Mach Learn
– volume: 62
  start-page: 4963
  year: 2002
  end-page: 4967
  ident: bib0175
  article-title: Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesotheliomar
  publication-title: Cancer Res
– volume: 415
  start-page: 530
  year: 2002
  end-page: 536
  ident: bib0030
  article-title: Gene expression profiling predicts clinical outcome of breast cancer
  publication-title: Nature
– volume: 23
  start-page: 1363
  year: 2007
  end-page: 1370
  ident: bib0085
  article-title: Classification based upon gene expression data: bias and precision of error rates
  publication-title: Bioinformatics
– start-page: 849
  year: 2006
  end-page: 856
  ident: bib0080
  article-title: Feature subset selection bias for classification learning
  publication-title: Proceedings of the 23rd international conference on machine learning, ICML ‘06
– start-page: 421
  year: 2007
  end-page: 427
  ident: bib0090
  article-title: A stability index for feature selection
  publication-title: Artificial intelligence and applications
– volume: 3
  start-page: 1333
  year: 2003
  end-page: 1356
  ident: bib0100
  article-title: Grafting: Fast, incremental feature selection by gradient descent in function space
  publication-title: J Mach Learn Res
– volume: 3
  start-page: 1157
  year: 2003
  end-page: 1182
  ident: bib0045
  article-title: An introduction to variable and feature selection
  publication-title: J Mach Learn Res
– year: 2011
  ident: bib0130
  article-title: Relaxed linear separability (RLS) approach to feature (gene) subset selection
  publication-title: Selected works in bioinformatics
– volume: 1
  start-page: 203
  year: 2002
  end-page: 209
  ident: bib0190
  article-title: Gene expression correlates of clinical prostate cancer behavior
  publication-title: Cancer cell
– volume: 14
  start-page: 457
  year: 1996
  end-page: 460
  ident: bib0010
  article-title: Use of a CDNA microarray to analyse gene expression patterns in human cancer
  publication-title: Nat Genet
– reference: Bobrowski L. Feature subsets selection based on linear separbilty, Lecture notes of the VII-th ICB seminar: statistics and clinical practice.
– volume: 62
  start-page: 1139
  year: 2002
  end-page: 1147
  ident: bib0015
  article-title: An integrated database of chemosensitivity to 55 anticancer drugs and gene expression profiles of 39 human cancer cell lines
  publication-title: Cancer Res
– volume: 99
  start-page: 6562
  year: 2002
  end-page: 6566
  ident: bib0055
  article-title: Selection bias in gene extraction on the basis of microarray gene-expression data
  publication-title: Proc Natl Acad Sci
– volume: 16
  start-page: 49
  year: 2004
  end-page: 56
  ident: bib0145
  article-title: 1-norm support vector machines
  publication-title: Adv Neural Inf Process Syst
– start-page: 299
  year: 2004
  end-page: 317
  ident: bib0110
  article-title: Gene expression analysis: joint feature selection and classifier design
  publication-title: Kernel Methods Comput Biol
– volume: 18
  start-page: 1332
  year: 2002
  end-page: 1339
  ident: bib0105
  article-title: Bayesian automatic relevance determination algorithms for classifying gene expression data
  publication-title: Bioinformatics
– volume: 10
  start-page: 1341
  year: 2009
  end-page: 1366
  ident: bib0115
  article-title: Feature selection with ensembles, artificial variables, and redundancy elimination
  publication-title: J Mach Learn Res
– volume: 286
  start-page: 531
  year: 1999
  end-page: 537
  ident: bib0020
  article-title: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring
  publication-title: Sciences
– year: 2007
  ident: bib0050
  article-title: Computational methods of feature selection
– volume: 17
  start-page: 205
  year: 1984
  end-page: 210
  ident: bib0150
  article-title: A method of synthesis of linear discriminant function in the case of nonseparability
  publication-title: Pattern Recognit
– year: 1961
  ident: 10.1016/j.artmed.2015.11.001_bib0040
– volume: 3
  start-page: 1157
  year: 2003
  ident: 10.1016/j.artmed.2015.11.001_bib0045
  article-title: An introduction to variable and feature selection
  publication-title: J Mach Learn Res
– volume: 17
  start-page: 205
  issue: 2
  year: 1984
  ident: 10.1016/j.artmed.2015.11.001_bib0150
  article-title: A method of synthesis of linear discriminant function in the case of nonseparability
  publication-title: Pattern Recognit
  doi: 10.1016/0031-3203(84)90059-1
– volume: 270
  start-page: 467
  issue: 5235
  year: 1995
  ident: 10.1016/j.artmed.2015.11.001_bib0005
  article-title: Quantitative monitoring of gene expression patterns with a complementary DNA microarray
  publication-title: Science
  doi: 10.1126/science.270.5235.467
– volume: 286
  start-page: 531
  year: 1999
  ident: 10.1016/j.artmed.2015.11.001_bib0020
  article-title: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring
  publication-title: Sciences
  doi: 10.1126/science.286.5439.531
– volume: 98
  start-page: 6730
  issue: 12
  year: 2001
  ident: 10.1016/j.artmed.2015.11.001_bib0070
  article-title: Recursive partitioning for tumor classification with gene expression microarray data
  publication-title: Proc Natl Acad Sci
  doi: 10.1073/pnas.111153698
– start-page: 421
  year: 2007
  ident: 10.1016/j.artmed.2015.11.001_bib0090
  article-title: A stability index for feature selection
– start-page: 737
  year: 2004
  ident: 10.1016/j.artmed.2015.11.001_bib0195
  article-title: Redundancy based feature selection for microarray data
– volume: 16
  start-page: 49
  issue: 1
  year: 2004
  ident: 10.1016/j.artmed.2015.11.001_bib0145
  article-title: 1-norm support vector machines
  publication-title: Adv Neural Inf Process Syst
– volume: 14
  start-page: 457
  issue: 4
  year: 1996
  ident: 10.1016/j.artmed.2015.11.001_bib0010
  article-title: Use of a CDNA microarray to analyse gene expression patterns in human cancer
  publication-title: Nat Genet
  doi: 10.1038/ng1296-457
– volume: vol. 3070
  start-page: 544
  year: 2004
  ident: 10.1016/j.artmed.2015.11.001_bib0205
  article-title: Selection of the linearly separable feature subsets
– volume: 23
  start-page: 1363
  issue: 11
  year: 2007
  ident: 10.1016/j.artmed.2015.11.001_bib0085
  article-title: Classification based upon gene expression data: bias and precision of error rates
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btm117
– volume: 10
  start-page: 1341
  year: 2009
  ident: 10.1016/j.artmed.2015.11.001_bib0115
  article-title: Feature selection with ensembles, artificial variables, and redundancy elimination
  publication-title: J Mach Learn Res
– volume: 1
  start-page: 203
  issue: 2
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0190
  article-title: Gene expression correlates of clinical prostate cancer behavior
  publication-title: Cancer cell
  doi: 10.1016/S1535-6108(02)00030-2
– volume: 347
  start-page: 1999
  issue: 25
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0035
  article-title: A gene-expression signature as a predictor of survival in breast cancer
  publication-title: N E J Med
  doi: 10.1056/NEJMoa021967
– year: 2011
  ident: 10.1016/j.artmed.2015.11.001_bib0130
  article-title: Relaxed linear separability (RLS) approach to feature (gene) subset selection
– volume: 18
  start-page: 1332
  issue: 10
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0105
  article-title: Bayesian automatic relevance determination algorithms for classifying gene expression data
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/18.10.1332
– start-page: 249
  year: 1992
  ident: 10.1016/j.artmed.2015.11.001_bib0165
  article-title: A practical approach to feature selection
– volume: 8
  start-page: 68
  issue: 1
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0185
  article-title: Diffuse large b-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning
  publication-title: Nat Med
  doi: 10.1038/nm0102-68
– volume: 96
  start-page: 6745
  year: 1999
  ident: 10.1016/j.artmed.2015.11.001_bib0065
  article-title: Broad patterns of gene expressions revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
  publication-title: PNAS
  doi: 10.1073/pnas.96.12.6745
– ident: 10.1016/j.artmed.2015.11.001_bib0155
– volume: 62
  start-page: 1139
  issue: 4
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0015
  article-title: An integrated database of chemosensitivity to 55 anticancer drugs and gene expression profiles of 39 human cancer cell lines
  publication-title: Cancer Res
– volume: 99
  start-page: 6562
  issue: 10
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0055
  article-title: Selection bias in gene extraction on the basis of microarray gene-expression data
  publication-title: Proc Natl Acad Sci
  doi: 10.1073/pnas.102102699
– volume: 68
  start-page: 1097
  issue: 5
  year: 2000
  ident: 10.1016/j.artmed.2015.11.001_bib0060
  article-title: A reality check for data snooping
  publication-title: Econometrica
  doi: 10.1111/1468-0262.00152
– start-page: 406
  year: 2009
  ident: 10.1016/j.artmed.2015.11.001_bib0095
  article-title: Measuring stability of feature selection in biomedical datasets
– year: 2007
  ident: 10.1016/j.artmed.2015.11.001_bib0050
– volume: 269
  start-page: 176
  year: 2014
  ident: 10.1016/j.artmed.2015.11.001_bib0120
  article-title: Embedded local feature selection within mixture of experts
  publication-title: Inf Sci
  doi: 10.1016/j.ins.2014.01.008
– volume: 29
  start-page: 43
  issue: 2
  year: 2009
  ident: 10.1016/j.artmed.2015.11.001_bib0125
  article-title: Feature selection based on relaxed linear separability
  publication-title: Biocybern Biomed Eng
– volume: 62
  start-page: 4963
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0175
  article-title: Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesotheliomar
  publication-title: Cancer Res
– start-page: 111
  year: 1974
  ident: 10.1016/j.artmed.2015.11.001_bib0200
  article-title: Cross-validatory choice and assessment of statistical predictions
  publication-title: J Royal Stat Soc Ser B (Methodol)
  doi: 10.1111/j.2517-6161.1974.tb00994.x
– volume: 415
  start-page: 530
  issue: 6871
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0030
  article-title: Gene expression profiling predicts clinical outcome of breast cancer
  publication-title: Nature
  doi: 10.1038/415530a
– volume: 3
  start-page: 1333
  year: 2003
  ident: 10.1016/j.artmed.2015.11.001_bib0100
  article-title: Grafting: Fast, incremental feature selection by gradient descent in function space
  publication-title: J Mach Learn Res
– volume: 415
  start-page: 436
  issue: 6870
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0180
  article-title: Prediction of central nervous system embryonal tumour outcome based on gene expression
  publication-title: Nature
  doi: 10.1038/415436a
– volume: 96
  start-page: 9212
  issue: 16
  year: 1999
  ident: 10.1016/j.artmed.2015.11.001_bib0025
  article-title: Distinctive gene expression patterns in human mammary epithelial cells and breast cancers
  publication-title: Proc Natl Acad Sci
  doi: 10.1073/pnas.96.16.9212
– start-page: 299
  year: 2004
  ident: 10.1016/j.artmed.2015.11.001_bib0110
  article-title: Gene expression analysis: joint feature selection and classifier design
  publication-title: Kernel Methods Comput Biol
– start-page: 849
  year: 2006
  ident: 10.1016/j.artmed.2015.11.001_bib0080
  article-title: Feature subset selection bias for classification learning
– volume: vol. 784
  start-page: 171
  year: 1994
  ident: 10.1016/j.artmed.2015.11.001_bib0160
  article-title: Estimating attributes: analysis and extensions of Relief
– volume: 3
  start-page: 185
  issue: 2
  year: 2005
  ident: 10.1016/j.artmed.2015.11.001_bib0170
  article-title: Minimum redundancy feature selection from microarray gene expression data
  publication-title: J Bioinform Comput Biol
  doi: 10.1142/S0219720005001004
– volume: 46
  start-page: 389
  issue: 1–3
  year: 2002
  ident: 10.1016/j.artmed.2015.11.001_bib0075
  article-title: Gene selection for cancer classification using support vector machines
  publication-title: Mach Learn
  doi: 10.1023/A:1012487302797
– year: 2005
  ident: 10.1016/j.artmed.2015.11.001_bib0135
– volume: 24
  start-page: 863
  issue: 9
  year: 1991
  ident: 10.1016/j.artmed.2015.11.001_bib0140
  article-title: Design of piecewise linear classifiers from formal neurons by some basis exchange technique
  publication-title: Pattern Recognit
  doi: 10.1016/0031-3203(91)90005-P
SSID ssj0007416
Score 2.3776596
Snippet •We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure.•We examine its importance by an empirical study of four...
Highlights • We analyze seven gene datasets to show the feature selection bias effect on the accuracy measure. • We examine its importance by an empirical...
Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being considered. In this...
OBJECTIVEFeature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being...
Objective Feature selection is a technique widely used in data mining. The aim is to select the best subset of features relevant to the problem being...
SourceID proquest
pubmed
crossref
elsevier
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 63
SubjectTerms Accuracy
Algorithms
Bias
Biomarkers, Tumor - genetics
Cancer
Classification
Computational Biology - methods
Convex and piecewise linear classifier
Data mining
Data Mining - methods
Databases, Genetic
Decision Support Techniques
Feature selection bias
Gene Expression Profiling - methods
Gene Expression Regulation, Neoplastic
Gene selection
Genes
Humans
Internal Medicine
Learning
Linear Models
Mathematical models
Microarray data
Oligonucleotide Array Sequence Analysis
Other
Pattern Recognition, Automated
Reproducibility of Results
Support Vector Machine
Title The feature selection bias problem in relation to high-dimensional gene data
URI https://www.clinicalkey.com/#!/content/1-s2.0-S0933365715001426
https://www.clinicalkey.es/playcontent/1-s2.0-S0933365715001426
https://dx.doi.org/10.1016/j.artmed.2015.11.001
https://www.ncbi.nlm.nih.gov/pubmed/26674595
https://www.proquest.com/docview/1767079170
https://www.proquest.com/docview/1768572872
https://www.proquest.com/docview/1793264824
Volume 66
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier)
  customDbUrl:
  eissn: 1873-2860
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0007416
  issn: 0933-3657
  databaseCode: GBLVA
  dateStart: 20110101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier ScienceDirect
  customDbUrl:
  eissn: 1873-2860
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0007416
  issn: 0933-3657
  databaseCode: ACRLP
  dateStart: 19950201
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals
  customDbUrl:
  eissn: 1873-2860
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0007416
  issn: 0933-3657
  databaseCode: AIKHN
  dateStart: 19950201
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Science Direct
  customDbUrl:
  eissn: 1873-2860
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0007416
  issn: 0933-3657
  databaseCode: .~1
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVLSH
  databaseName: Elsevier Journals
  customDbUrl:
  mediaType: online
  eissn: 1873-2860
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0007416
  issn: 0933-3657
  databaseCode: AKRWK
  dateStart: 19890101
  isFulltext: true
  providerName: Library Specific Holdings
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwELYQSBUXWmh5lSIjcTW78TtHhEBLH1wKEjfLdhxpEcoislz725lJHAoqBcRtd2NvksnY8038-RtC9m3pS17ryMZeCQYRX7KgPX4qIXlOSRShI8ie6cmF_H6pLhfI0bAXBmmVee7v5_Ruts6_jLI1RzfT6eg35uJCKwOQBnA-R9ltKQ1WMTj485fmgYij09sTgmHrYftcx_GC_4OYgwQvdYBanrk0zDPh6X_wswtDJ5_ISsaP9LC_xFWykJo18nGozUDzUF0jH37lRfPP5Ce4Aq1Tp-BJ267uDTwMGqa-pbmeDJ029DbT4uh8RlHEmFUo_N-LdlBws0SRTfqFXJwcnx9NWC6iwCLkJnNW2Kh1GQF2wV3aUPhUe4AVPPHa81DV3Cs47E2RkhWe-3GtrYlFxWvcNWusWCeLzaxJm4SW3FQClxlDFaWMpRdBVzr6UARfBiu3iBhs52JWGMdCF9duoJJdud7iDi0OyQcy6rYIe-h10ytsvNJeDY_FDbtHYb5zEAJe6Wee65faPGhbV7iWu7H7x7Ee93zim284597gNw6GLa7F-CbN7uBcRqM2YWHGL7axykBKy19qgwBcWg7W3-gd88GKgL2MVKXafvf1fyXL8C2_ctohi_Pbu_QNQNg87HajbJcsHZ7-mJzdA27YLwY
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwEB5RKrW9lJa-gD5cqVezGzt-5Fihom27cClI3CzbcaRFVRaR5cpvZyZxoFUpVL1FGzvJTsaeb-LP3wB8spWvRKMjn3olOUb8kgft6ajC5DklWYSeIHuoZ8fltxN1sgZ7414YolXmuX-Y0_vZOv8yydacnC0Wkx-Ui0utDEIaxPlCP4CHpRKGMrDdyxueB0GOXnBPSk7Nx_1zPckLL4hBhxheapfEPHNtmFvi09_wZx-H9p_B0wwg2efhGZ_DWmo3YWMszsDyWN2ERwd51fwFzNEXWJN6CU_W9YVv8G2wsPAdywVl2KJl55kXx1ZLRirGvCbl_0G1g6GfJUZ00pdwvP_laG_GcxUFHjE5WfHCRq2riLgL_6UNhU-NR1whkmi8CHUjvMLT3hQpWemFnzbamljUoqFts8bKV7DeLtv0BlglTC1pnTHUsSxj5WXQtY4-FMFXwZZbIEfbuZglxqnSxU83cslO3WBxRxbH7IModVvAr3udDRIb97RX42tx4_ZRnPAcxoB7-pnb-qUuj9rOFa4Tbur-8Kxfe_7mnP9wz4-j3zgct7QY49u0vMB7GU3ihIWZ3tnGKoM5rbirDSHw0gq0_uvBMa-tiODLlKpS2__9_B_g8ezoYO7mXw-_78ATPJO_P72F9dX5RXqHiGwV3vcj7gop_DCb
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+feature+selection+bias+problem+in+relation+to+high-dimensional+gene+data&rft.jtitle=Artificial+intelligence+in+medicine&rft.au=Krawczuk%2C+Jerzy&rft.au=ukaszuk%2C+Tomasz&rft.date=2016-01-01&rft.issn=0933-3657&rft.volume=66&rft.spage=63&rft.epage=71&rft_id=info:doi/10.1016%2Fj.artmed.2015.11.001&rft.externalDBID=NO_FULL_TEXT
thumbnail_m http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fcdn.clinicalkey.com%2Fck-thumbnails%2F09333657%2FS0933365716X00024%2Fcov150h.gif