Biased binomial assessment of cross-validated estimation of classification accuracies illustrated in diagnosis predictions
Multivariate classification is used in neuroimaging studies to infer brain activation or in medical applications to infer diagnosis. Their results are often assessed through either a binomial or a permutation test. Here, we simulated classification results of generated random data to assess the infl...
Saved in:
Published in | NeuroImage clinical Vol. 4; no. C; pp. 687 - 694 |
---|---|
Main Authors | , , , , , , , , |
Format | Journal Article Web Resource |
Language | English |
Published |
Netherlands
Elsevier Inc
01.01.2014
Elsevier |
Subjects | |
Online Access | Get full text |
ISSN | 2213-1582 2213-1582 |
DOI | 10.1016/j.nicl.2014.04.004 |
Cover
Summary: | Multivariate classification is used in neuroimaging studies to infer brain activation or in medical applications to infer diagnosis. Their results are often assessed through either a binomial or a permutation test. Here, we simulated classification results of generated random data to assess the influence of the cross-validation scheme on the significance of results. Distributions built from classification of random data with cross-validation did not follow the binomial distribution. The binomial test is therefore not adapted. On the contrary, the permutation test was unaffected by the cross-validation scheme. The influence of the cross-validation was further illustrated on real-data from a brain–computer interface experiment in patients with disorders of consciousness and from an fMRI study on patients with Parkinson disease. Three out of 16 patients with disorders of consciousness had significant accuracy on binomial testing, but only one showed significant accuracy using permutation testing. In the fMRI experiment, the mental imagery of gait could discriminate significantly between idiopathic Parkinson's disease patients and healthy subjects according to the permutation test but not according to the binomial test. Hence, binomial testing could lead to biased estimation of significance and false positive or negative results. In our view, permutation testing is thus recommended for clinical application of classification with cross-validation.
•We assess the influence of cross-validation on significance of classification results.•Classification of random data did not follow binomial distribution.•The permutation test was unaffected by the cross-validation scheme.•Results are illustrated on real-data from BCI and fMRI studies. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 scopus-id:2-s2.0-84899428996 Both authors contributed equally. |
ISSN: | 2213-1582 2213-1582 |
DOI: | 10.1016/j.nicl.2014.04.004 |