self-adaptive genetic algorithm-artificial neural network algorithm with leave-one-out cross validation for descriptor selection in QSAR study
Based on the quantitative structure-activity relationships (QSARs) models developed by artificial neural networks (ANNs), genetic algorithm (GA) was used in the variable-selection approach with molecule descriptors and helped to improve the back-propagation training algorithm as well. The cross vali...
        Saved in:
      
    
          | Published in | Journal of computational chemistry Vol. 31; no. 10; pp. 1956 - 1968 | 
|---|---|
| Main Authors | , , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Hoboken
          Wiley Subscription Services, Inc., A Wiley Company
    
        30.07.2010
     Wiley Subscription Services, Inc  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 0192-8651 1096-987X 1096-987X  | 
| DOI | 10.1002/jcc.21471 | 
Cover
| Summary: | Based on the quantitative structure-activity relationships (QSARs) models developed by artificial neural networks (ANNs), genetic algorithm (GA) was used in the variable-selection approach with molecule descriptors and helped to improve the back-propagation training algorithm as well. The cross validation techniques of leave-one-out investigated the validity of the generated ANN model and preferable variable combinations derived in the GAs. A self-adaptive GA-ANN model was successfully established by using a new estimate function for avoiding over-fitting phenomenon in ANN training. Compared with the variables selected in two recent QSAR studies that were based on stepwise multiple linear regression (MLR) models, the variables selected in self-adaptive GA-ANN model are superior in constructing ANN model, as they revealed a higher cross validation (CV) coefficient (Q²) and a lower root mean square deviation both in the established model and biological activity prediction. The introduced methods for validation, including leave-multiple-out, Y-randomization, and external validation, proved the superiority of the established GA-ANN models over MLR models in both stability and predictive power. Self-adaptive GA-ANN showed us a prospect of improving QSAR model. | 
|---|---|
| Bibliography: | http://dx.doi.org/10.1002/jcc.21471 istex:1D70A598CA5443AEF2FA57F5BD1DDF8253C13FD3 ArticleID:JCC21471 Optic Vector Computing Workstation at State Key Laboratory of Optoelectronic Materials of Sun Yat-sen University National Natural Science Foundation of the People's Republic of China - No. 90608012 High Performance Computing Center (HPCC) at Sun Yat-sen University ark:/67375/WNG-GR93QV2Z-1 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23  | 
| ISSN: | 0192-8651 1096-987X 1096-987X  | 
| DOI: | 10.1002/jcc.21471 |