TargetATPsite: A template-free method for ATP-binding sites prediction with residue evolution image sparse representation and classifier ensemble
Understanding the interactions between proteins and ligands is critical for protein function annotations and drug discovery. We report a new sequence‐based template‐free predictor (TargetATPsite) to identify the Adenosine‐5′‐triphosphate (ATP) binding sites with machine‐learning approaches. Two step...
Saved in:
| Published in | Journal of computational chemistry Vol. 34; no. 11; pp. 974 - 985 |
|---|---|
| Main Authors | , , , , , , |
| Format | Journal Article |
| Language | English |
| Published |
Hoboken
Wiley Subscription Services, Inc., A Wiley Company
30.04.2013
Wiley Subscription Services, Inc |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0192-8651 1096-987X 1096-987X |
| DOI | 10.1002/jcc.23219 |
Cover
| Summary: | Understanding the interactions between proteins and ligands is critical for protein function annotations and drug discovery. We report a new sequence‐based template‐free predictor (TargetATPsite) to identify the Adenosine‐5′‐triphosphate (ATP) binding sites with machine‐learning approaches. Two steps are implemented in TargetATPsite: binding residues and pockets predictions, respectively. To predict the binding residues, a novel image sparse representation technique is proposed to encode residue evolution information treated as the input features. An ensemble classifier constructed based on support vector machines (SVM) from multiple random under‐samplings is used as the prediction model, which is effective for dealing with imbalance phenomenon between the positive and negative training samples. Compared with the existing ATP‐specific sequence‐based predictors, TargetATPsite is featured by the second step of possessing the capability of further identifying the binding pockets from the predicted binding residues through a spatial clustering algorithm. Experimental results on three benchmark datasets demonstrate the efficacy of TargetATPsite. © 2013 Wiley Periodicals, Inc.
Accurately localizing the protein‐ATP binding sites is important for protein function analysis and drug design. A template‐free machine learning‐based software named TargetATPsite is developed to predict protein‐ ATP binding residues and pockets from amino acid sequences. The binding residues are predicted by an ensemble classifier formed by support vector machines, and the input features are sparse representations of residue evolution images. Binding pockets are identified through spatially clustering predicted binding residues. |
|---|---|
| Bibliography: | ArticleID:JCC23219 Jiangsu Postdoctoral Science Foundation - No. 1201027C Industry-Academia Cooperation Innovation Fund Projects of Jiangsu Province - No. BY2012022 Natural Science Foundation of Jiangsu - No. BK2011371 Shanghai Science and Technology Commission - No. 11JC1404800 National Natural Science Foundation of China - No. 91130033; No. 61175024; No. 61233011; No. 61222306; No. 61006091 istex:C53F81CBE24B017D566A900C0B5FACB295BC7929 ark:/67375/WNG-2T008RJ1-6 Foundation for the Author of National Excellent Doctoral Dissertation of PR China - No. 201048 Fax: (+86) 21 34204022 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0192-8651 1096-987X 1096-987X |
| DOI: | 10.1002/jcc.23219 |