Generation of Tautomers Using Micro-pKa's
Solutions of organic molecules containing one or more heterocycles with conjugated bonds may exist as a mixture of tautomers, but typically only a few of them are significantly populated even though the potential number grows combinatorially with the number of protonation and deprotonation sites. Ge...
        Saved in:
      
    
          | Published in | Journal of chemical information and modeling Vol. 59; no. 6; p. 2672 | 
|---|---|
| Main Authors | , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Washington
          American Chemical Society
    
        24.06.2019
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1549-9596 1549-960X  | 
| DOI | 10.1021/acs.jcim.8b00955 | 
Cover
| Abstract | Solutions of organic molecules containing one or more heterocycles with conjugated bonds may exist as a mixture of tautomers, but typically only a few of them are significantly populated even though the potential number grows combinatorially with the number of protonation and deprotonation sites. Generating the most stable tautomers from a given input structure is an important and challenging task, and numerous algorithms to tackle it have been proposed in the literature. This work describes a novel approach for tautomer prediction that involves the combined use of molecular mechanics, semiempirical quantum chemistry, and density functional theory. The key idea in our method is to identify the protonation and deprotonation sites using estimated micro-pKa's for every atom in the molecule as well as in its nearest protonated and deprotonated forms. To generate tautomers in a systematic way with minimal bias, we then consider the full set of tautomers that arise from the combinatorial distribution of all such mobile protons among all protonatable sites, with efficient postprocessing to screen away high-energy species. To estimate the micro-pKa's, we present a new method designed for the current task, but we emphasize that any alternative method can be used in conjunction with our basic algorithm. Our approach is therefore grounded in the computational prediction of physical properties in aqueous solution, in contrast to other approaches that may rely on the use of hard-coded rules of proton distribution, previously observed tautomerization patterns from a known chemical space, or human input. We present examples of the application of our algorithm to organic and drug-like molecules, with a focus on novel structures where traditional methods are expected to perform worse. | 
    
|---|---|
| AbstractList | Solutions of organic molecules containing one or more heterocycles with conjugated bonds may exist as a mixture of tautomers, but typically only a few of them are significantly populated even though the potential number grows combinatorially with the number of protonation and deprotonation sites. Generating the most stable tautomers from a given input structure is an important and challenging task, and numerous algorithms to tackle it have been proposed in the literature. This work describes a novel approach for tautomer prediction that involves the combined use of molecular mechanics, semiempirical quantum chemistry, and density functional theory. The key idea in our method is to identify the protonation and deprotonation sites using estimated micro-pKa's for every atom in the molecule as well as in its nearest protonated and deprotonated forms. To generate tautomers in a systematic way with minimal bias, we then consider the full set of tautomers that arise from the combinatorial distribution of all such mobile protons among all protonatable sites, with efficient postprocessing to screen away high-energy species. To estimate the micro-pKa's, we present a new method designed for the current task, but we emphasize that any alternative method can be used in conjunction with our basic algorithm. Our approach is therefore grounded in the computational prediction of physical properties in aqueous solution, in contrast to other approaches that may rely on the use of hard-coded rules of proton distribution, previously observed tautomerization patterns from a known chemical space, or human input. We present examples of the application of our algorithm to organic and drug-like molecules, with a focus on novel structures where traditional methods are expected to perform worse. | 
    
| Author | Yu, Haoyu S Watson, Mark A Bochevarov, Art D  | 
    
| Author_xml | – sequence: 1 givenname: Mark surname: Watson middlename: A fullname: Watson, Mark A – sequence: 2 givenname: Haoyu surname: Yu middlename: S fullname: Yu, Haoyu S – sequence: 3 givenname: Art surname: Bochevarov middlename: D fullname: Bochevarov, Art D  | 
    
| BookMark | eNo1jTtPwzAYAC1UJNrCzhiJATEkfLb7-TGiihZEEUsrsVV-okTUDnHy_0ECprvpbkFmKadAyDWFhgKj98aVpnPtqVEWQCOekTnFla61gPfZv6MWF2RRSgfAuRZsTu62IYXBjG1OVY7V3kxjPoWhVIfSpo_qtXVDrvsXc1suyXk0nyVc_XFJDpvH_fqp3r1tn9cPu7qnlI-1d0ZqpEJi5BCEUDZ6q0BCVCvrJaKLhgP1jjvLRFRRe6mt8YjRKW2BL8nNb7cf8tcUynjs8jSkn-WRMeSMo5aKfwN63EZc | 
    
| ContentType | Journal Article | 
    
| Copyright | Copyright American Chemical Society Jun 24, 2019 | 
    
| Copyright_xml | – notice: Copyright American Chemical Society Jun 24, 2019 | 
    
| DBID | 7SC 7SR 7U5 8BQ 8FD JG9 JQ2 L7M L~C L~D  | 
    
| DOI | 10.1021/acs.jcim.8b00955 | 
    
| DatabaseName | Computer and Information Systems Abstracts Engineered Materials Abstracts Solid State and Superconductivity Abstracts METADEX Technology Research Database Materials Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts  Academic Computer and Information Systems Abstracts Professional  | 
    
| DatabaseTitle | Materials Research Database Engineered Materials Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest Computer Science Collection Computer and Information Systems Abstracts Solid State and Superconductivity Abstracts Advanced Technologies Database with Aerospace METADEX Computer and Information Systems Abstracts Professional  | 
    
| DatabaseTitleList | Materials Research Database | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Chemistry | 
    
| EISSN | 1549-960X | 
    
| GroupedDBID | --- -~X 4.4 55A 5GY 5VS 7SC 7SR 7U5 7~N 8BQ 8FD AABXI ABBLG ABJNI ABLBI ABMVS ABQRX ABUCX ACGFS ACIWK ACNCT ACS ADHLV AEESW AENEX AFEFF AHGAQ ALMA_UNASSIGNED_HOLDINGS AQSVZ CUPRZ D0L DU5 EBS ED~ EJD F5P GGK GNL IH9 JG9 JG~ JQ2 L7M L~C L~D P2P PQQKQ RNS ROL UI2 VF5 VG9 W1F  | 
    
| ID | FETCH-LOGICAL-p113t-dca7951675f30e668bfdb8070f84bd755cfa301dc3cb26f8f9d79bad55fc89b03 | 
    
| ISSN | 1549-9596 | 
    
| IngestDate | Mon Jun 30 10:55:50 EDT 2025 | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 6 | 
    
| Language | English | 
    
| LinkModel | OpenURL | 
    
| MergedId | FETCHMERGED-LOGICAL-p113t-dca7951675f30e668bfdb8070f84bd755cfa301dc3cb26f8f9d79bad55fc89b03 | 
    
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14  | 
    
| PQID | 2253235978 | 
    
| PQPubID | 28739 | 
    
| ParticipantIDs | proquest_journals_2253235978 | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 20190624 | 
    
| PublicationDateYYYYMMDD | 2019-06-24 | 
    
| PublicationDate_xml | – month: 06 year: 2019 text: 20190624 day: 24  | 
    
| PublicationDecade | 2010 | 
    
| PublicationPlace | Washington | 
    
| PublicationPlace_xml | – name: Washington | 
    
| PublicationTitle | Journal of chemical information and modeling | 
    
| PublicationYear | 2019 | 
    
| Publisher | American Chemical Society | 
    
| Publisher_xml | – name: American Chemical Society | 
    
| SSID | ssj0033962 | 
    
| Score | 2.338535 | 
    
| Snippet | Solutions of organic molecules containing one or more heterocycles with conjugated bonds may exist as a mixture of tautomers, but typically only a few of them... | 
    
| SourceID | proquest | 
    
| SourceType | Aggregation Database | 
    
| StartPage | 2672 | 
    
| SubjectTerms | Algorithms Aqueous solutions Combinatorial analysis Density functional theory Organic chemistry Physical properties Protonation Quantum chemistry Tautomers  | 
    
| Title | Generation of Tautomers Using Micro-pKa's | 
    
| URI | https://www.proquest.com/docview/2253235978 | 
    
| Volume | 59 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVABC databaseName: American Chemical Society Journals customDbUrl: eissn: 1549-960X dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0033962 issn: 1549-9596 databaseCode: ACS dateStart: 20050101 isFulltext: true titleUrlDefault: https://pubs.acs.org/action/showPublications?display=journals providerName: American Chemical Society  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3fb9MwELZYeYCXCRhosA7lATEh5JLEjhM_dlWrCdryQCqVp8o_EqmTaEqbVIK_nrPrJBUFNPZiRY4UJ7nTd-fz3X0IvWGK-DISBBPqZ5gGWmER-gJnVMSB9FlCE1PvPJmymxn9OI_mbZ6urS4pZU_9_GNdyX2kCnMgV1Ml-x-SbR4KE3AN8oURJAzjnWS87xld-3ypqMrChKHf7_MAJibXDq8_CROT_4sTqup-Aa6BallnJ1uGnNqs2YB76WqzTHVPGwH9WlnjJYofVRtFvTY0XDuxKXYWejalyyt20QVT0MRw2EYXm2Ojpn2BSyY9hEzKMY-4a2h9MMf8-SHOus7fy2PQZHv2niM0B__DmCq17d2q5bdeIm3DvNZy1af108-L0Ww8XqTDefp2_R0bTjFz9u4IVk7QwxAw3xB79AdfajtNCLd0s837u0NsWPTD70seGWrrfaRP0KmTmNff68BT9CBbPUOPBjVb3xl61-qCV-Reowue1QWv0YWr7XM0Gw3TwQ12NBh4HQSkxFqJGPxg2NnlxM8YS2SuZQJQnSdU6jiKVC4AprUiSoYsT3KuYy6Fjkx6Hpc-eYE6q2KVnSNPZ-BeciYYk6ZLqOaBpDmJNTXMPiIgL1G3_s6F0_PtAhCfhAQ2nsmrf9--QI9bFeqiTrmpsktw2Ur52v75X1HzRAU | 
    
| linkProvider | American Chemical Society | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Generation+of+Tautomers+Using+Micro-pKa%27s&rft.jtitle=Journal+of+chemical+information+and+modeling&rft.au=Watson%2C+Mark+A&rft.au=Yu%2C+Haoyu+S&rft.au=Bochevarov%2C+Art+D&rft.date=2019-06-24&rft.pub=American+Chemical+Society&rft.issn=1549-9596&rft.eissn=1549-960X&rft.volume=59&rft.issue=6&rft.spage=2672&rft_id=info:doi/10.1021%2Facs.jcim.8b00955&rft.externalDBID=NO_FULL_TEXT | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1549-9596&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1549-9596&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1549-9596&client=summon |