Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment
The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making i...
        Saved in:
      
    
          | Published in | JMIR medical informatics Vol. 10; no. 4; p. e26353 | 
|---|---|
| Main Authors | , , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Canada
          JMIR Publications
    
        11.04.2022
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 2291-9694 2291-9694  | 
| DOI | 10.2196/26353 | 
Cover
| Abstract | The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.
The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.
The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.
The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.
This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. | 
    
|---|---|
| AbstractList | Background: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. Objective: The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. Methods: The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject’s age, the subject’s gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language–based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network–based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. Results: The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. Conclusions: This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.BACKGROUNDThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.OBJECTIVEThe aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.METHODSThe investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.RESULTSThe proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.CONCLUSIONSThis paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. BackgroundThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. ObjectiveThe aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. MethodsThe investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject’s age, the subject’s gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language–based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network–based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. ResultsThe proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. ConclusionsThis paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.  | 
    
| Author | Ghosn, Walid Morgand, Claire Falissard, Louis Bounebache, Karim Imbaud, Claire Rey, Grégoire  | 
    
| AuthorAffiliation | 1 Centre for Epidemiology on Medical Causes of Death Inserm Le Kremlin Bicêtre France | 
    
| AuthorAffiliation_xml | – name: 1 Centre for Epidemiology on Medical Causes of Death Inserm Le Kremlin Bicêtre France | 
    
| Author_xml | – sequence: 1 givenname: Louis orcidid: 0000-0001-5461-8330 surname: Falissard fullname: Falissard, Louis – sequence: 2 givenname: Claire orcidid: 0000-0003-2282-1494 surname: Morgand fullname: Morgand, Claire – sequence: 3 givenname: Walid orcidid: 0000-0003-2013-4586 surname: Ghosn fullname: Ghosn, Walid – sequence: 4 givenname: Claire orcidid: 0000-0003-2465-5186 surname: Imbaud fullname: Imbaud, Claire – sequence: 5 givenname: Karim orcidid: 0000-0002-3222-9905 surname: Bounebache fullname: Bounebache, Karim – sequence: 6 givenname: Grégoire orcidid: 0000-0001-7291-9444 surname: Rey fullname: Rey, Grégoire  | 
    
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/35404262$$D View this record in MEDLINE/PubMed | 
    
| BookMark | eNp1km9rFDEQxhep2FrvK0hAFEFO82-zG18Ix7W1B9cqUl8v2exk3SObnMlupZ_BL22615b2wFcJmV-emWdmXmYHzjvIshnBHymR4hMVLGfPsiNKJZlLIfnBo_thNotxgzEmnAghihfZIcs55lTQo-zvJYxBWXQVlItWDZ13SLkGLcbB92qABv0A7VvXTRFv0Gp5MicYXUDT6fTv1A0pBBGdBd-jSzVMamvl2lG18Bld-AYsOoFrsH7bgxsm9e8QjA-9chrQIkaI8Tb0KntulI0wuzuPs59np1fL8_n629fVcrGea1YWQ_KkSqaBF8mAMsZgRkvMjBSUc0oNr2sMppCSqwYIZ5ALI0EbUlBtat1QdpytdrqNV5tqG7pehZvKq66aHnxoKxWGTluoKKE5EbrhxtS85FhRYLxhYFhtCC5F0nq30xrdVt38UdY-CBJc3Q6nmoaTwC87cDvWPTQ6-U2depL9acR1v6rWX1cSM1lSnATe3wkE_3uEOFR9FzVYqxz4MaY0XNKc4pIn9M0euvFjcKmnicqL5CjPSaJeP67ooZT75UjA2x2gg48xgPmvtQ97nO6GaZWSkc7u0f8A3DrW1g | 
    
| CitedBy_id | crossref_primary_10_2196_40965 crossref_primary_10_1016_j_ijmedinf_2024_105462 crossref_primary_10_1016_j_artmed_2023_102622 crossref_primary_10_1016_j_medntd_2024_100319 crossref_primary_10_1002_trc2_70057 crossref_primary_10_1016_j_jbi_2022_104232 crossref_primary_10_1007_s10462_023_10677_z crossref_primary_10_1016_j_csl_2023_101582  | 
    
| Cites_doi | 10.1007/978-3-030-61377-8_39 10.1016/j.ijmedinf.2019.05.015 10.18653/v1/N19-1423 10.3115/1073083.1073135 10.1007/978-3-319-65340-2_12 10.3115/v1/d14-1179 10.18653/v1/2020.acl-demos.33  | 
    
| ContentType | Journal Article | 
    
| Copyright | Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022  | 
    
| Copyright_xml | – notice: Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. – notice: 2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022  | 
    
| DBID | AAYXX CITATION NPM 3V. 7X7 7XB 88C 8FI 8FJ 8FK ABUWG AFKRA AZQEC BENPR CCPQU DWQXO FYUFA GHDGH K9. M0S M0T PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQQKQ PQUKI PRINS 7X8 5PM ADTOC UNPAY DOA  | 
    
| DOI | 10.2196/26353 | 
    
| DatabaseName | CrossRef PubMed ProQuest Central (Corporate) Health & Medical Collection ProQuest Central (purchase pre-March 2016) Healthcare Administration Database (Alumni) Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central (New) (NC LIVE) ProQuest One Community College ProQuest Central Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) Health & Medical Collection (Alumni Edition) Healthcare Administration Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China MEDLINE - Academic PubMed Central (Full Participant titles) Unpaywall for CDI: Periodical Content Unpaywall DOAJ Directory of Open Access Journals  | 
    
| DatabaseTitle | CrossRef PubMed Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central China ProQuest Central ProQuest Health & Medical Research Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Health & Medical Research Collection ProQuest Central (New) ProQuest One Academic Eastern Edition ProQuest Health Management ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest One Academic UKI Edition ProQuest Health Management (Alumni Edition) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic  | 
    
| DatabaseTitleList | Publicly Available Content Database MEDLINE - Academic PubMed  | 
    
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository – sequence: 4 dbid: BENPR name: ProQuest Central url: http://www.proquest.com/pqcentral?accountid=15518 sourceTypes: Aggregation Database  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Medicine | 
    
| EISSN | 2291-9694 | 
    
| ExternalDocumentID | oai_doaj_org_article_212516cd4ffb4840a2e34d3ef3bf1086 10.2196/26353 PMC9039820 35404262 10_2196_26353  | 
    
| Genre | Journal Article | 
    
| GroupedDBID | 53G 5VS 7X7 8FI 8FJ AAFWJ AAYXX ABUWG ADBBV AFKRA AFPKN ALMA_UNASSIGNED_HOLDINGS AOIJS BAWUL BCNDV BENPR CCPQU CITATION DIK EMOBN FYUFA GROUPED_DOAJ HMCUK HYE KQ8 M0T M48 M~E OK1 PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PUEGO RPM UKHRP ALIPV NPM 3V. 7XB 8FK AZQEC DWQXO K9. PKEHL PQEST PQQKQ PQUKI PRINS 7X8 5PM ADRAZ ADTOC UNPAY  | 
    
| ID | FETCH-LOGICAL-c387t-96a83ce47262afff032803f9624422f4bb0ef7994ade143e56f9ecf172cfbcd23 | 
    
| IEDL.DBID | M48 | 
    
| ISSN | 2291-9694 | 
    
| IngestDate | Tue Oct 14 19:08:41 EDT 2025 Sun Oct 26 03:32:25 EDT 2025 Tue Sep 30 16:53:44 EDT 2025 Thu Oct 02 11:24:59 EDT 2025 Tue Oct 07 06:50:44 EDT 2025 Thu Jan 02 22:54:55 EST 2025 Wed Oct 01 05:10:16 EDT 2025 Thu Apr 24 23:11:24 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 4 | 
    
| Keywords | machine translation deep learning automated medical entity recognition mortality statistics machine learning ICD-10 coding  | 
    
| Language | English | 
    
| License | Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included. cc-by  | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c387t-96a83ce47262afff032803f9624422f4bb0ef7994ade143e56f9ecf172cfbcd23 | 
    
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23  | 
    
| ORCID | 0000-0003-2013-4586 0000-0003-2282-1494 0000-0003-2465-5186 0000-0001-7291-9444 0000-0001-5461-8330 0000-0002-3222-9905  | 
    
| OpenAccessLink | https://www.proquest.com/docview/2657516551?pq-origsite=%requestingapplication%&accountid=15518 | 
    
| PMID | 35404262 | 
    
| PQID | 2657516551 | 
    
| PQPubID | 4997117 | 
    
| ParticipantIDs | doaj_primary_oai_doaj_org_article_212516cd4ffb4840a2e34d3ef3bf1086 unpaywall_primary_10_2196_26353 pubmedcentral_primary_oai_pubmedcentral_nih_gov_9039820 proquest_miscellaneous_2649252084 proquest_journals_2657516551 pubmed_primary_35404262 crossref_primary_10_2196_26353 crossref_citationtrail_10_2196_26353  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 20220411 | 
    
| PublicationDateYYYYMMDD | 2022-04-11 | 
    
| PublicationDate_xml | – month: 4 year: 2022 text: 20220411 day: 11  | 
    
| PublicationDecade | 2020 | 
    
| PublicationPlace | Canada | 
    
| PublicationPlace_xml | – name: Canada – name: Toronto – name: Toronto, Canada  | 
    
| PublicationTitle | JMIR medical informatics | 
    
| PublicationTitleAlternate | JMIR Med Inform | 
    
| PublicationYear | 2022 | 
    
| Publisher | JMIR Publications | 
    
| Publisher_xml | – name: JMIR Publications | 
    
| References | ref13 ref12 ref15 ref14 ref11 ref10 ref2 ref1 ref17 ref16 ref18 ref8 ref7 ref9 ref4 ref3 ref6 ref5  | 
    
| References_xml | – ident: ref12 doi: 10.1007/978-3-030-61377-8_39 – ident: ref1 – ident: ref2 – ident: ref3 – ident: ref5 – ident: ref11 doi: 10.1016/j.ijmedinf.2019.05.015 – ident: ref6 doi: 10.18653/v1/N19-1423 – ident: ref18 doi: 10.3115/1073083.1073135 – ident: ref4 doi: 10.1007/978-3-319-65340-2_12 – ident: ref9 – ident: ref8 – ident: ref16 – ident: ref10 – ident: ref17 – ident: ref7 doi: 10.3115/v1/d14-1179 – ident: ref15 – ident: ref13 doi: 10.18653/v1/2020.acl-demos.33 – ident: ref14  | 
    
| SSID | ssj0001416667 | 
    
| Score | 2.2488456 | 
    
| Snippet | The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the... Background: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical... BackgroundThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding...  | 
    
| SourceID | doaj unpaywall pubmedcentral proquest pubmed crossref  | 
    
| SourceType | Open Website Open Access Repository Aggregation Database Index Database Enrichment Source  | 
    
| StartPage | e26353 | 
    
| SubjectTerms | Artificial intelligence Automation Cardiovascular disease Classification Codes Datasets Deep learning Epidemiology Language Machine translation Mortality Natural language Neural networks Original Paper  | 
    
| SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwEB6hHgoSQuWdthQj9Ro1jR0n5rZ9rAqiFUJU6i2ynbFAWpKq7KriN_CnmXGyYRcqceGSQzyy_Bh7PnvG3wDsY5EFj4c-tVUlU2WNTytsfFoWLtACtBlW_Br5_EKfXar3V8XVSqovjgnr6YH7gTvI2QJr36gQnKLTiM1RqkZikC5wliDefbPKrBym4u2KYndYuQkPOdaZtOyASVfkmvGJHP13Acu_4yPvL9pr--PWzmYrxme6BY8G1CgmfWsfwz1sn8Dm-eAXfwo_mWODBKLl6aPbhG0bMVnMO4Kk2IhPy0AhKumCeHd8QlujGNw04nRgVhXTm-6buLCRjEN8GO4y3wrOmDYTKwFGsfaPvx8diMnI8PkMLqenn4_P0iHNQuplVc5To20lPaoy17kNITDDXiaD0WT58zwo5zIMpTHKNkjoCgsdDPpAyMcH55tcPoeNtmvxJQhdBG-cp6-3CmWw1mdeaSuRUIpDncD-cvxrP3CQcyqMWU1nEZ6mOk5TAnuj2HVPuvGnwBFP3ljIHNnxB2lOPWhO_S_NSWB3OfX1sHC_U_XsiNKEIxN4MxbTkmM_im2xW7AMMzqS1qkEXvSaMraEr9GY5D-Bck2H1pq6XtJ-_RJpvU0mDeGxBF6P2nZ377f_R-934EHOLzmYtvJwFzbmNwt8Rfhq7vbiUvoFno8nGg priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Central (New) (NC LIVE) dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fb9MwED6NThpICPFrEBjDSHuNltmOkyAh1I1WA7Fqmpi0t8hxzoBUklJaIf4G_ml8rpOuMPGSh_gUObk7-8vd-TuAA0wTa_DIxDrPRSx1YeIcaxNnaWWdA-oEczqNfDZRp5fyw1V6tQWT7iwMlVV2a6JfqOvWUIz8kPsMgXIb_NvZ95i6RlF2tWuhoUNrhfqNpxi7BducmLEGsH08mpxfrKMuktJk2Q7cpRpoZ32HRMYiNjYlz91_E-D8t27y9rKZ6V8_9XR6bVMa34d7AU2y4Ur9D2ALm4ewcxby5Y_gN3FvOAG_I62q3phuajZcLloHVbFmF10BkRtpLXt_8s4tmSykb9goMK6y8bz9xibak3SwjyHG-ZpRJ7Upu1Z45J9-vj6MwIY98-djuByPPp2cxqH9QmxEni3iQulcGJQZV1xba4l5LxG2UA4RcG5lVSVos6KQukaHujBVtkBjHSIytjI1F7swaNoGnwJTqTVFZdzVaInCam0SI5UW6NBLhSqCg-77lyZwk1OLjGnp_lFITaVXUwT7vdhsRcbxt8AxKa8fJO5sf6Odfy6DKzpxh-mUqaW1lXT_t5qjkLVAKypLfaci2OtUXwaH_lGuzS-CV_2wc0XKr-gG2yXJENMjT3IZwZOVpfQzofAakf9HkG3Y0MZUN0ear1883XeRiMLhtAhe9tZ289s_-__En8MdTmc3iKjyaA8Gi_kSXzhEtaj2g5v8AQ7oJR8 priority: 102 providerName: ProQuest – databaseName: Unpaywall dbid: UNPAY link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwED6NThpIiJ8DAmMYaY-kTWPHjXkr26qBaFUhKo2nyHFsMeiSqbRC8C_wT3PnJmEdk-AlD_HJcuy7-LPv7juAA5tEzti-CXWa8lBoZcLUFiYcJLlDA9SRTSkbeTyRJzPx7jQ53YJXTS4MeZR9Hk73y_nZwrvy8Xge90TPEm0K702PRjdgWyaIvDuwPZtMh5-oflys-qGSSuzAbYpuRr3qefmN7caz8l8HJf-OiLy5Ki_0j-96Pr-03YzuwrgZ6DrK5Gt3tcy75ucVDsf__ZJ7cKfGnWy4VpT7sGXLB7Azrj3rD-EXsXSggN-71vFxTJcFG66WFYJaW7APTagRtlSOvT08wp8rqx097LjmZmWjRXXOJtrTebD39W3oa0Y11-bsUoiS7336J22BDVuO0F2YjY4_Hp6EdaGG0PB0sMSJ1yk3VgxiGWvnHHH0Rdwpidghjp3I88i6gVJCFxbxmU2kU9Y4xE7G5aaI-SPolFVpnwCTiTMqN_g0WljutDaREVJzizgntzKAg2Y9M1OzmFMxjXmGpxla9sxPbgD7rdjFmrbjqsAbUoa2kVi2_Qtcp6w2WhRH9CdNIZzLBZ6EdWy5KLh1PHdUoSqAvUaVstr0v2H35MqSiEQDeNk2o9GSJ0aXtlqRDHFCxlEqAni81rx2JHQRR2UCAhhs6OTGUDdbyrPPnhhcRVwhogvgRau913_9039KPINbpK3kN-v396CzXKzsc4Rfy3y_trrfctExmA priority: 102 providerName: Unpaywall  | 
    
| Title | Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment | 
    
| URI | https://www.ncbi.nlm.nih.gov/pubmed/35404262 https://www.proquest.com/docview/2657516551 https://www.proquest.com/docview/2649252084 https://pubmed.ncbi.nlm.nih.gov/PMC9039820 https://medinform.jmir.org/2022/4/e26353/PDF https://doaj.org/article/212516cd4ffb4840a2e34d3ef3bf1086  | 
    
| UnpaywallVersion | publishedVersion | 
    
| Volume | 10 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: KQ8 dateStart: 20130101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: DOA dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: DIK dateStart: 20130101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: M~E dateStart: 20130101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVAQN databaseName: PubMed Central customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: RPM dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/ providerName: National Library of Medicine – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: 7X7 dateStart: 20130101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: BENPR dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVFZP databaseName: Scholars Portal Journals: Open Access customDbUrl: eissn: 2291-9694 dateEnd: 20250131 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: M48 dateStart: 20131001 isFulltext: true titleUrlDefault: http://journals.scholarsportal.info providerName: Scholars Portal  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1tb9MwED6xTSqTEOKdwAhG2tdAGjsvRkKoG60GolU1UWl8ihzHBqSQjNIK9hv409ylSWigfMkH-5I49l382Hd-DuDYhL7VZqg9lSTcE0pqLzG59uIws2iAyjcJnUaezqKzhXh3EW5FEzYd-H3n0o7ySS2WxfOf365eo8G_ojBmVKAXxKfC9-AAJydJ2RumDcKvt1kE-cXo0HQQyKEnIykGcKN35yEMaPODqNl7k1PN4b8LeP4bP3l9XV6qqx-qKLYmp8ktuNmgSjbaqMFtuGbKOzCYNn7zu_CLODhQoJ6ZNtFvTJU5G61XFUJWk7PzNpAIayrL3p6-wV8na9w4bNwwr7LJsvrKZqom62Dvm73Ol4wyqhVsKwCpfvr8z6EENuoYQO_BYjL-cHrmNWkYPM2TeIV9phKujYixg5S1lhj4fG5lhMggCKzIMt_YWEqhcoPoy4SRlUZbREbaZjoP-H3YL6vSPAQWhVbLTONVK2G4VUr7WkSKG0QxmYkcOG77P9UNRzmlyihSXKvQiKX1iDngdmKXG1KOvwVOaPC6SuLQrguq5ae0MUkUR2wX6VxYmwlc56rAcJFzY3lmKf-UA0ft0KetXuLjyVEVIc504FlXjSZJfhZVmmpNMsT4GPiJcODBRlO6lrSa5kDc06FeU_s15ZfPNe239LlEvObA007bdn_9o_--9TEcBnR8g7gqh0ewv1quzRMEVavMhb34Inbh4GQ8m5-79daEW5sSli1m89HH3yX8JcM | 
    
| linkProvider | Scholars Portal | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB4hkKASqvoEtxS2Ej1aGO_6VQlVARIlJYkQAombu17vtpVSOw2JEL-h_6m_rTPO2iEt6o2LD9lRtPbM7MzO4xuAfR14RulD5co45q6QiXJjnSs3CjKDCig9HVM38mAYdq_E5-vgegV-170wVFZZn4nVQZ2XimLkB36VIQjRwH8a_3RpahRlV-sRGtKOVsiPKogx29hxpu9u8Qp3c9Q7RX5_8P1O-_Kk69opA67icTR1k1DGXGkR-aEvjTEEMOdxk4Ro-HzfiCzztImSRMhco3Ohg9AkWhk0_MpkKifgAzQBa4KLBC9_a8ft4fnFIsojKC0XrcMm1VyjtB8Q-AtfMoLVrICHHNx_6zQ3ZsVY3t3K0eieEew8g6fWe2Wtubg9hxVdvID1gc3Pv4RfhPWBBJUFnFfZMVnkrDWbluga65xd1AVLuFIa1js5xSOa2XQRa1uEV9aZlD_YUFagIKxvY6ofGU1uG7F7hU7Vv58vmh9Yq0EafQVXj8KI17BalIXeBhYGRiWZwqeSQnMjpfKUCCXX6C1lOnRgv_7-qbJY6DSSY5TinYjYlFZscmC3IRvPwT_-Jjgm5jWLhNVd_VBOvqZW9ZEcfchQ5cKYTOB9Wvqai5xrwzNDc64c2KlZn9oD5CZdiLsD75tlVH3K58hClzOiIWRJ34uFA1tzSWl2QuE8GjbgQLQkQ0tbXV4pvn-r4MUTjyfoFzqw10jbw2__5v8b34ON7uWgn_Z7w7O38MSnvhECyTzcgdXpZKbfoTc3zXatyjD48tha-gdJ1WLB | 
    
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3daxNBEB9KhSiI-FlPa7tCfTxy3d3chyASm4bGtqGIhbyde3u7KsS7mCaU_g3-R_51zux9pNHiW1_uITuEvZvPnZn9DcCe6QVWm33tqzgWvlSJ9mOTaz_qZRYVUAUmptvIp-Pw6Fx-nPQmG_C7uQtDbZWNTXSGOi815ci73FUIQnTwXVu3RZwNhu9nP32aIEWV1macRiUix-bqEo9vF-9GA-T1G86Hh58Pjvx6woCvRRwt_CRUsdBGRjzkylpL4HKBsEmITo9zK7MsMDZKEqlyg4GF6YU2Mdqi09c20zmBHqD5vxMJkVA7YTSJVvkdSQW5qAP3qdsa5bxLsC9izf25KQE3hbb_dmjeXRYzdXWpptNr7m_4EB7UcSvrV4L2CDZM8Rg6p3Vl_gn8IpQPJHC-r-qvY6rIWX-5KDEoNjn71LQq4Upp2ehggMaZ1YUidlhju7LhvPzBxsrBgbCTOpv6ltHMtim71uLk_v1sde2B9VuM0adwfitseAabRVmY58DCntVJpvGplTTCKqUDLUMlDMZJmQk92Gu-f6prFHQaxjFN8TREbEodmzzYaclmFezH3wQfiHntIqF0ux_K-de0Vnokx-gx1Lm0NpN4klbcCJkLY0VmacKVB9sN69PadFykK0H34HW7jEpPlRxVmHJJNIQpyYNYerBVSUq7E0rk0ZgBD6I1GVrb6vpK8f2bAxZPApFgROjBbittN7_9i_9vfBc6qJvpyWh8_BLucbowQuiY-9uwuZgvzSsM4xbZjtMXBl9uW0H_AKetYFs | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwED6NThpIiJ8DAmMYaY-kTWPHjXkr26qBaFUhKo2nyHFsMeiSqbRC8C_wT3PnJmEdk-AlD_HJcuy7-LPv7juAA5tEzti-CXWa8lBoZcLUFiYcJLlDA9SRTSkbeTyRJzPx7jQ53YJXTS4MeZR9Hk73y_nZwrvy8Xge90TPEm0K702PRjdgWyaIvDuwPZtMh5-oflys-qGSSuzAbYpuRr3qefmN7caz8l8HJf-OiLy5Ki_0j-96Pr-03YzuwrgZ6DrK5Gt3tcy75ucVDsf__ZJ7cKfGnWy4VpT7sGXLB7Azrj3rD-EXsXSggN-71vFxTJcFG66WFYJaW7APTagRtlSOvT08wp8rqx097LjmZmWjRXXOJtrTebD39W3oa0Y11-bsUoiS7336J22BDVuO0F2YjY4_Hp6EdaGG0PB0sMSJ1yk3VgxiGWvnHHH0Rdwpidghjp3I88i6gVJCFxbxmU2kU9Y4xE7G5aaI-SPolFVpnwCTiTMqN_g0WljutDaREVJzizgntzKAg2Y9M1OzmFMxjXmGpxla9sxPbgD7rdjFmrbjqsAbUoa2kVi2_Qtcp6w2WhRH9CdNIZzLBZ6EdWy5KLh1PHdUoSqAvUaVstr0v2H35MqSiEQDeNk2o9GSJ0aXtlqRDHFCxlEqAni81rx2JHQRR2UCAhhs6OTGUDdbyrPPnhhcRVwhogvgRau913_9039KPINbpK3kN-v396CzXKzsc4Rfy3y_trrfctExmA | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Neural+Translation+and+Automated+Recognition+of+ICD-10+Medical+Entities+From+Natural+Language%3A+Model+Development+and+Performance+Assessment&rft.jtitle=JMIR+medical+informatics&rft.au=Falissard%2C+Louis&rft.au=Morgand%2C+Claire&rft.au=Ghosn%2C+Walid&rft.au=Imbaud%2C+Claire&rft.date=2022-04-11&rft.issn=2291-9694&rft.eissn=2291-9694&rft.volume=10&rft.issue=4&rft.spage=e26353&rft_id=info:doi/10.2196%2F26353&rft_id=info%3Apmid%2F35404262&rft.externalDocID=35404262 | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2291-9694&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2291-9694&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2291-9694&client=summon |