Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment

The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making i...

Full description

Saved in:

Bibliographic Details
Published in	JMIR medical informatics Vol. 10; no. 4; p. e26353
Main Authors	Falissard, Louis, Morgand, Claire, Ghosn, Walid, Imbaud, Claire, Bounebache, Karim, Rey, Grégoire
Format	Journal Article
Language	English
Published	Canada JMIR Publications 11.04.2022
Subjects	Artificial intelligence Automation Cardiovascular disease Classification Codes Datasets Deep learning Epidemiology Language Machine translation Mortality Natural language Neural networks Original Paper machine translation deep learning automated medical entity recognition mortality statistics machine learning ICD-10 coding
Online Access	Get full text
ISSN	2291-9694 2291-9694
DOI	10.2196/26353

Cover

Abstract	The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
AbstractList	Background: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. Objective: The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. Methods: The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject’s age, the subject’s gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language–based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network–based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. Results: The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. Conclusions: This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.BACKGROUNDThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.OBJECTIVEThe aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.METHODSThe investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.RESULTSThe proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.CONCLUSIONSThis paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. BackgroundThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. ObjectiveThe aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. MethodsThe investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject’s age, the subject’s gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language–based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network–based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. ResultsThe proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. ConclusionsThis paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications. The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
Author	Ghosn, Walid Morgand, Claire Falissard, Louis Bounebache, Karim Imbaud, Claire Rey, Grégoire
AuthorAffiliation	1 Centre for Epidemiology on Medical Causes of Death Inserm Le Kremlin Bicêtre France
AuthorAffiliation_xml	– name: 1 Centre for Epidemiology on Medical Causes of Death Inserm Le Kremlin Bicêtre France
Author_xml	– sequence: 1 givenname: Louis orcidid: 0000-0001-5461-8330 surname: Falissard fullname: Falissard, Louis – sequence: 2 givenname: Claire orcidid: 0000-0003-2282-1494 surname: Morgand fullname: Morgand, Claire – sequence: 3 givenname: Walid orcidid: 0000-0003-2013-4586 surname: Ghosn fullname: Ghosn, Walid – sequence: 4 givenname: Claire orcidid: 0000-0003-2465-5186 surname: Imbaud fullname: Imbaud, Claire – sequence: 5 givenname: Karim orcidid: 0000-0002-3222-9905 surname: Bounebache fullname: Bounebache, Karim – sequence: 6 givenname: Grégoire orcidid: 0000-0001-7291-9444 surname: Rey fullname: Rey, Grégoire
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/35404262$$D View this record in MEDLINE/PubMed
BookMark	eNp1km9rFDEQxhep2FrvK0hAFEFO82-zG18Ix7W1B9cqUl8v2exk3SObnMlupZ_BL22615b2wFcJmV-emWdmXmYHzjvIshnBHymR4hMVLGfPsiNKJZlLIfnBo_thNotxgzEmnAghihfZIcs55lTQo-zvJYxBWXQVlItWDZ13SLkGLcbB92qABv0A7VvXTRFv0Gp5MicYXUDT6fTv1A0pBBGdBd-jSzVMamvl2lG18Bld-AYsOoFrsH7bgxsm9e8QjA-9chrQIkaI8Tb0KntulI0wuzuPs59np1fL8_n629fVcrGea1YWQ_KkSqaBF8mAMsZgRkvMjBSUc0oNr2sMppCSqwYIZ5ALI0EbUlBtat1QdpytdrqNV5tqG7pehZvKq66aHnxoKxWGTluoKKE5EbrhxtS85FhRYLxhYFhtCC5F0nq30xrdVt38UdY-CBJc3Q6nmoaTwC87cDvWPTQ6-U2depL9acR1v6rWX1cSM1lSnATe3wkE_3uEOFR9FzVYqxz4MaY0XNKc4pIn9M0euvFjcKmnicqL5CjPSaJeP67ooZT75UjA2x2gg48xgPmvtQ97nO6GaZWSkc7u0f8A3DrW1g
CitedBy_id	crossref_primary_10_2196_40965 crossref_primary_10_1016_j_ijmedinf_2024_105462 crossref_primary_10_1016_j_artmed_2023_102622 crossref_primary_10_1016_j_medntd_2024_100319 crossref_primary_10_1002_trc2_70057 crossref_primary_10_1016_j_jbi_2022_104232 crossref_primary_10_1007_s10462_023_10677_z crossref_primary_10_1016_j_csl_2023_101582
Cites_doi	10.1007/978-3-030-61377-8_39 10.1016/j.ijmedinf.2019.05.015 10.18653/v1/N19-1423 10.3115/1073083.1073135 10.1007/978-3-319-65340-2_12 10.3115/v1/d14-1179 10.18653/v1/2020.acl-demos.33
ContentType	Journal Article
Copyright	Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022
Copyright_xml	– notice: Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. – notice: 2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022
DBID	AAYXX CITATION NPM 3V. 7X7 7XB 88C 8FI 8FJ 8FK ABUWG AFKRA AZQEC BENPR CCPQU DWQXO FYUFA GHDGH K9. M0S M0T PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQQKQ PQUKI PRINS 7X8 5PM ADTOC UNPAY DOA
DOI	10.2196/26353
DatabaseName	CrossRef PubMed ProQuest Central (Corporate) Health & Medical Collection ProQuest Central (purchase pre-March 2016) Healthcare Administration Database (Alumni) Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central (New) (NC LIVE) ProQuest One Community College ProQuest Central Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) Health & Medical Collection (Alumni Edition) Healthcare Administration Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China MEDLINE - Academic PubMed Central (Full Participant titles) Unpaywall for CDI: Periodical Content Unpaywall DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef PubMed Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central China ProQuest Central ProQuest Health & Medical Research Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Health & Medical Research Collection ProQuest Central (New) ProQuest One Academic Eastern Edition ProQuest Health Management ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest One Academic UKI Edition ProQuest Health Management (Alumni Edition) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	Publicly Available Content Database MEDLINE - Academic PubMed
Database_xml	– sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository – sequence: 4 dbid: BENPR name: ProQuest Central url: http://www.proquest.com/pqcentral?accountid=15518 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine
EISSN	2291-9694
ExternalDocumentID	oai_doaj_org_article_212516cd4ffb4840a2e34d3ef3bf1086 10.2196/26353 PMC9039820 35404262 10_2196_26353
Genre	Journal Article
GroupedDBID	53G 5VS 7X7 8FI 8FJ AAFWJ AAYXX ABUWG ADBBV AFKRA AFPKN ALMA_UNASSIGNED_HOLDINGS AOIJS BAWUL BCNDV BENPR CCPQU CITATION DIK EMOBN FYUFA GROUPED_DOAJ HMCUK HYE KQ8 M0T M48 M~E OK1 PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PUEGO RPM UKHRP ALIPV NPM 3V. 7XB 8FK AZQEC DWQXO K9. PKEHL PQEST PQQKQ PQUKI PRINS 7X8 5PM ADRAZ ADTOC UNPAY
ID	FETCH-LOGICAL-c387t-96a83ce47262afff032803f9624422f4bb0ef7994ade143e56f9ecf172cfbcd23
IEDL.DBID	M48
ISSN	2291-9694
IngestDate	Tue Oct 14 19:08:41 EDT 2025 Sun Oct 26 03:32:25 EDT 2025 Tue Sep 30 16:53:44 EDT 2025 Thu Oct 02 11:24:59 EDT 2025 Tue Oct 07 06:50:44 EDT 2025 Thu Jan 02 22:54:55 EST 2025 Wed Oct 01 05:10:16 EDT 2025 Thu Apr 24 23:11:24 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	4
Keywords	machine translation deep learning automated medical entity recognition mortality statistics machine learning ICD-10 coding
Language	English
License	Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included. cc-by
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c387t-96a83ce47262afff032803f9624422f4bb0ef7994ade143e56f9ecf172cfbcd23
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ORCID	0000-0003-2013-4586 0000-0003-2282-1494 0000-0003-2465-5186 0000-0001-7291-9444 0000-0001-5461-8330 0000-0002-3222-9905
OpenAccessLink	https://www.proquest.com/docview/2657516551?pq-origsite=%requestingapplication%&accountid=15518
PMID	35404262
PQID	2657516551
PQPubID	4997117
ParticipantIDs	doaj_primary_oai_doaj_org_article_212516cd4ffb4840a2e34d3ef3bf1086 unpaywall_primary_10_2196_26353 pubmedcentral_primary_oai_pubmedcentral_nih_gov_9039820 proquest_miscellaneous_2649252084 proquest_journals_2657516551 pubmed_primary_35404262 crossref_primary_10_2196_26353 crossref_citationtrail_10_2196_26353
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	20220411
PublicationDateYYYYMMDD	2022-04-11
PublicationDate_xml	– month: 4 year: 2022 text: 20220411 day: 11
PublicationDecade	2020
PublicationPlace	Canada
PublicationPlace_xml	– name: Canada – name: Toronto – name: Toronto, Canada
PublicationTitle	JMIR medical informatics
PublicationTitleAlternate	JMIR Med Inform
PublicationYear	2022
Publisher	JMIR Publications
Publisher_xml	– name: JMIR Publications
References	ref13 ref12 ref15 ref14 ref11 ref10 ref2 ref1 ref17 ref16 ref18 ref8 ref7 ref9 ref4 ref3 ref6 ref5
References_xml	– ident: ref12 doi: 10.1007/978-3-030-61377-8_39 – ident: ref1 – ident: ref2 – ident: ref3 – ident: ref5 – ident: ref11 doi: 10.1016/j.ijmedinf.2019.05.015 – ident: ref6 doi: 10.18653/v1/N19-1423 – ident: ref18 doi: 10.3115/1073083.1073135 – ident: ref4 doi: 10.1007/978-3-319-65340-2_12 – ident: ref9 – ident: ref8 – ident: ref16 – ident: ref10 – ident: ref17 – ident: ref7 doi: 10.3115/v1/d14-1179 – ident: ref15 – ident: ref13 doi: 10.18653/v1/2020.acl-demos.33 – ident: ref14
SSID	ssj0001416667
Score	2.2488456
Snippet	The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the... Background: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical... BackgroundThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding...
SourceID	doaj unpaywall pubmedcentral proquest pubmed crossref
SourceType	Open Website Open Access Repository Aggregation Database Index Database Enrichment Source
StartPage	e26353
SubjectTerms	Artificial intelligence Automation Cardiovascular disease Classification Codes Datasets Deep learning Epidemiology Language Machine translation Mortality Natural language Neural networks Original Paper
SummonAdditionalLinks	– databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwEB6hHgoSQuWdthQj9Ro1jR0n5rZ9rAqiFUJU6i2ynbFAWpKq7KriN_CnmXGyYRcqceGSQzyy_Bh7PnvG3wDsY5EFj4c-tVUlU2WNTytsfFoWLtACtBlW_Br5_EKfXar3V8XVSqovjgnr6YH7gTvI2QJr36gQnKLTiM1RqkZikC5wliDefbPKrBym4u2KYndYuQkPOdaZtOyASVfkmvGJHP13Acu_4yPvL9pr--PWzmYrxme6BY8G1CgmfWsfwz1sn8Dm-eAXfwo_mWODBKLl6aPbhG0bMVnMO4Kk2IhPy0AhKumCeHd8QlujGNw04nRgVhXTm-6buLCRjEN8GO4y3wrOmDYTKwFGsfaPvx8diMnI8PkMLqenn4_P0iHNQuplVc5To20lPaoy17kNITDDXiaD0WT58zwo5zIMpTHKNkjoCgsdDPpAyMcH55tcPoeNtmvxJQhdBG-cp6-3CmWw1mdeaSuRUIpDncD-cvxrP3CQcyqMWU1nEZ6mOk5TAnuj2HVPuvGnwBFP3ljIHNnxB2lOPWhO_S_NSWB3OfX1sHC_U_XsiNKEIxN4MxbTkmM_im2xW7AMMzqS1qkEXvSaMraEr9GY5D-Bck2H1pq6XtJ-_RJpvU0mDeGxBF6P2nZ377f_R-934EHOLzmYtvJwFzbmNwt8Rfhq7vbiUvoFno8nGg priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Central (New) (NC LIVE) dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fb9MwED6NThpICPFrEBjDSHuNltmOkyAh1I1WA7Fqmpi0t8hxzoBUklJaIf4G_ml8rpOuMPGSh_gUObk7-8vd-TuAA0wTa_DIxDrPRSx1YeIcaxNnaWWdA-oEczqNfDZRp5fyw1V6tQWT7iwMlVV2a6JfqOvWUIz8kPsMgXIb_NvZ95i6RlF2tWuhoUNrhfqNpxi7BducmLEGsH08mpxfrKMuktJk2Q7cpRpoZ32HRMYiNjYlz91_E-D8t27y9rKZ6V8_9XR6bVMa34d7AU2y4Ur9D2ALm4ewcxby5Y_gN3FvOAG_I62q3phuajZcLloHVbFmF10BkRtpLXt_8s4tmSykb9goMK6y8bz9xibak3SwjyHG-ZpRJ7Upu1Z45J9-vj6MwIY98-djuByPPp2cxqH9QmxEni3iQulcGJQZV1xba4l5LxG2UA4RcG5lVSVos6KQukaHujBVtkBjHSIytjI1F7swaNoGnwJTqTVFZdzVaInCam0SI5UW6NBLhSqCg-77lyZwk1OLjGnp_lFITaVXUwT7vdhsRcbxt8AxKa8fJO5sf6Odfy6DKzpxh-mUqaW1lXT_t5qjkLVAKypLfaci2OtUXwaH_lGuzS-CV_2wc0XKr-gG2yXJENMjT3IZwZOVpfQzofAakf9HkG3Y0MZUN0ear1883XeRiMLhtAhe9tZ289s_-__En8MdTmc3iKjyaA8Gi_kSXzhEtaj2g5v8AQ7oJR8 priority: 102 providerName: ProQuest – databaseName: Unpaywall dbid: UNPAY link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwED6NThpIiJ8DAmMYaY-kTWPHjXkr26qBaFUhKo2nyHFsMeiSqbRC8C_wT3PnJmEdk-AlD_HJcuy7-LPv7juAA5tEzti-CXWa8lBoZcLUFiYcJLlDA9SRTSkbeTyRJzPx7jQ53YJXTS4MeZR9Hk73y_nZwrvy8Xge90TPEm0K702PRjdgWyaIvDuwPZtMh5-oflys-qGSSuzAbYpuRr3qefmN7caz8l8HJf-OiLy5Ki_0j-96Pr-03YzuwrgZ6DrK5Gt3tcy75ucVDsf__ZJ7cKfGnWy4VpT7sGXLB7Azrj3rD-EXsXSggN-71vFxTJcFG66WFYJaW7APTagRtlSOvT08wp8rqx097LjmZmWjRXXOJtrTebD39W3oa0Y11-bsUoiS7336J22BDVuO0F2YjY4_Hp6EdaGG0PB0sMSJ1yk3VgxiGWvnHHH0Rdwpidghjp3I88i6gVJCFxbxmU2kU9Y4xE7G5aaI-SPolFVpnwCTiTMqN_g0WljutDaREVJzizgntzKAg2Y9M1OzmFMxjXmGpxla9sxPbgD7rdjFmrbjqsAbUoa2kVi2_Qtcp6w2WhRH9CdNIZzLBZ6EdWy5KLh1PHdUoSqAvUaVstr0v2H35MqSiEQDeNk2o9GSJ0aXtlqRDHFCxlEqAni81rx2JHQRR2UCAhhs6OTGUDdbyrPPnhhcRVwhogvgRau913_9039KPINbpK3kN-v396CzXKzsc4Rfy3y_trrfctExmA priority: 102 providerName: Unpaywall
Title	Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment
URI	https://www.ncbi.nlm.nih.gov/pubmed/35404262 https://www.proquest.com/docview/2657516551 https://www.proquest.com/docview/2649252084 https://pubmed.ncbi.nlm.nih.gov/PMC9039820 https://medinform.jmir.org/2022/4/e26353/PDF https://doaj.org/article/212516cd4ffb4840a2e34d3ef3bf1086
UnpaywallVersion	publishedVersion
Volume	10
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: KQ8 dateStart: 20130101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: DOA dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: DIK dateStart: 20130101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: M~E dateStart: 20130101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVAQN databaseName: PubMed Central customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: RPM dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/ providerName: National Library of Medicine – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: 7X7 dateStart: 20130101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 2291-9694 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: BENPR dateStart: 20130101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVFZP databaseName: Scholars Portal Journals: Open Access customDbUrl: eissn: 2291-9694 dateEnd: 20250131 omitProxy: true ssIdentifier: ssj0001416667 issn: 2291-9694 databaseCode: M48 dateStart: 20131001 isFulltext: true titleUrlDefault: http://journals.scholarsportal.info providerName: Scholars Portal
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1tb9MwED6xTSqTEOKdwAhG2tdAGjsvRkKoG60GolU1UWl8ihzHBqSQjNIK9hv409ylSWigfMkH-5I49l382Hd-DuDYhL7VZqg9lSTcE0pqLzG59uIws2iAyjcJnUaezqKzhXh3EW5FEzYd-H3n0o7ySS2WxfOf365eo8G_ojBmVKAXxKfC9-AAJydJ2RumDcKvt1kE-cXo0HQQyKEnIykGcKN35yEMaPODqNl7k1PN4b8LeP4bP3l9XV6qqx-qKLYmp8ktuNmgSjbaqMFtuGbKOzCYNn7zu_CLODhQoJ6ZNtFvTJU5G61XFUJWk7PzNpAIayrL3p6-wV8na9w4bNwwr7LJsvrKZqom62Dvm73Ol4wyqhVsKwCpfvr8z6EENuoYQO_BYjL-cHrmNWkYPM2TeIV9phKujYixg5S1lhj4fG5lhMggCKzIMt_YWEqhcoPoy4SRlUZbREbaZjoP-H3YL6vSPAQWhVbLTONVK2G4VUr7WkSKG0QxmYkcOG77P9UNRzmlyihSXKvQiKX1iDngdmKXG1KOvwVOaPC6SuLQrguq5ae0MUkUR2wX6VxYmwlc56rAcJFzY3lmKf-UA0ft0KetXuLjyVEVIc504FlXjSZJfhZVmmpNMsT4GPiJcODBRlO6lrSa5kDc06FeU_s15ZfPNe239LlEvObA007bdn_9o_--9TEcBnR8g7gqh0ewv1quzRMEVavMhb34Inbh4GQ8m5-79daEW5sSli1m89HH3yX8JcM
linkProvider	Scholars Portal
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB4hkKASqvoEtxS2Ej1aGO_6VQlVARIlJYkQAombu17vtpVSOw2JEL-h_6m_rTPO2iEt6o2LD9lRtPbM7MzO4xuAfR14RulD5co45q6QiXJjnSs3CjKDCig9HVM38mAYdq_E5-vgegV-170wVFZZn4nVQZ2XimLkB36VIQjRwH8a_3RpahRlV-sRGtKOVsiPKogx29hxpu9u8Qp3c9Q7RX5_8P1O-_Kk69opA67icTR1k1DGXGkR-aEvjTEEMOdxk4Ro-HzfiCzztImSRMhco3Ohg9AkWhk0_MpkKifgAzQBa4KLBC9_a8ft4fnFIsojKC0XrcMm1VyjtB8Q-AtfMoLVrICHHNx_6zQ3ZsVY3t3K0eieEew8g6fWe2Wtubg9hxVdvID1gc3Pv4RfhPWBBJUFnFfZMVnkrDWbluga65xd1AVLuFIa1js5xSOa2XQRa1uEV9aZlD_YUFagIKxvY6ofGU1uG7F7hU7Vv58vmh9Yq0EafQVXj8KI17BalIXeBhYGRiWZwqeSQnMjpfKUCCXX6C1lOnRgv_7-qbJY6DSSY5TinYjYlFZscmC3IRvPwT_-Jjgm5jWLhNVd_VBOvqZW9ZEcfchQ5cKYTOB9Wvqai5xrwzNDc64c2KlZn9oD5CZdiLsD75tlVH3K58hClzOiIWRJ34uFA1tzSWl2QuE8GjbgQLQkQ0tbXV4pvn-r4MUTjyfoFzqw10jbw2__5v8b34ON7uWgn_Z7w7O38MSnvhECyTzcgdXpZKbfoTc3zXatyjD48tha-gdJ1WLB
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3daxNBEB9KhSiI-FlPa7tCfTxy3d3chyASm4bGtqGIhbyde3u7KsS7mCaU_g3-R_51zux9pNHiW1_uITuEvZvPnZn9DcCe6QVWm33tqzgWvlSJ9mOTaz_qZRYVUAUmptvIp-Pw6Fx-nPQmG_C7uQtDbZWNTXSGOi815ci73FUIQnTwXVu3RZwNhu9nP32aIEWV1macRiUix-bqEo9vF-9GA-T1G86Hh58Pjvx6woCvRRwt_CRUsdBGRjzkylpL4HKBsEmITo9zK7MsMDZKEqlyg4GF6YU2Mdqi09c20zmBHqD5vxMJkVA7YTSJVvkdSQW5qAP3qdsa5bxLsC9izf25KQE3hbb_dmjeXRYzdXWpptNr7m_4EB7UcSvrV4L2CDZM8Rg6p3Vl_gn8IpQPJHC-r-qvY6rIWX-5KDEoNjn71LQq4Upp2ehggMaZ1YUidlhju7LhvPzBxsrBgbCTOpv6ltHMtim71uLk_v1sde2B9VuM0adwfitseAabRVmY58DCntVJpvGplTTCKqUDLUMlDMZJmQk92Gu-f6prFHQaxjFN8TREbEodmzzYaclmFezH3wQfiHntIqF0ux_K-de0Vnokx-gx1Lm0NpN4klbcCJkLY0VmacKVB9sN69PadFykK0H34HW7jEpPlRxVmHJJNIQpyYNYerBVSUq7E0rk0ZgBD6I1GVrb6vpK8f2bAxZPApFgROjBbittN7_9i_9vfBc6qJvpyWh8_BLucbowQuiY-9uwuZgvzSsM4xbZjtMXBl9uW0H_AKetYFs
linkToUnpaywall	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwED6NThpIiJ8DAmMYaY-kTWPHjXkr26qBaFUhKo2nyHFsMeiSqbRC8C_wT3PnJmEdk-AlD_HJcuy7-LPv7juAA5tEzti-CXWa8lBoZcLUFiYcJLlDA9SRTSkbeTyRJzPx7jQ53YJXTS4MeZR9Hk73y_nZwrvy8Xge90TPEm0K702PRjdgWyaIvDuwPZtMh5-oflys-qGSSuzAbYpuRr3qefmN7caz8l8HJf-OiLy5Ki_0j-96Pr-03YzuwrgZ6DrK5Gt3tcy75ucVDsf__ZJ7cKfGnWy4VpT7sGXLB7Azrj3rD-EXsXSggN-71vFxTJcFG66WFYJaW7APTagRtlSOvT08wp8rqx097LjmZmWjRXXOJtrTebD39W3oa0Y11-bsUoiS7336J22BDVuO0F2YjY4_Hp6EdaGG0PB0sMSJ1yk3VgxiGWvnHHH0Rdwpidghjp3I88i6gVJCFxbxmU2kU9Y4xE7G5aaI-SPolFVpnwCTiTMqN_g0WljutDaREVJzizgntzKAg2Y9M1OzmFMxjXmGpxla9sxPbgD7rdjFmrbjqsAbUoa2kVi2_Qtcp6w2WhRH9CdNIZzLBZ6EdWy5KLh1PHdUoSqAvUaVstr0v2H35MqSiEQDeNk2o9GSJ0aXtlqRDHFCxlEqAni81rx2JHQRR2UCAhhs6OTGUDdbyrPPnhhcRVwhogvgRau913_9039KPINbpK3kN-v396CzXKzsc4Rfy3y_trrfctExmA
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Neural+Translation+and+Automated+Recognition+of+ICD-10+Medical+Entities+From+Natural+Language%3A+Model+Development+and+Performance+Assessment&rft.jtitle=JMIR+medical+informatics&rft.au=Falissard%2C+Louis&rft.au=Morgand%2C+Claire&rft.au=Ghosn%2C+Walid&rft.au=Imbaud%2C+Claire&rft.date=2022-04-11&rft.issn=2291-9694&rft.eissn=2291-9694&rft.volume=10&rft.issue=4&rft.spage=e26353&rft_id=info:doi/10.2196%2F26353&rft_id=info%3Apmid%2F35404262&rft.externalDocID=35404262
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2291-9694&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2291-9694&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2291-9694&client=summon