Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment

The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making i...

Full description

Saved in:
Bibliographic Details
Published inJMIR medical informatics Vol. 10; no. 4; p. e26353
Main Authors Falissard, Louis, Morgand, Claire, Ghosn, Walid, Imbaud, Claire, Bounebache, Karim, Rey, Grégoire
Format Journal Article
LanguageEnglish
Published Canada JMIR Publications 11.04.2022
Subjects
Online AccessGet full text
ISSN2291-9694
2291-9694
DOI10.2196/26353

Cover

Abstract The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
AbstractList Background: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. Objective: The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. Methods: The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject’s age, the subject’s gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language–based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network–based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. Results: The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. Conclusions: This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.BACKGROUNDThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.OBJECTIVEThe aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language.The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.METHODSThe investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping.The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.RESULTSThe proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics.This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.CONCLUSIONSThis paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
BackgroundThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. ObjectiveThe aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. MethodsThe investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject’s age, the subject’s gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language–based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network–based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. ResultsThe proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. ConclusionsThis paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner. The aim of this paper was to investigate the application of deep neural sequence models to the problem of medical entity recognition from natural language. The investigated data set included every French death certificate from 2011 to 2016. These certificates contain information such as the subject's age, the subject's gender, and the chain of events leading to his or her death, both in French and encoded as International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) medical entities, for a total of around 3 million observations in the data set. The task of automatically recognizing ICD-10 medical entities from the French natural language-based chain of events leading to death was then formulated as a type of predictive modeling problem known as a sequence-to-sequence modeling problem. A deep neural network-based model, known as the Transformer, was then slightly adapted and fit to the data set. Its performance was then assessed on an external data set and compared to the current state-of-the-art approach. CIs for derived measurements were estimated via bootstrapping. The proposed approach resulted in an F-measure value of 0.952 (95% CI 0.946-0.957), which constitutes a significant improvement over the current state-of-the-art approach and its previously reported F-measure value of 0.825 as assessed on a comparable data set. Such an improvement makes possible a whole field of new applications, from nosologist-level automated coding to temporal harmonization of death statistics. This paper shows that a deep artificial neural network can directly learn from voluminous data sets in order to identify complex relationships between natural language and medical entities, without any explicit prior knowledge. Although not entirely free from mistakes, the derived model constitutes a powerful tool for automated coding of medical entities from medical language with promising potential applications.
Author Ghosn, Walid
Morgand, Claire
Falissard, Louis
Bounebache, Karim
Imbaud, Claire
Rey, Grégoire
AuthorAffiliation 1 Centre for Epidemiology on Medical Causes of Death Inserm Le Kremlin Bicêtre France
AuthorAffiliation_xml – name: 1 Centre for Epidemiology on Medical Causes of Death Inserm Le Kremlin Bicêtre France
Author_xml – sequence: 1
  givenname: Louis
  orcidid: 0000-0001-5461-8330
  surname: Falissard
  fullname: Falissard, Louis
– sequence: 2
  givenname: Claire
  orcidid: 0000-0003-2282-1494
  surname: Morgand
  fullname: Morgand, Claire
– sequence: 3
  givenname: Walid
  orcidid: 0000-0003-2013-4586
  surname: Ghosn
  fullname: Ghosn, Walid
– sequence: 4
  givenname: Claire
  orcidid: 0000-0003-2465-5186
  surname: Imbaud
  fullname: Imbaud, Claire
– sequence: 5
  givenname: Karim
  orcidid: 0000-0002-3222-9905
  surname: Bounebache
  fullname: Bounebache, Karim
– sequence: 6
  givenname: Grégoire
  orcidid: 0000-0001-7291-9444
  surname: Rey
  fullname: Rey, Grégoire
BackLink https://www.ncbi.nlm.nih.gov/pubmed/35404262$$D View this record in MEDLINE/PubMed
BookMark eNp1km9rFDEQxhep2FrvK0hAFEFO82-zG18Ix7W1B9cqUl8v2exk3SObnMlupZ_BL22615b2wFcJmV-emWdmXmYHzjvIshnBHymR4hMVLGfPsiNKJZlLIfnBo_thNotxgzEmnAghihfZIcs55lTQo-zvJYxBWXQVlItWDZ13SLkGLcbB92qABv0A7VvXTRFv0Gp5MicYXUDT6fTv1A0pBBGdBd-jSzVMamvl2lG18Bld-AYsOoFrsH7bgxsm9e8QjA-9chrQIkaI8Tb0KntulI0wuzuPs59np1fL8_n629fVcrGea1YWQ_KkSqaBF8mAMsZgRkvMjBSUc0oNr2sMppCSqwYIZ5ALI0EbUlBtat1QdpytdrqNV5tqG7pehZvKq66aHnxoKxWGTluoKKE5EbrhxtS85FhRYLxhYFhtCC5F0nq30xrdVt38UdY-CBJc3Q6nmoaTwC87cDvWPTQ6-U2depL9acR1v6rWX1cSM1lSnATe3wkE_3uEOFR9FzVYqxz4MaY0XNKc4pIn9M0euvFjcKmnicqL5CjPSaJeP67ooZT75UjA2x2gg48xgPmvtQ97nO6GaZWSkc7u0f8A3DrW1g
CitedBy_id crossref_primary_10_2196_40965
crossref_primary_10_1016_j_ijmedinf_2024_105462
crossref_primary_10_1016_j_artmed_2023_102622
crossref_primary_10_1016_j_medntd_2024_100319
crossref_primary_10_1002_trc2_70057
crossref_primary_10_1016_j_jbi_2022_104232
crossref_primary_10_1007_s10462_023_10677_z
crossref_primary_10_1016_j_csl_2023_101582
Cites_doi 10.1007/978-3-030-61377-8_39
10.1016/j.ijmedinf.2019.05.015
10.18653/v1/N19-1423
10.3115/1073083.1073135
10.1007/978-3-319-65340-2_12
10.3115/v1/d14-1179
10.18653/v1/2020.acl-demos.33
ContentType Journal Article
Copyright Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022.
2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022
Copyright_xml – notice: Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022.
– notice: 2022. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. 2022
DBID AAYXX
CITATION
NPM
3V.
7X7
7XB
88C
8FI
8FJ
8FK
ABUWG
AFKRA
AZQEC
BENPR
CCPQU
DWQXO
FYUFA
GHDGH
K9.
M0S
M0T
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQQKQ
PQUKI
PRINS
7X8
5PM
ADTOC
UNPAY
DOA
DOI 10.2196/26353
DatabaseName CrossRef
PubMed
ProQuest Central (Corporate)
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Healthcare Administration Database (Alumni)
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
ProQuest Central Essentials
ProQuest Central (New) (NC LIVE)
ProQuest One Community College
ProQuest Central
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Health & Medical Complete (Alumni)
Health & Medical Collection (Alumni Edition)
Healthcare Administration Database
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
MEDLINE - Academic
PubMed Central (Full Participant titles)
Unpaywall for CDI: Periodical Content
Unpaywall
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
PubMed
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Central China
ProQuest Central
ProQuest Health & Medical Research Collection
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Health & Medical Research Collection
ProQuest Central (New)
ProQuest One Academic Eastern Edition
ProQuest Health Management
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
ProQuest Hospital Collection (Alumni)
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
ProQuest Health Management (Alumni Edition)
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList Publicly Available Content Database
MEDLINE - Academic

PubMed
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
– sequence: 4
  dbid: BENPR
  name: ProQuest Central
  url: http://www.proquest.com/pqcentral?accountid=15518
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISSN 2291-9694
ExternalDocumentID oai_doaj_org_article_212516cd4ffb4840a2e34d3ef3bf1086
10.2196/26353
PMC9039820
35404262
10_2196_26353
Genre Journal Article
GroupedDBID 53G
5VS
7X7
8FI
8FJ
AAFWJ
AAYXX
ABUWG
ADBBV
AFKRA
AFPKN
ALMA_UNASSIGNED_HOLDINGS
AOIJS
BAWUL
BCNDV
BENPR
CCPQU
CITATION
DIK
EMOBN
FYUFA
GROUPED_DOAJ
HMCUK
HYE
KQ8
M0T
M48
M~E
OK1
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PUEGO
RPM
UKHRP
ALIPV
NPM
3V.
7XB
8FK
AZQEC
DWQXO
K9.
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
7X8
5PM
ADRAZ
ADTOC
UNPAY
ID FETCH-LOGICAL-c387t-96a83ce47262afff032803f9624422f4bb0ef7994ade143e56f9ecf172cfbcd23
IEDL.DBID M48
ISSN 2291-9694
IngestDate Tue Oct 14 19:08:41 EDT 2025
Sun Oct 26 03:32:25 EDT 2025
Tue Sep 30 16:53:44 EDT 2025
Thu Oct 02 11:24:59 EDT 2025
Tue Oct 07 06:50:44 EDT 2025
Thu Jan 02 22:54:55 EST 2025
Wed Oct 01 05:10:16 EDT 2025
Thu Apr 24 23:11:24 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Keywords machine translation
deep learning
automated medical entity recognition
mortality statistics
machine learning
ICD-10 coding
Language English
License Louis Falissard, Claire Morgand, Walid Ghosn, Claire Imbaud, Karim Bounebache, Grégoire Rey. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c387t-96a83ce47262afff032803f9624422f4bb0ef7994ade143e56f9ecf172cfbcd23
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0003-2013-4586
0000-0003-2282-1494
0000-0003-2465-5186
0000-0001-7291-9444
0000-0001-5461-8330
0000-0002-3222-9905
OpenAccessLink https://www.proquest.com/docview/2657516551?pq-origsite=%requestingapplication%&accountid=15518
PMID 35404262
PQID 2657516551
PQPubID 4997117
ParticipantIDs doaj_primary_oai_doaj_org_article_212516cd4ffb4840a2e34d3ef3bf1086
unpaywall_primary_10_2196_26353
pubmedcentral_primary_oai_pubmedcentral_nih_gov_9039820
proquest_miscellaneous_2649252084
proquest_journals_2657516551
pubmed_primary_35404262
crossref_primary_10_2196_26353
crossref_citationtrail_10_2196_26353
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20220411
PublicationDateYYYYMMDD 2022-04-11
PublicationDate_xml – month: 4
  year: 2022
  text: 20220411
  day: 11
PublicationDecade 2020
PublicationPlace Canada
PublicationPlace_xml – name: Canada
– name: Toronto
– name: Toronto, Canada
PublicationTitle JMIR medical informatics
PublicationTitleAlternate JMIR Med Inform
PublicationYear 2022
Publisher JMIR Publications
Publisher_xml – name: JMIR Publications
References ref13
ref12
ref15
ref14
ref11
ref10
ref2
ref1
ref17
ref16
ref18
ref8
ref7
ref9
ref4
ref3
ref6
ref5
References_xml – ident: ref12
  doi: 10.1007/978-3-030-61377-8_39
– ident: ref1
– ident: ref2
– ident: ref3
– ident: ref5
– ident: ref11
  doi: 10.1016/j.ijmedinf.2019.05.015
– ident: ref6
  doi: 10.18653/v1/N19-1423
– ident: ref18
  doi: 10.3115/1073083.1073135
– ident: ref4
  doi: 10.1007/978-3-319-65340-2_12
– ident: ref9
– ident: ref8
– ident: ref16
– ident: ref10
– ident: ref17
– ident: ref7
  doi: 10.3115/v1/d14-1179
– ident: ref15
– ident: ref13
  doi: 10.18653/v1/2020.acl-demos.33
– ident: ref14
SSID ssj0001416667
Score 2.2488456
Snippet The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the...
Background: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical...
BackgroundThe recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding...
SourceID doaj
unpaywall
pubmedcentral
proquest
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage e26353
SubjectTerms Artificial intelligence
Automation
Cardiovascular disease
Classification
Codes
Datasets
Deep learning
Epidemiology
Language
Machine translation
Mortality
Natural language
Neural networks
Original Paper
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwEB6hHgoSQuWdthQj9Ro1jR0n5rZ9rAqiFUJU6i2ynbFAWpKq7KriN_CnmXGyYRcqceGSQzyy_Bh7PnvG3wDsY5EFj4c-tVUlU2WNTytsfFoWLtACtBlW_Br5_EKfXar3V8XVSqovjgnr6YH7gTvI2QJr36gQnKLTiM1RqkZikC5wliDefbPKrBym4u2KYndYuQkPOdaZtOyASVfkmvGJHP13Acu_4yPvL9pr--PWzmYrxme6BY8G1CgmfWsfwz1sn8Dm-eAXfwo_mWODBKLl6aPbhG0bMVnMO4Kk2IhPy0AhKumCeHd8QlujGNw04nRgVhXTm-6buLCRjEN8GO4y3wrOmDYTKwFGsfaPvx8diMnI8PkMLqenn4_P0iHNQuplVc5To20lPaoy17kNITDDXiaD0WT58zwo5zIMpTHKNkjoCgsdDPpAyMcH55tcPoeNtmvxJQhdBG-cp6-3CmWw1mdeaSuRUIpDncD-cvxrP3CQcyqMWU1nEZ6mOk5TAnuj2HVPuvGnwBFP3ljIHNnxB2lOPWhO_S_NSWB3OfX1sHC_U_XsiNKEIxN4MxbTkmM_im2xW7AMMzqS1qkEXvSaMraEr9GY5D-Bck2H1pq6XtJ-_RJpvU0mDeGxBF6P2nZ377f_R-934EHOLzmYtvJwFzbmNwt8Rfhq7vbiUvoFno8nGg
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Central (New) (NC LIVE)
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fb9MwED6NThpICPFrEBjDSHuNltmOkyAh1I1WA7Fqmpi0t8hxzoBUklJaIf4G_ml8rpOuMPGSh_gUObk7-8vd-TuAA0wTa_DIxDrPRSx1YeIcaxNnaWWdA-oEczqNfDZRp5fyw1V6tQWT7iwMlVV2a6JfqOvWUIz8kPsMgXIb_NvZ95i6RlF2tWuhoUNrhfqNpxi7BducmLEGsH08mpxfrKMuktJk2Q7cpRpoZ32HRMYiNjYlz91_E-D8t27y9rKZ6V8_9XR6bVMa34d7AU2y4Ur9D2ALm4ewcxby5Y_gN3FvOAG_I62q3phuajZcLloHVbFmF10BkRtpLXt_8s4tmSykb9goMK6y8bz9xibak3SwjyHG-ZpRJ7Upu1Z45J9-vj6MwIY98-djuByPPp2cxqH9QmxEni3iQulcGJQZV1xba4l5LxG2UA4RcG5lVSVos6KQukaHujBVtkBjHSIytjI1F7swaNoGnwJTqTVFZdzVaInCam0SI5UW6NBLhSqCg-77lyZwk1OLjGnp_lFITaVXUwT7vdhsRcbxt8AxKa8fJO5sf6Odfy6DKzpxh-mUqaW1lXT_t5qjkLVAKypLfaci2OtUXwaH_lGuzS-CV_2wc0XKr-gG2yXJENMjT3IZwZOVpfQzofAakf9HkG3Y0MZUN0ear1883XeRiMLhtAhe9tZ289s_-__En8MdTmc3iKjyaA8Gi_kSXzhEtaj2g5v8AQ7oJR8
  priority: 102
  providerName: ProQuest
– databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwED6NThpIiJ8DAmMYaY-kTWPHjXkr26qBaFUhKo2nyHFsMeiSqbRC8C_wT3PnJmEdk-AlD_HJcuy7-LPv7juAA5tEzti-CXWa8lBoZcLUFiYcJLlDA9SRTSkbeTyRJzPx7jQ53YJXTS4MeZR9Hk73y_nZwrvy8Xge90TPEm0K702PRjdgWyaIvDuwPZtMh5-oflys-qGSSuzAbYpuRr3qefmN7caz8l8HJf-OiLy5Ki_0j-96Pr-03YzuwrgZ6DrK5Gt3tcy75ucVDsf__ZJ7cKfGnWy4VpT7sGXLB7Azrj3rD-EXsXSggN-71vFxTJcFG66WFYJaW7APTagRtlSOvT08wp8rqx097LjmZmWjRXXOJtrTebD39W3oa0Y11-bsUoiS7336J22BDVuO0F2YjY4_Hp6EdaGG0PB0sMSJ1yk3VgxiGWvnHHH0Rdwpidghjp3I88i6gVJCFxbxmU2kU9Y4xE7G5aaI-SPolFVpnwCTiTMqN_g0WljutDaREVJzizgntzKAg2Y9M1OzmFMxjXmGpxla9sxPbgD7rdjFmrbjqsAbUoa2kVi2_Qtcp6w2WhRH9CdNIZzLBZ6EdWy5KLh1PHdUoSqAvUaVstr0v2H35MqSiEQDeNk2o9GSJ0aXtlqRDHFCxlEqAni81rx2JHQRR2UCAhhs6OTGUDdbyrPPnhhcRVwhogvgRau913_9039KPINbpK3kN-v396CzXKzsc4Rfy3y_trrfctExmA
  priority: 102
  providerName: Unpaywall
Title Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment
URI https://www.ncbi.nlm.nih.gov/pubmed/35404262
https://www.proquest.com/docview/2657516551
https://www.proquest.com/docview/2649252084
https://pubmed.ncbi.nlm.nih.gov/PMC9039820
https://medinform.jmir.org/2022/4/e26353/PDF
https://doaj.org/article/212516cd4ffb4840a2e34d3ef3bf1086
UnpaywallVersion publishedVersion
Volume 10
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: KQ8
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: DOA
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: DIK
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: M~E
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: RPM
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: 7X7
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 2291-9694
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: BENPR
  dateStart: 20130101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVFZP
  databaseName: Scholars Portal Journals: Open Access
  customDbUrl:
  eissn: 2291-9694
  dateEnd: 20250131
  omitProxy: true
  ssIdentifier: ssj0001416667
  issn: 2291-9694
  databaseCode: M48
  dateStart: 20131001
  isFulltext: true
  titleUrlDefault: http://journals.scholarsportal.info
  providerName: Scholars Portal
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1tb9MwED6xTSqTEOKdwAhG2tdAGjsvRkKoG60GolU1UWl8ihzHBqSQjNIK9hv409ylSWigfMkH-5I49l382Hd-DuDYhL7VZqg9lSTcE0pqLzG59uIws2iAyjcJnUaezqKzhXh3EW5FEzYd-H3n0o7ySS2WxfOf365eo8G_ojBmVKAXxKfC9-AAJydJ2RumDcKvt1kE-cXo0HQQyKEnIykGcKN35yEMaPODqNl7k1PN4b8LeP4bP3l9XV6qqx-qKLYmp8ktuNmgSjbaqMFtuGbKOzCYNn7zu_CLODhQoJ6ZNtFvTJU5G61XFUJWk7PzNpAIayrL3p6-wV8na9w4bNwwr7LJsvrKZqom62Dvm73Ol4wyqhVsKwCpfvr8z6EENuoYQO_BYjL-cHrmNWkYPM2TeIV9phKujYixg5S1lhj4fG5lhMggCKzIMt_YWEqhcoPoy4SRlUZbREbaZjoP-H3YL6vSPAQWhVbLTONVK2G4VUr7WkSKG0QxmYkcOG77P9UNRzmlyihSXKvQiKX1iDngdmKXG1KOvwVOaPC6SuLQrguq5ae0MUkUR2wX6VxYmwlc56rAcJFzY3lmKf-UA0ft0KetXuLjyVEVIc504FlXjSZJfhZVmmpNMsT4GPiJcODBRlO6lrSa5kDc06FeU_s15ZfPNe239LlEvObA007bdn_9o_--9TEcBnR8g7gqh0ewv1quzRMEVavMhb34Inbh4GQ8m5-79daEW5sSli1m89HH3yX8JcM
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB4hkKASqvoEtxS2Ej1aGO_6VQlVARIlJYkQAombu17vtpVSOw2JEL-h_6m_rTPO2iEt6o2LD9lRtPbM7MzO4xuAfR14RulD5co45q6QiXJjnSs3CjKDCig9HVM38mAYdq_E5-vgegV-170wVFZZn4nVQZ2XimLkB36VIQjRwH8a_3RpahRlV-sRGtKOVsiPKogx29hxpu9u8Qp3c9Q7RX5_8P1O-_Kk69opA67icTR1k1DGXGkR-aEvjTEEMOdxk4Ro-HzfiCzztImSRMhco3Ohg9AkWhk0_MpkKifgAzQBa4KLBC9_a8ft4fnFIsojKC0XrcMm1VyjtB8Q-AtfMoLVrICHHNx_6zQ3ZsVY3t3K0eieEew8g6fWe2Wtubg9hxVdvID1gc3Pv4RfhPWBBJUFnFfZMVnkrDWbluga65xd1AVLuFIa1js5xSOa2XQRa1uEV9aZlD_YUFagIKxvY6ofGU1uG7F7hU7Vv58vmh9Yq0EafQVXj8KI17BalIXeBhYGRiWZwqeSQnMjpfKUCCXX6C1lOnRgv_7-qbJY6DSSY5TinYjYlFZscmC3IRvPwT_-Jjgm5jWLhNVd_VBOvqZW9ZEcfchQ5cKYTOB9Wvqai5xrwzNDc64c2KlZn9oD5CZdiLsD75tlVH3K58hClzOiIWRJ34uFA1tzSWl2QuE8GjbgQLQkQ0tbXV4pvn-r4MUTjyfoFzqw10jbw2__5v8b34ON7uWgn_Z7w7O38MSnvhECyTzcgdXpZKbfoTc3zXatyjD48tha-gdJ1WLB
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3daxNBEB9KhSiI-FlPa7tCfTxy3d3chyASm4bGtqGIhbyde3u7KsS7mCaU_g3-R_51zux9pNHiW1_uITuEvZvPnZn9DcCe6QVWm33tqzgWvlSJ9mOTaz_qZRYVUAUmptvIp-Pw6Fx-nPQmG_C7uQtDbZWNTXSGOi815ci73FUIQnTwXVu3RZwNhu9nP32aIEWV1macRiUix-bqEo9vF-9GA-T1G86Hh58Pjvx6woCvRRwt_CRUsdBGRjzkylpL4HKBsEmITo9zK7MsMDZKEqlyg4GF6YU2Mdqi09c20zmBHqD5vxMJkVA7YTSJVvkdSQW5qAP3qdsa5bxLsC9izf25KQE3hbb_dmjeXRYzdXWpptNr7m_4EB7UcSvrV4L2CDZM8Rg6p3Vl_gn8IpQPJHC-r-qvY6rIWX-5KDEoNjn71LQq4Upp2ehggMaZ1YUidlhju7LhvPzBxsrBgbCTOpv6ltHMtim71uLk_v1sde2B9VuM0adwfitseAabRVmY58DCntVJpvGplTTCKqUDLUMlDMZJmQk92Gu-f6prFHQaxjFN8TREbEodmzzYaclmFezH3wQfiHntIqF0ux_K-de0Vnokx-gx1Lm0NpN4klbcCJkLY0VmacKVB9sN69PadFykK0H34HW7jEpPlRxVmHJJNIQpyYNYerBVSUq7E0rk0ZgBD6I1GVrb6vpK8f2bAxZPApFgROjBbittN7_9i_9vfBc6qJvpyWh8_BLucbowQuiY-9uwuZgvzSsM4xbZjtMXBl9uW0H_AKetYFs
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fb9MwED6NThpIiJ8DAmMYaY-kTWPHjXkr26qBaFUhKo2nyHFsMeiSqbRC8C_wT3PnJmEdk-AlD_HJcuy7-LPv7juAA5tEzti-CXWa8lBoZcLUFiYcJLlDA9SRTSkbeTyRJzPx7jQ53YJXTS4MeZR9Hk73y_nZwrvy8Xge90TPEm0K702PRjdgWyaIvDuwPZtMh5-oflys-qGSSuzAbYpuRr3qefmN7caz8l8HJf-OiLy5Ki_0j-96Pr-03YzuwrgZ6DrK5Gt3tcy75ucVDsf__ZJ7cKfGnWy4VpT7sGXLB7Azrj3rD-EXsXSggN-71vFxTJcFG66WFYJaW7APTagRtlSOvT08wp8rqx097LjmZmWjRXXOJtrTebD39W3oa0Y11-bsUoiS7336J22BDVuO0F2YjY4_Hp6EdaGG0PB0sMSJ1yk3VgxiGWvnHHH0Rdwpidghjp3I88i6gVJCFxbxmU2kU9Y4xE7G5aaI-SPolFVpnwCTiTMqN_g0WljutDaREVJzizgntzKAg2Y9M1OzmFMxjXmGpxla9sxPbgD7rdjFmrbjqsAbUoa2kVi2_Qtcp6w2WhRH9CdNIZzLBZ6EdWy5KLh1PHdUoSqAvUaVstr0v2H35MqSiEQDeNk2o9GSJ0aXtlqRDHFCxlEqAni81rx2JHQRR2UCAhhs6OTGUDdbyrPPnhhcRVwhogvgRau913_9039KPINbpK3kN-v396CzXKzsc4Rfy3y_trrfctExmA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Neural+Translation+and+Automated+Recognition+of+ICD-10+Medical+Entities+From+Natural+Language%3A+Model+Development+and+Performance+Assessment&rft.jtitle=JMIR+medical+informatics&rft.au=Falissard%2C+Louis&rft.au=Morgand%2C+Claire&rft.au=Ghosn%2C+Walid&rft.au=Imbaud%2C+Claire&rft.date=2022-04-11&rft.issn=2291-9694&rft.eissn=2291-9694&rft.volume=10&rft.issue=4&rft.spage=e26353&rft_id=info:doi/10.2196%2F26353&rft_id=info%3Apmid%2F35404262&rft.externalDocID=35404262
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2291-9694&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2291-9694&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2291-9694&client=summon