A multi-aspect comparison study of supervised word sense disambiguation

The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The s...

Full description

Saved in:
Bibliographic Details
Published inJournal of the American Medical Informatics Association : JAMIA Vol. 11; no. 4; pp. 320 - 331
Main Authors Liu, Hongfang, Teller, Virginia, Friedman, Carol
Format Journal Article
LanguageEnglish
Published England Elsevier Inc 01.07.2004
Oxford University Press
American Medical Informatics Association
Subjects
Online AccessGet full text
ISSN1067-5027
1527-974X
1527-974X
DOI10.1197/jamia.M1533

Cover

Abstract The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naı̈ve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
AbstractList OBJECTIVE: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. METHODS:The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naive Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). RESULTS: Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. CONCLUSION: From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
Objective: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. Methods:The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naïve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Results: Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. Conclusion: From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naı̈ve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
OBJECTIVEThe aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain.METHODSThe study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naïve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10).RESULTSSupervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets.CONCLUSIONFrom this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naïve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
Author Liu, Hongfang
Teller, Virginia
Friedman, Carol
AuthorAffiliation Affiliations of the authors: Department of Information Systems, University of Maryland at Baltimore County, Baltimore, MD (HL); Department of Computer Science, Hunter College, City University of New York, New York, NY (VT); Department of Biomedical Informatics, Columbia University, New York, NY (CF)
AuthorAffiliation_xml – name: Affiliations of the authors: Department of Information Systems, University of Maryland at Baltimore County, Baltimore, MD (HL); Department of Computer Science, Hunter College, City University of New York, New York, NY (VT); Department of Biomedical Informatics, Columbia University, New York, NY (CF)
Author_xml – sequence: 1
  givenname: Hongfang
  surname: Liu
  fullname: Liu, Hongfang
  email: hfliu@umbc.edu
– sequence: 2
  givenname: Virginia
  surname: Teller
  fullname: Teller, Virginia
– sequence: 3
  givenname: Carol
  surname: Friedman
  fullname: Friedman, Carol
BackLink https://www.ncbi.nlm.nih.gov/pubmed/15064284$$D View this record in MEDLINE/PubMed
BookMark eNqFkk1rFTEUhoNU7Ieu3MvgQgSdmjOZJDMLF6W0VWhxo-AuZJJzay4zyZjM3HL_vbkfUC1WVwnkeQ_kec8xOfDBIyEvgZ4CtPLDUg9On94AZ-wJOQJeybKV9feDfKdClpxW8pAcp7SkFETF-DNyCJyKumrqI3J1VgxzP7lSpxHNVJgwjDq6FHyRptmui7Ao0jxiXLmEtrgL0RYJfcLCuqSHzt3OenLBPydPF7pP-GJ_npBvlxdfzz-V11-uPp-fXZeGy3YqQRigVLe2aoylgmk0lHJmpbAoAGpa865r21oY0XSyw45CxyrKu5aCZNKwE_J-N3f2o17f6b5XY3SDjmsFVG18qK0PNWx8ZPzjDh_nbkBr0E9R30eCdurPF-9-qNuwUjUTtNnk3-zzMfycMU1qcMlg32uPYU5KCMGBtSKDb_8JQiN5U9eNEP-dCVJkFpoMvn4ALsMcfdarqorKBmA77dXvP7y3sa84A-92gIkhpYiLR4Td7IXBA9q4adtw1uP6RzJ8l8Fc_MphVMk49Aati3mnlA3ur7lfCR3bzg
CitedBy_id crossref_primary_10_1136_amiajnl_2012_001244
crossref_primary_10_1016_j_jbi_2014_12_013
crossref_primary_10_1109_ACCESS_2021_3119621
crossref_primary_10_1186_1471_2105_10_28
crossref_primary_10_1186_s12859_019_3079_8
crossref_primary_10_1051_itmconf_20235603001
crossref_primary_10_1186_1471_2105_9_S11_S7
crossref_primary_10_3414_ME12_01_0040
crossref_primary_10_35596_1729_7648_2019_123_5_60_65
crossref_primary_10_4338_ACI_2014_10_RA_0088
crossref_primary_10_1089_cmb_2005_12_554
crossref_primary_10_1016_j_ipm_2011_09_005
crossref_primary_10_1093_jamia_ocw109
crossref_primary_10_1002_asi_20257
crossref_primary_10_4137_BII_S38308
crossref_primary_10_7717_peerj_cs_2541
crossref_primary_10_1016_j_jbi_2012_06_003
crossref_primary_10_1186_1471_2105_7_334
crossref_primary_10_1016_j_jbi_2010_02_005
crossref_primary_10_1136_amiajnl_2012_001350
crossref_primary_10_4258_hir_2015_21_1_35
crossref_primary_10_1186_1471_2105_12_223
crossref_primary_10_1016_j_jbi_2022_104229
crossref_primary_10_1186_1471_2105_6_149
crossref_primary_10_1016_j_cosrev_2022_100511
crossref_primary_10_1186_1471_2105_10_S3_S4
crossref_primary_10_1197_jamia_M2085
crossref_primary_10_1016_j_jbi_2013_09_009
crossref_primary_10_1093_jamia_ocy013
crossref_primary_10_1007_s10257_014_0259_y
crossref_primary_10_1093_bioinformatics_btq555
crossref_primary_10_1136_amiajnl_2011_000415
crossref_primary_10_1016_j_jbi_2010_08_009
crossref_primary_10_1016_j_patcog_2017_10_028
crossref_primary_10_1197_jamia_M2927
ContentType Journal Article
Copyright 2004 American Medical Informatics Association
Copyright Hanley & Belfus, Inc. Jul/Aug 2004
Copyright © 2004, American Medical Informatics Association 2004
Copyright_xml – notice: 2004 American Medical Informatics Association
– notice: Copyright Hanley & Belfus, Inc. Jul/Aug 2004
– notice: Copyright © 2004, American Medical Informatics Association 2004
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7RV
7X7
7XB
88C
88E
88I
8AF
8AL
8AO
8FE
8FG
8FI
8FJ
8FK
ABUWG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
KB0
M0N
M0S
M0T
M1P
M2P
NAPCQ
P5Z
P62
PHGZM
PHGZT
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
S0X
7QO
8FD
FR3
P64
7X8
5PM
ADTOC
UNPAY
DOI 10.1197/jamia.M1533
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
ProQuest Central (Corporate)
Nursing & Allied Health Database (Proquest)
ProQuest Health & Medical Complete
ProQuest Central (purchase pre-March 2016)
Healthcare Administration Database (Alumni)
Medical Database (Alumni Edition)
Science Database (Alumni Edition)
STEM Database
Computing Database (Alumni Edition)
ProQuest Pharma Collection
ProQuest SciTech Collection
ProQuest Technology Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials - QC
ProQuest Central
ProQuest Technology Collection (LUT)
ProQuest One
ProQuest Central
Proquest Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection (Proquest)
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Nursing & Allied Health Database (Alumni Edition)
Computing Database
ProQuest Health & Medical Collection
Healthcare Administration Database
Medical Database
Science Database
Nursing & Allied Health Premium
Advanced Technologies & Aerospace Collection
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
SIRS Editorial
Biotechnology Research Abstracts
Technology Research Database
Engineering Research Database
Biotechnology and BioEngineering Abstracts
MEDLINE - Academic
PubMed Central (Full Participant titles)
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest AP Science
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
Health Research Premium Collection
Health & Medical Research Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Science Journals (Alumni Edition)
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
ProQuest Hospital Collection (Alumni)
Nursing & Allied Health Premium
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
ProQuest Health Management (Alumni Edition)
ProQuest Nursing & Allied Health Source (Alumni)
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
ProQuest One Academic Middle East (New)
SIRS Editorial
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
ProQuest Computing
ProQuest Central Basic
ProQuest Science Journals
ProQuest Computing (Alumni Edition)
ProQuest Health Management
ProQuest Nursing & Allied Health Source
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
Engineering Research Database
Biotechnology Research Abstracts
Technology Research Database
Biotechnology and BioEngineering Abstracts
MEDLINE - Academic
DatabaseTitleList Engineering Research Database


MEDLINE - Academic
Engineering Research Database
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
– sequence: 4
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
EISSN 1527-974X
EndPage 331
ExternalDocumentID oai:pubmedcentral.nih.gov:436083
PMC436083
841262421
15064284
10_1197_jamia_M1533
S1067502704000544
Genre Research Support, U.S. Gov't, Non-P.H.S
Research Support, U.S. Gov't, P.H.S
Comparative Study
Journal Article
General Information
GrantInformation_xml – fundername: NLM NIH HHS
  grantid: LM06274
GroupedDBID ---
--K
.DC
.GJ
0R~
18M
1B1
1TH
29L
2WC
3V.
4.4
48X
53G
5GY
5RE
5WD
6PF
7RV
7X7
7~T
88E
88I
8AF
8AO
8FE
8FG
8FI
8FJ
8FW
8RD
AABJS
AABMN
AABZA
AACZT
AAEDT
AAESY
AAIYJ
AAJQQ
AALRI
AAMVS
AAOGV
AAPGJ
AAPQZ
AAPXW
AAUQX
AAVAP
AAWDT
AAWTL
AAXUO
ABEUO
ABIXL
ABJNI
ABNHQ
ABOCM
ABPTD
ABQLI
ABSAR
ABSMQ
ABUWG
ABWST
ABXVV
ACFRR
ACGFO
ACGFS
ACGOD
ACHQT
ACIMA
ACUFI
ACUTJ
ACYHN
ADBBV
ADEIU
ADGZP
ADHKW
ADHZD
ADIPN
ADJOM
ADJQC
ADMUD
ADORX
ADQLU
ADRIX
ADRTK
ADVEK
ADYVW
AEGPL
AEJOX
AEKSI
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFIYH
AFKRA
AFOFC
AFXEN
AFYAG
AGINJ
AGQXC
AGSYK
AGUTN
AHMBA
AIKOY
AIMBJ
AJEEA
ALMA_UNASSIGNED_HOLDINGS
ALUQC
APIBT
APJGH
AQDSO
AQKUS
AQUVI
ARAPS
ASMCH
AVNTJ
AVWKF
AWCFO
AXUDD
AYCSE
AZQEC
AZQFJ
BAWUL
BAYMD
BCRHZ
BENPR
BEYMZ
BGLVJ
BGYMP
BHONS
BKEYQ
BPHCQ
BTRTY
BVRKM
BVXVI
BYORX
BZKNY
C1A
C45
CASEJ
CCPQU
CDBKE
CS3
DAKXR
DIK
DILTD
DPPUQ
DU5
DWQXO
E3Z
EBD
EBS
EIHJH
EJD
EMOBN
ENERS
EO8
EX3
F5P
FDB
FECEO
FLUFQ
FOEOM
FOTVD
FQBLK
FYUFA
G-Q
GAUVT
GJXCC
GNUQQ
GX1
HAR
HCIFZ
HMCUK
IH2
IHE
J21
K6V
K7-
KBUDW
KOP
KSI
KSN
LSO
M0N
M0T
M1P
M2P
M2Q
M41
MBLQV
MHKGH
NAPCQ
NOMLY
NOYVH
NQ-
NVLIB
O9-
OAUYM
OAWHX
OCZFY
ODMLO
OJQWA
OJZSN
OK1
OPAEJ
OVD
OWPYF
P2P
P62
PAFKI
PCD
PEELM
PQQKQ
PROAC
PSQYO
Q5Y
R53
RIG
ROL
ROX
ROZ
RPM
RPZ
RUSNO
RWL
RXO
S0X
SSZ
SV3
TAE
TEORI
TJX
TMA
UKHRP
WOQ
WOW
YAYTL
YHZ
YKOAZ
YXANX
ZGI
~S-
77I
AARHZ
AAUAY
AAYXX
ABDFA
ABEJV
ABGNP
ABPQP
ABQNK
ABVGC
ADNBA
ADQBN
AEMQT
AFXAL
AHMMS
AJBYB
AJNCP
ALXQX
ATGXG
CITATION
JXSIZ
PHGZM
PHGZT
PJZUB
PPXIY
PQGLB
PUEGO
ABWVN
ACRPL
ACZBC
ADNMO
AFFQV
AGKRT
AGMDO
ALIPV
CGR
CUY
CVF
ECM
EIF
H13
NPM
NU-
7XB
8AL
8FK
JQ2
K9.
PKEHL
PQEST
PQUKI
PRINS
Q9U
7QO
8FD
FR3
P64
7X8
5PM
ACVCV
ADMTO
ADTOC
AHGBF
AJDVS
OBFPC
UNPAY
ID FETCH-LOGICAL-c579t-16c100a9d28cd063aec0053d76de6114045bb9946c68b7beb01b3205b901737c3
IEDL.DBID UNPAY
ISSN 1067-5027
1527-974X
IngestDate Sun Oct 26 03:25:19 EDT 2025
Tue Sep 30 16:45:06 EDT 2025
Wed Oct 01 09:37:03 EDT 2025
Tue Oct 07 09:40:31 EDT 2025
Mon Oct 06 18:00:46 EDT 2025
Mon Oct 06 17:13:35 EDT 2025
Wed Feb 19 01:37:37 EST 2025
Thu Apr 24 22:55:29 EDT 2025
Wed Oct 01 02:43:26 EDT 2025
Fri Feb 23 02:38:32 EST 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c579t-16c100a9d28cd063aec0053d76de6114045bb9946c68b7beb01b3205b901737c3
Notes SourceType-Scholarly Journals-1
ObjectType-General Information-1
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
Supported in part by NLM grant LM06274 and NSF grant NSF 0312250.
OpenAccessLink https://proxy.k.utb.cz/login?url=http://doi.org/10.1197/jamia.M1533
PMID 15064284
PQID 220781166
PQPubID 23462
PageCount 12
ParticipantIDs unpaywall_primary_10_1197_jamia_m1533
pubmedcentral_primary_oai_pubmedcentral_nih_gov_436083
proquest_miscellaneous_66651396
proquest_miscellaneous_1875844866
proquest_miscellaneous_17675818
proquest_journals_220781166
pubmed_primary_15064284
crossref_primary_10_1197_jamia_M1533
crossref_citationtrail_10_1197_jamia_M1533
elsevier_sciencedirect_doi_10_1197_jamia_M1533
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2004-07-01
PublicationDateYYYYMMDD 2004-07-01
PublicationDate_xml – month: 07
  year: 2004
  text: 2004-07-01
  day: 01
PublicationDecade 2000
PublicationPlace England
PublicationPlace_xml – name: England
– name: Oxford
PublicationTitle Journal of the American Medical Informatics Association : JAMIA
PublicationTitleAlternate J Am Med Inform Assoc
PublicationYear 2004
Publisher Elsevier Inc
Oxford University Press
American Medical Informatics Association
Publisher_xml – name: Elsevier Inc
– name: Oxford University Press
– name: American Medical Informatics Association
References Veronis J, Ide N. Very large neural networks for natural language processing. Proceedings of the European Conference on Artificial Intelligence. 1990:366–8.
Mooney R. Comparative experiments on disambiguating word senses: an illustration of the role of bias in machine learning. Proceedings of the First Conference on Empirical Methods in Natural Language Processing. 1996:82–91.
Duda, Hart (BIB17) 1973
Leacock C, Towell G, Voorhees EM. Corpus-based statistical sense resolution. Proceedings of the Advanced Research Projects Agency (ARPA) Workshop on Human Language Technology. 1993:260–5.
Liu, Lussier, Friedman (BIB8) 2001; 34
Fujii, Inui, Tokunaga, Tanaka (BIB7) 1998; 24
Ng HT, Lee HB. Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996:40–7
Ng HT. Getting serious about word-sense disambiguation. Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What and How? 1997:1–7.
Liu, Johnson, Friedman (BIB9) 2002; 9
Towell, Voorhees (BIB11) 1998; 24
Jorgensen (BIB19) 1990; 19
Kilgarriff, Rosenzweig (BIB3) 1999; 34
Bruce R, Wiebe J. Word-sense disambiguation using decomposable models. Proceedings of the Thirty-Second Annual Meeting of the Association of Computational Linguistics. 1994:139–46.
Marquez L. Machine learning and natural language processing. Technical Report LSI-00-45-R, Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, 2000.
Aha, Kibler, Albert (BIB18) 1991; 7
Escudero G, Marquez L, Rigau G. Boosting applied to word sense disambiguation. Proceedings of the European Conference on Machine Learning. 2000:129–141.
Yarowsky D. Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. Proceedings of the Thirty-Second Annual meeting of the Association of computational Linguistics. 1994:88–95.
Escudero G, Marquez L, Rigau G. Naive Bayes and exemplar-based approaches to word sense disambiguation revisited. Proceeding of the 14th European Conference on Artificial Intelligence (ECAI). 2000:421–425.
Mooney (BIB15) 1997
Weeber, Mork, Aronson (BIB22) 2001
Ide, Veronis (BIB1) 1998; 24
Witten, Bell (BIB24) 1991; 37
Ng HT. Examplar-based word sense disambiguation: some recent improvements. Proceedings of the Second Conference on Empirical methods in natural Language Processing. 1997:208–213.
Ng, Zelle (BIB2) 1997; winter
Engelson SP, Dagan I. Minimizing manual annotation cost in supervised training from corpora. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996;34:319–26.
Leacock, Chodorow, Miller (BIB20) 1998; 24
12386113 - J Am Med Inform Assoc. 2002 Nov-Dec;9(6):621-36
11977807 - J Biomed Inform. 2001 Aug;34(4):249-61
11825285 - Proc AMIA Symp. 2001;:746-50
References_xml – volume: 37
  start-page: 1085
  year: 1991
  end-page: 1094
  ident: BIB24
  article-title: The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression
  publication-title: IEEE Trans Inf Theory
– reference: Veronis J, Ide N. Very large neural networks for natural language processing. Proceedings of the European Conference on Artificial Intelligence. 1990:366–8.
– volume: 34
  start-page: 1
  year: 1999
  end-page: 2
  ident: BIB3
  article-title: Framework and results for English SENSEVAL
  publication-title: Comput Humanities
– volume: 7
  start-page: 33
  year: 1991
  end-page: 66
  ident: BIB18
  article-title: Instance-based learning algorithms
  publication-title: Machine Learning
– volume: 9
  start-page: 621
  year: 2002
  end-page: 636
  ident: BIB9
  article-title: Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS
  publication-title: J Am Med Inform Assoc
– volume: 34
  start-page: 249
  year: 2001
  end-page: 261
  ident: BIB8
  article-title: Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method
  publication-title: J Biomed Inform
– volume: 24
  start-page: 1
  year: 1998
  end-page: 40
  ident: BIB1
  article-title: Introduction to the special issue on word sense disambiguation: the state of the art
  publication-title: Computational Linguistics
– reference: Ng HT. Examplar-based word sense disambiguation: some recent improvements. Proceedings of the Second Conference on Empirical methods in natural Language Processing. 1997:208–213.
– reference: Engelson SP, Dagan I. Minimizing manual annotation cost in supervised training from corpora. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996;34:319–26.
– reference: Marquez L. Machine learning and natural language processing. Technical Report LSI-00-45-R, Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, 2000.
– volume: 24
  start-page: 147
  year: 1998
  end-page: 165
  ident: BIB20
  article-title: Using corpus statistics and WordNet relations for sense identification
  publication-title: Computational Linguistics
– reference: Bruce R, Wiebe J. Word-sense disambiguation using decomposable models. Proceedings of the Thirty-Second Annual Meeting of the Association of Computational Linguistics. 1994:139–46.
– reference: Yarowsky D. Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. Proceedings of the Thirty-Second Annual meeting of the Association of computational Linguistics. 1994:88–95.
– year: 1973
  ident: BIB17
  article-title: Pattern Classification and Scene Analysis
– reference: Escudero G, Marquez L, Rigau G. Naive Bayes and exemplar-based approaches to word sense disambiguation revisited. Proceeding of the 14th European Conference on Artificial Intelligence (ECAI). 2000:421–425.
– volume: 19
  start-page: 167
  year: 1990
  end-page: 190
  ident: BIB19
  article-title: The psychological reality of word senses
  publication-title: J Psycholinguist Res
– volume: 24
  start-page: 573
  year: 1998
  end-page: 597
  ident: BIB7
  article-title: Selective sampling for example-based word sense disambiguation
  publication-title: Computational Linguistics
– reference: Escudero G, Marquez L, Rigau G. Boosting applied to word sense disambiguation. Proceedings of the European Conference on Machine Learning. 2000:129–141.
– volume: 24
  start-page: 125
  year: 1998
  end-page: 146
  ident: BIB11
  article-title: Disambiguating highly ambiguous words
  publication-title: Computational Linguistics
– year: 1997
  ident: BIB15
  article-title: Inductive logic programming for natural language processing
  publication-title: Inductive Logic Programming: Selected Papers from the 6th International Workshop
– reference: Leacock C, Towell G, Voorhees EM. Corpus-based statistical sense resolution. Proceedings of the Advanced Research Projects Agency (ARPA) Workshop on Human Language Technology. 1993:260–5.
– reference: Mooney R. Comparative experiments on disambiguating word senses: an illustration of the role of bias in machine learning. Proceedings of the First Conference on Empirical Methods in Natural Language Processing. 1996:82–91.
– reference: Ng HT, Lee HB. Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996:40–7
– start-page: 746
  year: 2001
  end-page: 750
  ident: BIB22
  article-title: Developing a test collection for biomedical word sense disambiguation
  publication-title: Proc AMIA Symp
– volume: winter
  start-page: 45
  year: 1997
  end-page: 64
  ident: BIB2
  article-title: Corpus-based approaches to semantic interpretation in natural language processing
  publication-title: AI Magazine
– reference: Ng HT. Getting serious about word-sense disambiguation. Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What and How? 1997:1–7.
– reference: 12386113 - J Am Med Inform Assoc. 2002 Nov-Dec;9(6):621-36
– reference: 11825285 - Proc AMIA Symp. 2001;:746-50
– reference: 11977807 - J Biomed Inform. 2001 Aug;34(4):249-61
SSID ssj0016235
Score 2.0107465
Snippet The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for...
OBJECTIVE: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning...
OBJECTIVEThe aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning...
Objective: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning...
SourceID unpaywall
pubmedcentral
proquest
pubmed
crossref
elsevier
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 320
SubjectTerms Abbreviations as Topic
Algorithms
Artificial Intelligence
Bayes Theorem
Natural Language Processing
Original Investigations
Vocabulary, Controlled
SummonAdditionalLinks – databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3db9MwED-NTgJe0PgO48NI4wUpo05iO3lA0xgbE1InxJjYW2Q7zlapSwtthfbfc-c4gWnrnnOp1PP57n65u98BbNWWpw5xR6xziQClSKtY1whcFbfcGYznPKHZ4dGRPDzJvp6K0zUYdbMw1FbZ-UTvqKuppW_kH5KEaGm4lDuzXzEtjaLiardBQ4fNCtVHzzB2B9YTIsYawPqn_aNv3_uyAsZ64cuf6B0EArIwsMcLRTREY709ovRnVYi6noJe76S8t2xm-vKPnkz-C1MHG_Ag5JdstzWIh7DmmkdwdxQq6I_hyy7zM7ex9jOWbK_fQ8iopfCSTWt2vJyRB5m7iv1EbMqOEeo69nk81xdmfNZygz-Bk4P9H3uHcVimEFuhikXMpeXDoS6qJLcV5iXaWbqAlZKVk5xIdoQxRZFJK3OjDJ4TN2kyFAYTBpUqmz6FQTNt3HNg6BYQVgpXpxmCt9TkqGqn0XUonRW81hG87_RX2sA0TgsvJqVHHIUqvbJLr-wItnrhWUuwcbPYdncQZcgN2phfouu_-YXN7rjKcC3nZW9EEbzpn-J9oiKJbtx0OS85sdtgFhPB21USCPFyRLW3_QqCQoG5NUo8aw3k338THvJlEcgrptMLEN_31SfN-NzzfmepxIQ5gne9ja3Q2AUp4MWtCtiE-20PEjUev4TB4vfSvcL0amFeh0vzFyrKJe8
  priority: 102
  providerName: ProQuest
Title A multi-aspect comparison study of supervised word sense disambiguation
URI https://dx.doi.org/10.1197/jamia.M1533
https://www.ncbi.nlm.nih.gov/pubmed/15064284
https://www.proquest.com/docview/220781166
https://www.proquest.com/docview/17675818
https://www.proquest.com/docview/1875844866
https://www.proquest.com/docview/66651396
https://pubmed.ncbi.nlm.nih.gov/PMC436083
http://doi.org/10.1197/jamia.M1533
UnpaywallVersion submittedVersion
Volume 11
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1527-974X
  dateEnd: 20241105
  omitProxy: true
  ssIdentifier: ssj0016235
  issn: 1527-974X
  databaseCode: DIK
  dateStart: 19940101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1527-974X
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0016235
  issn: 1527-974X
  databaseCode: GX1
  dateStart: 19940101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1527-974X
  dateEnd: 20241105
  omitProxy: true
  ssIdentifier: ssj0016235
  issn: 1527-974X
  databaseCode: RPM
  dateStart: 19940101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1527-974X
  dateEnd: 20060331
  omitProxy: true
  ssIdentifier: ssj0016235
  issn: 1527-974X
  databaseCode: 7X7
  dateStart: 20010701
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 1527-974X
  dateEnd: 20060331
  omitProxy: true
  ssIdentifier: ssj0016235
  issn: 1527-974X
  databaseCode: BENPR
  dateStart: 20010701
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 1527-974X
  dateEnd: 20060331
  omitProxy: true
  ssIdentifier: ssj0016235
  issn: 1527-974X
  databaseCode: 8FG
  dateStart: 20010701
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bb9MwFD7aWgl44X4Jg2Kk8YKUqs7FTh5Lt26AWk2MivIU2Y7DqnVpRRuh8es5di7b1HXiJVLk40ixfezvk8_5DsB-pqivkXe4ImJIUGI_dUWGxJVTRbXE85x6Jnd4NGbHk-DLNJzuQF117fr1PY25EQeaie7IgJJdaLMQ8XYL2pPxSf-nvcZELw97ti6rKc_qIjieVkl4V70vTO9tx84mrNyMjrxf5Etx-UfM59eOnuEjGNQJPGXEyXm3WMuu-rup53jHXz2GhxXyJP1yqTyBHZ0_hXuj6m79GRz1ic3GdYXNviSDpkIhMcGGl2SRkdNiafaWlU7JD2St5BRJsCYHs5W4kLNfpWr4c5gMD78Pjt2qzIKrQh6vXcoU7fVEnHqRShGxCK2Ma6acpZpRI78TShnHAVMsklziDFLpe71QIpTgPlf-C2jli1y_AoIbBhLOUGd-gLTOlxESSi1wU-EiiGkmHPhYz0KiKg1yUwpjnlguEvPEDk1ih8aB_cZ4WUpv3G7WraczqVBDiQYSHPbbO-zVk55UDrtKPM-oHlHGHHjXtKKnmesTketFsUqo0b1BfOPA-20WSP4i5Lt3fQXpYoioGy1elsvs6t9CSwYDB9iNBdgYGCXwmy357Mwqggc-QyjtwIdmpW4ZMesLr__Tbg8elHFKJjj5DbTWvwv9FiHYWnZgl085PqPhUQfanw7HJ9_w7eDz107lnf8A2dozGQ
linkProvider Unpaywall
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB5RkAqXqu-6tGUrwaWSadaPtX1AFeXRUEhUFRDctrvrNY0UnLROhPLj-t86u167RRBunD2xktnZmfkyM98ArBeKhhpxhy9ShgAlC3NfFAhcE6qolhjPaWBmh3t91j2Nvp7H5wvwp5mFMW2VjU-0jjofKfMf-ccgMLQ0lLFP41--WRpliqvNBg3hNivkW5ZhzM11HOrZFSK4autgF497Iwj29052ur5bMuCrOMkmPmWKdjoiy4NU5RivhVbGMPOE5ZpRQz4TS5llEVMslYnE709lGHRiiYE0CRMV4nsfwFIURhliv6XPe_1v39syBuYWsS23ojeKEQC6AUGaJYb2aCA2eybdmhcSb6a8Nzs3l6flWMyuxHD4X1jcfwyPXD5LtmsDfAILunwKD3uuYv8MvmwTO-PrCzvTSXbavYfEtDDOyKggx9Ox8ViVzskZapIcI7TWZHdQiUs5uKi5yJ_D6b3o9QUslqNSvwKCbghhbKyLMEKwGMoUYaoW6KoSEWW0EB58aPTHlWM2Nws2htwinCzhVtncKtuD9VZ4XBN63C622RwEd7lInWNwDDW3f2C1OS7u3EDFW6P1YK19ivfXFGVEqUfTilPDpoNZkwfv50kgpEwRRd_1FgShMebyKPGyNpB_vy22EDPygF0znVbA8Itff1IOflqe8ShkmKB7sNHa2ByNXRoFvL5TAWuw3D3pHfGjg_7hKqzU_U-m6fkNLE5-T_VbTO0m8p27QAR-3Ped_QvOamFb
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwED-NIQ1eEN9kA2ak7QUpWx0ndvKA0LRSNkYnpDHRN892HFapSwttNfVP47_j7HzAtHVve84lSs7nu98vvg-ArcJQZpF3hCrlSFAyloeqQOIqqKFWYzynkasd7h_zg9P4yyAZrMCfphbGpVU2PtE76nxs3D_y3ShybWko57tFnRXxrdv7OPkVugFS7qC1maZRWciRXVwie5t-OOziUm9HUe_T9_2DsB4wEJpEZLOQckM7HZXlUWpyjNXKGmeUueC55dQ1nkm0zrKYG55qofHdqWZRJ9EYRAUThuFz78F9wVjmsgnFoOV6FFFF4g9a0Q8lSP3q0kCaCdfwaKh2-g5oLQuG18Hu9ZzNB_NyohaXajT6LyD2HsOjGsmSvcr0nsCKLZ_CWr8-q38Gn_eIr-4Nla_mJPvtxEPikhcXZFyQk_nE-aqpzckP1CM5QVJtSXc4VRd6-LPqQv4cTu9Eqy9gtRyX9hUQdEBIYBNbsBhpItMpElSr0EkJFWe0UAG8b_QnTd3T3I3WGEnPbTIhvbKlV3YAW63wpGrlcbPYTrMQskYhFbqQGGRuvmGjWS5ZO4CpbM01gM32Ku5cdxyjSjueTyV1fXQQLwXwbpkEkskU-fNtT0H6mSCKR4mXlYH8-7bEk8s4AH7FdFoB11n86pVyeO47jMeMIzQPYLu1sSUau3AKWL9VAZuwhjtVfj08PtqAh1Xik8t2fg2rs99z-wYx3Uy_9buHwNldb9e_I7Re9Q
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3da9swED-6FLa97PvDa7dp0L0MHCLbkuzHkK4rg5RBF5Y9CUmW19DUCUvM6P76neSPtqQpe_bJIOlO-v3Q3e8ADgpDY4u8I1QpR4KSxXmoCiSughpqNd7nNHK1w-MTfjxJvk7ZdAfarmvXn-9pJpw40Ez1xw6U3INdzhBv92B3cvJt-NM_Y2KUs4Hvy-ras4YIjqdNEd7V6As3etu1swkrN7MjH1TlUl3-UfP5tavn6DGM2gKeOuPkvF-tdd_83dRzvGNWT-BRgzzJsHaVp7Bjy2dwf9y8rT-HL0Piq3FD5asvyajrUEhcsuElWRTktFq6s2Vlc_IDWSs5RRJsyeFspS707FetGv4CJkefv4-Ow6bNQmiYyNYh5YYOBirLo9TkiFiUNS40c8Fzy6mT32FaZ1nCDU-10LiDVMfRgGmEEiIWJn4JvXJR2tdA8MBAwslsESdI62KdIqG0Cg8VoZKMFiqAT-0uSNNokLtWGHPpuUgmpF8a6ZcmgIPOeFlLb9xu1m-3UzaooUYDEpf99gF77abLJmBXMoqc6hHlPID33VeMNPd8okq7qFaSOt0bxDcBfNhmgeQvRb5711-QLjJE3Wjxqnazq7kxTwaTAPgNB-wMnBL4zS_l7MwrgicxRygdwMfOU7esmI-FN_9ptwcP6zwll5y8D73178q-RQi21u-aGPwH7zEuuQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+multi-aspect+comparison+study+of+supervised+word+sense+disambiguation&rft.jtitle=Journal+of+the+American+Medical+Informatics+Association+%3A+JAMIA&rft.au=Liu%2C+Hongfang&rft.au=Teller%2C+Virginia&rft.au=Friedman%2C+Carol&rft.date=2004-07-01&rft.issn=1067-5027&rft.volume=11&rft.issue=4&rft.spage=320&rft.epage=331&rft_id=info:doi/10.1197%2Fjamia.M1533&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1067-5027&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1067-5027&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1067-5027&client=summon