A multi-aspect comparison study of supervised word sense disambiguation

The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The s...

Full description

Saved in:

Bibliographic Details
Published in	Journal of the American Medical Informatics Association : JAMIA Vol. 11; no. 4; pp. 320 - 331
Main Authors	Liu, Hongfang, Teller, Virginia, Friedman, Carol
Format	Journal Article
Language	English
Published	England Elsevier Inc 01.07.2004 Oxford University Press American Medical Informatics Association
Subjects	Abbreviations as Topic Algorithms Artificial Intelligence Bayes Theorem Natural Language Processing Original Investigations Vocabulary, Controlled
Online Access	Get full text
ISSN	1067-5027 1527-974X 1527-974X
DOI	10.1197/jamia.M1533

Cover

Abstract	The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naı̈ve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
AbstractList	OBJECTIVE: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. METHODS:The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naive Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). RESULTS: Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. CONCLUSION: From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term. Objective: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. Methods:The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naïve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Results: Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. Conclusion: From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term. The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naı̈ve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term. OBJECTIVEThe aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain.METHODSThe study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naïve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10).RESULTSSupervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets.CONCLUSIONFrom this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term. The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for disambiguating the sense of a term in a context) and compare supervised WSD in the biomedical domain with that in the general English domain. The study involves three data sets (a biomedical abbreviation data set, a general biomedical term data set, and a general English data set). The authors implemented three machine-learning algorithms, including (1) naïve Bayes (NBL) and decision lists (TDLL), (2) their adaptation of decision lists (ODLL), and (3) their mixed supervised learning (MSL). There were six feature representations (various combinations of collocations, bag of words, oriented bag of words, etc.) and five window sizes (2, 4, 6, 8, and 10). Supervised WSD is suitable only when there are enough sense-tagged instances with at least a few dozens of instances for each sense. Collocations combined with neighboring words are appropriate selections for the context. For terms with unrelated biomedical senses, a large window size such as the whole paragraph should be used, while for general English words a moderate window size between 4 and 10 should be used. The performance of the authors' implementation of decision list classifiers for abbreviations was better than that of traditional decision list classifiers. However, the opposite held for the other two sets. Also, the authors' mixed supervised learning was stable and generally better than others for all sets. From this study, it was found that different aspects of supervised WSD depend on each other. The experiment method presented in the study can be used to select the best supervised WSD classifier for each ambiguous term.
Author	Liu, Hongfang Teller, Virginia Friedman, Carol
AuthorAffiliation	Affiliations of the authors: Department of Information Systems, University of Maryland at Baltimore County, Baltimore, MD (HL); Department of Computer Science, Hunter College, City University of New York, New York, NY (VT); Department of Biomedical Informatics, Columbia University, New York, NY (CF)
AuthorAffiliation_xml	– name: Affiliations of the authors: Department of Information Systems, University of Maryland at Baltimore County, Baltimore, MD (HL); Department of Computer Science, Hunter College, City University of New York, New York, NY (VT); Department of Biomedical Informatics, Columbia University, New York, NY (CF)
Author_xml	– sequence: 1 givenname: Hongfang surname: Liu fullname: Liu, Hongfang email: hfliu@umbc.edu – sequence: 2 givenname: Virginia surname: Teller fullname: Teller, Virginia – sequence: 3 givenname: Carol surname: Friedman fullname: Friedman, Carol
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/15064284$$D View this record in MEDLINE/PubMed
BookMark	eNqFkk1rFTEUhoNU7Ieu3MvgQgSdmjOZJDMLF6W0VWhxo-AuZJJzay4zyZjM3HL_vbkfUC1WVwnkeQ_kec8xOfDBIyEvgZ4CtPLDUg9On94AZ-wJOQJeybKV9feDfKdClpxW8pAcp7SkFETF-DNyCJyKumrqI3J1VgxzP7lSpxHNVJgwjDq6FHyRptmui7Ao0jxiXLmEtrgL0RYJfcLCuqSHzt3OenLBPydPF7pP-GJ_npBvlxdfzz-V11-uPp-fXZeGy3YqQRigVLe2aoylgmk0lHJmpbAoAGpa865r21oY0XSyw45CxyrKu5aCZNKwE_J-N3f2o17f6b5XY3SDjmsFVG18qK0PNWx8ZPzjDh_nbkBr0E9R30eCdurPF-9-qNuwUjUTtNnk3-zzMfycMU1qcMlg32uPYU5KCMGBtSKDb_8JQiN5U9eNEP-dCVJkFpoMvn4ALsMcfdarqorKBmA77dXvP7y3sa84A-92gIkhpYiLR4Td7IXBA9q4adtw1uP6RzJ8l8Fc_MphVMk49Aati3mnlA3ur7lfCR3bzg
CitedBy_id	crossref_primary_10_1136_amiajnl_2012_001244 crossref_primary_10_1016_j_jbi_2014_12_013 crossref_primary_10_1109_ACCESS_2021_3119621 crossref_primary_10_1186_1471_2105_10_28 crossref_primary_10_1186_s12859_019_3079_8 crossref_primary_10_1051_itmconf_20235603001 crossref_primary_10_1186_1471_2105_9_S11_S7 crossref_primary_10_3414_ME12_01_0040 crossref_primary_10_35596_1729_7648_2019_123_5_60_65 crossref_primary_10_4338_ACI_2014_10_RA_0088 crossref_primary_10_1089_cmb_2005_12_554 crossref_primary_10_1016_j_ipm_2011_09_005 crossref_primary_10_1093_jamia_ocw109 crossref_primary_10_1002_asi_20257 crossref_primary_10_4137_BII_S38308 crossref_primary_10_7717_peerj_cs_2541 crossref_primary_10_1016_j_jbi_2012_06_003 crossref_primary_10_1186_1471_2105_7_334 crossref_primary_10_1016_j_jbi_2010_02_005 crossref_primary_10_1136_amiajnl_2012_001350 crossref_primary_10_4258_hir_2015_21_1_35 crossref_primary_10_1186_1471_2105_12_223 crossref_primary_10_1016_j_jbi_2022_104229 crossref_primary_10_1186_1471_2105_6_149 crossref_primary_10_1016_j_cosrev_2022_100511 crossref_primary_10_1186_1471_2105_10_S3_S4 crossref_primary_10_1197_jamia_M2085 crossref_primary_10_1016_j_jbi_2013_09_009 crossref_primary_10_1093_jamia_ocy013 crossref_primary_10_1007_s10257_014_0259_y crossref_primary_10_1093_bioinformatics_btq555 crossref_primary_10_1136_amiajnl_2011_000415 crossref_primary_10_1016_j_jbi_2010_08_009 crossref_primary_10_1016_j_patcog_2017_10_028 crossref_primary_10_1197_jamia_M2927
ContentType	Journal Article
Copyright	2004 American Medical Informatics Association Copyright Hanley & Belfus, Inc. Jul/Aug 2004 Copyright © 2004, American Medical Informatics Association 2004
Copyright_xml	– notice: 2004 American Medical Informatics Association – notice: Copyright Hanley & Belfus, Inc. Jul/Aug 2004 – notice: Copyright © 2004, American Medical Informatics Association 2004
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 3V. 7RV 7X7 7XB 88C 88E 88I 8AF 8AL 8AO 8FE 8FG 8FI 8FJ 8FK ABUWG AFKRA ARAPS AZQEC BENPR BGLVJ CCPQU DWQXO FYUFA GHDGH GNUQQ HCIFZ JQ2 K7- K9. KB0 M0N M0S M0T M1P M2P NAPCQ P5Z P62 PHGZM PHGZT PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS Q9U S0X 7QO 8FD FR3 P64 7X8 5PM ADTOC UNPAY
DOI	10.1197/jamia.M1533
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed ProQuest Central (Corporate) Nursing & Allied Health Database (Proquest) ProQuest Health & Medical Complete ProQuest Central (purchase pre-March 2016) Healthcare Administration Database (Alumni) Medical Database (Alumni Edition) Science Database (Alumni Edition) STEM Database Computing Database (Alumni Edition) ProQuest Pharma Collection ProQuest SciTech Collection ProQuest Technology Collection Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland Advanced Technologies & Computer Science Collection ProQuest Central Essentials - QC ProQuest Central ProQuest Technology Collection (LUT) ProQuest One ProQuest Central Proquest Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection (Proquest) ProQuest Computer Science Collection Computer Science Database ProQuest Health & Medical Complete (Alumni) Nursing & Allied Health Database (Alumni Edition) Computing Database ProQuest Health & Medical Collection Healthcare Administration Database Medical Database Science Database Nursing & Allied Health Premium Advanced Technologies & Aerospace Collection ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic SIRS Editorial Biotechnology Research Abstracts Technology Research Database Engineering Research Database Biotechnology and BioEngineering Abstracts MEDLINE - Academic PubMed Central (Full Participant titles) Unpaywall for CDI: Periodical Content Unpaywall
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection ProQuest AP Science SciTech Premium Collection ProQuest Central China ProQuest One Applied & Life Sciences Health Research Premium Collection Health & Medical Research Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Advanced Technologies & Aerospace Collection ProQuest Science Journals (Alumni Edition) ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) Nursing & Allied Health Premium ProQuest Health & Medical Complete ProQuest One Academic UKI Edition ProQuest Health Management (Alumni Edition) ProQuest Nursing & Allied Health Source (Alumni) ProQuest One Academic ProQuest One Academic (New) Technology Collection ProQuest One Academic Middle East (New) SIRS Editorial ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Pharma Collection ProQuest Central ProQuest Health & Medical Research Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea ProQuest Computing ProQuest Central Basic ProQuest Science Journals ProQuest Computing (Alumni Edition) ProQuest Health Management ProQuest Nursing & Allied Health Source ProQuest SciTech Collection Advanced Technologies & Aerospace Database ProQuest Medical Library ProQuest Central (Alumni) Engineering Research Database Biotechnology Research Abstracts Technology Research Database Biotechnology and BioEngineering Abstracts MEDLINE - Academic
DatabaseTitleList	Engineering Research Database MEDLINE - Academic Engineering Research Database MEDLINE
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository – sequence: 4 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine
EISSN	1527-974X
EndPage	331
ExternalDocumentID	oai:pubmedcentral.nih.gov:436083 PMC436083 841262421 15064284 10_1197_jamia_M1533 S1067502704000544
Genre	Research Support, U.S. Gov't, Non-P.H.S Research Support, U.S. Gov't, P.H.S Comparative Study Journal Article General Information
GrantInformation_xml	– fundername: NLM NIH HHS grantid: LM06274
GroupedDBID	--- --K .DC .GJ 0R~ 18M 1B1 1TH 29L 2WC 3V. 4.4 48X 53G 5GY 5RE 5WD 6PF 7RV 7X7 7~T 88E 88I 8AF 8AO 8FE 8FG 8FI 8FJ 8FW 8RD AABJS AABMN AABZA AACZT AAEDT AAESY AAIYJ AAJQQ AALRI AAMVS AAOGV AAPGJ AAPQZ AAPXW AAUQX AAVAP AAWDT AAWTL AAXUO ABEUO ABIXL ABJNI ABNHQ ABOCM ABPTD ABQLI ABSAR ABSMQ ABUWG ABWST ABXVV ACFRR ACGFO ACGFS ACGOD ACHQT ACIMA ACUFI ACUTJ ACYHN ADBBV ADEIU ADGZP ADHKW ADHZD ADIPN ADJOM ADJQC ADMUD ADORX ADQLU ADRIX ADRTK ADVEK ADYVW AEGPL AEJOX AEKSI AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFZL AFIYH AFKRA AFOFC AFXEN AFYAG AGINJ AGQXC AGSYK AGUTN AHMBA AIKOY AIMBJ AJEEA ALMA_UNASSIGNED_HOLDINGS ALUQC APIBT APJGH AQDSO AQKUS AQUVI ARAPS ASMCH AVNTJ AVWKF AWCFO AXUDD AYCSE AZQEC AZQFJ BAWUL BAYMD BCRHZ BENPR BEYMZ BGLVJ BGYMP BHONS BKEYQ BPHCQ BTRTY BVRKM BVXVI BYORX BZKNY C1A C45 CASEJ CCPQU CDBKE CS3 DAKXR DIK DILTD DPPUQ DU5 DWQXO E3Z EBD EBS EIHJH EJD EMOBN ENERS EO8 EX3 F5P FDB FECEO FLUFQ FOEOM FOTVD FQBLK FYUFA G-Q GAUVT GJXCC GNUQQ GX1 HAR HCIFZ HMCUK IH2 IHE J21 K6V K7- KBUDW KOP KSI KSN LSO M0N M0T M1P M2P M2Q M41 MBLQV MHKGH NAPCQ NOMLY NOYVH NQ- NVLIB O9- OAUYM OAWHX OCZFY ODMLO OJQWA OJZSN OK1 OPAEJ OVD OWPYF P2P P62 PAFKI PCD PEELM PQQKQ PROAC PSQYO Q5Y R53 RIG ROL ROX ROZ RPM RPZ RUSNO RWL RXO S0X SSZ SV3 TAE TEORI TJX TMA UKHRP WOQ WOW YAYTL YHZ YKOAZ YXANX ZGI ~S- 77I AARHZ AAUAY AAYXX ABDFA ABEJV ABGNP ABPQP ABQNK ABVGC ADNBA ADQBN AEMQT AFXAL AHMMS AJBYB AJNCP ALXQX ATGXG CITATION JXSIZ PHGZM PHGZT PJZUB PPXIY PQGLB PUEGO ABWVN ACRPL ACZBC ADNMO AFFQV AGKRT AGMDO ALIPV CGR CUY CVF ECM EIF H13 NPM NU- 7XB 8AL 8FK JQ2 K9. PKEHL PQEST PQUKI PRINS Q9U 7QO 8FD FR3 P64 7X8 5PM ACVCV ADMTO ADTOC AHGBF AJDVS OBFPC UNPAY
ID	FETCH-LOGICAL-c579t-16c100a9d28cd063aec0053d76de6114045bb9946c68b7beb01b3205b901737c3
IEDL.DBID	UNPAY
ISSN	1067-5027 1527-974X
IngestDate	Sun Oct 26 03:25:19 EDT 2025 Tue Sep 30 16:45:06 EDT 2025 Wed Oct 01 09:37:03 EDT 2025 Tue Oct 07 09:40:31 EDT 2025 Mon Oct 06 18:00:46 EDT 2025 Mon Oct 06 17:13:35 EDT 2025 Wed Feb 19 01:37:37 EST 2025 Thu Apr 24 22:55:29 EDT 2025 Wed Oct 01 02:43:26 EDT 2025 Fri Feb 23 02:38:32 EST 2024
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	4
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c579t-16c100a9d28cd063aec0053d76de6114045bb9946c68b7beb01b3205b901737c3
Notes	SourceType-Scholarly Journals-1 ObjectType-General Information-1 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 Supported in part by NLM grant LM06274 and NSF grant NSF 0312250.
OpenAccessLink	https://proxy.k.utb.cz/login?url=http://doi.org/10.1197/jamia.M1533
PMID	15064284
PQID	220781166
PQPubID	23462
PageCount	12
ParticipantIDs	unpaywall_primary_10_1197_jamia_m1533 pubmedcentral_primary_oai_pubmedcentral_nih_gov_436083 proquest_miscellaneous_66651396 proquest_miscellaneous_1875844866 proquest_miscellaneous_17675818 proquest_journals_220781166 pubmed_primary_15064284 crossref_primary_10_1197_jamia_M1533 crossref_citationtrail_10_1197_jamia_M1533 elsevier_sciencedirect_doi_10_1197_jamia_M1533
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2004-07-01
PublicationDateYYYYMMDD	2004-07-01
PublicationDate_xml	– month: 07 year: 2004 text: 2004-07-01 day: 01
PublicationDecade	2000
PublicationPlace	England
PublicationPlace_xml	– name: England – name: Oxford
PublicationTitle	Journal of the American Medical Informatics Association : JAMIA
PublicationTitleAlternate	J Am Med Inform Assoc
PublicationYear	2004
Publisher	Elsevier Inc Oxford University Press American Medical Informatics Association
Publisher_xml	– name: Elsevier Inc – name: Oxford University Press – name: American Medical Informatics Association
References	Veronis J, Ide N. Very large neural networks for natural language processing. Proceedings of the European Conference on Artificial Intelligence. 1990:366–8. Mooney R. Comparative experiments on disambiguating word senses: an illustration of the role of bias in machine learning. Proceedings of the First Conference on Empirical Methods in Natural Language Processing. 1996:82–91. Duda, Hart (BIB17) 1973 Leacock C, Towell G, Voorhees EM. Corpus-based statistical sense resolution. Proceedings of the Advanced Research Projects Agency (ARPA) Workshop on Human Language Technology. 1993:260–5. Liu, Lussier, Friedman (BIB8) 2001; 34 Fujii, Inui, Tokunaga, Tanaka (BIB7) 1998; 24 Ng HT, Lee HB. Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996:40–7 Ng HT. Getting serious about word-sense disambiguation. Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What and How? 1997:1–7. Liu, Johnson, Friedman (BIB9) 2002; 9 Towell, Voorhees (BIB11) 1998; 24 Jorgensen (BIB19) 1990; 19 Kilgarriff, Rosenzweig (BIB3) 1999; 34 Bruce R, Wiebe J. Word-sense disambiguation using decomposable models. Proceedings of the Thirty-Second Annual Meeting of the Association of Computational Linguistics. 1994:139–46. Marquez L. Machine learning and natural language processing. Technical Report LSI-00-45-R, Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, 2000. Aha, Kibler, Albert (BIB18) 1991; 7 Escudero G, Marquez L, Rigau G. Boosting applied to word sense disambiguation. Proceedings of the European Conference on Machine Learning. 2000:129–141. Yarowsky D. Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. Proceedings of the Thirty-Second Annual meeting of the Association of computational Linguistics. 1994:88–95. Escudero G, Marquez L, Rigau G. Naive Bayes and exemplar-based approaches to word sense disambiguation revisited. Proceeding of the 14th European Conference on Artificial Intelligence (ECAI). 2000:421–425. Mooney (BIB15) 1997 Weeber, Mork, Aronson (BIB22) 2001 Ide, Veronis (BIB1) 1998; 24 Witten, Bell (BIB24) 1991; 37 Ng HT. Examplar-based word sense disambiguation: some recent improvements. Proceedings of the Second Conference on Empirical methods in natural Language Processing. 1997:208–213. Ng, Zelle (BIB2) 1997; winter Engelson SP, Dagan I. Minimizing manual annotation cost in supervised training from corpora. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996;34:319–26. Leacock, Chodorow, Miller (BIB20) 1998; 24 12386113 - J Am Med Inform Assoc. 2002 Nov-Dec;9(6):621-36 11977807 - J Biomed Inform. 2001 Aug;34(4):249-61 11825285 - Proc AMIA Symp. 2001;:746-50
References_xml	– volume: 37 start-page: 1085 year: 1991 end-page: 1094 ident: BIB24 article-title: The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression publication-title: IEEE Trans Inf Theory – reference: Veronis J, Ide N. Very large neural networks for natural language processing. Proceedings of the European Conference on Artificial Intelligence. 1990:366–8. – volume: 34 start-page: 1 year: 1999 end-page: 2 ident: BIB3 article-title: Framework and results for English SENSEVAL publication-title: Comput Humanities – volume: 7 start-page: 33 year: 1991 end-page: 66 ident: BIB18 article-title: Instance-based learning algorithms publication-title: Machine Learning – volume: 9 start-page: 621 year: 2002 end-page: 636 ident: BIB9 article-title: Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS publication-title: J Am Med Inform Assoc – volume: 34 start-page: 249 year: 2001 end-page: 261 ident: BIB8 article-title: Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method publication-title: J Biomed Inform – volume: 24 start-page: 1 year: 1998 end-page: 40 ident: BIB1 article-title: Introduction to the special issue on word sense disambiguation: the state of the art publication-title: Computational Linguistics – reference: Ng HT. Examplar-based word sense disambiguation: some recent improvements. Proceedings of the Second Conference on Empirical methods in natural Language Processing. 1997:208–213. – reference: Engelson SP, Dagan I. Minimizing manual annotation cost in supervised training from corpora. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996;34:319–26. – reference: Marquez L. Machine learning and natural language processing. Technical Report LSI-00-45-R, Departament de Llenguatges i Sistemes Informàtics, Universitat Politècnica de Catalunya, 2000. – volume: 24 start-page: 147 year: 1998 end-page: 165 ident: BIB20 article-title: Using corpus statistics and WordNet relations for sense identification publication-title: Computational Linguistics – reference: Bruce R, Wiebe J. Word-sense disambiguation using decomposable models. Proceedings of the Thirty-Second Annual Meeting of the Association of Computational Linguistics. 1994:139–46. – reference: Yarowsky D. Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. Proceedings of the Thirty-Second Annual meeting of the Association of computational Linguistics. 1994:88–95. – year: 1973 ident: BIB17 article-title: Pattern Classification and Scene Analysis – reference: Escudero G, Marquez L, Rigau G. Naive Bayes and exemplar-based approaches to word sense disambiguation revisited. Proceeding of the 14th European Conference on Artificial Intelligence (ECAI). 2000:421–425. – volume: 19 start-page: 167 year: 1990 end-page: 190 ident: BIB19 article-title: The psychological reality of word senses publication-title: J Psycholinguist Res – volume: 24 start-page: 573 year: 1998 end-page: 597 ident: BIB7 article-title: Selective sampling for example-based word sense disambiguation publication-title: Computational Linguistics – reference: Escudero G, Marquez L, Rigau G. Boosting applied to word sense disambiguation. Proceedings of the European Conference on Machine Learning. 2000:129–141. – volume: 24 start-page: 125 year: 1998 end-page: 146 ident: BIB11 article-title: Disambiguating highly ambiguous words publication-title: Computational Linguistics – year: 1997 ident: BIB15 article-title: Inductive logic programming for natural language processing publication-title: Inductive Logic Programming: Selected Papers from the 6th International Workshop – reference: Leacock C, Towell G, Voorhees EM. Corpus-based statistical sense resolution. Proceedings of the Advanced Research Projects Agency (ARPA) Workshop on Human Language Technology. 1993:260–5. – reference: Mooney R. Comparative experiments on disambiguating word senses: an illustration of the role of bias in machine learning. Proceedings of the First Conference on Empirical Methods in Natural Language Processing. 1996:82–91. – reference: Ng HT, Lee HB. Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. Proceedings of the Thirty-Fourth Annual meeting of the Association of computational Linguistics. 1996:40–7 – start-page: 746 year: 2001 end-page: 750 ident: BIB22 article-title: Developing a test collection for biomedical word sense disambiguation publication-title: Proc AMIA Symp – volume: winter start-page: 45 year: 1997 end-page: 64 ident: BIB2 article-title: Corpus-based approaches to semantic interpretation in natural language processing publication-title: AI Magazine – reference: Ng HT. Getting serious about word-sense disambiguation. Proceedings of the ACL SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What and How? 1997:1–7. – reference: 12386113 - J Am Med Inform Assoc. 2002 Nov-Dec;9(6):621-36 – reference: 11825285 - Proc AMIA Symp. 2001;:746-50 – reference: 11977807 - J Biomed Inform. 2001 Aug;34(4):249-61
SSID	ssj0016235
Score	2.0107465
Snippet	The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning for... OBJECTIVE: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning... OBJECTIVEThe aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning... Objective: The aim of this study was to investigate relations among different aspects in supervised word sense disambiguation (WSD; supervised machine learning...
SourceID	unpaywall pubmedcentral proquest pubmed crossref elsevier
SourceType	Open Access Repository Aggregation Database Index Database Enrichment Source Publisher
StartPage	320
SubjectTerms	Abbreviations as Topic Algorithms Artificial Intelligence Bayes Theorem Natural Language Processing Original Investigations Vocabulary, Controlled
SummonAdditionalLinks	– databaseName: ProQuest Central dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3db9MwED-NTgJe0PgO48NI4wUpo05iO3lA0xgbE1InxJjYW2Q7zlapSwtthfbfc-c4gWnrnnOp1PP57n65u98BbNWWpw5xR6xziQClSKtY1whcFbfcGYznPKHZ4dGRPDzJvp6K0zUYdbMw1FbZ-UTvqKuppW_kH5KEaGm4lDuzXzEtjaLiardBQ4fNCtVHzzB2B9YTIsYawPqn_aNv3_uyAsZ64cuf6B0EArIwsMcLRTREY709ovRnVYi6noJe76S8t2xm-vKPnkz-C1MHG_Ag5JdstzWIh7DmmkdwdxQq6I_hyy7zM7ex9jOWbK_fQ8iopfCSTWt2vJyRB5m7iv1EbMqOEeo69nk81xdmfNZygz-Bk4P9H3uHcVimEFuhikXMpeXDoS6qJLcV5iXaWbqAlZKVk5xIdoQxRZFJK3OjDJ4TN2kyFAYTBpUqmz6FQTNt3HNg6BYQVgpXpxmCt9TkqGqn0XUonRW81hG87_RX2sA0TgsvJqVHHIUqvbJLr-wItnrhWUuwcbPYdncQZcgN2phfouu_-YXN7rjKcC3nZW9EEbzpn-J9oiKJbtx0OS85sdtgFhPB21USCPFyRLW3_QqCQoG5NUo8aw3k338THvJlEcgrptMLEN_31SfN-NzzfmepxIQ5gne9ja3Q2AUp4MWtCtiE-20PEjUev4TB4vfSvcL0amFeh0vzFyrKJe8 priority: 102 providerName: ProQuest
Title	A multi-aspect comparison study of supervised word sense disambiguation
URI	https://dx.doi.org/10.1197/jamia.M1533 https://www.ncbi.nlm.nih.gov/pubmed/15064284 https://www.proquest.com/docview/220781166 https://www.proquest.com/docview/17675818 https://www.proquest.com/docview/1875844866 https://www.proquest.com/docview/66651396 https://pubmed.ncbi.nlm.nih.gov/PMC436083 http://doi.org/10.1197/jamia.M1533
UnpaywallVersion	submittedVersion
Volume	11
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 1527-974X dateEnd: 20241105 omitProxy: true ssIdentifier: ssj0016235 issn: 1527-974X databaseCode: DIK dateStart: 19940101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVFQY databaseName: GFMER Free Medical Journals customDbUrl: eissn: 1527-974X dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0016235 issn: 1527-974X databaseCode: GX1 dateStart: 19940101 isFulltext: true titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php providerName: Geneva Foundation for Medical Education and Research – providerCode: PRVAQN databaseName: PubMed Central customDbUrl: eissn: 1527-974X dateEnd: 20241105 omitProxy: true ssIdentifier: ssj0016235 issn: 1527-974X databaseCode: RPM dateStart: 19940101 isFulltext: true titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/ providerName: National Library of Medicine – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 1527-974X dateEnd: 20060331 omitProxy: true ssIdentifier: ssj0016235 issn: 1527-974X databaseCode: 7X7 dateStart: 20010701 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 1527-974X dateEnd: 20060331 omitProxy: true ssIdentifier: ssj0016235 issn: 1527-974X databaseCode: BENPR dateStart: 20010701 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Technology Collection customDbUrl: eissn: 1527-974X dateEnd: 20060331 omitProxy: true ssIdentifier: ssj0016235 issn: 1527-974X databaseCode: 8FG dateStart: 20010701 isFulltext: true titleUrlDefault: https://search.proquest.com/technologycollection1 providerName: ProQuest
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bb9MwFD7aWgl44X4Jg2Kk8YKUqs7FTh5Lt26AWk2MivIU2Y7DqnVpRRuh8es5di7b1HXiJVLk40ixfezvk8_5DsB-pqivkXe4ImJIUGI_dUWGxJVTRbXE85x6Jnd4NGbHk-DLNJzuQF117fr1PY25EQeaie7IgJJdaLMQ8XYL2pPxSf-nvcZELw97ti6rKc_qIjieVkl4V70vTO9tx84mrNyMjrxf5Etx-UfM59eOnuEjGNQJPGXEyXm3WMuu-rup53jHXz2GhxXyJP1yqTyBHZ0_hXuj6m79GRz1ic3GdYXNviSDpkIhMcGGl2SRkdNiafaWlU7JD2St5BRJsCYHs5W4kLNfpWr4c5gMD78Pjt2qzIKrQh6vXcoU7fVEnHqRShGxCK2Ma6acpZpRI78TShnHAVMsklziDFLpe71QIpTgPlf-C2jli1y_AoIbBhLOUGd-gLTOlxESSi1wU-EiiGkmHPhYz0KiKg1yUwpjnlguEvPEDk1ih8aB_cZ4WUpv3G7WraczqVBDiQYSHPbbO-zVk55UDrtKPM-oHlHGHHjXtKKnmesTketFsUqo0b1BfOPA-20WSP4i5Lt3fQXpYoioGy1elsvs6t9CSwYDB9iNBdgYGCXwmy357Mwqggc-QyjtwIdmpW4ZMesLr__Tbg8elHFKJjj5DbTWvwv9FiHYWnZgl085PqPhUQfanw7HJ9_w7eDz107lnf8A2dozGQ
linkProvider	Unpaywall
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT9tAEB5RkAqXqu-6tGUrwaWSadaPtX1AFeXRUEhUFRDctrvrNY0UnLROhPLj-t86u167RRBunD2xktnZmfkyM98ArBeKhhpxhy9ShgAlC3NfFAhcE6qolhjPaWBmh3t91j2Nvp7H5wvwp5mFMW2VjU-0jjofKfMf-ccgMLQ0lLFP41--WRpliqvNBg3hNivkW5ZhzM11HOrZFSK4autgF497Iwj29052ur5bMuCrOMkmPmWKdjoiy4NU5RivhVbGMPOE5ZpRQz4TS5llEVMslYnE709lGHRiiYE0CRMV4nsfwFIURhliv6XPe_1v39syBuYWsS23ojeKEQC6AUGaJYb2aCA2eybdmhcSb6a8Nzs3l6flWMyuxHD4X1jcfwyPXD5LtmsDfAILunwKD3uuYv8MvmwTO-PrCzvTSXbavYfEtDDOyKggx9Ox8ViVzskZapIcI7TWZHdQiUs5uKi5yJ_D6b3o9QUslqNSvwKCbghhbKyLMEKwGMoUYaoW6KoSEWW0EB58aPTHlWM2Nws2htwinCzhVtncKtuD9VZ4XBN63C622RwEd7lInWNwDDW3f2C1OS7u3EDFW6P1YK19ivfXFGVEqUfTilPDpoNZkwfv50kgpEwRRd_1FgShMebyKPGyNpB_vy22EDPygF0znVbA8Itff1IOflqe8ShkmKB7sNHa2ByNXRoFvL5TAWuw3D3pHfGjg_7hKqzU_U-m6fkNLE5-T_VbTO0m8p27QAR-3Ped_QvOamFb
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwED-NIQ1eEN9kA2ak7QUpWx0ndvKA0LRSNkYnpDHRN892HFapSwttNfVP47_j7HzAtHVve84lSs7nu98vvg-ArcJQZpF3hCrlSFAyloeqQOIqqKFWYzynkasd7h_zg9P4yyAZrMCfphbGpVU2PtE76nxs3D_y3ShybWko57tFnRXxrdv7OPkVugFS7qC1maZRWciRXVwie5t-OOziUm9HUe_T9_2DsB4wEJpEZLOQckM7HZXlUWpyjNXKGmeUueC55dQ1nkm0zrKYG55qofHdqWZRJ9EYRAUThuFz78F9wVjmsgnFoOV6FFFF4g9a0Q8lSP3q0kCaCdfwaKh2-g5oLQuG18Hu9ZzNB_NyohaXajT6LyD2HsOjGsmSvcr0nsCKLZ_CWr8-q38Gn_eIr-4Nla_mJPvtxEPikhcXZFyQk_nE-aqpzckP1CM5QVJtSXc4VRd6-LPqQv4cTu9Eqy9gtRyX9hUQdEBIYBNbsBhpItMpElSr0EkJFWe0UAG8b_QnTd3T3I3WGEnPbTIhvbKlV3YAW63wpGrlcbPYTrMQskYhFbqQGGRuvmGjWS5ZO4CpbM01gM32Ku5cdxyjSjueTyV1fXQQLwXwbpkEkskU-fNtT0H6mSCKR4mXlYH8-7bEk8s4AH7FdFoB11n86pVyeO47jMeMIzQPYLu1sSUau3AKWL9VAZuwhjtVfj08PtqAh1Xik8t2fg2rs99z-wYx3Uy_9buHwNldb9e_I7Re9Q
linkToUnpaywall	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3da9swED-6FLa97PvDa7dp0L0MHCLbkuzHkK4rg5RBF5Y9CUmW19DUCUvM6P76neSPtqQpe_bJIOlO-v3Q3e8ADgpDY4u8I1QpR4KSxXmoCiSughpqNd7nNHK1w-MTfjxJvk7ZdAfarmvXn-9pJpw40Ez1xw6U3INdzhBv92B3cvJt-NM_Y2KUs4Hvy-ras4YIjqdNEd7V6As3etu1swkrN7MjH1TlUl3-UfP5tavn6DGM2gKeOuPkvF-tdd_83dRzvGNWT-BRgzzJsHaVp7Bjy2dwf9y8rT-HL0Piq3FD5asvyajrUEhcsuElWRTktFq6s2Vlc_IDWSs5RRJsyeFspS707FetGv4CJkefv4-Ow6bNQmiYyNYh5YYOBirLo9TkiFiUNS40c8Fzy6mT32FaZ1nCDU-10LiDVMfRgGmEEiIWJn4JvXJR2tdA8MBAwslsESdI62KdIqG0Cg8VoZKMFiqAT-0uSNNokLtWGHPpuUgmpF8a6ZcmgIPOeFlLb9xu1m-3UzaooUYDEpf99gF77abLJmBXMoqc6hHlPID33VeMNPd8okq7qFaSOt0bxDcBfNhmgeQvRb5711-QLjJE3Wjxqnazq7kxTwaTAPgNB-wMnBL4zS_l7MwrgicxRygdwMfOU7esmI-FN_9ptwcP6zwll5y8D73178q-RQi21u-aGPwH7zEuuQ
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+multi-aspect+comparison+study+of+supervised+word+sense+disambiguation&rft.jtitle=Journal+of+the+American+Medical+Informatics+Association+%3A+JAMIA&rft.au=Liu%2C+Hongfang&rft.au=Teller%2C+Virginia&rft.au=Friedman%2C+Carol&rft.date=2004-07-01&rft.issn=1067-5027&rft.volume=11&rft.issue=4&rft.spage=320&rft.epage=331&rft_id=info:doi/10.1197%2Fjamia.M1533&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1067-5027&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1067-5027&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1067-5027&client=summon