Concept-match medical data scrubbing. How pathology text can be used in research

In the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based data sets, annotated with clinical information and linked to archived tissues, to discover and validate new diagnostic tests and therapies. Pat...

Full description

Saved in:

Bibliographic Details
Published in	Archives of pathology & laboratory medicine (1976) Vol. 127; no. 6; p. 680
Main Author	Berman, Jules J
Format	Journal Article
Language	English
Published	United States College of American Pathologists 01.06.2003
Subjects	Algorithms Archives & records Computing Methodologies Database Management Systems - classification Database Management Systems - instrumentation Database Management Systems - supply & distribution Databases, Factual - classification Databases, Factual - supply & distribution Datasets Diagnostic tests Humans Kidney cancer Medical Record Linkage - instrumentation Medical Record Linkage - methods Medical Records Systems, Computerized - classification Medical Records Systems, Computerized - instrumentation Medical Records Systems, Computerized - supply & distribution Medical Records, Problem-Oriented Medical research Pathology Pathology, Clinical - organization & administration Patients Subject Headings Unified Medical Language System - classification Unified Medical Language System - instrumentation Unified Medical Language System - supply & distribution
Online Access	Get full text
ISSN	0003-9985 1543-2165 1543-2165
DOI	10.5858/2003-127-680-CMDS

Cover

Abstract	In the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based data sets, annotated with clinical information and linked to archived tissues, to discover and validate new diagnostic tests and therapies. Pathology records can be used for research purposes (without obtaining informed patient consent for each use of each record), provided the data are rendered harmless. Large data sets can be made harmless through 3 computational steps: (1) deidentification, the removal or modification of data fields that can be used to identify a patient (name, social security number, etc); (2) rendering the data ambiguous, ensuring that every data record in a public data set has a nonunique set of characterizing data; and (3) data scrubbing, the removal or transformation of words in free text that can be used to identify persons or that contain information that is incriminating or otherwise private. This article addresses the problem of data scrubbing. To design and implement a general algorithm that scrubs pathology free text, removing all identifying or private information. The Concept-Match algorithm steps through confidential text. When a medical term matching a standard nomenclature term is encountered, the term is replaced by a nomenclature code and a synonym for the original term. When a high-frequency "stop" word, such as a, an, the, or for, is encountered, it is left in place. When any other word is encountered, it is blocked and replaced by asterisks. This produces a scrubbed text. An open-source implementation of the algorithm is freely available. The Concept-Match scrub method transformed pathology free text into scrubbed output that preserved the sense of the original sentences, while it blocked terms that did not match terms found in the Unified Medical Language System (UMLS). The scrubbed product is safe, in the restricted sense that the output retains only standard medical terms. The software implementation scrubbed more than half a million surgical pathology report phrases in less than an hour. Computerized scrubbing can render the textual portion of a pathology report harmless for research purposes. Scrubbing and deidentification methods allow pathologists to create and use large pathology databases to conduct medical research.
AbstractList	In the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based data sets, annotated with clinical information and linked to archived tissues, to discover and validate new diagnostic tests and therapies. Pathology records can be used for research purposes (without obtaining informed patient consent for each use of each record), provided the data are rendered harmless. Large data sets can be made harmless through 3 computational steps: (1) deidentification, the removal or modification of data fields that can be used to identify a patient (name, social security number, etc); (2) rendering the data ambiguous, ensuring that every data record in a public data set has a nonunique set of characterizing data; and (3) data scrubbing, the removal or transformation of words in free text that can be used to identify persons or that contain information that is incriminating or otherwise private. This article addresses the problem of data scrubbing.CONTEXTIn the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based data sets, annotated with clinical information and linked to archived tissues, to discover and validate new diagnostic tests and therapies. Pathology records can be used for research purposes (without obtaining informed patient consent for each use of each record), provided the data are rendered harmless. Large data sets can be made harmless through 3 computational steps: (1) deidentification, the removal or modification of data fields that can be used to identify a patient (name, social security number, etc); (2) rendering the data ambiguous, ensuring that every data record in a public data set has a nonunique set of characterizing data; and (3) data scrubbing, the removal or transformation of words in free text that can be used to identify persons or that contain information that is incriminating or otherwise private. This article addresses the problem of data scrubbing.To design and implement a general algorithm that scrubs pathology free text, removing all identifying or private information.OBJECTIVETo design and implement a general algorithm that scrubs pathology free text, removing all identifying or private information.The Concept-Match algorithm steps through confidential text. When a medical term matching a standard nomenclature term is encountered, the term is replaced by a nomenclature code and a synonym for the original term. When a high-frequency "stop" word, such as a, an, the, or for, is encountered, it is left in place. When any other word is encountered, it is blocked and replaced by asterisks. This produces a scrubbed text. An open-source implementation of the algorithm is freely available.METHODSThe Concept-Match algorithm steps through confidential text. When a medical term matching a standard nomenclature term is encountered, the term is replaced by a nomenclature code and a synonym for the original term. When a high-frequency "stop" word, such as a, an, the, or for, is encountered, it is left in place. When any other word is encountered, it is blocked and replaced by asterisks. This produces a scrubbed text. An open-source implementation of the algorithm is freely available.The Concept-Match scrub method transformed pathology free text into scrubbed output that preserved the sense of the original sentences, while it blocked terms that did not match terms found in the Unified Medical Language System (UMLS). The scrubbed product is safe, in the restricted sense that the output retains only standard medical terms. The software implementation scrubbed more than half a million surgical pathology report phrases in less than an hour.RESULTSThe Concept-Match scrub method transformed pathology free text into scrubbed output that preserved the sense of the original sentences, while it blocked terms that did not match terms found in the Unified Medical Language System (UMLS). The scrubbed product is safe, in the restricted sense that the output retains only standard medical terms. The software implementation scrubbed more than half a million surgical pathology report phrases in less than an hour.Computerized scrubbing can render the textual portion of a pathology report harmless for research purposes. Scrubbing and deidentification methods allow pathologists to create and use large pathology databases to conduct medical research.CONCLUSIONSComputerized scrubbing can render the textual portion of a pathology report harmless for research purposes. Scrubbing and deidentification methods allow pathologists to create and use large pathology databases to conduct medical research. In the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based data sets, annotated with clinical information and linked to archived tissues, to discover and validate new diagnostic tests and therapies. Pathology records can be used for research purposes (without obtaining informed patient consent for each use of each record), provided the data are rendered harmless. Large data sets can be made harmless through 3 computational steps: (1) deidentification, the removal or modification of data fields that can be used to identify a patient (name, social security number, etc); (2) rendering the data ambiguous, ensuring that every data record in a public data set has a nonunique set of characterizing data; and (3) data scrubbing, the removal or transformation of words in free text that can be used to identify persons or that contain information that is incriminating or otherwise private. This article addresses the problem of data scrubbing. To design and implement a general algorithm that scrubs pathology free text, removing all identifying or private information. The Concept-Match algorithm steps through confidential text. When a medical term matching a standard nomenclature term is encountered, the term is replaced by a nomenclature code and a synonym for the original term. When a high-frequency "stop" word, such as a, an, the, or for, is encountered, it is left in place. When any other word is encountered, it is blocked and replaced by asterisks. This produces a scrubbed text. An open-source implementation of the algorithm is freely available. The Concept-Match scrub method transformed pathology free text into scrubbed output that preserved the sense of the original sentences, while it blocked terms that did not match terms found in the Unified Medical Language System (UMLS). The scrubbed product is safe, in the restricted sense that the output retains only standard medical terms. The software implementation scrubbed more than half a million surgical pathology report phrases in less than an hour. Computerized scrubbing can render the textual portion of a pathology report harmless for research purposes. Scrubbing and deidentification methods allow pathologists to create and use large pathology databases to conduct medical research. In the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based data sets, annotated with clinical information and linked to archived tissues, to discover and validate new diagnostic tests and therapies. Pathology records can be used for research purposes (without obtaining informed patient consent for each use of each record), provided the data are rendered harmless. Large data sets can be made harmless through 3 computational steps: (1) deidentification, the removal or modification of data fields that can be used to identify a patient (name, social security number, etc); (2) rendering the data ambiguous, ensuring that every data record in a public data set has a nonunique set of characterizing data; and (3) data scrubbing, the removal or transformation of words in free text that can be used to identify persons or that contain information that is incriminating or otherwise private. This article addresses the problem of data scrubbing. To design and implement a general algorithm that scrubs pathology free text, removing all identifying or private information. The Concept-Match algorithm steps through confidential text. When a medical term matching a standard nomenclature term is encountered, the term is replaced by a nomenclature code and a synonym for the original term. When a high-frequency "stop" word, such as a, an, the, or for, is encountered, it is left in place. When any other word is encountered, it is blocked and replaced by asterisks. This produces a scrubbed text. An open-source implementation of the algorithm is freely available. The Concept-Match scrub method transformed pathology free text into scrubbed output that preserved the sense of the original sentences, while it blocked terms that did not match terms found in the Unified Medical Language System (UMLS). The scrubbed product is safe, in the restricted sense that the output retains only standard medical terms. The software implementation scrubbed more than half a million surgical pathology report phrases in less than an hour. Computerized scrubbing can render the textual portion of a pathology report harmless for research purposes. Scrubbing and deidentification methods allow pathologists to create and use large pathology databases to conduct medical research.
Author	Berman, Jules J
Author_xml	– sequence: 1 givenname: Jules J surname: Berman fullname: Berman, Jules J email: bermanj@mail.nih.gov organization: Pathology Informatics, Cancer Diagnosis Program, National Cancer Institute, National Institutes of Health, Rockville, Md 20892, USA. bermanj@mail.nih.gov
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/12741890$$D View this record in MEDLINE/PubMed
BookMark	eNpdkEtPwzAQhC1URB_wA7ggiwM3F7_jHFF5FKkIJOAc2Y5pUyV2iB1B_z2pKBcuu1rNt6PRTMHIB-8AOCd4LpRQ1xRjhgjNkFQYLZ5uX4_AhAjOECVSjMAE7_U8V2IMpjFuhzOnlJyA8fDDicrxBLwsgreuTajRyW5g48rK6hqWOmkYbdcbU_n1HC7DF2x12oQ6rHcwue8ErfbQONhHV8LKw85Fpzu7OQXHH7qO7uywZ-D9_u5tsUSr54fHxc0KtZTxhKRUipVY2WEQyyQVnBjDBcs45yUVhBpuJeMcmwEkmSwJ1c7kVufGEILZDFz9-rZd-OxdTEVTRevqWnsX-lhkjEpC2B68_AduQ9_5IVtBCcVSZWIPXRyg3gwdFG1XNbrbFX9FsR984Gnw
CODEN	APLMAS
ContentType	Journal Article
Copyright	Copyright College of American Pathologists Jun 2003
Copyright_xml	– notice: Copyright College of American Pathologists Jun 2003
DBID	CGR CUY CVF ECM EIF NPM 3V. 4T- 4U- 7RV 7X7 7XB 88E 88I 8AF 8AO 8C1 8FE 8FH 8FI 8FJ 8FK ABUWG AFKRA AZQEC BBNVY BENPR BHPHI CCPQU DWQXO FYUFA GHDGH GNUQQ HCIFZ K9. KB0 LK8 M0S M1P M2P M7P NAPCQ PHGZM PHGZT PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS Q9U 7X8
DOI	10.5858/2003-127-680-CMDS
DatabaseName	Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed ProQuest Central (Corporate) Docstoc University Readers Nursing & Allied Health Database Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Science Database (Alumni Edition) STEM Database ProQuest Pharma Collection Public Health Database ProQuest SciTech Collection ProQuest Natural Science Journals ProQuest Hospital Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central ProQuest Central Essentials Biological Science Collection ProQuest Central Natural Science Collection ProQuest One Community College ProQuest Central Korea Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Health & Medical Complete (Alumni) Nursing & Allied Health Database (Alumni Edition) ProQuest Biological Science Collection Health & Medical Collection (Alumni Edition) Medical Database Science Database Biological Science Database Nursing & Allied Health Premium ProQuest Central Premium ProQuest One Academic ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic MEDLINE - Academic
DatabaseTitle	MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) University Readers ProQuest Central Student ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest AP Science ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Health & Medical Research Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) Natural Science Collection ProQuest Central Korea Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) ProQuest Public Health ProQuest Science Journals (Alumni Edition) ProQuest Biological Science Collection ProQuest Central Basic ProQuest Science Journals ProQuest One Academic Eastern Edition ProQuest Nursing & Allied Health Source ProQuest Hospital Collection Health Research Premium Collection (Alumni) Biological Science Database ProQuest SciTech Collection ProQuest Hospital Collection (Alumni) Nursing & Allied Health Premium ProQuest Health & Medical Complete ProQuest Medical Library ProQuest One Academic UKI Edition Docstoc ProQuest Nursing & Allied Health Source (Alumni) ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	MEDLINE - Academic MEDLINE University Readers
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: BENPR name: ProQuest Central - New (Subscription) url: http://www.proquest.com/pqcentral?accountid=15518 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine
EISSN	1543-2165
ExternalDocumentID	344291071 12741890
Genre	Journal Article
GroupedDBID	--- -~X .55 .GJ 04C 1CY 23N 2WC 36B 3O- 3V. 53G 5GY 5RE 6PF 7RV 7X7 88E 88I 8AF 8AO 8C1 8FE 8FH 8FI 8FJ 8R4 8R5 AAQOH AAQQT AAWTL ABCQX ABDBF ABOCM ABUWG ACGFO ACGOD ACPRK ACUHS ADBBV ADOJX AENEX AFFNX AFKRA AHMBA ALIPV ALMA_UNASSIGNED_HOLDINGS AZQEC B0M BAWUL BBNVY BENPR BHPHI BKEYQ BKOMP BMSDO BPHCQ BVXVI C1A CCPQU CGR CUY CVF DIK DWQXO E3Z EAP EAS EBC EBD EBS ECF ECM ECT ECV EHN EIF EIHBH EJD EMB EMK EMOBN ENC EPL EPT ESX EX3 F5P FAC FAL FJD FJW FRP FYUFA GNUQQ GX1 HCIFZ HMCUK IAO IEA IHR IHW INH INR IOF ITC J5H L7B LK8 M1P M2P M2Q M7P NAPCQ NPM OK1 P2P PCD PKN PQQKQ PROAC PSQYO PV9 Q2X Q~Q RWL RXW RZL SV3 TAE TAF TR2 TUS TWZ UDS UKHRP W2D WH7 WOW WQ9 X6Y X7M Y3D YQJ ZGI ZXP ~8M 4T- 4U- 7XB 8FK K9. PHGZM PHGZT PJZUB PKEHL PPXIY PQEST PQGLB PQUKI PRINS Q9U 7X8 PUEGO
ID	FETCH-LOGICAL-p234t-66883d08c3d01c362541bb4537444d2512b4c63440b883176d12aeb9ca9bb1103
IEDL.DBID	BENPR
ISSN	0003-9985 1543-2165
IngestDate	Thu Sep 04 19:16:08 EDT 2025 Mon Oct 06 18:33:51 EDT 2025 Wed Feb 19 01:30:55 EST 2025
IsPeerReviewed	true
IsScholarly	true
Issue	6
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-p234t-66883d08c3d01c362541bb4537444d2512b4c63440b883176d12aeb9ca9bb1103
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
PMID	12741890
PQID	212068750
PQPubID	42082
ParticipantIDs	proquest_miscellaneous_73261130 proquest_journals_212068750 pubmed_primary_12741890
PublicationCentury	2000
PublicationDate	2003-Jun 20030601
PublicationDateYYYYMMDD	2003-06-01
PublicationDate_xml	– month: 06 year: 2003 text: 2003-Jun
PublicationDecade	2000
PublicationPlace	United States
PublicationPlace_xml	– name: United States – name: Northfield
PublicationTitle	Archives of pathology & laboratory medicine (1976)
PublicationTitleAlternate	Arch Pathol Lab Med
PublicationYear	2003
Publisher	College of American Pathologists
Publisher_xml	– name: College of American Pathologists
SSID	ssj0009221
Score	1.9287131
Snippet	In the normal course of activity, pathologists create and archive immense data sets of scientifically valuable information. Researchers need pathology-based...
SourceID	proquest pubmed
SourceType	Aggregation Database Index Database
StartPage	680
SubjectTerms	Algorithms Archives & records Computing Methodologies Database Management Systems - classification Database Management Systems - instrumentation Database Management Systems - supply & distribution Databases, Factual - classification Databases, Factual - supply & distribution Datasets Diagnostic tests Humans Kidney cancer Medical Record Linkage - instrumentation Medical Record Linkage - methods Medical Records Systems, Computerized - classification Medical Records Systems, Computerized - instrumentation Medical Records Systems, Computerized - supply & distribution Medical Records, Problem-Oriented Medical research Pathology Pathology, Clinical - organization & administration Patients Subject Headings Unified Medical Language System - classification Unified Medical Language System - instrumentation Unified Medical Language System - supply & distribution
Title	Concept-match medical data scrubbing. How pathology text can be used in research
URI	https://www.ncbi.nlm.nih.gov/pubmed/12741890 https://www.proquest.com/docview/212068750 https://www.proquest.com/docview/73261130
Volume	127
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVEBS databaseName: EBSCOhost Academic Search Ultimate customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn eissn: 1543-2165 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009221 issn: 0003-9985 databaseCode: ABDBF dateStart: 20030101 isFulltext: true titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn providerName: EBSCOhost – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 1543-2165 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009221 issn: 0003-9985 databaseCode: DIK dateStart: 19990101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVFQY databaseName: GFMER Free Medical Journals customDbUrl: eissn: 1543-2165 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009221 issn: 0003-9985 databaseCode: GX1 dateStart: 0 isFulltext: true titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php providerName: Geneva Foundation for Medical Education and Research – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 1543-2165 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009221 issn: 0003-9985 databaseCode: 7X7 dateStart: 19960301 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central - New (Subscription) customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 1543-2165 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009221 issn: 0003-9985 databaseCode: BENPR dateStart: 19960301 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: Public Health Database customDbUrl: eissn: 1543-2165 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0009221 issn: 0003-9985 databaseCode: 8C1 dateStart: 19960301 isFulltext: true titleUrlDefault: https://search.proquest.com/publichealth providerName: ProQuest
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB76APEivq3VugevS_eVlyCitaUILUUt9BayycZ6MKm2xb_vbJLWk172ksdhZpn5dr-Z-QCuU6aiVPgp5SbVVPE4pYElHI3UiK49BNzGMrqjsTucqqeZM6vBaNMLY8sqNzGxCNRJHts78i6GWOYiuGZ3i09qRaMsubpR0IgqZYXktpgwVoemsIOxGtB86I8nz79TeIXYSujhOcMpaU5EzH63LNISHnV9Rnujx5e_IWeRegb7sFdhRnJfOvkAaiY7hJ1RxYofwaRX9h5SRJ_xnHyU3AuxxZ9kaSVo8PT7dkOG-TexAsTFRTqxFR8E7Uq0IeulSch7RqrJP_NjmA76r70hrZQS6EJItaKu6_syYX6MC48xJzmKa60c6SmlEgthtIpdqRTT-CL33ISLyOggjgKtEQDIE2hkeWbOgCCMlTLG-Gn7RjAeRDJxtImkYgl-KoIWtDdmCavtvgy3zmnB1fYp7lNLPkSZydfL0EOcyDFhtuC0tGW4KMdphLyYoBOw83__3IZdUSofUsYvoLH6WptLxAMr3YG6N_Nw9Xu8U3n8B9vJswo
linkProvider	ProQuest
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV25TsNAEB0FkIAGcRPCsQWUK_byhYQQJKAASYQ4JDrjtddAgRNwIsTH8W_M2k6ooKNxY3uLPWbe7JuZB7CXMhWlwk8pN6mmiscpDSzhaKRGdO0h4DaW0e323Pa9unxwHmrwNa6FsWmVY5tYGOqkH9s78gM0scxFcM2OB2_UikZZcnWsoBFVygrJUdFhrKrruDKfHxjB5UcXLVzufSHOz-6abVqJDNCBkGpIXdf3ZcL8GB88RnPuKK61cqSnlEqs99cqdqVSTOOH3HMTLiKjgzgKtEbfKXHcKZhRUgUY-82cnvWub366_goxkezDuMYpaVVE6P5BmRQmPOr6jDa7rdvfIW7h6s4XYaHCqOSk3FRLUDPZMsx2KxZ-Ba6bZa0jRbQbP5PXkushNtmU5FbyBqPtp0PS7n8QK3hcXNwTm2FCcB2JNmSUm4S8ZKTqNPS8Cvf_MmlrMJ31M7MBBGGzlDHaa1ungvYnkomjTSQVS_BXEdShMZ6WsDpeeTjZDHXYnbzFc2HJjigz_VEeeohLOTroOqyXcxkOyvYdIS869gRs88-Rd2GufdfthJ2L3lUD5kWpukgZ34Lp4fvIbCMWGeqdasUJPP73JvsGGivrrA
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3JTsMwEB2xSIgLYqesPsDRqrdsSAihlqosRUiA1FuIEwc4kBbSCvFp_B3jOC0nuHHJJYkP4_HMs994HsBhzlSSizCn3OSaKp7mNLKEo5Ea0XWAgNtYRrd343cf1GXf68_A1-QujC2rnMTEKlBng9SekTcxxDIfwTVr5nVVxG27czp8o1ZAyhKtEzUN5yFX5vMDd2_lyUUbp_pIiM75fatLa4EBOhRSjajvh6HMWJjig6cYyj3FtVaeDJRSmc38WqW-VIpp_JAHfsZFYnSUJpHWmDcljjsL84GUka0mDPrBT79fIaZifbij8Ryhitg8bLpyMBFQP2S01Wvf_Q5uqyTXWYalGp2SM-dOKzBjilVY6NX8-xrcttwtR4o4N30mr47lIbbMlJRW7Ab32U_HpDv4IFbquDqyJ9aKBGeQaEPGpcnIS0HqHkPP6_DwLybbgLliUJgtIAiYpUwxUtsbKhh5Epl52iRSsQx_FVEDdiZmieuFVcZTN2jAwfQtrghLcySFGYzLOEBEyjE1N2DT2TIeusYdMa969URs-8-RD2ABXSu-vri52oFF4eQWKeO7MDd6H5s9BCEjvV9NN4HH__avb4736UY
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Concept-match+medical+data+scrubbing.+How+pathology+text+can+be+used+in+research&rft.jtitle=Archives+of+pathology+%26+laboratory+medicine+%281976%29&rft.au=Berman%2C+Jules+J&rft.date=2003-06-01&rft.issn=1543-2165&rft.eissn=1543-2165&rft.volume=127&rft.issue=6&rft.spage=680&rft_id=info:doi/10.5858%2F2003-127-680-CMDS&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0003-9985&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0003-9985&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0003-9985&client=summon