Invited Commentary: The Use of Imperfect Data—Compromise or Compromising?

Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data available. Researchers may compromise by trying to fill those gaps with proxy variables, but how appropriate these surrogates are is rarely know...

Full description

Saved in:
Bibliographic Details
Published inAmerican journal of epidemiology Vol. 167; no. 6; pp. 641 - 643
Main Author Howards, Penelope P.
Format Journal Article
LanguageEnglish
Published United States Oxford University Press 15.03.2008
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text
ISSN0002-9262
1476-6256
1476-6256
DOI10.1093/aje/kwm368

Cover

Abstract Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data available. Researchers may compromise by trying to fill those gaps with proxy variables, but how appropriate these surrogates are is rarely known. In this issue (Am J Epidemiol 2008;167:630–640), Toh et al. consider the effect of using two algorithms to estimate the timing of medication use during pregnancy in the absence of gestational-age data. Although the delivery-date algorithm has promising sensitivity and specificity, it is true under very specific conditions that seem unlikely to hold generally. Furthermore, it seems difficult to know a priori when those conditions do hold. There are times when using imperfect data is acceptable, but, at other times, the data are too imperfect to be helpful. Automated databases are certainly valuable, but they should be used with caution and where possible should be linked to databases that can fill the critical gaps.
AbstractList Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data available. Researchers may compromise by trying to fill those gaps with proxy variables, but how appropriate these surrogates are is rarely known. In this issue (Am J Epidemiol 2008;167:630-640), Toh et al. consider the effect of using two algorithms to estimate the timing of medication use during pregnancy in the absence of gestational-age data. Although the delivery-date algorithm has promising sensitivity and specificity, it is true under very specific conditions that seem unlikely to hold generally. Furthermore, it seems difficult to know a priori when those conditions do hold. There are times when using imperfect data is acceptable, but, at other times, the data are too imperfect to be helpful. Automated databases are certainly valuable, but they should be used with caution and where possible should be linked to databases that can fill the critical gaps.
Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data available. Researchers may compromise by trying to fill those gaps with proxy variables, but how appropriate these surrogates are is rarely known. In this issue (Am J Epidemiol 2008;167:630-640), Toh et al. consider the effect of using two algorithms to estimate the timing of medication use during pregnancy in the absence of gestational-age data. Although the delivery-date algorithm has promising sensitivity and specificity, it is true under very specific conditions that seem unlikely to hold generally. Furthermore, it seems difficult to know a priori when those conditions do hold. There are times when using imperfect data is acceptable, but, at other times, the data are too imperfect to be helpful. Automated databases are certainly valuable, but they should be used with caution and where possible should be linked to databases that can fill the critical gaps. [PUBLICATION ABSTRACT]
Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data available. Researchers may compromise by trying to fill those gaps with proxy variables, but how appropriate these surrogates are is rarely known. In this issue (Am J Epidemiol 2008;167:630-640), Toh et al. consider the effect of using two algorithms to estimate the timing of medication use during pregnancy in the absence of gestational-age data. Although the delivery-date algorithm has promising sensitivity and specificity, it is true under very specific conditions that seem unlikely to hold generally. Furthermore, it seems difficult to know a priori when those conditions do hold. There are times when using imperfect data is acceptable, but, at other times, the data are too imperfect to be helpful. Automated databases are certainly valuable, but they should be used with caution and where possible should be linked to databases that can fill the critical gaps.Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data available. Researchers may compromise by trying to fill those gaps with proxy variables, but how appropriate these surrogates are is rarely known. In this issue (Am J Epidemiol 2008;167:630-640), Toh et al. consider the effect of using two algorithms to estimate the timing of medication use during pregnancy in the absence of gestational-age data. Although the delivery-date algorithm has promising sensitivity and specificity, it is true under very specific conditions that seem unlikely to hold generally. Furthermore, it seems difficult to know a priori when those conditions do hold. There are times when using imperfect data is acceptable, but, at other times, the data are too imperfect to be helpful. Automated databases are certainly valuable, but they should be used with caution and where possible should be linked to databases that can fill the critical gaps.
Author Howards, Penelope P.
Author_xml – sequence: 1
  givenname: Penelope P.
  surname: Howards
  fullname: Howards, Penelope P.
  email: penelope.howards@emory.edu
BackLink https://www.ncbi.nlm.nih.gov/pubmed/18194998$$D View this record in MEDLINE/PubMed
BookMark eNp90ctKxDAUBuAgio6XjQ8gRdCFUD25tE3ciNTLDIoiKIqbkGkT7Thtx6T1svMhfEKfxAz1Ai5chUM-zjmcfxHNVnWlEVrFsI1B0B010jsPzyWN-QzqYZbEYUyieBb1AICEgsRkAS06NwLAWEQwjxYwx4IJwXvoZFA9FY3Og7QuS101yr7uBpf3OrhyOqhNMCgn2hqdNcGBatTH27t3E1uXxfTbBj9VUd3tLaM5o8ZOr3y9S-jq6PAy7Yen58eDdP80zCgVTSgixQlLSC4IYUPFcjCckIyqRFHCzFBwhbkwGeUxy7UywigFMOR5IjjDEdAltNn19aMfW-0a6RfI9HisKl23TiZABeNAPFz_A0d1ayu_myQ0EpQAUI_WvlA7LHUuJ7Yo_RXk94082OpAZmvnrDa_BOQ0AOkDkF0AHm90uG4n_7uwc4Vr9MuPVPZBxglNItm_uZXiOuVnKQN5QT8BGiOT_g
ContentType Journal Article
Copyright American Journal of Epidemiology © The Author 2008. Published by the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org. 2008
American Journal of Epidemiology © The Author 2008. Published by the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.
Copyright_xml – notice: American Journal of Epidemiology © The Author 2008. Published by the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org. 2008
– notice: American Journal of Epidemiology © The Author 2008. Published by the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org.
DBID BSCLL
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7QP
7T2
7TK
7U7
7U9
C1K
H94
K9.
NAPCQ
7X8
DOI 10.1093/aje/kwm368
DatabaseName Istex
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Calcium & Calcified Tissue Abstracts
Health and Safety Science Abstracts (Full archive)
Neurosciences Abstracts
Toxicology Abstracts
Virology and AIDS Abstracts
Environmental Sciences and Pollution Management
AIDS and Cancer Research Abstracts
ProQuest Health & Medical Complete (Alumni)
Nursing & Allied Health Premium
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Nursing & Allied Health Premium
Virology and AIDS Abstracts
Toxicology Abstracts
AIDS and Cancer Research Abstracts
ProQuest Health & Medical Complete (Alumni)
Health & Safety Science Abstracts
Calcium & Calcified Tissue Abstracts
Neurosciences Abstracts
Environmental Sciences and Pollution Management
MEDLINE - Academic
DatabaseTitleList MEDLINE


Nursing & Allied Health Premium
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Public Health
EISSN 1476-6256
EndPage 643
ExternalDocumentID 1450216651
18194998
10_1093_aje_kwm368
10.1093/aje/kwm368
ark_67375_HXZ_9WC8NC40_Q
Genre Comment
Journal Article
Commentary
GroupedDBID ---
-DZ
-E4
-~X
..I
.2P
.I3
.XZ
.ZR
0R~
186
1TH
23M
2WC
354
4.4
482
48X
53G
5GY
5RE
5VS
5WA
5WD
6J9
70D
85S
8F7
AABZA
AACZT
AAILS
AAJKP
AAJQQ
AAMVS
AAOGV
AAPNW
AAPQZ
AAPXW
AARHZ
AAUAY
AAUQX
AAVAP
AAWTL
ABDFA
ABEJV
ABEUO
ABGNP
ABIXL
ABJNI
ABKDP
ABLJU
ABNGD
ABNHQ
ABNKS
ABOCM
ABPTD
ABQLI
ABVGC
ABXVV
ABZBJ
ACGFO
ACGFS
ACGOD
ACPRK
ACUFI
ACUKT
ACUTJ
ACUTO
ADBBV
ADCFL
ADEYI
ADEZT
ADGHP
ADGZP
ADHKW
ADHZD
ADIPN
ADMHG
ADNBA
ADOCK
ADQBN
ADRTK
ADVEK
ADYVW
ADZXQ
AEGPL
AEHKS
AEJOX
AEKSI
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFNX
AFFZL
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGQPQ
AGSYK
AHMBA
AHMMS
AHXPO
AI.
AIAGR
AIJHB
AJBYB
AJEEA
AJNCP
ALMA_UNASSIGNED_HOLDINGS
ALUQC
ALXQX
APIBT
APWMN
ASPBG
ATGXG
AVWKF
AXUDD
AZFZN
BAWUL
BAYMD
BCRHZ
BEYMZ
BHONS
BSCLL
BTRTY
BVRKM
C1A
C45
CAG
CDBKE
COF
CS3
CZ4
DAKXR
DIK
DILTD
D~K
E3Z
EBS
EE~
EJD
EMOBN
F5P
F9B
FEDTE
FLUFQ
FOEOM
FOTVD
FQBLK
GAUVT
GJXCC
GX1
H13
H5~
HAR
HVGLF
HW0
HZ~
IH2
IOX
J21
JXSIZ
KAQDR
KBUDW
KOP
KQ8
KSI
KSN
L7B
M-Z
ML0
N9A
NEJ
NGC
NOMLY
NOYVH
NTWIH
NVLIB
O0~
O9-
OAWHX
OCZFY
ODMLO
OHH
OHT
OJQWA
OJZSN
OK1
OPAEJ
OVD
OWPYF
P2P
P6G
PAFKI
PB-
PEELM
PQQKQ
Q1.
Q5Y
R44
RD5
ROL
ROX
ROZ
RUSNO
RW1
RXO
TCURE
TEORI
TJX
TR2
UHB
UPT
VH1
W8F
WOQ
X7H
YAYTL
YF5
YKOAZ
YOC
YROCO
YSK
YXANX
Z0Y
ZKX
~91
AASNB
ABQTQ
ADJQC
ADRIX
AFXEN
M49
NU-
RHF
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7QP
7T2
7TK
7U7
7U9
C1K
H94
K9.
NAPCQ
7X8
ID FETCH-LOGICAL-c339t-95a82472d9224ba4d0f822c3a7a324fb98a189fc3864deaf9faa00b8d79841503
ISSN 0002-9262
1476-6256
IngestDate Sun Sep 28 00:06:17 EDT 2025
Mon Oct 06 18:03:19 EDT 2025
Thu Apr 03 06:56:55 EDT 2025
Wed Oct 01 04:14:20 EDT 2025
Wed Sep 11 04:51:00 EDT 2024
Sat Sep 20 11:02:03 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 6
Keywords medical records systems, computerized
bias (epidemiology)
pregnancy outcome
gestational age
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c339t-95a82472d9224ba4d0f822c3a7a324fb98a189fc3864deaf9faa00b8d79841503
Notes ark:/67375/HXZ-9WC8NC40-Q
istex:1F61178753803D803D6175F8F12EF73127CDA985
SourceType-Scholarly Journals-1
ObjectType-Commentary-1
content type line 14
ObjectType-Article-2
ObjectType-Feature-3
content type line 23
PMID 18194998
PQID 235932003
PQPubID 41038
PageCount 3
ParticipantIDs proquest_miscellaneous_70394802
proquest_journals_235932003
pubmed_primary_18194998
crossref_primary_10_1093_aje_kwm368
oup_primary_10_1093_aje_kwm368
istex_primary_ark_67375_HXZ_9WC8NC40_Q
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2008-03-15
PublicationDateYYYYMMDD 2008-03-15
PublicationDate_xml – month: 03
  year: 2008
  text: 2008-03-15
  day: 15
PublicationDecade 2000
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: Oxford
PublicationTitle American journal of epidemiology
PublicationTitleAlternate Am J Epidemiol
PublicationYear 2008
Publisher Oxford University Press
Oxford Publishing Limited (England)
Publisher_xml – name: Oxford University Press
– name: Oxford Publishing Limited (England)
References 18194999 - Am J Epidemiol. 2008 Mar 15;167(6):633-40
References_xml – reference: 18194999 - Am J Epidemiol. 2008 Mar 15;167(6):633-40
SSID ssj0011950
Score 1.8768816
Snippet Automated databases are appealing resources because they contain detailed data that are relatively accessible, but there are also critical gaps in the data...
SourceID proquest
pubmed
crossref
oup
istex
SourceType Aggregation Database
Index Database
Publisher
StartPage 641
SubjectTerms Algorithms
Automation
Bias
bias (epidemiology)
Clinical outcomes
computerized
Data Collection
Drug-Related Side Effects and Adverse Reactions
Epidemiology
Female
Gestational Age
Humans
Medical records
medical records systems
Online data bases
Perinatal Care
Pregnancy
pregnancy outcome
Time Factors
Title Invited Commentary: The Use of Imperfect Data—Compromise or Compromising?
URI https://api.istex.fr/ark:/67375/HXZ-9WC8NC40-Q/fulltext.pdf
https://www.ncbi.nlm.nih.gov/pubmed/18194998
https://www.proquest.com/docview/235932003
https://www.proquest.com/docview/70394802
Volume 167
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Colorado Digital library
  customDbUrl:
  eissn: 1476-6256
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0011950
  issn: 0002-9262
  databaseCode: KQ8
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1476-6256
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0011950
  issn: 0002-9262
  databaseCode: DIK
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1476-6256
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0011950
  issn: 0002-9262
  databaseCode: GX1
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1baxQxFA7agggiWm9jtQYUXyTtzCQzk_gisihroQWhxcWXkMwkUGtnlumut1_vSTKX3VZFfRl2J7sJnC9z8iXznXMQepawqmRVVhFjUkqY0ZyokubEVklitHBtLhr54DCfHrP9WTYbq7356JKF3i1__DKu5H9QhXuAq4uS_Qdkh07hBnwGfOEKCMP1rzB-V3_xjBH69hrw9nsv01iGI_oTIMWtT0_slKCEOAF52wC0rpzTi-GbD3peE_gNL3JWMkuYsZjscA4fVLch2T94zc_N3HQRY_1Jgg-tC7GUuyZ4P1bkBDZE-Zp7DOUyunmw6uzykLLqkhMOCarUJwPX069nNNTNWcFjfuYBAXLhUuPwcSkaBIJ901W0mYK79jU5ZoN2x9eu7XPNCroHQ-2FgVwO2O6va0Rj0z0z3y4EMV7aS3hOcXQL3ew2A_h1QPY2umLqLXTtoJM7bKEb4VAVh1ixO2i_AxyPgL_EADcGuHFj8QA3vgg3blq8Cveru-j47ZujyZR0xTBISalYEJEpnrIirQSQLq1YFVvgdiVVhQJObLXgKuHClpTnrDLKCqtUHGteFYIDSYvpPbRRN7V5gHCcaGNKa0WZUVaUuTA2TizVjnrrTNkIPe0tJ-ch54kMWgUqwdQymDpCz71Rh5-o9tSpBItMTmcfpfgw4YcTFsv3EdoBq_-xp-0eENnN6nOZ0gy2FrDmROjJ0Aomcu-zVG2a5bmExUowHqcRuh9QHMfo5sDD37Zso-vjI_AIbSzapXkM9HKhd_xU-wmHTXxS
linkProvider Geneva Foundation for Medical Education and Research
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Invited+commentary%3A+the+use+of+imperfect+data--compromise+or+compromising%3F&rft.jtitle=American+journal+of+epidemiology&rft.au=Howards%2C+Penelope+P&rft.date=2008-03-15&rft.eissn=1476-6256&rft.volume=167&rft.issue=6&rft.spage=641&rft_id=info:doi/10.1093%2Faje%2Fkwm368&rft_id=info%3Apmid%2F18194998&rft.externalDocID=18194998
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0002-9262&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0002-9262&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0002-9262&client=summon