The repeated adjustment of measurement protocols method for developing high-validity text classifiers

The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational algorithms is usually considered separately. This is problematic because developing high-validity classifiers is a repeated process of identifyin...

Full description

Saved in:
Bibliographic Details
Published inPsychological methods
Main Authors Goddard, Alex, Gillespie, Alex
Format Journal Article
LanguageEnglish
Published United States 06.10.2025
Online AccessGet full text
ISSN1939-1463
1082-989X
1939-1463
DOI10.1037/met0000787

Cover

Abstract The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational algorithms is usually considered separately. This is problematic because developing high-validity classifiers is a repeated process of identifying, explaining, and addressing conceptual and measurement issues during both the manual coding and classifier development stages. To address this problem, we introduce the Repeated Adjustment of Measurement Protocols (RAMP) method for developing high-validity text classifiers in psychology. The RAMP method has three stages: manual coding, classifier development, and integrative evaluation. These stages integrate the best practices of content analysis (manual coding), data science (classifier development), and psychology (integrative evaluation). Central to this integration is the concept of an inference loop, defined as the process of maximizing validity through repeated adjustments to concepts and constructs, guided by push-back from the empirical data. Inference loops operate both within each stage of the method and across related studies. We illustrate RAMP through a case study, where we manually coded 21,815 sentences for misunderstanding (Krippendorff's α = .79), and developed a rule-based classifier (Matthews correlation coefficient [MCC] = 0.22), a supervised machine learning classifier (Bidirectional Encoder Representations From Transformers; MCC = 0.69) and a large language model classifier (GPT-4o; MCC = 0.47). By integrating manual coding and classifier development stages, we were able to identify and address a concept validity problem with misunderstandings. RAMP advances existing methods by operationalizing validity as an ongoing dynamic process, where concepts and constructs are repeatedly adjusted toward increasingly widespread intersubjective agreement on their utility. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
AbstractList The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational algorithms is usually considered separately. This is problematic because developing high-validity classifiers is a repeated process of identifying, explaining, and addressing conceptual and measurement issues during both the manual coding and classifier development stages. To address this problem, we introduce the Repeated Adjustment of Measurement Protocols (RAMP) method for developing high-validity text classifiers in psychology. The RAMP method has three stages: manual coding, classifier development, and integrative evaluation. These stages integrate the best practices of content analysis (manual coding), data science (classifier development), and psychology (integrative evaluation). Central to this integration is the concept of an inference loop, defined as the process of maximizing validity through repeated adjustments to concepts and constructs, guided by push-back from the empirical data. Inference loops operate both within each stage of the method and across related studies. We illustrate RAMP through a case study, where we manually coded 21,815 sentences for misunderstanding (Krippendorff's α = .79), and developed a rule-based classifier (Matthews correlation coefficient [MCC] = 0.22), a supervised machine learning classifier (Bidirectional Encoder Representations From Transformers; MCC = 0.69) and a large language model classifier (GPT-4o; MCC = 0.47). By integrating manual coding and classifier development stages, we were able to identify and address a concept validity problem with misunderstandings. RAMP advances existing methods by operationalizing validity as an ongoing dynamic process, where concepts and constructs are repeatedly adjusted toward increasingly widespread intersubjective agreement on their utility. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
Author Gillespie, Alex
Goddard, Alex
Author_xml – sequence: 1
  givenname: Alex
  orcidid: 0000-0003-1382-2700
  surname: Goddard
  fullname: Goddard, Alex
  organization: Department of Psychological and Behavioural Science, London School of Economics and Political Science
– sequence: 2
  givenname: Alex
  orcidid: 0000-0002-0162-1269
  surname: Gillespie
  fullname: Gillespie, Alex
  organization: Department of Psychological and Behavioural Science, London School of Economics and Political Science
BackLink https://www.ncbi.nlm.nih.gov/pubmed/41051829$$D View this record in MEDLINE/PubMed
BookMark eNpNkMtOwzAQRS1URB-w4QOQ9yhg104cL1HFS6rEpqwj1540rpw4sp1C_55Cec3mzoyO7uJM0ajzHSB0SckNJUzctpDIYUQpTtCESiYzygs2-reP0TTGLSGUs5KfoTGnJKflXE4QrBrAAXpQCQxWZjvE1EKXsK9xCyoOAb7OPvjktXfx8E2NN7j2ARvYgfO97Ta4sZsm2ylnjU17nOA9Ye1UjLa2EOI5Oq2Vi3DxnTP0-nC_Wjxly5fH58XdMtNzTlImC8N5XtJirXPNSyMEo7pU3DDJqdGEalWUZi2KmsvaSEK0NKbQOQhaAJkLNkPXx96h69X-TTlX9cG2KuwrSqpPWdWfrAN9daT7Yd2C-UV_7LAP84dpIw
ContentType Journal Article
DBID NPM
ADTOC
UNPAY
DOI 10.1037/met0000787
DatabaseName PubMed
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle PubMed
DatabaseTitleList PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Psychology
EISSN 1939-1463
ExternalDocumentID 10.1037/met0000787
41051829
Genre Journal Article
GroupedDBID ---
--Z
-~X
0R~
123
29P
354
3KI
5VS
7RZ
ABIVO
ABNCP
ABVOZ
ACHQT
ACPQG
AEHFB
ALMA_UNASSIGNED_HOLDINGS
AWKKM
AZXWR
CS3
EPA
F5P
FTD
HVGLF
HZ~
ISO
LW5
NPM
O9-
OPA
OVD
P2P
PHGZT
ROL
SES
SPA
TEORI
TN5
YNT
ZPI
.-4
07C
53G
ADTOC
AETEA
CGNQK
OHT
UHS
UNPAY
XJT
ID FETCH-LOGICAL-c240t-96d445816bc5c48d7731c8a4d3941dc01ca68db76f49fd900c9dd6c5e716e0273
IEDL.DBID UNPAY
ISSN 1939-1463
1082-989X
IngestDate Thu Oct 09 05:44:01 EDT 2025
Fri Oct 10 01:53:24 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c240t-96d445816bc5c48d7731c8a4d3941dc01ca68db76f49fd900c9dd6c5e716e0273
ORCID 0000-0002-0162-1269
0000-0003-1382-2700
OpenAccessLink https://proxy.k.utb.cz/login?url=https://doi.org/10.1037/met0000787
PMID 41051829
ParticipantIDs unpaywall_primary_10_1037_met0000787
pubmed_primary_41051829
PublicationCentury 2000
PublicationDate 2025-Oct-06
PublicationDateYYYYMMDD 2025-10-06
PublicationDate_xml – month: 10
  year: 2025
  text: 2025-Oct-06
  day: 06
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Psychological methods
PublicationTitleAlternate Psychol Methods
PublicationYear 2025
SSID ssj0014384
Score 2.4693372
SecondaryResourceType online_first
Snippet The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational...
SourceID unpaywall
pubmed
SourceType Open Access Repository
Index Database
Title The repeated adjustment of measurement protocols method for developing high-validity text classifiers
URI https://www.ncbi.nlm.nih.gov/pubmed/41051829
https://doi.org/10.1037/met0000787
UnpaywallVersion publishedVersion
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LawIxEB6sHtpL3w9LK6F4jey62WxyFKlIQfFQwZ4kmwfYWhXdpdhf31nXZw-l5yQQkknmm8yXbwCq1ufOhhGjLLZ1yqLAUMGdofVAuli7gPMViabT5e0-exmEgwI8bf7CHOTvAwzLbbLKtonoCEo8RLxdhFK_22u85cz5OpVCDvLUsaR47IONBunB4D3_cpxOZmr5pcbjPUfSOoPmZgo5f-SjliZxTX__Umf8e47ncLrGkaSRb_wFFOzkEk6219nyCixaAJnbGV621hBl3tPFilFOpo587l4GSabUMEVzWJC8mjRBGEt2f6lIJmhM0R5HBgE7yYgiRGeQe-SyKtrX0G89vzbbdF1UgWp03gmV3DAWCp_HOtRMmCgKfC0UM4FkvtGerxUXJo64Y9IZ6XlaGsN1aDGwspn4zQ0UJ9OJvQOCSMDiQOms1Bg4RUoz5TGltIgRVghRhtt80YezXDljmHFKMaCRZahud2HbuEqHB9Fwt5z3_-v2AMVkntpHRAdJXIGjbq9TWRvJD0b3uwI
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA61PejF96OiEqTXlN0mm8exFEsRLB4s1FPJ5gHVui3tLlJ_vbO7felBPCeBkEwy32S-fINQw4Xcu0gwwmLXIkxQSyT3lrSo8rHxlPOCRPPU570BexxGwwq6X_-F-ZG_pxCWu7TItkmxh2o8ArxdRbVB_7n9WjLnW0RJNSxTx4rAsadrDdIfg3f8y36WzPTyU08mO46ke4Q66ymU_JH3ZpbGTfP1S53x7zkeo8MVjsTtcuNPUMUlp-hgc50tz5ADC8BzN4PL1lms7Vu2KBjleOrxx_ZlEOdKDVMwhwUuq0ljgLF4-5cK54LGBOxxbAGw45wogk0Oucc-r6J9jgbdh5dOj6yKKhADzjslilvGIhny2ESGSSsEDY3UzFLFQmuC0GgubSy4Z8pbFQRGWctN5CCwcrn4zQWqJtPEXSEMSMDBQOWdMhA4CW2YDpjWRsYAK6Sso8ty0UezUjljlHNKIaBRddTY7MKmsUiHUzHaLuf1_7rdoGo6z9wtoIM0vluZxzfqEbn2
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+repeated+adjustment+of+measurement+protocols+method+for+developing+high-validity+text+classifiers&rft.jtitle=Psychological+methods&rft.issn=1939-1463&rft_id=info:doi/10.1037%2Fmet0000787&rft.externalDocID=10.1037%2Fmet0000787
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1939-1463&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1939-1463&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1939-1463&client=summon