The repeated adjustment of measurement protocols method for developing high-validity text classifiers

The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational algorithms is usually considered separately. This is problematic because developing high-validity classifiers is a repeated process of identifyin...

Full description

Saved in:

Bibliographic Details
Published in	Psychological methods
Main Authors	Goddard, Alex, Gillespie, Alex
Format	Journal Article
Language	English
Published	United States 06.10.2025
Online Access	Get full text
ISSN	1939-1463 1082-989X 1939-1463
DOI	10.1037/met0000787

Cover

Abstract	The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational algorithms is usually considered separately. This is problematic because developing high-validity classifiers is a repeated process of identifying, explaining, and addressing conceptual and measurement issues during both the manual coding and classifier development stages. To address this problem, we introduce the Repeated Adjustment of Measurement Protocols (RAMP) method for developing high-validity text classifiers in psychology. The RAMP method has three stages: manual coding, classifier development, and integrative evaluation. These stages integrate the best practices of content analysis (manual coding), data science (classifier development), and psychology (integrative evaluation). Central to this integration is the concept of an inference loop, defined as the process of maximizing validity through repeated adjustments to concepts and constructs, guided by push-back from the empirical data. Inference loops operate both within each stage of the method and across related studies. We illustrate RAMP through a case study, where we manually coded 21,815 sentences for misunderstanding (Krippendorff's α = .79), and developed a rule-based classifier (Matthews correlation coefficient [MCC] = 0.22), a supervised machine learning classifier (Bidirectional Encoder Representations From Transformers; MCC = 0.69) and a large language model classifier (GPT-4o; MCC = 0.47). By integrating manual coding and classifier development stages, we were able to identify and address a concept validity problem with misunderstandings. RAMP advances existing methods by operationalizing validity as an ongoing dynamic process, where concepts and constructs are repeatedly adjusted toward increasingly widespread intersubjective agreement on their utility. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
AbstractList	The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational algorithms is usually considered separately. This is problematic because developing high-validity classifiers is a repeated process of identifying, explaining, and addressing conceptual and measurement issues during both the manual coding and classifier development stages. To address this problem, we introduce the Repeated Adjustment of Measurement Protocols (RAMP) method for developing high-validity text classifiers in psychology. The RAMP method has three stages: manual coding, classifier development, and integrative evaluation. These stages integrate the best practices of content analysis (manual coding), data science (classifier development), and psychology (integrative evaluation). Central to this integration is the concept of an inference loop, defined as the process of maximizing validity through repeated adjustments to concepts and constructs, guided by push-back from the empirical data. Inference loops operate both within each stage of the method and across related studies. We illustrate RAMP through a case study, where we manually coded 21,815 sentences for misunderstanding (Krippendorff's α = .79), and developed a rule-based classifier (Matthews correlation coefficient [MCC] = 0.22), a supervised machine learning classifier (Bidirectional Encoder Representations From Transformers; MCC = 0.69) and a large language model classifier (GPT-4o; MCC = 0.47). By integrating manual coding and classifier development stages, we were able to identify and address a concept validity problem with misunderstandings. RAMP advances existing methods by operationalizing validity as an ongoing dynamic process, where concepts and constructs are repeatedly adjusted toward increasingly widespread intersubjective agreement on their utility. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
Author	Gillespie, Alex Goddard, Alex
Author_xml	– sequence: 1 givenname: Alex orcidid: 0000-0003-1382-2700 surname: Goddard fullname: Goddard, Alex organization: Department of Psychological and Behavioural Science, London School of Economics and Political Science – sequence: 2 givenname: Alex orcidid: 0000-0002-0162-1269 surname: Gillespie fullname: Gillespie, Alex organization: Department of Psychological and Behavioural Science, London School of Economics and Political Science
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/41051829$$D View this record in MEDLINE/PubMed
BookMark	eNpNkMtOwzAQRS1URB-w4QOQ9yhg104cL1HFS6rEpqwj1540rpw4sp1C_55Cec3mzoyO7uJM0ajzHSB0SckNJUzctpDIYUQpTtCESiYzygs2-reP0TTGLSGUs5KfoTGnJKflXE4QrBrAAXpQCQxWZjvE1EKXsK9xCyoOAb7OPvjktXfx8E2NN7j2ARvYgfO97Ta4sZsm2ylnjU17nOA9Ye1UjLa2EOI5Oq2Vi3DxnTP0-nC_Wjxly5fH58XdMtNzTlImC8N5XtJirXPNSyMEo7pU3DDJqdGEalWUZi2KmsvaSEK0NKbQOQhaAJkLNkPXx96h69X-TTlX9cG2KuwrSqpPWdWfrAN9daT7Yd2C-UV_7LAP84dpIw
ContentType	Journal Article
DBID	NPM ADTOC UNPAY
DOI	10.1037/met0000787
DatabaseName	PubMed Unpaywall for CDI: Periodical Content Unpaywall
DatabaseTitle	PubMed
DatabaseTitleList	PubMed
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Psychology
EISSN	1939-1463
ExternalDocumentID	10.1037/met0000787 41051829
Genre	Journal Article
GroupedDBID	--- --Z -~X 0R~ 123 29P 354 3KI 5VS 7RZ ABIVO ABNCP ABVOZ ACHQT ACPQG AEHFB ALMA_UNASSIGNED_HOLDINGS AWKKM AZXWR CS3 EPA F5P FTD HVGLF HZ~ ISO LW5 NPM O9- OPA OVD P2P PHGZT ROL SES SPA TEORI TN5 YNT ZPI .-4 07C 53G ADTOC AETEA CGNQK OHT UHS UNPAY XJT
ID	FETCH-LOGICAL-c240t-96d445816bc5c48d7731c8a4d3941dc01ca68db76f49fd900c9dd6c5e716e0273
IEDL.DBID	UNPAY
ISSN	1939-1463 1082-989X
IngestDate	Thu Oct 09 05:44:01 EDT 2025 Fri Oct 10 01:53:24 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Language	English
License	cc-by
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c240t-96d445816bc5c48d7731c8a4d3941dc01ca68db76f49fd900c9dd6c5e716e0273
ORCID	0000-0002-0162-1269 0000-0003-1382-2700
OpenAccessLink	https://proxy.k.utb.cz/login?url=https://doi.org/10.1037/met0000787
PMID	41051829
ParticipantIDs	unpaywall_primary_10_1037_met0000787 pubmed_primary_41051829
PublicationCentury	2000
PublicationDate	2025-Oct-06
PublicationDateYYYYMMDD	2025-10-06
PublicationDate_xml	– month: 10 year: 2025 text: 2025-Oct-06 day: 06
PublicationDecade	2020
PublicationPlace	United States
PublicationPlace_xml	– name: United States
PublicationTitle	Psychological methods
PublicationTitleAlternate	Psychol Methods
PublicationYear	2025
SSID	ssj0014384
Score	2.4693372
SecondaryResourceType	online_first
Snippet	The development and evaluation of text classifiers in psychology depends on rigorous manual coding. Yet, the evaluation of manual coding and computational...
SourceID	unpaywall pubmed
SourceType	Open Access Repository Index Database
Title	The repeated adjustment of measurement protocols method for developing high-validity text classifiers
URI	https://www.ncbi.nlm.nih.gov/pubmed/41051829 https://doi.org/10.1037/met0000787
UnpaywallVersion	publishedVersion
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LawIxEB6sHtpL3w9LK6F4jey62WxyFKlIQfFQwZ4kmwfYWhXdpdhf31nXZw-l5yQQkknmm8yXbwCq1ufOhhGjLLZ1yqLAUMGdofVAuli7gPMViabT5e0-exmEgwI8bf7CHOTvAwzLbbLKtonoCEo8RLxdhFK_22u85cz5OpVCDvLUsaR47IONBunB4D3_cpxOZmr5pcbjPUfSOoPmZgo5f-SjliZxTX__Umf8e47ncLrGkaSRb_wFFOzkEk6219nyCixaAJnbGV621hBl3tPFilFOpo587l4GSabUMEVzWJC8mjRBGEt2f6lIJmhM0R5HBgE7yYgiRGeQe-SyKtrX0G89vzbbdF1UgWp03gmV3DAWCp_HOtRMmCgKfC0UM4FkvtGerxUXJo64Y9IZ6XlaGsN1aDGwspn4zQ0UJ9OJvQOCSMDiQOms1Bg4RUoz5TGltIgRVghRhtt80YezXDljmHFKMaCRZahud2HbuEqHB9Fwt5z3_-v2AMVkntpHRAdJXIGjbq9TWRvJD0b3uwI
linkProvider	Unpaywall
linkToUnpaywall	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA61PejF96OiEqTXlN0mm8exFEsRLB4s1FPJ5gHVui3tLlJ_vbO7felBPCeBkEwy32S-fINQw4Xcu0gwwmLXIkxQSyT3lrSo8rHxlPOCRPPU570BexxGwwq6X_-F-ZG_pxCWu7TItkmxh2o8ArxdRbVB_7n9WjLnW0RJNSxTx4rAsadrDdIfg3f8y36WzPTyU08mO46ke4Q66ymU_JH3ZpbGTfP1S53x7zkeo8MVjsTtcuNPUMUlp-hgc50tz5ADC8BzN4PL1lms7Vu2KBjleOrxx_ZlEOdKDVMwhwUuq0ljgLF4-5cK54LGBOxxbAGw45wogk0Oucc-r6J9jgbdh5dOj6yKKhADzjslilvGIhny2ESGSSsEDY3UzFLFQmuC0GgubSy4Z8pbFQRGWctN5CCwcrn4zQWqJtPEXSEMSMDBQOWdMhA4CW2YDpjWRsYAK6Sso8ty0UezUjljlHNKIaBRddTY7MKmsUiHUzHaLuf1_7rdoGo6z9wtoIM0vluZxzfqEbn2
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+repeated+adjustment+of+measurement+protocols+method+for+developing+high-validity+text+classifiers&rft.jtitle=Psychological+methods&rft.issn=1939-1463&rft_id=info:doi/10.1037%2Fmet0000787&rft.externalDocID=10.1037%2Fmet0000787
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1939-1463&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1939-1463&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1939-1463&client=summon