Using a data-driven approach for the development and evaluation of phenotype algorithms for systemic lupus erythematosus

Systemic lupus erythematosus (SLE) is a chronic autoimmune disease of unknown origin. The objective of this research was to develop phenotype algorithms for SLE suitable for use in epidemiological studies using empirical evidence from observational databases. We used a process for empirically determ...

Full description

Saved in:
Bibliographic Details
Published inPloS one Vol. 18; no. 2; p. e0281929
Main Authors Swerdel, Joel N., Ramcharran, Darmendra, Hardin, Jill
Format Journal Article
LanguageEnglish
Published United States Public Library of Science 16.02.2023
Public Library of Science (PLoS)
Subjects
Online AccessGet full text
ISSN1932-6203
1932-6203
DOI10.1371/journal.pone.0281929

Cover

More Information
Summary:Systemic lupus erythematosus (SLE) is a chronic autoimmune disease of unknown origin. The objective of this research was to develop phenotype algorithms for SLE suitable for use in epidemiological studies using empirical evidence from observational databases. We used a process for empirically determining and evaluating phenotype algorithms for health conditions to be analyzed in observational research. The process started with a literature search to discover prior algorithms used for SLE. We then used a set of Observational Health Data Sciences and Informatics (OHDSI) open-source tools to refine and validate the algorithms. These included tools to discover codes for SLE that may have been missed in prior studies and to determine possible low specificity and index date misclassification in algorithms for correction. We developed four algorithms using our process: two algorithms for prevalent SLE and two for incident SLE. The algorithms for both incident and prevalent cases are comprised of a more specific version and a more sensitive version. Each of the algorithms corrects for possible index date misclassification. After validation, we found the highest positive predictive value estimate for the prevalent, specific algorithm (89%). The highest sensitivity estimate was found for the sensitive, prevalent algorithm (77%). We developed phenotype algorithms for SLE using a data-driven approach. The four final algorithms may be used directly in observational studies. The validation of these algorithms provides researchers an added measure of confidence that the algorithms are selecting subjects correctly and allows for the application of quantitative bias analysis.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Competing Interests: Authors JS and JH are employees of Janssen Research and Development and shareholders of Johnson & Johnson. DR was an employee of Janssen Research and Development and shareholder of Johnson & Johnson and is currently affiliated with GSK.
Current address: Safety and Quantitative Innovation, GSK, Waltham, Massachusetts, United States of America
ISSN:1932-6203
1932-6203
DOI:10.1371/journal.pone.0281929