Toward automating GRADE classification: a proof-of-concept evaluation of an artificial intelligence-based tool for semiautomated evidence quality rating in systematic reviews

BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and...

Full description

Saved in:
Bibliographic Details
Published inBMJ evidence-based medicine p. bmjebm-2024-113123
Main Authors Oliveira dos Santos, Alisson, Belo, Vinícius Silva, Mota Machado, Tales, Silva, Eduardo Sérgio da
Format Journal Article
LanguageEnglish
Published England BMJ Publishing Group Ltd 07.04.2025
BMJ Publishing Group LTD
Subjects
Online AccessGet full text
ISSN2515-446X
2515-4478
2515-4478
DOI10.1136/bmjebm-2024-113123

Cover

Abstract BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.DesignAnalytical experimental study.ObjectiveThe objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.MethodsThe URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.ResultsThe open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen’s kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.ConclusionThe results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future.
AbstractList Evaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers. Analytical experimental study. The objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials. The URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators. The open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen's kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively. The results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future.
BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.DesignAnalytical experimental study.ObjectiveThe objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.MethodsThe URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.ResultsThe open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen’s kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.ConclusionThe results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future.
Evaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.BACKGROUNDEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.Analytical experimental study.DESIGNAnalytical experimental study.The objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.OBJECTIVEThe objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.The URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.METHODSThe URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.The open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen's kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.RESULTSThe open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen's kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.The results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future.CONCLUSIONThe results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future.
Author Mota Machado, Tales
Silva, Eduardo Sérgio da
Oliveira dos Santos, Alisson
Belo, Vinícius Silva
Author_xml – sequence: 1
  givenname: Alisson
  orcidid: 0000-0002-4648-9951
  surname: Oliveira dos Santos
  fullname: Oliveira dos Santos, Alisson
  organization: Campus Tres Lagoas, Universidade Federal de Mato Grosso do Sul, Três Lagoas, Brazil
– sequence: 2
  givenname: Vinícius Silva
  orcidid: 0000-0003-0183-1175
  surname: Belo
  fullname: Belo, Vinícius Silva
  email: viniciusbelo@ufsj.edu.br
  organization: Universidade Federal de Sao Joao del-Rei—Campus Centro-Oeste Dona Lindu, Divinopolis, Brazil
– sequence: 3
  givenname: Tales
  orcidid: 0000-0003-0603-823X
  surname: Mota Machado
  fullname: Mota Machado, Tales
  organization: Universidade Federal de Ouro Preto, Ouro Preto, Brazil
– sequence: 4
  givenname: Eduardo Sérgio da
  orcidid: 0000-0001-7409-9216
  surname: Silva
  fullname: Silva, Eduardo Sérgio da
  organization: Oswaldo Cruz Foundation, Rio de Janeiro, Brazil
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40194821$$D View this record in MEDLINE/PubMed
BookMark eNp9kc1q3TAQhUVJaX6aF-iiCLrpxqlH8o_cXUjStBAolBS6M2N5HHSxpRtJTrgv1WesHN-k0EVBIDH6juaMzjE7sM4SY-8gPwOQ1adu2lA3ZSIXRZYKIOQrdiRKKLOiqNXBy7n6dchOQ9jkeS4AmlKpN-ywyKEplIAj9vvWPaLvOc7RTRiNvePXP84vr7geMQQzGJ2Kzn7myLfeuSFLSzuraRs5PeA4P11zN3C0HH1cFAZHbmykcTR3lNCsw0A9j86NfHCeB5rMvl8q04PpF4rfzziauON-tWEsD7sQaXGluU8YPYa37PWAY6DT_X7Cfn65ur34mt18v_52cX6TdWnsmMmm6lHUlQRUsqlRFLrRfV93taixEYWsSyokEAjsO8AuF40SPQEs39k1lTxhH9d309D3M4XYTiboNBFacnNoJai6FJUqm4R--AfduNnb5O6JAiVqBYl6v6fmbqK-3Xozod-1z0kkQKyA9i4ET8MLAnm7JN6uibeLxXZNPInOVlG6-9v2P4I_LDqwIA
Cites_doi 10.1136/bmj.39489.470347.AD
10.1136/bmj.f7383
10.2307/2529310
10.1016/j.jclinepi.2015.11.019
10.18653/v1/P17-4002
10.1371/journal.pone.0034697
10.48550/arXiv.1810.04805
10.1016/j.jclinepi.2015.11.018
10.1186/1471-2288-7-10
10.1002/jrsm.1230
10.1016/j.jclinepi.2015.08.013
10.1038/s41433-024-02958-w
10.1136/bmj.d5928
10.1016/j.jclinepi.2017.12.015
10.1136/bmj.b4012
10.1093/jamia/ocv044
10.4103/2249-4863.109934
10.1016/j.jbi.2023.104389
10.1186/2046-4053-3-82
10.1111/jnu.12628
10.1186/s12911-019-0814-z
10.1097/PRS.0b013e318219c171
10.1136/bmjopen-2016-012545
10.1136/amiajnl-2013-002411
10.5281/ZENODO.13916887
10.1016/j.jclinepi.2010.04.026
10.1002/14651858.CD001914.pub2
10.1016/j.jclinepi.2012.07.005
10.23919/MIPRO.2019.8757088
ContentType Journal Article
Copyright Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group.
2025 Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group.
Copyright_xml – notice: Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group.
– notice: 2025 Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group.
DBID AAYXX
CITATION
NPM
3V.
7X7
7XB
88E
8FI
8FJ
8FK
ABUWG
AFKRA
BENPR
BTHHO
CCPQU
FYUFA
GHDGH
K9.
M0S
M1P
PHGZM
PHGZT
PJZUB
PKEHL
PPXIY
PQEST
PQQKQ
PQUKI
PRINS
S0X
7X8
DOI 10.1136/bmjebm-2024-113123
DatabaseName CrossRef
PubMed
ProQuest Central (Corporate)
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central
ProQuest Central
BMJ Journals
ProQuest One Community College
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Health & Medical Complete (Alumni)
Health & Medical Collection (Alumni Edition)
Medical Database
ProQuest Central Premium
ProQuest One Academic
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
SIRS Editorial
MEDLINE - Academic
DatabaseTitle CrossRef
PubMed
ProQuest One Academic Middle East (New)
SIRS Editorial
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Central China
ProQuest Central
ProQuest Health & Medical Research Collection
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
Health & Medical Research Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
ProQuest Hospital Collection (Alumni)
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
BMJ Journals
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList PubMed
ProQuest One Academic Middle East (New)

MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: BENPR
  name: ProQuest Central
  url: http://www.proquest.com/pqcentral?accountid=15518
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
EISSN 2515-4478
ExternalDocumentID 40194821
10_1136_bmjebm_2024_113123
ebmed
Genre Journal Article
GrantInformation_xml – fundername: Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
  grantid: 001
  funderid: http://dx.doi.org/10.13039/501100002322
– fundername: Universidade Federal de São João del-Rei
  funderid: http://dx.doi.org/10.13039/501100020987
– fundername: Universidade Federal de Mato Grosso do Sul
  funderid: http://dx.doi.org/10.13039/501100016182
– fundername: Conselho Nacional de Desenvolvimento Científico e Tecnológico
  grantid: 305665/2023-5; 311309/2023-2
  funderid: http://dx.doi.org/10.13039/501100003593
GroupedDBID 53G
AAHLL
AAOJX
ABTFR
ACGFS
ACHTP
ADBBV
ADUGQ
AFWFF
AJYBZ
ALMA_UNASSIGNED_HOLDINGS
BENPR
BLJBA
BTHHO
CXRWF
EBS
EJD
HAJ
OVD
RMJ
TEORI
UYXKK
7X7
AAYXX
ACQHZ
AERUA
BPHCQ
BVXVI
CITATION
PQQKQ
PROAC
S0X
NPM
3V.
7XB
88E
8FI
8FJ
8FK
ABUWG
AFKRA
CCPQU
FYUFA
HMCUK
K9.
M1P
PHGZM
PHGZT
PJZUB
PKEHL
PPXIY
PQEST
PQUKI
PRINS
PSQYO
UKHRP
7X8
ID FETCH-LOGICAL-b251t-396da27631a8397a24c9cdd7b727a924375e431e12adb1ab02982de112024b963
IEDL.DBID BENPR
ISSN 2515-446X
2515-4478
IngestDate Fri Jul 11 18:47:05 EDT 2025
Tue Oct 07 07:07:23 EDT 2025
Mon Jul 21 05:59:36 EDT 2025
Wed Oct 01 06:34:12 EDT 2025
Tue Apr 08 14:21:35 EDT 2025
IsPeerReviewed true
IsScholarly true
Keywords Evidence-Based Practice
Systematic Reviews as Topic
Language English
License Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-b251t-396da27631a8397a24c9cdd7b727a924375e431e12adb1ab02982de112024b963
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Feature-3
ObjectType-Evidence Based Healthcare-1
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0003-0183-1175
0000-0002-4648-9951
0000-0001-7409-9216
0000-0003-0603-823X
PMID 40194821
PQID 3187182781
PQPubID 2041037
ParticipantIDs proquest_miscellaneous_3187526859
proquest_journals_3187182781
pubmed_primary_40194821
crossref_primary_10_1136_bmjebm_2024_113123
bmj_journals_10_1136_bmjebm_2024_113123
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2025-04-07
PublicationDateYYYYMMDD 2025-04-07
PublicationDate_xml – month: 04
  year: 2025
  text: 2025-04-07
  day: 07
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
– name: London
PublicationTitle BMJ evidence-based medicine
PublicationTitleAbbrev BMJ EBM
PublicationTitleAlternate BMJ Evid Based Med
PublicationYear 2025
Publisher BMJ Publishing Group Ltd
BMJ Publishing Group LTD
Publisher_xml – name: BMJ Publishing Group Ltd
– name: BMJ Publishing Group LTD
References Devlin, Chang, Lee (R13) 2019
Meader, King, Llewellyn (R6) 2014; 3
Soboczenski, Trikalinos, Kuiper (R25) 2019; 19
Murad, Mustafa, Morgan (R38) 2016; 74
Marshall, Kuiper, Banner (R11) 2017; 2017
Shea, Grimshaw, Wells (R10) 2007; 7
Hartling, Fernandes, Seida (R22) 2012; 7
Hartling, Ospina, Liang (R27) 2009; 339
Guyatt, Oxman, Akl (R33) 2011; 64
Burns, Rohrich, Chung (R4) 2011; 128
Hartling, Hamm, Milne (R26) 2013; 66
Hirt, Meichlinger, Schumacher (R24) 2021; 53
Smith (R15) 2013; 347
Bui, Zeng-Treitler (R29) 2014; 21
Higgins, Altman, Gøtzsche (R9) 2011; 343
Borenstein, Higgins, Hedges (R35) 2017; 8
Marshall, Kuiper, Wallace (R30) 2016; 23
Masalkhi, Ong, Waisberg (R39) 2024; 38
Francis, Smith, Saljuqi (R23) 2015; 2015
Santos, Machado (R20) 2024
Gates, Vandermeer, Hartling (R28) 2018; 96
Landis, Koch (R16) 1977; 33
Pollock, Farmer, Brady (R7) 2016; 70
Gionfriddo (R37) 2016; 74
Guyatt, Oxman, Vist (R5) 2008; 336
Santos, da Silva, Couto (R8) 2023; 142
Borah, Brown, Capers (R3) 2017; 7
Maratkar, Adkar (R31) 2021; 4
Gopalakrishnan, Ganeshkumar (R2) 2013; 2
2025040710200943000_bmjebm-2024-113123v1.18
2025040710200943000_bmjebm-2024-113123v1.5
2025040710200943000_bmjebm-2024-113123v1.19
2025040710200943000_bmjebm-2024-113123v1.4
2025040710200943000_bmjebm-2024-113123v1.38
2025040710200943000_bmjebm-2024-113123v1.17
2025040710200943000_bmjebm-2024-113123v1.39
Meader (2025040710200943000_bmjebm-2024-113123v1.6) 2014; 3
2025040710200943000_bmjebm-2024-113123v1.8
2025040710200943000_bmjebm-2024-113123v1.7
2025040710200943000_bmjebm-2024-113123v1.32
2025040710200943000_bmjebm-2024-113123v1.33
2025040710200943000_bmjebm-2024-113123v1.30
2025040710200943000_bmjebm-2024-113123v1.14
2025040710200943000_bmjebm-2024-113123v1.36
2025040710200943000_bmjebm-2024-113123v1.37
2025040710200943000_bmjebm-2024-113123v1.12
2025040710200943000_bmjebm-2024-113123v1.34
2025040710200943000_bmjebm-2024-113123v1.13
2025040710200943000_bmjebm-2024-113123v1.35
Higgins (2025040710200943000_bmjebm-2024-113123v1.9) 2011; 343
Soboczenski (2025040710200943000_bmjebm-2024-113123v1.25) 2019; 19
2025040710200943000_bmjebm-2024-113123v1.2
Borah (2025040710200943000_bmjebm-2024-113123v1.3) 2017; 7
2025040710200943000_bmjebm-2024-113123v1.1
Hartling (2025040710200943000_bmjebm-2024-113123v1.27) 2009; 339
Smith (2025040710200943000_bmjebm-2024-113123v1.15) 2013; 347
2025040710200943000_bmjebm-2024-113123v1.29
Landis (2025040710200943000_bmjebm-2024-113123v1.16) 1977; 33
2025040710200943000_bmjebm-2024-113123v1.28
2025040710200943000_bmjebm-2024-113123v1.21
2025040710200943000_bmjebm-2024-113123v1.20
2025040710200943000_bmjebm-2024-113123v1.26
2025040710200943000_bmjebm-2024-113123v1.24
Hartling (2025040710200943000_bmjebm-2024-113123v1.22) 2012; 7
Maratkar (2025040710200943000_bmjebm-2024-113123v1.31) 2021; 4
Francis (2025040710200943000_bmjebm-2024-113123v1.23) 2015; 2015
Shea (2025040710200943000_bmjebm-2024-113123v1.10) 2007; 7
Marshall (2025040710200943000_bmjebm-2024-113123v1.11) 2017; 2017
References_xml – volume: 336
  start-page: 924
  year: 2008
  ident: R5
  article-title: GRADE: an emerging consensus on rating quality of evidence and strength of recommendations
  publication-title: BMJ
  doi: 10.1136/bmj.39489.470347.AD
– volume: 347
  year: 2013
  ident: R15
  article-title: The Cochrane collaboration at 20
  publication-title: BMJ
  doi: 10.1136/bmj.f7383
– volume: 33
  year: 1977
  ident: R16
  article-title: The Measurement of Observer Agreement for Categorical Data
  publication-title: Biometrics
  doi: 10.2307/2529310
– volume: 74
  start-page: 237
  year: 2016
  ident: R37
  article-title: Subjectivity is a strength: a comment on “an algorithm was developed to assign GRADE levels of evidence to comparisons within systematic reviews”
  publication-title: J Clin Epidemiol
  doi: 10.1016/j.jclinepi.2015.11.019
– volume: 2017
  start-page: 7
  year: 2017
  ident: R11
  article-title: Automating Biomedical Evidence Synthesis: RobotReviewer
  publication-title: Proc Conf Assoc Comput Linguist Meet
  doi: 10.18653/v1/P17-4002
– volume: 7
  year: 2012
  ident: R22
  article-title: From the trenches: a cross-sectional study applying the GRADE tool in systematic reviews of healthcare interventions
  publication-title: PLoS ONE
  doi: 10.1371/journal.pone.0034697
– year: 2019
  ident: R13
  article-title: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
  publication-title: arXiv
  doi: 10.48550/arXiv.1810.04805
– volume: 74
  start-page: 237
  year: 2016
  ident: R38
  article-title: Rating the quality of evidence is by necessity a matter of judgment
  publication-title: J Clin Epidemiol
  doi: 10.1016/j.jclinepi.2015.11.018
– volume: 7
  year: 2007
  ident: R10
  article-title: Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews
  publication-title: BMC Med Res Methodol
  doi: 10.1186/1471-2288-7-10
– volume: 8
  start-page: 5
  year: 2017
  ident: R35
  article-title: Basics of meta-analysis: I2 is not an absolute measure of heterogeneity
  publication-title: Res Synth Methods
  doi: 10.1002/jrsm.1230
– volume: 70
  start-page: 106
  year: 2016
  ident: R7
  article-title: An algorithm was developed to assign GRADE levels of evidence to comparisons within systematic reviews
  publication-title: J Clin Epidemiol
  doi: 10.1016/j.jclinepi.2015.08.013
– volume: 38
  start-page: 1412
  year: 2024
  ident: R39
  article-title: Google DeepMind’s gemini AI versus ChatGPT: a comparative analysis in ophthalmology
  publication-title: Eye (Lond)
  doi: 10.1038/s41433-024-02958-w
– volume: 343
  year: 2011
  ident: R9
  article-title: The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials
  publication-title: BMJ
  doi: 10.1136/bmj.d5928
– volume: 96
  start-page: 54
  year: 2018
  ident: R28
  article-title: Technology-assisted risk of bias assessment in systematic reviews: a prospective cross-sectional evaluation of the RobotReviewer machine learning tool
  publication-title: J Clin Epidemiol
  doi: 10.1016/j.jclinepi.2017.12.015
– volume: 339
  year: 2009
  ident: R27
  article-title: Risk of bias versus quality assessment of randomised controlled trials: cross sectional study
  publication-title: BMJ
  doi: 10.1136/bmj.b4012
– volume: 23
  start-page: 193
  year: 2016
  ident: R30
  article-title: RobotReviewer: evaluation of a system for automatically assessing bias in clinical trials
  publication-title: J Am Med Inform Assoc
  doi: 10.1093/jamia/ocv044
– volume: 2
  start-page: 9
  year: 2013
  ident: R2
  article-title: Systematic Reviews and Meta-analysis: Understanding the Best Evidence in Primary Healthcare
  publication-title: J Family Med Prim Care
  doi: 10.4103/2249-4863.109934
– volume: 142
  start-page: 104389
  year: 2023
  ident: R8
  article-title: The use of artificial intelligence for automating or semi-automating biomedical literature analyses: A scoping review
  publication-title: J Biomed Inform
  doi: 10.1016/j.jbi.2023.104389
– volume: 3
  year: 2014
  ident: R6
  article-title: A checklist designed to aid consistency and reproducibility of GRADE assessments: development and pilot validation
  publication-title: Syst Rev
  doi: 10.1186/2046-4053-3-82
– volume: 53
  start-page: 246
  year: 2021
  ident: R24
  article-title: Agreement in Risk of Bias Assessment Between RobotReviewer and Human Reviewers: An Evaluation Study on Randomised Controlled Trials in Nursing-Related Cochrane Reviews
  publication-title: J Nurs Scholarsh
  doi: 10.1111/jnu.12628
– volume: 19
  year: 2019
  ident: R25
  article-title: Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study
  publication-title: BMC Med Inform Decis Mak
  doi: 10.1186/s12911-019-0814-z
– volume: 128
  start-page: 305
  year: 2011
  ident: R4
  article-title: The levels of evidence and their role in evidence-based medicine
  publication-title: Plast Reconstr Surg
  doi: 10.1097/PRS.0b013e318219c171
– volume: 7
  year: 2017
  ident: R3
  article-title: Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry
  publication-title: BMJ Open
  doi: 10.1136/bmjopen-2016-012545
– volume: 21
  start-page: 850
  year: 2014
  ident: R29
  article-title: Learning regular expressions for clinical text classification
  publication-title: J Am Med Inform Assoc
  doi: 10.1136/amiajnl-2013-002411
– year: 2024
  ident: R20
  article-title: URSE-Automated GRADE System
  publication-title: Zenodo
  doi: 10.5281/ZENODO.13916887
– volume: 64
  start-page: 383
  year: 2011
  ident: R33
  article-title: GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables
  publication-title: J Clin Epidemiol
  doi: 10.1016/j.jclinepi.2010.04.026
– volume: 2015
  year: 2015
  ident: R23
  article-title: Oral protein calorie supplementation for children with chronic disease
  publication-title: Cochrane Database Syst Rev
  doi: 10.1002/14651858.CD001914.pub2
– volume: 4
  start-page: 99
  year: 2021
  ident: R31
  article-title: An emerging frontend JavaScript library
  publication-title: IRE Journals
– volume: 66
  start-page: 973
  year: 2013
  ident: R26
  article-title: Testing the risk of bias tool showed low reliability between individual reviewers and across consensus assessments of reviewer pairs
  publication-title: J Clin Epidemiol
  doi: 10.1016/j.jclinepi.2012.07.005
– ident: 2025040710200943000_bmjebm-2024-113123v1.32
– volume: 19
  year: 2019
  ident: 2025040710200943000_bmjebm-2024-113123v1.25
  article-title: Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study
  publication-title: BMC Med Inform Decis Mak
  doi: 10.1186/s12911-019-0814-z
– volume: 3
  year: 2014
  ident: 2025040710200943000_bmjebm-2024-113123v1.6
  article-title: A checklist designed to aid consistency and reproducibility of GRADE assessments: development and pilot validation
  publication-title: Syst Rev
  doi: 10.1186/2046-4053-3-82
– ident: 2025040710200943000_bmjebm-2024-113123v1.30
  doi: 10.1093/jamia/ocv044
– ident: 2025040710200943000_bmjebm-2024-113123v1.34
– ident: 2025040710200943000_bmjebm-2024-113123v1.13
– ident: 2025040710200943000_bmjebm-2024-113123v1.33
  doi: 10.1016/j.jclinepi.2010.04.026
– ident: 2025040710200943000_bmjebm-2024-113123v1.38
  doi: 10.1016/j.jclinepi.2015.11.018
– volume: 7
  year: 2012
  ident: 2025040710200943000_bmjebm-2024-113123v1.22
  article-title: From the trenches: a cross-sectional study applying the GRADE tool in systematic reviews of healthcare interventions
  publication-title: PLoS ONE
  doi: 10.1371/journal.pone.0034697
– ident: 2025040710200943000_bmjebm-2024-113123v1.4
  doi: 10.1097/PRS.0b013e318219c171
– ident: 2025040710200943000_bmjebm-2024-113123v1.39
  doi: 10.1038/s41433-024-02958-w
– ident: 2025040710200943000_bmjebm-2024-113123v1.20
– ident: 2025040710200943000_bmjebm-2024-113123v1.26
  doi: 10.1016/j.jclinepi.2012.07.005
– ident: 2025040710200943000_bmjebm-2024-113123v1.17
– ident: 2025040710200943000_bmjebm-2024-113123v1.19
  doi: 10.23919/MIPRO.2019.8757088
– ident: 2025040710200943000_bmjebm-2024-113123v1.5
  doi: 10.1136/bmj.39489.470347.AD
– ident: 2025040710200943000_bmjebm-2024-113123v1.36
– ident: 2025040710200943000_bmjebm-2024-113123v1.2
  doi: 10.4103/2249-4863.109934
– ident: 2025040710200943000_bmjebm-2024-113123v1.8
  doi: 10.1016/j.jbi.2023.104389
– volume: 339
  year: 2009
  ident: 2025040710200943000_bmjebm-2024-113123v1.27
  article-title: Risk of bias versus quality assessment of randomised controlled trials: cross sectional study
  publication-title: BMJ
  doi: 10.1136/bmj.b4012
– volume: 4
  start-page: 99
  year: 2021
  ident: 2025040710200943000_bmjebm-2024-113123v1.31
  article-title: An emerging frontend JavaScript library
  publication-title: IRE Journals
– volume: 343
  year: 2011
  ident: 2025040710200943000_bmjebm-2024-113123v1.9
  article-title: The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials
  publication-title: BMJ
  doi: 10.1136/bmj.d5928
– volume: 33
  year: 1977
  ident: 2025040710200943000_bmjebm-2024-113123v1.16
  article-title: The Measurement of Observer Agreement for Categorical Data
  publication-title: Biometrics
  doi: 10.2307/2529310
– ident: 2025040710200943000_bmjebm-2024-113123v1.29
  doi: 10.1136/amiajnl-2013-002411
– ident: 2025040710200943000_bmjebm-2024-113123v1.35
  doi: 10.1002/jrsm.1230
– ident: 2025040710200943000_bmjebm-2024-113123v1.12
– ident: 2025040710200943000_bmjebm-2024-113123v1.28
  doi: 10.1016/j.jclinepi.2017.12.015
– volume: 7
  year: 2017
  ident: 2025040710200943000_bmjebm-2024-113123v1.3
  article-title: Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry
  publication-title: BMJ Open
  doi: 10.1136/bmjopen-2016-012545
– ident: 2025040710200943000_bmjebm-2024-113123v1.37
  doi: 10.1016/j.jclinepi.2015.11.018
– volume: 2015
  year: 2015
  ident: 2025040710200943000_bmjebm-2024-113123v1.23
  article-title: Oral protein calorie supplementation for children with chronic disease
  publication-title: Cochrane Database Syst Rev
– ident: 2025040710200943000_bmjebm-2024-113123v1.1
– ident: 2025040710200943000_bmjebm-2024-113123v1.24
  doi: 10.1111/jnu.12628
– ident: 2025040710200943000_bmjebm-2024-113123v1.18
– volume: 7
  year: 2007
  ident: 2025040710200943000_bmjebm-2024-113123v1.10
  article-title: Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews
  publication-title: BMC Med Res Methodol
  doi: 10.1186/1471-2288-7-10
– volume: 2017
  start-page: 7
  year: 2017
  ident: 2025040710200943000_bmjebm-2024-113123v1.11
  article-title: Automating Biomedical Evidence Synthesis: RobotReviewer
  publication-title: Proc Conf Assoc Comput Linguist Meet
– ident: 2025040710200943000_bmjebm-2024-113123v1.21
– ident: 2025040710200943000_bmjebm-2024-113123v1.14
– volume: 347
  year: 2013
  ident: 2025040710200943000_bmjebm-2024-113123v1.15
  article-title: The Cochrane collaboration at 20
  publication-title: BMJ
  doi: 10.1136/bmj.f7383
– ident: 2025040710200943000_bmjebm-2024-113123v1.7
  doi: 10.1016/j.jclinepi.2015.08.013
SSID ssj0002119588
Score 2.29032
Snippet BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations...
Evaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment,...
SourceID proquest
pubmed
crossref
bmj
SourceType Aggregation Database
Index Database
Publisher
StartPage bmjebm-2024-113123
SubjectTerms Adaptation
Artificial intelligence
Automation
Bias
Classification
Clinical trials
Decision making
Evidence-Based Practice
Original research
Systematic Reviews as Topic
Title Toward automating GRADE classification: a proof-of-concept evaluation of an artificial intelligence-based tool for semiautomated evidence quality rating in systematic reviews
URI https://ebm.bmj.com/content/early/2025/04/07/bmjebm-2024-113123.full
https://www.ncbi.nlm.nih.gov/pubmed/40194821
https://www.proquest.com/docview/3187182781
https://www.proquest.com/docview/3187526859
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 2515-4478
  dateEnd: 20250502
  omitProxy: true
  ssIdentifier: ssj0002119588
  issn: 2515-446X
  databaseCode: 7X7
  dateStart: 20180101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 2515-4478
  dateEnd: 20250502
  omitProxy: true
  ssIdentifier: ssj0002119588
  issn: 2515-446X
  databaseCode: BENPR
  dateStart: 19951201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1dSxwxFL3o-uJLUWzrtiq3UPBBgk4281UQsaKVgouIwr4NySRTVrozVmcf_FP9jd6bzOzWhwrzlAlJuPcmOfk6B-CrdIrC5qgSiS0zoZSNRKY0HzhWmdEJLbe9NuDVOLm8Uz8n8WQFxv1bGL5W2Y-JfqC2Tcl75IcUezSMyjSLTh7-CFaN4tPVXkJDd9IK9thTjK3CmmRmrAGsfT8fX98sdl2Yzyz2YpQ0r8fUtmTSv6QZJYdmdu_MjAJHKkEJkdcwosTXc9Z_gKifkC424F2HJPE0uH4TVly9BX9v_TVY1PO2YSxa_8IfN2RsLBkl87Ug74lvqJEqaCpBXxleLuKS-BubCnWNHFWBYAKn_zB3Cp75LLZN8xsJ8uKTm027-ijZdTKlGJ5rPuNjaMa0xiVrNHYkqO_h7uL89uxSdJIMwpDBWjHKE6sljUmRJmSVaqnKvLQ2NQSDdM7khjE5P3KR1NZE2jDBu7SOQB3Z1FBn_wCDuqndNqBmmqCsMnEZVyrWtLSyhrCmKkcsk2WSIeyT6YuuSz0VfrUySorgpIILLIKThnDQu6d4CBwdb-be6T24LHwZXUP4svhNPY2PT3TtmnnIw-Q4cT6Ej8Hzi-polZqrTEaf3i78M6xL1g_mmz_pDgzax7nbJVDTmj1YTSfpXhevL7IK9ps
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtQwELZKe4ALKuJvaYFBAnFAVonjZBOkquKnZUvbFaq20t5SO3bQom7SdrNCfSkeoc_GjO3swoHeKuXkROMkM_bM2J7vY-y1sBLN5n3FU1NmXEoT8Uwq2nCsMq1STLcdN-DRMB2cyG_jZLzCrrtaGDpW2c2JbqI2TUlr5FtoeziNin4W7ZxfcGKNot3VjkJDBWoFs-0gxkJhx4G9-oUp3Gx7_wvq-40Qe7ujzwMeWAa4Rt_e8jhPjRI4zCKFwUJfCVnmpTF9jZ5d5YTXl-D3RDYSyuhIacIsF8ZinILuTaP9otw7bE3GMsfkb-3T7vD78WKVh_DTEkd-iX0l-C_ScVe5E6dbevrT6iknQRwbIseZhI3_-sj_BL7OAe6ts_shcoWP3tQesBVbP2S_R-7YLah521DsW_-Ar8eoXCgpKqdjSE7zH0ABdtBUHK_SV0rCEmgcmgpUDWTFHtACJn8hhXLytAbapjkDDLFhZqeT0B8220CLCr489Aou_WtMaliiVEMAXX3ETm5FOY_Zat3U9ikDRbBEWaWTMqlkojCVMxpjW1nGRMul0x57i7--CEN4VrjsKE4Lr6SCBBZeST32rlNPce4xQW58erPT4FL40pp77NXiNo5s2q5RtW3m_hkC40nyHnviNb_oDrPiXGYienaz8Jfs7mB0dFgc7g8PNtg9QdzFdOqov8lW28u5fY4BVatfBKsFdnrbA-UP9RQw1w
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1fb9QwDLfGkBAvCMS_YwOMBOIBRUfT9B8SQohxbAwmhDbp3krSpNMhrt12PaF9KT4Anw47ae_ggb1N6lNaOW1tx3Zi_wzwVDpFYvOyFqmtcqGUjUSuNB841rnRKYXbvjfg54N090h9nCbTDfg91MJwWuWwJvqF2rYV75GPSfZoGZVZHo3rPi3iy87kzcmp4A5SfNI6tNMIIrLvzn9S-LZ4vbdDvH4m5eT94btd0XcYEIbseifiIrVakopFmhyFTEtVFZW1mSGrrgvG6kvoWyIXSW1NpA3jlUvryEch02ZIdonuFbiaxXHB6YTZNFvt7zByWuLbXtJMCf2FdDrU7MTp2My_OzMXTEbQQOS7JdHgv9bxPy6vN32Tm3Cj91nxbRCyW7Dhmtvw69An3KJedi17vc0xfvhKbMWK_XFOQPI8f4UaaYK2FnRVoUYS1xDj2NaoG2T5DVAWOPsLI1SwjbXYte0PJOcaF24-6-ejYdc3RMVQGHqOZ-E1Zg2u8amxh1u9A0eXwpq7sNm0jbsPqBmQKK9NUiW1SjQFcdaQV6uqmBtymXQEz-nXl73yLkofF8VpGZhUMsEyMGkELwb2lCcBDeTCp7cHDq6Jr-V4BE9Wt0mn-aBGN65dhmcYhicpRnAvcH41HcXDhcpl9OBi4o_hGqlH-WnvYH8Lrkuu2fAZiNuw2Z0t3UPypDrzyIsswrfL1pE_kWAuZQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Toward+automating+GRADE+classification%3A+a+proof-of-concept+evaluation+of+an+artificial+intelligence-based+tool+for+semiautomated+evidence+quality+rating+in+systematic+reviews&rft.jtitle=BMJ+evidence-based+medicine&rft.au=Oliveira+Dos+Santos%2C+Alisson&rft.au=Belo%2C+Vin%C3%ADcius+Silva&rft.au=Mota+Machado%2C+Tales&rft.au=Silva%2C+Eduardo+S%C3%A9rgio+da&rft.date=2025-04-07&rft.eissn=2515-4478&rft_id=info:doi/10.1136%2Fbmjebm-2024-113123&rft_id=info%3Apmid%2F40194821&rft.externalDocID=40194821
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2515-446X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2515-446X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2515-446X&client=summon