Toward automating GRADE classification: a proof-of-concept evaluation of an artificial intelligence-based tool for semiautomated evidence quality rating in systematic reviews
BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and...
Saved in:
| Published in | BMJ evidence-based medicine p. bmjebm-2024-113123 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
England
BMJ Publishing Group Ltd
07.04.2025
BMJ Publishing Group LTD |
| Subjects | |
| Online Access | Get full text |
| ISSN | 2515-446X 2515-4478 2515-4478 |
| DOI | 10.1136/bmjebm-2024-113123 |
Cover
| Abstract | BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.DesignAnalytical experimental study.ObjectiveThe objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.MethodsThe URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.ResultsThe open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen’s kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.ConclusionThe results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future. |
|---|---|
| AbstractList | Evaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.
Analytical experimental study.
The objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.
The URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.
The open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen's kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I
values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.
The results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future. BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.DesignAnalytical experimental study.ObjectiveThe objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.MethodsThe URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.ResultsThe open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen’s kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.ConclusionThe results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future. Evaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.BACKGROUNDEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment, Development and Evaluation (GRADE) affords a consolidated approach for rating the level of evidence, its application is complex and time-consuming. Artificial intelligence (AI) can be used to overcome these barriers.Analytical experimental study.DESIGNAnalytical experimental study.The objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.OBJECTIVEThe objective is to develop and appraise a proof-of-concept AI-powered tool for the semiautomation of an adaptation of the GRADE classification system to determine levels of evidence in SRs with meta-analyses compiled from randomised clinical trials.The URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.METHODSThe URSE-automated system was based on an algorithm created to enhance the objectivity of the GRADE classification. It was developed using the Python language and the React library to create user-friendly interfaces. Evaluation of the URSE-automated system was performed by analysing 115 SRs from the Cochrane Library and comparing the predicted levels of evidence with those generated by human evaluators.The open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen's kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.RESULTSThe open-source URSE code is available on GitHub (http://www.github.com/alisson-mfc/urse). The agreement between the URSE-automated GRADE system and human evaluators regarding the quality of evidence was 63.2% with a Cohen's kappa coefficient of 0.44. The metrics of the GRADE domains evaluated included accuracy and F1-scores, which were 0.97 and 0.94 for imprecision (number of participants), 0.73 and 0.7 for risk of bias, 0.9 and 0.9 for I2 values (heterogeneity) and 0.98 and 0.99 for quality of methodology (A Measurement Tool to Assess Systematic Reviews), respectively.The results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future.CONCLUSIONThe results demonstrate the potential use of AI in assessing the quality of evidence. However, in consideration of the emphasis of the GRADE approach on subjectivity and understanding the context of evidence production, full automation of the classification process is not opportune. Nevertheless, the combination of the URSE-automated system with human evaluation or the integration of this tool into other platforms represents interesting directions for the future. |
| Author | Mota Machado, Tales Silva, Eduardo Sérgio da Oliveira dos Santos, Alisson Belo, Vinícius Silva |
| Author_xml | – sequence: 1 givenname: Alisson orcidid: 0000-0002-4648-9951 surname: Oliveira dos Santos fullname: Oliveira dos Santos, Alisson organization: Campus Tres Lagoas, Universidade Federal de Mato Grosso do Sul, Três Lagoas, Brazil – sequence: 2 givenname: Vinícius Silva orcidid: 0000-0003-0183-1175 surname: Belo fullname: Belo, Vinícius Silva email: viniciusbelo@ufsj.edu.br organization: Universidade Federal de Sao Joao del-Rei—Campus Centro-Oeste Dona Lindu, Divinopolis, Brazil – sequence: 3 givenname: Tales orcidid: 0000-0003-0603-823X surname: Mota Machado fullname: Mota Machado, Tales organization: Universidade Federal de Ouro Preto, Ouro Preto, Brazil – sequence: 4 givenname: Eduardo Sérgio da orcidid: 0000-0001-7409-9216 surname: Silva fullname: Silva, Eduardo Sérgio da organization: Oswaldo Cruz Foundation, Rio de Janeiro, Brazil |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/40194821$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9kc1q3TAQhUVJaX6aF-iiCLrpxqlH8o_cXUjStBAolBS6M2N5HHSxpRtJTrgv1WesHN-k0EVBIDH6juaMzjE7sM4SY-8gPwOQ1adu2lA3ZSIXRZYKIOQrdiRKKLOiqNXBy7n6dchOQ9jkeS4AmlKpN-ywyKEplIAj9vvWPaLvOc7RTRiNvePXP84vr7geMQQzGJ2Kzn7myLfeuSFLSzuraRs5PeA4P11zN3C0HH1cFAZHbmykcTR3lNCsw0A9j86NfHCeB5rMvl8q04PpF4rfzziauON-tWEsD7sQaXGluU8YPYa37PWAY6DT_X7Cfn65ur34mt18v_52cX6TdWnsmMmm6lHUlQRUsqlRFLrRfV93taixEYWsSyokEAjsO8AuF40SPQEs39k1lTxhH9d309D3M4XYTiboNBFacnNoJai6FJUqm4R--AfduNnb5O6JAiVqBYl6v6fmbqK-3Xozod-1z0kkQKyA9i4ET8MLAnm7JN6uibeLxXZNPInOVlG6-9v2P4I_LDqwIA |
| Cites_doi | 10.1136/bmj.39489.470347.AD 10.1136/bmj.f7383 10.2307/2529310 10.1016/j.jclinepi.2015.11.019 10.18653/v1/P17-4002 10.1371/journal.pone.0034697 10.48550/arXiv.1810.04805 10.1016/j.jclinepi.2015.11.018 10.1186/1471-2288-7-10 10.1002/jrsm.1230 10.1016/j.jclinepi.2015.08.013 10.1038/s41433-024-02958-w 10.1136/bmj.d5928 10.1016/j.jclinepi.2017.12.015 10.1136/bmj.b4012 10.1093/jamia/ocv044 10.4103/2249-4863.109934 10.1016/j.jbi.2023.104389 10.1186/2046-4053-3-82 10.1111/jnu.12628 10.1186/s12911-019-0814-z 10.1097/PRS.0b013e318219c171 10.1136/bmjopen-2016-012545 10.1136/amiajnl-2013-002411 10.5281/ZENODO.13916887 10.1016/j.jclinepi.2010.04.026 10.1002/14651858.CD001914.pub2 10.1016/j.jclinepi.2012.07.005 10.23919/MIPRO.2019.8757088 |
| ContentType | Journal Article |
| Copyright | Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group. 2025 Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group. |
| Copyright_xml | – notice: Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group. – notice: 2025 Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group. |
| DBID | AAYXX CITATION NPM 3V. 7X7 7XB 88E 8FI 8FJ 8FK ABUWG AFKRA BENPR BTHHO CCPQU FYUFA GHDGH K9. M0S M1P PHGZM PHGZT PJZUB PKEHL PPXIY PQEST PQQKQ PQUKI PRINS S0X 7X8 |
| DOI | 10.1136/bmjebm-2024-113123 |
| DatabaseName | CrossRef PubMed ProQuest Central (Corporate) Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central ProQuest Central BMJ Journals ProQuest One Community College Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) Health & Medical Collection (Alumni Edition) Medical Database ProQuest Central Premium ProQuest One Academic ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China SIRS Editorial MEDLINE - Academic |
| DatabaseTitle | CrossRef PubMed ProQuest One Academic Middle East (New) SIRS Editorial ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central China ProQuest Central ProQuest Health & Medical Research Collection Health Research Premium Collection Health and Medicine Complete (Alumni Edition) Health & Medical Research Collection ProQuest Central (New) ProQuest Medical Library (Alumni) ProQuest One Academic Eastern Edition ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest Medical Library ProQuest One Academic UKI Edition BMJ Journals ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic |
| DatabaseTitleList | PubMed ProQuest One Academic Middle East (New) MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: BENPR name: ProQuest Central url: http://www.proquest.com/pqcentral?accountid=15518 sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| EISSN | 2515-4478 |
| ExternalDocumentID | 40194821 10_1136_bmjebm_2024_113123 ebmed |
| Genre | Journal Article |
| GrantInformation_xml | – fundername: Coordenação de Aperfeiçoamento de Pessoal de Nível Superior grantid: 001 funderid: http://dx.doi.org/10.13039/501100002322 – fundername: Universidade Federal de São João del-Rei funderid: http://dx.doi.org/10.13039/501100020987 – fundername: Universidade Federal de Mato Grosso do Sul funderid: http://dx.doi.org/10.13039/501100016182 – fundername: Conselho Nacional de Desenvolvimento Científico e Tecnológico grantid: 305665/2023-5; 311309/2023-2 funderid: http://dx.doi.org/10.13039/501100003593 |
| GroupedDBID | 53G AAHLL AAOJX ABTFR ACGFS ACHTP ADBBV ADUGQ AFWFF AJYBZ ALMA_UNASSIGNED_HOLDINGS BENPR BLJBA BTHHO CXRWF EBS EJD HAJ OVD RMJ TEORI UYXKK 7X7 AAYXX ACQHZ AERUA BPHCQ BVXVI CITATION PQQKQ PROAC S0X NPM 3V. 7XB 88E 8FI 8FJ 8FK ABUWG AFKRA CCPQU FYUFA HMCUK K9. M1P PHGZM PHGZT PJZUB PKEHL PPXIY PQEST PQUKI PRINS PSQYO UKHRP 7X8 |
| ID | FETCH-LOGICAL-b251t-396da27631a8397a24c9cdd7b727a924375e431e12adb1ab02982de112024b963 |
| IEDL.DBID | BENPR |
| ISSN | 2515-446X 2515-4478 |
| IngestDate | Fri Jul 11 18:47:05 EDT 2025 Tue Oct 07 07:07:23 EDT 2025 Mon Jul 21 05:59:36 EDT 2025 Wed Oct 01 06:34:12 EDT 2025 Tue Apr 08 14:21:35 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Keywords | Evidence-Based Practice Systematic Reviews as Topic |
| Language | English |
| License | Author(s) (or their employer(s)) 2025. No commercial re-use. See rights and permissions. Published by BMJ Group. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-b251t-396da27631a8397a24c9cdd7b727a924375e431e12adb1ab02982de112024b963 |
| Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 content type line 14 ObjectType-Feature-3 ObjectType-Evidence Based Healthcare-1 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
| ORCID | 0000-0003-0183-1175 0000-0002-4648-9951 0000-0001-7409-9216 0000-0003-0603-823X |
| PMID | 40194821 |
| PQID | 3187182781 |
| PQPubID | 2041037 |
| ParticipantIDs | proquest_miscellaneous_3187526859 proquest_journals_3187182781 pubmed_primary_40194821 crossref_primary_10_1136_bmjebm_2024_113123 bmj_journals_10_1136_bmjebm_2024_113123 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2025-04-07 |
| PublicationDateYYYYMMDD | 2025-04-07 |
| PublicationDate_xml | – month: 04 year: 2025 text: 2025-04-07 day: 07 |
| PublicationDecade | 2020 |
| PublicationPlace | England |
| PublicationPlace_xml | – name: England – name: London |
| PublicationTitle | BMJ evidence-based medicine |
| PublicationTitleAbbrev | BMJ EBM |
| PublicationTitleAlternate | BMJ Evid Based Med |
| PublicationYear | 2025 |
| Publisher | BMJ Publishing Group Ltd BMJ Publishing Group LTD |
| Publisher_xml | – name: BMJ Publishing Group Ltd – name: BMJ Publishing Group LTD |
| References | Devlin, Chang, Lee (R13) 2019 Meader, King, Llewellyn (R6) 2014; 3 Soboczenski, Trikalinos, Kuiper (R25) 2019; 19 Murad, Mustafa, Morgan (R38) 2016; 74 Marshall, Kuiper, Banner (R11) 2017; 2017 Shea, Grimshaw, Wells (R10) 2007; 7 Hartling, Fernandes, Seida (R22) 2012; 7 Hartling, Ospina, Liang (R27) 2009; 339 Guyatt, Oxman, Akl (R33) 2011; 64 Burns, Rohrich, Chung (R4) 2011; 128 Hartling, Hamm, Milne (R26) 2013; 66 Hirt, Meichlinger, Schumacher (R24) 2021; 53 Smith (R15) 2013; 347 Bui, Zeng-Treitler (R29) 2014; 21 Higgins, Altman, Gøtzsche (R9) 2011; 343 Borenstein, Higgins, Hedges (R35) 2017; 8 Marshall, Kuiper, Wallace (R30) 2016; 23 Masalkhi, Ong, Waisberg (R39) 2024; 38 Francis, Smith, Saljuqi (R23) 2015; 2015 Santos, Machado (R20) 2024 Gates, Vandermeer, Hartling (R28) 2018; 96 Landis, Koch (R16) 1977; 33 Pollock, Farmer, Brady (R7) 2016; 70 Gionfriddo (R37) 2016; 74 Guyatt, Oxman, Vist (R5) 2008; 336 Santos, da Silva, Couto (R8) 2023; 142 Borah, Brown, Capers (R3) 2017; 7 Maratkar, Adkar (R31) 2021; 4 Gopalakrishnan, Ganeshkumar (R2) 2013; 2 2025040710200943000_bmjebm-2024-113123v1.18 2025040710200943000_bmjebm-2024-113123v1.5 2025040710200943000_bmjebm-2024-113123v1.19 2025040710200943000_bmjebm-2024-113123v1.4 2025040710200943000_bmjebm-2024-113123v1.38 2025040710200943000_bmjebm-2024-113123v1.17 2025040710200943000_bmjebm-2024-113123v1.39 Meader (2025040710200943000_bmjebm-2024-113123v1.6) 2014; 3 2025040710200943000_bmjebm-2024-113123v1.8 2025040710200943000_bmjebm-2024-113123v1.7 2025040710200943000_bmjebm-2024-113123v1.32 2025040710200943000_bmjebm-2024-113123v1.33 2025040710200943000_bmjebm-2024-113123v1.30 2025040710200943000_bmjebm-2024-113123v1.14 2025040710200943000_bmjebm-2024-113123v1.36 2025040710200943000_bmjebm-2024-113123v1.37 2025040710200943000_bmjebm-2024-113123v1.12 2025040710200943000_bmjebm-2024-113123v1.34 2025040710200943000_bmjebm-2024-113123v1.13 2025040710200943000_bmjebm-2024-113123v1.35 Higgins (2025040710200943000_bmjebm-2024-113123v1.9) 2011; 343 Soboczenski (2025040710200943000_bmjebm-2024-113123v1.25) 2019; 19 2025040710200943000_bmjebm-2024-113123v1.2 Borah (2025040710200943000_bmjebm-2024-113123v1.3) 2017; 7 2025040710200943000_bmjebm-2024-113123v1.1 Hartling (2025040710200943000_bmjebm-2024-113123v1.27) 2009; 339 Smith (2025040710200943000_bmjebm-2024-113123v1.15) 2013; 347 2025040710200943000_bmjebm-2024-113123v1.29 Landis (2025040710200943000_bmjebm-2024-113123v1.16) 1977; 33 2025040710200943000_bmjebm-2024-113123v1.28 2025040710200943000_bmjebm-2024-113123v1.21 2025040710200943000_bmjebm-2024-113123v1.20 2025040710200943000_bmjebm-2024-113123v1.26 2025040710200943000_bmjebm-2024-113123v1.24 Hartling (2025040710200943000_bmjebm-2024-113123v1.22) 2012; 7 Maratkar (2025040710200943000_bmjebm-2024-113123v1.31) 2021; 4 Francis (2025040710200943000_bmjebm-2024-113123v1.23) 2015; 2015 Shea (2025040710200943000_bmjebm-2024-113123v1.10) 2007; 7 Marshall (2025040710200943000_bmjebm-2024-113123v1.11) 2017; 2017 |
| References_xml | – volume: 336 start-page: 924 year: 2008 ident: R5 article-title: GRADE: an emerging consensus on rating quality of evidence and strength of recommendations publication-title: BMJ doi: 10.1136/bmj.39489.470347.AD – volume: 347 year: 2013 ident: R15 article-title: The Cochrane collaboration at 20 publication-title: BMJ doi: 10.1136/bmj.f7383 – volume: 33 year: 1977 ident: R16 article-title: The Measurement of Observer Agreement for Categorical Data publication-title: Biometrics doi: 10.2307/2529310 – volume: 74 start-page: 237 year: 2016 ident: R37 article-title: Subjectivity is a strength: a comment on “an algorithm was developed to assign GRADE levels of evidence to comparisons within systematic reviews” publication-title: J Clin Epidemiol doi: 10.1016/j.jclinepi.2015.11.019 – volume: 2017 start-page: 7 year: 2017 ident: R11 article-title: Automating Biomedical Evidence Synthesis: RobotReviewer publication-title: Proc Conf Assoc Comput Linguist Meet doi: 10.18653/v1/P17-4002 – volume: 7 year: 2012 ident: R22 article-title: From the trenches: a cross-sectional study applying the GRADE tool in systematic reviews of healthcare interventions publication-title: PLoS ONE doi: 10.1371/journal.pone.0034697 – year: 2019 ident: R13 article-title: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding publication-title: arXiv doi: 10.48550/arXiv.1810.04805 – volume: 74 start-page: 237 year: 2016 ident: R38 article-title: Rating the quality of evidence is by necessity a matter of judgment publication-title: J Clin Epidemiol doi: 10.1016/j.jclinepi.2015.11.018 – volume: 7 year: 2007 ident: R10 article-title: Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews publication-title: BMC Med Res Methodol doi: 10.1186/1471-2288-7-10 – volume: 8 start-page: 5 year: 2017 ident: R35 article-title: Basics of meta-analysis: I2 is not an absolute measure of heterogeneity publication-title: Res Synth Methods doi: 10.1002/jrsm.1230 – volume: 70 start-page: 106 year: 2016 ident: R7 article-title: An algorithm was developed to assign GRADE levels of evidence to comparisons within systematic reviews publication-title: J Clin Epidemiol doi: 10.1016/j.jclinepi.2015.08.013 – volume: 38 start-page: 1412 year: 2024 ident: R39 article-title: Google DeepMind’s gemini AI versus ChatGPT: a comparative analysis in ophthalmology publication-title: Eye (Lond) doi: 10.1038/s41433-024-02958-w – volume: 343 year: 2011 ident: R9 article-title: The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials publication-title: BMJ doi: 10.1136/bmj.d5928 – volume: 96 start-page: 54 year: 2018 ident: R28 article-title: Technology-assisted risk of bias assessment in systematic reviews: a prospective cross-sectional evaluation of the RobotReviewer machine learning tool publication-title: J Clin Epidemiol doi: 10.1016/j.jclinepi.2017.12.015 – volume: 339 year: 2009 ident: R27 article-title: Risk of bias versus quality assessment of randomised controlled trials: cross sectional study publication-title: BMJ doi: 10.1136/bmj.b4012 – volume: 23 start-page: 193 year: 2016 ident: R30 article-title: RobotReviewer: evaluation of a system for automatically assessing bias in clinical trials publication-title: J Am Med Inform Assoc doi: 10.1093/jamia/ocv044 – volume: 2 start-page: 9 year: 2013 ident: R2 article-title: Systematic Reviews and Meta-analysis: Understanding the Best Evidence in Primary Healthcare publication-title: J Family Med Prim Care doi: 10.4103/2249-4863.109934 – volume: 142 start-page: 104389 year: 2023 ident: R8 article-title: The use of artificial intelligence for automating or semi-automating biomedical literature analyses: A scoping review publication-title: J Biomed Inform doi: 10.1016/j.jbi.2023.104389 – volume: 3 year: 2014 ident: R6 article-title: A checklist designed to aid consistency and reproducibility of GRADE assessments: development and pilot validation publication-title: Syst Rev doi: 10.1186/2046-4053-3-82 – volume: 53 start-page: 246 year: 2021 ident: R24 article-title: Agreement in Risk of Bias Assessment Between RobotReviewer and Human Reviewers: An Evaluation Study on Randomised Controlled Trials in Nursing-Related Cochrane Reviews publication-title: J Nurs Scholarsh doi: 10.1111/jnu.12628 – volume: 19 year: 2019 ident: R25 article-title: Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study publication-title: BMC Med Inform Decis Mak doi: 10.1186/s12911-019-0814-z – volume: 128 start-page: 305 year: 2011 ident: R4 article-title: The levels of evidence and their role in evidence-based medicine publication-title: Plast Reconstr Surg doi: 10.1097/PRS.0b013e318219c171 – volume: 7 year: 2017 ident: R3 article-title: Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry publication-title: BMJ Open doi: 10.1136/bmjopen-2016-012545 – volume: 21 start-page: 850 year: 2014 ident: R29 article-title: Learning regular expressions for clinical text classification publication-title: J Am Med Inform Assoc doi: 10.1136/amiajnl-2013-002411 – year: 2024 ident: R20 article-title: URSE-Automated GRADE System publication-title: Zenodo doi: 10.5281/ZENODO.13916887 – volume: 64 start-page: 383 year: 2011 ident: R33 article-title: GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables publication-title: J Clin Epidemiol doi: 10.1016/j.jclinepi.2010.04.026 – volume: 2015 year: 2015 ident: R23 article-title: Oral protein calorie supplementation for children with chronic disease publication-title: Cochrane Database Syst Rev doi: 10.1002/14651858.CD001914.pub2 – volume: 4 start-page: 99 year: 2021 ident: R31 article-title: An emerging frontend JavaScript library publication-title: IRE Journals – volume: 66 start-page: 973 year: 2013 ident: R26 article-title: Testing the risk of bias tool showed low reliability between individual reviewers and across consensus assessments of reviewer pairs publication-title: J Clin Epidemiol doi: 10.1016/j.jclinepi.2012.07.005 – ident: 2025040710200943000_bmjebm-2024-113123v1.32 – volume: 19 year: 2019 ident: 2025040710200943000_bmjebm-2024-113123v1.25 article-title: Machine learning to help researchers evaluate biases in clinical trials: a prospective, randomized user study publication-title: BMC Med Inform Decis Mak doi: 10.1186/s12911-019-0814-z – volume: 3 year: 2014 ident: 2025040710200943000_bmjebm-2024-113123v1.6 article-title: A checklist designed to aid consistency and reproducibility of GRADE assessments: development and pilot validation publication-title: Syst Rev doi: 10.1186/2046-4053-3-82 – ident: 2025040710200943000_bmjebm-2024-113123v1.30 doi: 10.1093/jamia/ocv044 – ident: 2025040710200943000_bmjebm-2024-113123v1.34 – ident: 2025040710200943000_bmjebm-2024-113123v1.13 – ident: 2025040710200943000_bmjebm-2024-113123v1.33 doi: 10.1016/j.jclinepi.2010.04.026 – ident: 2025040710200943000_bmjebm-2024-113123v1.38 doi: 10.1016/j.jclinepi.2015.11.018 – volume: 7 year: 2012 ident: 2025040710200943000_bmjebm-2024-113123v1.22 article-title: From the trenches: a cross-sectional study applying the GRADE tool in systematic reviews of healthcare interventions publication-title: PLoS ONE doi: 10.1371/journal.pone.0034697 – ident: 2025040710200943000_bmjebm-2024-113123v1.4 doi: 10.1097/PRS.0b013e318219c171 – ident: 2025040710200943000_bmjebm-2024-113123v1.39 doi: 10.1038/s41433-024-02958-w – ident: 2025040710200943000_bmjebm-2024-113123v1.20 – ident: 2025040710200943000_bmjebm-2024-113123v1.26 doi: 10.1016/j.jclinepi.2012.07.005 – ident: 2025040710200943000_bmjebm-2024-113123v1.17 – ident: 2025040710200943000_bmjebm-2024-113123v1.19 doi: 10.23919/MIPRO.2019.8757088 – ident: 2025040710200943000_bmjebm-2024-113123v1.5 doi: 10.1136/bmj.39489.470347.AD – ident: 2025040710200943000_bmjebm-2024-113123v1.36 – ident: 2025040710200943000_bmjebm-2024-113123v1.2 doi: 10.4103/2249-4863.109934 – ident: 2025040710200943000_bmjebm-2024-113123v1.8 doi: 10.1016/j.jbi.2023.104389 – volume: 339 year: 2009 ident: 2025040710200943000_bmjebm-2024-113123v1.27 article-title: Risk of bias versus quality assessment of randomised controlled trials: cross sectional study publication-title: BMJ doi: 10.1136/bmj.b4012 – volume: 4 start-page: 99 year: 2021 ident: 2025040710200943000_bmjebm-2024-113123v1.31 article-title: An emerging frontend JavaScript library publication-title: IRE Journals – volume: 343 year: 2011 ident: 2025040710200943000_bmjebm-2024-113123v1.9 article-title: The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials publication-title: BMJ doi: 10.1136/bmj.d5928 – volume: 33 year: 1977 ident: 2025040710200943000_bmjebm-2024-113123v1.16 article-title: The Measurement of Observer Agreement for Categorical Data publication-title: Biometrics doi: 10.2307/2529310 – ident: 2025040710200943000_bmjebm-2024-113123v1.29 doi: 10.1136/amiajnl-2013-002411 – ident: 2025040710200943000_bmjebm-2024-113123v1.35 doi: 10.1002/jrsm.1230 – ident: 2025040710200943000_bmjebm-2024-113123v1.12 – ident: 2025040710200943000_bmjebm-2024-113123v1.28 doi: 10.1016/j.jclinepi.2017.12.015 – volume: 7 year: 2017 ident: 2025040710200943000_bmjebm-2024-113123v1.3 article-title: Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry publication-title: BMJ Open doi: 10.1136/bmjopen-2016-012545 – ident: 2025040710200943000_bmjebm-2024-113123v1.37 doi: 10.1016/j.jclinepi.2015.11.018 – volume: 2015 year: 2015 ident: 2025040710200943000_bmjebm-2024-113123v1.23 article-title: Oral protein calorie supplementation for children with chronic disease publication-title: Cochrane Database Syst Rev – ident: 2025040710200943000_bmjebm-2024-113123v1.1 – ident: 2025040710200943000_bmjebm-2024-113123v1.24 doi: 10.1111/jnu.12628 – ident: 2025040710200943000_bmjebm-2024-113123v1.18 – volume: 7 year: 2007 ident: 2025040710200943000_bmjebm-2024-113123v1.10 article-title: Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews publication-title: BMC Med Res Methodol doi: 10.1186/1471-2288-7-10 – volume: 2017 start-page: 7 year: 2017 ident: 2025040710200943000_bmjebm-2024-113123v1.11 article-title: Automating Biomedical Evidence Synthesis: RobotReviewer publication-title: Proc Conf Assoc Comput Linguist Meet – ident: 2025040710200943000_bmjebm-2024-113123v1.21 – ident: 2025040710200943000_bmjebm-2024-113123v1.14 – volume: 347 year: 2013 ident: 2025040710200943000_bmjebm-2024-113123v1.15 article-title: The Cochrane collaboration at 20 publication-title: BMJ doi: 10.1136/bmj.f7383 – ident: 2025040710200943000_bmjebm-2024-113123v1.7 doi: 10.1016/j.jclinepi.2015.08.013 |
| SSID | ssj0002119588 |
| Score | 2.29032 |
| Snippet | BackgroundEvaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations... Evaluation of the quality of evidence in systematic reviews (SRs) is essential for assertive decision-making. Although Grading of Recommendations Assessment,... |
| SourceID | proquest pubmed crossref bmj |
| SourceType | Aggregation Database Index Database Publisher |
| StartPage | bmjebm-2024-113123 |
| SubjectTerms | Adaptation Artificial intelligence Automation Bias Classification Clinical trials Decision making Evidence-Based Practice Original research Systematic Reviews as Topic |
| Title | Toward automating GRADE classification: a proof-of-concept evaluation of an artificial intelligence-based tool for semiautomated evidence quality rating in systematic reviews |
| URI | https://ebm.bmj.com/content/early/2025/04/07/bmjebm-2024-113123.full https://www.ncbi.nlm.nih.gov/pubmed/40194821 https://www.proquest.com/docview/3187182781 https://www.proquest.com/docview/3187526859 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 2515-4478 dateEnd: 20250502 omitProxy: true ssIdentifier: ssj0002119588 issn: 2515-446X databaseCode: 7X7 dateStart: 20180101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: http://www.proquest.com/pqcentral?accountid=15518 eissn: 2515-4478 dateEnd: 20250502 omitProxy: true ssIdentifier: ssj0002119588 issn: 2515-446X databaseCode: BENPR dateStart: 19951201 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1dSxwxFL3o-uJLUWzrtiq3UPBBgk4281UQsaKVgouIwr4NySRTVrozVmcf_FP9jd6bzOzWhwrzlAlJuPcmOfk6B-CrdIrC5qgSiS0zoZSNRKY0HzhWmdEJLbe9NuDVOLm8Uz8n8WQFxv1bGL5W2Y-JfqC2Tcl75IcUezSMyjSLTh7-CFaN4tPVXkJDd9IK9thTjK3CmmRmrAGsfT8fX98sdl2Yzyz2YpQ0r8fUtmTSv6QZJYdmdu_MjAJHKkEJkdcwosTXc9Z_gKifkC424F2HJPE0uH4TVly9BX9v_TVY1PO2YSxa_8IfN2RsLBkl87Ug74lvqJEqaCpBXxleLuKS-BubCnWNHFWBYAKn_zB3Cp75LLZN8xsJ8uKTm027-ijZdTKlGJ5rPuNjaMa0xiVrNHYkqO_h7uL89uxSdJIMwpDBWjHKE6sljUmRJmSVaqnKvLQ2NQSDdM7khjE5P3KR1NZE2jDBu7SOQB3Z1FBn_wCDuqndNqBmmqCsMnEZVyrWtLSyhrCmKkcsk2WSIeyT6YuuSz0VfrUySorgpIILLIKThnDQu6d4CBwdb-be6T24LHwZXUP4svhNPY2PT3TtmnnIw-Q4cT6Ej8Hzi-polZqrTEaf3i78M6xL1g_mmz_pDgzax7nbJVDTmj1YTSfpXhevL7IK9ps |
| linkProvider | ProQuest |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3NbtQwELZKe4ALKuJvaYFBAnFAVonjZBOkquKnZUvbFaq20t5SO3bQom7SdrNCfSkeoc_GjO3swoHeKuXkROMkM_bM2J7vY-y1sBLN5n3FU1NmXEoT8Uwq2nCsMq1STLcdN-DRMB2cyG_jZLzCrrtaGDpW2c2JbqI2TUlr5FtoeziNin4W7ZxfcGKNot3VjkJDBWoFs-0gxkJhx4G9-oUp3Gx7_wvq-40Qe7ujzwMeWAa4Rt_e8jhPjRI4zCKFwUJfCVnmpTF9jZ5d5YTXl-D3RDYSyuhIacIsF8ZinILuTaP9otw7bE3GMsfkb-3T7vD78WKVh_DTEkd-iX0l-C_ScVe5E6dbevrT6iknQRwbIseZhI3_-sj_BL7OAe6ts_shcoWP3tQesBVbP2S_R-7YLah521DsW_-Ar8eoXCgpKqdjSE7zH0ABdtBUHK_SV0rCEmgcmgpUDWTFHtACJn8hhXLytAbapjkDDLFhZqeT0B8220CLCr489Aou_WtMaliiVEMAXX3ETm5FOY_Zat3U9ikDRbBEWaWTMqlkojCVMxpjW1nGRMul0x57i7--CEN4VrjsKE4Lr6SCBBZeST32rlNPce4xQW58erPT4FL40pp77NXiNo5s2q5RtW3m_hkC40nyHnviNb_oDrPiXGYienaz8Jfs7mB0dFgc7g8PNtg9QdzFdOqov8lW28u5fY4BVatfBKsFdnrbA-UP9RQw1w |
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1fb9QwDLfGkBAvCMS_YwOMBOIBRUfT9B8SQohxbAwmhDbp3krSpNMhrt12PaF9KT4Anw47ae_ggb1N6lNaOW1tx3Zi_wzwVDpFYvOyFqmtcqGUjUSuNB841rnRKYXbvjfg54N090h9nCbTDfg91MJwWuWwJvqF2rYV75GPSfZoGZVZHo3rPi3iy87kzcmp4A5SfNI6tNMIIrLvzn9S-LZ4vbdDvH4m5eT94btd0XcYEIbseifiIrVakopFmhyFTEtVFZW1mSGrrgvG6kvoWyIXSW1NpA3jlUvryEch02ZIdonuFbiaxXHB6YTZNFvt7zByWuLbXtJMCf2FdDrU7MTp2My_OzMXTEbQQOS7JdHgv9bxPy6vN32Tm3Cj91nxbRCyW7Dhmtvw69An3KJedi17vc0xfvhKbMWK_XFOQPI8f4UaaYK2FnRVoUYS1xDj2NaoG2T5DVAWOPsLI1SwjbXYte0PJOcaF24-6-ejYdc3RMVQGHqOZ-E1Zg2u8amxh1u9A0eXwpq7sNm0jbsPqBmQKK9NUiW1SjQFcdaQV6uqmBtymXQEz-nXl73yLkofF8VpGZhUMsEyMGkELwb2lCcBDeTCp7cHDq6Jr-V4BE9Wt0mn-aBGN65dhmcYhicpRnAvcH41HcXDhcpl9OBi4o_hGqlH-WnvYH8Lrkuu2fAZiNuw2Z0t3UPypDrzyIsswrfL1pE_kWAuZQ |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Toward+automating+GRADE+classification%3A+a+proof-of-concept+evaluation+of+an+artificial+intelligence-based+tool+for+semiautomated+evidence+quality+rating+in+systematic+reviews&rft.jtitle=BMJ+evidence-based+medicine&rft.au=Oliveira+Dos+Santos%2C+Alisson&rft.au=Belo%2C+Vin%C3%ADcius+Silva&rft.au=Mota+Machado%2C+Tales&rft.au=Silva%2C+Eduardo+S%C3%A9rgio+da&rft.date=2025-04-07&rft.eissn=2515-4478&rft_id=info:doi/10.1136%2Fbmjebm-2024-113123&rft_id=info%3Apmid%2F40194821&rft.externalDocID=40194821 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2515-446X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2515-446X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2515-446X&client=summon |