Objectives: Grading of Recommendations Assessment, Development, and Evaluation (GRADE) and Confidence in Network Meta-Analysis (CINeMA) are available to assess the confidence in network meta-analysis (NMA) results. They share common aspects, but their operationalization differs. We evaluated interrater reliability (IRR) among assessors, the approaches' concordance, and application time. Methods: Two dichotomous (“seizure response”, “ischemic stroke”) and two continuous (“change in attention deficit hyperactivity disorder (ADHD)”, “weight loss”) outcomes with networks of different sizes, structures and complexities were chosen from four NMAs. Thirteen assessors were randomly assigned to four groups to apply both GRADE and CINeMA on one continuous and one dichotomous outcome. We measured IRR and concordance using Gwet's AC on the overall evaluation and each tool's domain. We calculated time spent evaluating each network and tool, including time to consult the guidance papers. Results: IRR ranged from 0.49 (“seizure response”) to 0.70 (“ischemic stroke”) with GRADE, from 0.02 (“ischemic stroke”) to 0.73 (“change in ADHD symptoms”) with CINeMA. Overall concordance was 1, 0.90, 0.68, and 0.42 for “seizure response”, “ADHD symptoms”, “weight loss”, and “ischemic stroke”, respectively. The median time spent to assess each network ranged from 160 to 481 minutes with GRADE, from 150 to 330 minutes with CINeMA. Conclusion: IRR was moderate to substantial with both approaches, except for CINeMA when applied to the largest and most complex network (“ischemic stroke”). Concordance was good for small networks with no or few indirect comparisons (“seizure response”) and decreased as the number of comparisons and indirect evidence increased. Application time was long with both approaches, particularly with GRADE for large networks.

Grading of Recommendations Assessment, Development, and Evaluation and Confidence in Network Meta-Analysis showed moderate to substantial concordance in the evaluation of certainty of the evidence / Minozzi, S.; Cinquini, M.; Arienti, C.; Battain, P. C.; Brigadoi, G.; Del Vicario, M.; Di Domenico, G.; Farma, T.; Federico, S.; Innocenti, T.; Maria La Rosa, G. R.; Orlandi, E.; Piersanti, A.; Selvanetti, A.; Zanetta, L.; Del Giovane, C.. - In: JOURNAL OF CLINICAL EPIDEMIOLOGY. - ISSN 0895-4356. - 184:(2025), pp. 1-10. [10.1016/j.jclinepi.2025.111811]

Grading of Recommendations Assessment, Development, and Evaluation and Confidence in Network Meta-Analysis showed moderate to substantial concordance in the evaluation of certainty of the evidence

Cinquini M.;Del Giovane C.
2025

Abstract

Objectives: Grading of Recommendations Assessment, Development, and Evaluation (GRADE) and Confidence in Network Meta-Analysis (CINeMA) are available to assess the confidence in network meta-analysis (NMA) results. They share common aspects, but their operationalization differs. We evaluated interrater reliability (IRR) among assessors, the approaches' concordance, and application time. Methods: Two dichotomous (“seizure response”, “ischemic stroke”) and two continuous (“change in attention deficit hyperactivity disorder (ADHD)”, “weight loss”) outcomes with networks of different sizes, structures and complexities were chosen from four NMAs. Thirteen assessors were randomly assigned to four groups to apply both GRADE and CINeMA on one continuous and one dichotomous outcome. We measured IRR and concordance using Gwet's AC on the overall evaluation and each tool's domain. We calculated time spent evaluating each network and tool, including time to consult the guidance papers. Results: IRR ranged from 0.49 (“seizure response”) to 0.70 (“ischemic stroke”) with GRADE, from 0.02 (“ischemic stroke”) to 0.73 (“change in ADHD symptoms”) with CINeMA. Overall concordance was 1, 0.90, 0.68, and 0.42 for “seizure response”, “ADHD symptoms”, “weight loss”, and “ischemic stroke”, respectively. The median time spent to assess each network ranged from 160 to 481 minutes with GRADE, from 150 to 330 minutes with CINeMA. Conclusion: IRR was moderate to substantial with both approaches, except for CINeMA when applied to the largest and most complex network (“ischemic stroke”). Concordance was good for small networks with no or few indirect comparisons (“seizure response”) and decreased as the number of comparisons and indirect evidence increased. Application time was long with both approaches, particularly with GRADE for large networks.
2025
184
1
10
Grading of Recommendations Assessment, Development, and Evaluation and Confidence in Network Meta-Analysis showed moderate to substantial concordance in the evaluation of certainty of the evidence / Minozzi, S.; Cinquini, M.; Arienti, C.; Battain, P. C.; Brigadoi, G.; Del Vicario, M.; Di Domenico, G.; Farma, T.; Federico, S.; Innocenti, T.; Maria La Rosa, G. R.; Orlandi, E.; Piersanti, A.; Selvanetti, A.; Zanetta, L.; Del Giovane, C.. - In: JOURNAL OF CLINICAL EPIDEMIOLOGY. - ISSN 0895-4356. - 184:(2025), pp. 1-10. [10.1016/j.jclinepi.2025.111811]
Minozzi, S.; Cinquini, M.; Arienti, C.; Battain, P. C.; Brigadoi, G.; Del Vicario, M.; Di Domenico, G.; Farma, T.; Federico, S.; Innocenti, T.; Maria ...espandi
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0895435625001441-main.pdf

Accesso riservato

Tipologia: VOR - Versione pubblicata dall'editore
Licenza: [IR] closed
Dimensione 1.12 MB
Formato Adobe PDF
1.12 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1385190
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact