GRADE guidelines: 13. Preparing Summary of Findings tables and evidence profiles—continuous outcomes

doi:10.1016/j.jclinepi.2012.08.001

Journal of Clinical Epidemiology

Volume 66, Issue 2, February 2013, Pages 173-183

https://doi.org/10.1016/j.jclinepi.2012.08.001 Get rights and content

Abstract

Presenting continuous outcomes in Summary of Findings tables presents particular challenges to interpretation. When each study uses the same outcome measure, and the units of that measure are intuitively interpretable (e.g., duration of hospitalization, duration of symptoms), presenting differences in means is usually desirable. When the natural units of the outcome measure are not easily interpretable, choosing a threshold to create a binary outcome and presenting relative and absolute effects become a more attractive alternative.

When studies use different measures of the same construct, calculating summary measures requires converting to the same units of measurement for each study. The longest standing and most widely used approach is to divide the difference in means in each study by its standard deviation and present pooled results in standard deviation units (standardized mean difference). Disadvantages of this approach include vulnerability to varying degrees of heterogeneity in the underlying populations and difficulties in interpretation. Alternatives include presenting results in the units of the most popular or interpretable measure, converting to dichotomous measures and presenting relative and absolute effects, presenting the ratio of the means of intervention and control groups, and presenting the results in minimally important difference units. We outline the merits and limitations of each alternative and provide guidance for meta-analysts and guideline developers.

Introduction

Key points

•
Summary of Findings tables provide succinct presentations of evidence quality and magnitude of effects.
•
Summarizing the findings of continuous outcomes presents special challenges to interpretation that become daunting when individual trials use different measures for the same construct.
•
The most commonly used approach to providing pooled estimates for different measures, presenting results in standard deviation units, has limitations related to both statistical properties and interpretability.
•
Potentially preferable alternatives include presenting results in the natural units of the most popular measure, transforming into a binary outcome and presenting relative and absolute effects, presenting the ratio of the means of intervention and control groups, and presenting results in preestablished minimally important difference units.

The first 12 articles in this series introduced the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach to systematic reviews and guideline development [1], discussed the framing of the question [2], presented GRADE's concept of quality of evidence and how to apply it [3], [4], [5], [6], [7], [8], [9] presented GRADEs approach to resource use considerations [10], described how to make overall ratings of confidence [11], and discussed Summary of Findings (SoF) tables presenting the results of binary outcomes [12]. In this thirteenth article, we address issues specific to SoF tables that report results of continuous outcomes.

Our recommendations will differ according to whether

1.
investigators have all used the same measure that is familiar to the target audiences
2.
investigators have all used the same or very similar measures that are less familiar to the target audiences
3.
investigators have used different measures

Section snippets

Options when investigators have all used the same measure that is familiar to the target audiences

In the simplest situation, authors of primary studies have all used the same measure of the continuous outcome of interest, and the target audiences will easily interpret that outcome. This is likely to be true, for instance, of durations of events, such as hospitalization or symptoms for conditions such as sore throat, otitis media, or influenza. For such outcomes, the SoF table should include a weighted difference of means.

Table 1 presents examples of such outcomes from systematic reviews in

Options when investigators have all used the same or very similar measures that are less familiar to the target audiences

Transparency becomes more challenging when clinicians and patients are unfamiliar with the units of the outcome measure. For instance, Table 2 presents data derived from a systematic review addressing the impact of compression stockings for people taking long flights [16]. Outcomes include the presence of edema. Because each study used the same measurement tool for assessing edema, it is possible to make the pooled difference between the groups (the “weighted mean difference”) of 4.7 units more

Options when investigators have used different measures

Reviewers face further challenges when studies measure the same concept but use different measurement instruments. For instance, one set of trials may have measured depression using the Beck Depression Inventory-II [22], and another set may have used the Hamilton Rating Scale for Depression [23]. Under these circumstances, providing pooled estimates of effect and making results interpretable mandates use of one of five available approaches. Table 3 summarizes the merits of each approach and our

Reflections on the interpretation of the five methods

The prior discussion makes evident that there is no ideal method for making results of continuous variables interpretable, particularly when studies have used different measurement tools for the same construct (e.g., pain, physical function, emotional function). Given the sometimes questionable assumptions that each approach makes, it would be reassuring if the methods led to essentially the same inferences. This is true for the respiratory rehabilitation example: all approaches suggest a

Recommendations for enhancing interpretability in meta-analyses in which primary studies use different instruments to measure the same underlying construct

We have described five approaches to enhancing the interpretability of continuous variables in meta-analyses in which primary studies have used different instruments. Review authors will have to tailor their approach to the individual situation but may find the following guides helpful:

1.
Using more than one presentation is likely to be both informative and, if the clinical message is similar, reassuring. It can also reduce the risk of biased selection of which presentation to use when the

Conclusion

Summarizing continuous variables in ways that are both valid and interpretable is challenging. To achieve these goals, systematic review authors and guideline developers should carefully consider the approaches we have suggested.

References (36)

G. Guyatt et al.
GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines: 2. Framing the question and deciding on important outcomes
J Clin Epidemiol
(2011)
H. Balshem et al.
GRADE guidelines: 3. Rating the quality of evidence
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines: 4. Rating the quality of evidence—study limitations (risk of bias)
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines: 5. Rating the quality of evidence—publication bias
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines 6. Rating the quality of evidence—imprecision
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines: 7. Rating the quality of evidence–inconsistency
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines: 8. Rating the quality of evidence—indirectness
J Clin Epidemiol
(2011)
G.H. Guyatt et al.
GRADE guidelines: 9. Rating up the quality of evidence
J Clin Epidemiol
(2011)
M. Brunetti et al.
GRADE guidelines 10 - Considering resource use and rating the quality of economic evidence
J Clin Epidemiol
(2013)

G. Guyatt et al.

GRADE guidelines 11 - Making an overall rating of evidence for a single outcome and for all outcomes

J Clin Epidemiol

(2013)

G.H. Guyatt et al.

GRADE guidelines 12 - Preparing summary of findings tables (SOF) - binary outcomes

J Clin Epidemiol

(2013)

G.H. Guyatt et al.

Methods to explain the clinical significance of health status measures

Mayo Clin Proc

(2002)

R. Jaeschke et al.

Measurement of health status. Ascertaining the minimal clinically important difference

Control Clin Trials

(1989)

S. Suissa

Binary methods for continuous outcomes: a parametric alternative

J Clin Epidemiol

(1991)

R. Dworkin et al.

Interpreting the clinical importance of treatment outcomes in chronic pain clinical trials: IMMPACT recommendations

J Pain

(2008)

T. Furukawa

From effect size into number needed to treat

Lancet

(1999)

G. Guyatt et al.

How can quality of life researchers make their work more useful to health workers and their patients?

Qual Life Res

(2007)

Cited by (481)

Effectiveness of virtual reality on pain and anxiety in patients undergoing cardiac procedures: A systematic review and meta-analysis of randomized controlled trials
2024, Current Problems in Cardiology
Background: Cardiac procedures often induce pain and anxiety in patients, adversely impacting recovery. Pharmachological approaches have limitations, prompting exploration of innovative digital solutions like virtual reality (VR). Although early evidence suggests a potential favourable benefit with VR, it remains unclear whether the implementation of this technology can improve pain and anxiety. We aimed to assess by a systematic review and meta-analysis the effectiveness of VR in alleviating anxiety and pain on patients undergoing cardiac procedures. Methods: Our study adhered to the PRISMA method and was registered in PROSPERO under the code CRD42024504563. The search was carried out in the PubMed, Web of Science, Scopus, and the Cochrane Library databases in January 2024. Four randomized controlled trials were included (a total of 382 patients). Risk of bias was employed to assess the quality of individual studies, and a random-effects model was utilized to examine the overall effect. Results: The results showed that VR, when compared to the standard of care, had a statistically significant impact on anxiety (SMD = −0.51, 95 % CI: −0.86 to −0.16, p = 0.004), with a heterogeneity I2 = 57 %. VR did not show a significant difference in terms of pain when compared to standard care (SMD= −0.34, 95 % CI: −0.75 to −0.07, p = 0.10). The included trials exhibited small sample sizes, substantial heterogeneity, and variations in VR technology types, lengths, and frequencies. Conclusions: VR effectively lowers anxiety levels in patients undergoing cardiac procedures, however, did not show a statistically significant difference on pain.
World Allergy Organization (WAO) Diagnosis and Rationale for Action against Cow's Milk Allergy (DRACMA) guideline update – XII – Recommendations on milk formula supplements with and without probiotics for infants and toddlers with CMA
2024, World Allergy Organization Journal
Cow's milk allergy (CMA) is the most common food allergy in infants. The replacement with specialized formulas is an established clinical approach to ensure adequate growth and minimize the risk of severe allergic reactions when breastfeeding is not possible. Still, given the availability of multiple options, such as extensively hydrolyzed cow's milk protein formula (eHF-CM), amino acid formula (AAF), hydrolyzed rice formula (HRF) and soy formulas (SF), there is some uncertainty as to the most suitable choice with respect to health outcomes. Furthermore, the addition of probiotics to a formula has been proposed as a potential approach to maximize benefit.
These evidence-based guidelines from the World Allergy Organization (WAO) intend to support patients, clinicians, and others in decisions about the use of milk specialized formulas, with and without probiotics, for individuals with CMA.
WAO formed a multidisciplinary guideline panel balanced to include the views of all stakeholders and to minimize potential biases from competing interests. The McMaster University GRADE Centre supported the guideline-development process, including updating or performing systematic evidence reviews. The panel prioritized clinical questions and outcomes according to their importance for clinicians and patients. The Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach was used, including GRADE Evidence-to-Decision frameworks, which were subject to review by stakeholders.
After reviewing the summarized evidence and thoroughly discussing the different management options, the WAO guideline panel suggests: a) using an extensively hydrolyzed (cow's milk) formula or a hydrolyzed rice formula as the first option for managing infants with immunoglobulin E (IgE) and non-IgE-mediated CMA who are not being breastfed. An amino-acid formula or a soy formula could be regarded as second and third options respectively; b) using either a formula without a probiotic or a casein-based extensively hydrolyzed formula containing Lacticaseibacillus rhamnosus GG (LGG) for infants with either IgE or non-IgE-mediated CMA.
The issued recommendations are labeled as “conditional” following the GRADE approach due to the very low certainty about the health effects based on the available evidence.
If breastfeeding is not available, clinicians, patients, and their family members might want to discuss all the potential desirable and undesirable consequences of each formula in infants with CMA, integrating them with the patients' and caregivers’ values and preferences, local availability, and cost, before deciding on a treatment option. We also suggest what research is needed to determine with greater certainty which formulas are likely to be the most beneficial, cost-effective, and equitable.
Topical treatments for atopic dermatitis (eczema): Systematic review and network meta-analysis of randomized trials
2023, Journal of Allergy and Clinical Immunology
Atopic dermatitis (AD) is a common skin condition with multiple topical treatment options, but uncertain comparative effects.
We sought to systematically synthesize the benefits and harms of AD prescription topical treatments.
For the 2023 American Academy of Allergy, Asthma & Immunology and American College of Allergy, Asthma, and Immunology Joint Task Force on Practice Parameters AD guidelines, we searched MEDLINE, EMBASE, CENTRAL, CINAHL, LILACS, ICTRP, and GREAT databases to September 5, 2022, for randomized trials addressing AD topical treatments. Paired reviewers independently screened records, extracted data, and assessed risk of bias. Random-effects network meta-analyses addressed AD severity, itch, sleep, AD-related quality of life, flares, and harms. The Grading of Recommendations Assessment, Development and Evaluation approach informed certainty of evidence ratings. We classified topical corticosteroids (TCS) using 7 groups—group 1 being most potent. This review is registered in the Open Science Framework (https://osf.io/q5m6s).
The 219 included trials (43,123 patients) evaluated 68 interventions. With high-certainty evidence, pimecrolimus improved 6 of 7 outcomes—among the best for 2; high-dose tacrolimus (0.1%) improved 5—among the best for 2; low-dose tacrolimus (0.03%) improved 5—among the best for 1. With moderate- to high-certainty evidence, group 5 TCS improved 6—among the best for 3; group 4 TCS and delgocitinib improved 4—among the best for 2; ruxolitinib improved 4—among the best for 1; group 1 TCS improved 3—among the best for 2. These interventions did not increase harm. Crisaborole and difamilast were intermediately effective, but with uncertain harm. Topical antibiotics alone or in combination may be among the least effective. To maintain AD control, group 5 TCS were among the most effective, followed by tacrolimus and pimecrolimus.
For individuals with AD, pimecrolimus, tacrolimus, and moderate-potency TCS are among the most effective in improving and maintaining multiple AD outcomes. Topical antibiotics may be among the least effective.
Systemic treatments for atopic dermatitis (eczema): Systematic review and network meta-analysis of randomized trials
2023, Journal of Allergy and Clinical Immunology
Atopic dermatitis (AD) is an inflammatory skin condition with multiple systemic treatments and uncertainty regarding their comparative impact on AD outcomes.
We sought to systematically synthesize the benefits and harms of AD systemic treatments.
For the 2023 American Academy of Allergy, Asthma & Immunology and American College of Allergy, Asthma, and Immunology Joint Task Force on Practice Parameters AD guidelines, we searched MEDLINE, EMBASE, CENTRAL, Web of Science, and GREAT databases from inception to November 29, 2022, for randomized trials addressing systemic treatments and phototherapy for AD. Paired reviewers independently screened records, extracted data, and assessed risk of bias. Random-effects network meta-analyses addressed AD severity, itch, sleep, AD-related quality of life, flares, and harms. The Grading of Recommendations Assessment, Development and Evaluation approach informed certainty of evidence ratings. This review is registered in the Open Science Framework (https://osf.io/e5sna).
The 149 included trials (28,686 patients with moderate-to-severe AD) evaluated 75 interventions. With high-certainty evidence, high-dose upadacitinib was among the most effective for 5 of 6 patient-important outcomes; high-dose abrocitinib and low-dose upadacitinib were among the most effective for 2 outcomes. These Janus kinase inhibitors were among the most harmful in increasing adverse events. With high-certainty evidence, dupilumab, lebrikizumab, and tralokinumab were of intermediate effectiveness and among the safest, modestly increasing conjunctivitis. Low-dose baricitinib was among the least effective. Efficacy and safety of azathioprine, oral corticosteroids, cyclosporine, methotrexate, mycophenolate, phototherapy, and many novel agents are less certain.
Among individuals with moderate-to-severe AD, high-certainty evidence demonstrates that high-dose upadacitinib is among the most effective in addressing multiple patient-important outcomes, but also is among the most harmful. High-dose abrocitinib and low-dose upadacitinib are effective, but also among the most harmful. Dupilumab, lebrikizumab, and tralokinumab are of intermediate effectiveness and have favorable safety.
Clinical efficacy and safety of SARS-CoV-2-neutralizing monoclonal antibody in patients with COVID-19: A living systematic review and meta-analysis
2023, Journal of Microbiology, Immunology and Infection
This study evaluated the efficacy and safety of neutralizing monoclonal antibodies (mAbs) with usual care in patients with coronavirus disease 2019 (COVID-19). Randomized controlled trials comparing the efficacy and safety of neutralizing mAb treatment in patients with COVID-19 were identified using electronic database searches through March 10, 2023. This systematic review was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Overall, 13 trials (23 articles) involving 25,646 patients were included in this systematic review. Compared with usual care, neutralizing mAbs were associated with significantly reduced all-cause mortality in outpatients with COVID-19 (pooled risk ratios [RR], 0.41; 95% confidence interval (CI), 0.20–0.83; 12 studies), but not in inpatients. In the subgroup analysis, only outpatients infected prior to the emergence of Delta variant or those with mAb–VOC match had significantly reduced mortality, while no significant benefit was observed in patients infected with Delta and post–Delta variants or mAb–VOC mismatch. Moreover, the rate of hospitalization and number of hospital visits had significantly reduced only in outpatients infected prior to the emergence of the Delta variant and those with mAb–VOC match. Our systematic review used majority of the high-certainty evidence. Our study found neutralizing mAbs were beneficial for outpatients infected prior to Delta variant or mAb–VOC match. In the face of the continuous emergence of new COVID-19 variants, additional clinical data are needed to determine whether neutralizing mAb treatment will be effective for the newly emerging variants.
Randomised controlled trials of antipsychotics for people with autism spectrum disorder: A systematic review and a meta-analysis
2023, Psychological Medicine

View all citing articles on Scopus

: The GRADE system has been developed by the GRADE Working Group. The named authors drafted and revised this article. A complete list of contributors to this series can be found on the Journal of Clinical Epidemiology Web site.

View full text

GRADE SeriesGRADE guidelines: 13. Preparing Summary of Findings tables and evidence profiles—continuous outcomes

Abstract

Introduction

Section snippets

Options when investigators have all used the same measure that is familiar to the target audiences

Options when investigators have all used the same or very similar measures that are less familiar to the target audiences

Options when investigators have used different measures

Reflections on the interpretation of the five methods

Recommendations for enhancing interpretability in meta-analyses in which primary studies use different instruments to measure the same underlying construct

Conclusion

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

J Clin Epidemiol

Mayo Clin Proc

Control Clin Trials

J Clin Epidemiol

J Pain

Lancet

How can quality of life researchers make their work more useful to health workers and their patients?

Qual Life Res

GRADE Series
GRADE guidelines: 13. Preparing Summary of Findings tables and evidence profiles—continuous outcomes