RCTWikiExerciseHigh evidence score

Randomized controlled trial of yoga and exercise in multiple sclerosis

Read full paper →
Authors
B. S. Oken, Shirley S. Kishiyama, Daniel P. Zajdel, DN Bourdette, Jane Carlsen, Mitchell Haas, C. Hugos, Dale F. Kraemer, Jean M. Lawrence, Michele Mass
Journal
Neurology
Year
2004
Citations
589

TL;DR

This study found that both a 6-month program of Iyengar yoga and a 6-month program of stationary bicycle exercise significantly reduced fatigue in people with multiple sclerosis compared to a control group, suggesting these interventions are worth testing for fatigue management.

What they tested

This study investigated the effects of two different interventions on individuals with multiple sclerosis (MS):

1. **Iyengar Yoga:** Participants attended a weekly Iyengar yoga class and were instructed to practice at home. Iyengar yoga is a form of Hatha yoga that emphasizes precision, alignment, and the use of props to help students achieve poses safely and effectively.

2. **Aerobic Exercise:** Participants attended a weekly exercise class using a stationary bicycle and were instructed to exercise at home. This intervention focused on cardiovascular fitness.

These two active interventions were compared against a **waiting-list control group**. Participants in the control group received no active intervention during the 6-month study period but were offered the interventions afterward.

The researchers measured several outcomes to assess the impact of these interventions:

**Cognitive Function:** Specifically focusing on attention.

**Physiologic Measures of Alertness:** Objective measures related to wakefulness.

**Fatigue:** Assessed using two different self-report questionnaires.

**Mood:** Assessed using two different self-report questionnaires.

**Quality of Life:** Assessed using a comprehensive health-related quality of life questionnaire.

Who was studied

The study recruited **69 subjects** in total.

All participants had a diagnosis of **clinically definite Multiple Sclerosis (MS)**. This means their diagnosis was confirmed based on established neurological criteria, indicating a clear and consistent pattern of MS symptoms.

Their disability level was restricted to an **Expanded Disability Status Score (EDSS) less than or equal to 6.0**. The EDSS is a scale used to quantify disability in MS, ranging from 0 (normal neurological exam) to 10 (death due to MS). An EDSS of 6.0 indicates that the person requires a cane, crutch, or brace to walk about 100 meters with or without resting. This specific range means the study focused on individuals with mild to moderate disability, who were still ambulatory, rather than those with very severe or very mild MS.

The study was conducted in a research setting, likely an outpatient clinic or university research center, given the nature of the interventions and assessments. Of the 69 subjects recruited and randomized, **12 subjects did not complete the full 6-month intervention**, indicating a dropout rate of approximately 17.4%.

How they measured it

The study used a battery of standardized instruments and measures to assess the various outcomes at baseline and after the 6-month intervention period:

**Cognitive Measures:** A battery of tests specifically designed to assess **attention**. The abstract does not specify the exact tests used (e.g., Symbol Digit Modalities Test, Paced Auditory Serial Addition Test), but indicates a focus on this cognitive domain.

**Physiologic Measures of Alertness:** These are objective measures, likely involving electrophysiological recordings (e.g., EEG, evoked potentials) or psychomotor vigilance tasks, designed to quantify a person's level of wakefulness and responsiveness. The specific instruments are not detailed in the abstract.

**Profile of Mood States (POMS):** A widely used psychological questionnaire that assesses transient, fluctuating mood states. It typically measures dimensions such as tension, depression, anger, vigor, fatigue, and confusion.

**State-Trait Anxiety Inventory (STAI):** A self-report questionnaire that distinguishes between "state anxiety" (a temporary condition of anxiety in response to a specific situation) and "trait anxiety" (a more stable, general predisposition to be anxious).

**Multi-Dimensional Fatigue Inventory (MFI):** A comprehensive self-report questionnaire designed to assess different dimensions of fatigue. It typically includes subscales for general fatigue, physical fatigue, mental fatigue, reduced activity, and reduced motivation. The abstract specifically mentions "general fatigue" from the MFI as an outcome.

**Short Form (SF)-36 Health-Related Quality of Life:** A generic, multi-purpose health survey with 36 questions. It measures eight health domains: physical functioning, role limitations due to physical health, bodily pain, general health, vitality (energy/fatigue), social functioning, role limitations due to emotional problems, and mental health. The abstract specifically highlights "Energy and Fatigue (Vitality)" on the SF-36 as an outcome.

These instruments were chosen because they are validated and commonly used in research to assess the subjective and objective experiences of individuals with chronic conditions like MS. Using multiple measures for concepts like fatigue (MFI and SF-36 Vitality) helps to provide a more robust and comprehensive picture of the effect.

Methodology

This study employed a **Randomized Controlled Trial (RCT)** design, which is considered the gold standard for evaluating the effectiveness of interventions. In an RCT, participants are randomly assigned to different groups, ensuring that, on average, the groups are similar in all characteristics at the start of the study, except for the intervention they receive. This minimizes bias and allows researchers to attribute any observed differences in outcomes directly to the intervention.

**Randomization:** Subjects were "randomly assigned to one of three groups." This means each participant had an equal chance of being placed into the Iyengar yoga group, the stationary bicycle exercise group, or the waiting-list control group. The purpose of randomization is to create groups that are comparable in terms of known and unknown confounding factors (e.g., age, disease severity, lifestyle habits, genetic predispositions) at baseline. If the groups are balanced, any differences observed at the end of the study are more likely due to the intervention itself rather than pre-existing differences between the groups.

**Blinding:** The abstract does not mention blinding. Given the nature of the interventions (yoga classes, exercise classes), it would have been impossible to blind the participants or the instructors to their assigned group. Participants knew whether they were doing yoga, exercise, or nothing. This lack of participant blinding introduces a potential for **placebo effects** or **expectancy bias**, where participants' knowledge of receiving an active intervention (or being in the control group) could influence their self-reported outcomes. While it might have been possible to blind the outcome assessors (the individuals administering the cognitive tests and questionnaires) to the participants' group assignments, the abstract does not specify if this was done. Without assessor blinding, there's a risk of **detection bias**, where assessors' knowledge of group assignments could subtly influence how they administer tests or interpret responses.

**Duration:** The intervention period lasted **6 months**. This is a reasonably long duration for behavioral interventions, allowing sufficient time for potential physiological and psychological adaptations to occur and for participants to integrate the practices into their routines. Assessments were performed at baseline (before the intervention started) and at the end of the 6-month period.

**Statistical Approach:** The abstract states that "Both active interventions produced improvement in secondary measures of fatigue compared to the control group" and that there was "significant improvement." While specific statistical tests (e.g., ANOVA, t-tests, ANCOVA) and p-values are not provided in the abstract, the use of the term "significant" implies that statistical analyses were conducted to compare the changes in outcome measures between the groups, accounting for baseline differences and variability.

**What this design can and cannot prove:**

**Can Prove:** As an RCT, this study design has the potential to establish a **causal relationship** between the interventions (yoga or exercise) and the observed changes in outcomes (fatigue). If the randomization was successful and the groups were balanced at baseline, and if the observed effects are statistically significant, then it's reasonable to conclude that the interventions *caused* the improvements in fatigue.

**Cannot Prove:**

* **Mechanism of Action:** While it can show *what* happened, the study design itself doesn't explain *how* yoga or exercise reduced fatigue (e.g., through improved physical fitness, stress reduction, better sleep, social interaction).

* **Generalizability to all MS patients:** The study focused on individuals with clinically definite MS and an EDSS <= 6.0. The findings may not directly apply to individuals with very early-stage MS, very advanced MS, or other neurological conditions.

* **Optimal Dose/Type:** The study tested specific forms (Iyengar yoga, stationary cycling) and frequencies (weekly class + home practice). It cannot determine if other types of yoga or exercise, or different frequencies/intensities, would be more or less effective.

* **Long-term Effects:** A 6-month study cannot definitively prove the sustained, long-term benefits or risks beyond that period.

**Major Methodological Weaknesses:**

1. **Lack of Blinding:** As discussed, the inability to blind participants and the likely lack of assessor blinding introduces a significant risk of bias, particularly for subjective self-reported outcomes like fatigue, mood, and quality of life. Participants in the active intervention groups might report greater improvements simply because they expect to feel better, or control group participants might report less improvement due to disappointment.

2. **High Dropout Rate:** With 12 out of 69 subjects (17.4%) not completing the intervention, there's a risk of **attrition bias**. If participants who dropped out were systematically different from those who completed the study (e.g., those who dropped out found the interventions ineffective or too difficult, or those in the control group became frustrated), the results could be skewed. The abstract doesn't provide details on why participants dropped out or which groups they belonged to.

3. **Reliance on Self-Report:** While validated, many key outcomes (fatigue, mood, quality of life) are subjective and rely entirely on participants' self-reporting. This makes them more susceptible to placebo effects and reporting biases compared to objective measures.

4. **Limited Detail in Abstract:** The abstract lacks specific statistical details (e.g., p-values, effect sizes, confidence intervals, mean changes) for the observed improvements. This

Test it on yourself

Run a structured exercise experiment

The research gives you a prior. Your own data tells you what actually works for you.

Randomized controlled trial of yoga and exercise in multiple sclerosis | Steady Practice | SteadyPractice