Latest Posts

Example: Comprehensive Statistical Plan for a Interventional Study Involving 3 or more Groups

Research Inquiry

"Suppose my research study is to see the effect on increase in Hb (Hemoglobin) by different treatment protocols. I took 300 study participants and recorded their base Hb before starting treatment (100 No Treatment, 100 Oral Iron, 100 IV Iron). Now, after treatment completion, I have taken the Hb again in all 3 groups. How should I start my data analysis?"


Statistical Analysis Plan

Gemini: To analyze clinical data where you have a "Pre" and "Post" measurement across multiple groups, you should follow this structured "Pipeline of Evidence."

1. Data Preparation: Calculating the 'Delta' (Delta)

In this study, your primary interest is not just the final Hb level, but the improvement caused by the iron.

  • Calculate the Change: 

  • For every participant, create a new variable: Hb Increase= Post-Treatment Hb - Baseline Hb.

  • Check Normality: 

  • Use a Histogram to ensure the distribution of the increase follows a "Bell Curve." ANOVA assumes that the data within each group is normally distributed.


2. Baseline Comparability (The "Fair Start" Check)

Before proving your treatment worked, you must prove the groups were equal at the start.

  • Run a One-Way ANOVA on Baseline Hb: 

  • If the p-value is> 0.05, it confirms that your randomization was successful and no group had a "head start" with significantly higher Hb levels.

  • If p < 0.05: 

  • You may need to use ANCOVA (Analysis of Covariance) to adjust for these initial differences.

3. Primary Analysis: One-Way ANOVA

Since you are comparing the Mean Increase across three independent groups (Control vs. Oral vs. IV), a One-Way ANOVA is the gold standard.

  • Null Hypothesis (H_0): There is no difference in the mean Hb increase between the three protocols.

  • Alternative Hypothesis (H_1): At least one protocol results in a significantly different (likely higher) Hb increase than the others.

4. Post-Hoc Testing (Tukey HSD)

An ANOVA only tells you if a difference exists; it doesn't tell you where. If your ANOVA p-value is < 0.05$, you must run a Tukey HSD test to make pairwise comparisons:

  • IV Iron vs. Oral Iron: Is the expensive/invasive IV route significantly better than the pill?

  • Oral Iron vs. Control: Is the oral treatment significantly better than doing nothing?

5. Interpreting Effect Size (Eta-Squared)

Don't just look at p-values. Calculate Eta-squared. This tells you what percentage of the total increase in Hb can be directly attributed to the treatment protocol rather than individual patient variation.


Data Structure Checklist

To use the ANOVA tool effectively, your spreadsheet should look like this:

Participant IDGroupBaseline HbPost-Treatment HbHb Increase (The Test Variable)
001IV Iron9.212.53.3
002Oral Iron9.510.81.3
003Control9.49.50.1


FAQ

Question: 
When we enter data in protocol A protocol B and protocol C, what should we enter,
Actual Hb numbers after treatment?
or
Increment in Hb after treatment?

Answer:
In statistical analysis, the "correct" data to enter depends entirely on the specific research question you are asking. 
However, for a clinical study comparing treatment protocols, there is a clear gold standard.

The Recommended Approach: Enter the "Increment" (Change Score): 

Increment in Hb (Post-Treatment minus Baseline) into the ANOVA tool.


Why this is better:

  • Accounts for Individual Differences: Some participants might start with an Hb of 7.0 and others at 9.0. Looking only at the final number ignores the "distance traveled" by each patient.

  • Measures Effectiveness Directly: The "Increment" represents the actual effect of the Iron protocol. ANOVA on final numbers alone might be biased if one group accidentally started with higher Hb levels than another.

  • Standard Research Practice: In medical literature, this is often referred to as a "Change Score Analysis" or "Mean Change from Baseline".


When would you enter "Actual Hb Numbers"?

You would only enter the actual post-treatment numbers if:

  1. You have already proven that the baseline (starting) Hb was exactly the same for all three groups.

  2. You are only interested in the "final state" of the patient, regardless of where they started.


Summary Comparison

If you enter Actual HbIf you enter Hb Increment (Recommended)
Answers: "Which group has the highest Hb right now?"Answers: "Which treatment caused the most growth?"
Ignores the starting point of the patient.Corrects for the starting point of the patient.
Risks "Baseline Bias" (if one group started higher).Provides a cleaner measure of treatment "Potency."

The "Pro" Workflow for your Blog

  1. Step 1: Use the ANOVA tool on the Baseline Hb to ensure the groups started fairly (p > .05).

  2. Step 2: Use the ANOVA tool on the Increment (Post - Pre) to prove which treatment is the most effective.

  3. Step 3: If Step 2 is significant, use the Tukey HSD link to see if IV Iron grew Hb significantly more than Oral Iron.


Hope Now you understand that how to plan your data analysis for Interventional / Experimental Research study involving 3 or more than 3 groups with different treatment protocols?

Would you like to preform One-Way ANOVA on your data set? Click Here