ASL-4.10 Spearman’s rank correlation coefficient

Term / concept	Definition / short explanation
Spearman’s rank correlation (r_s)	Non-parametric measure of the strength and direction of a monotonic relationship between two variables based on ranks (range −1 to +1).
Rank difference (d)	For each pair, d = (rank of X) − (rank of Y). Used in the standard formula r_s = 1 − (6 Σ d²) / (n (n² − 1)) when no ties are present.
Ties	Equal values receive the average of the ranks they occupy. Ties require tie-adjusted methods (use technology for accuracy).

📌 1. What r_s measures and when to use it

Definition: r_s measures the degree to which two variables move together in a monotonic way using their ranks rather than raw values.
Use when: data are ordinal, relationship is monotonic (not necessarily linear), or when outliers / non-normality make Pearson’s r unreliable.
Interpretation: r_s ≈ +1 strong monotonic increase; ≈ −1 strong monotonic decrease; ≈ 0 no monotonic association.

🌍 Real-World Connection

r_s is common in survey analysis (Likert-scale responses), ecology (rank abundance), and other fields where the data naturally come as ranks or where robustness to extremes matters.

📌 2. Step-by-step computation (explicit)

Rank X values from 1 to n (smallest = 1). Average ranks for ties.
Rank Y values the same way.
Input the ranks for each value in the order of the normal values in your GDC table menu
Calculate the Linear Regression of the model and use the r value to show correlation.

📐 IA spotlight

For an IA choose ordinal or ranked data (e.g., customer preference ranks). Show hand-ranking for a subset, then use GDC for the full dataset and discuss ties and limitations.

Worked example (no ties)

Observations (n = 6):
X: 10, 20, 30, 40, 50, 60
Y: 8, 25, 22, 49, 53, 48

Ranks: Rx = 1,2,3,4,5,6. Ry = 1,3,2,5,6,4. Then d and d² computed and Σ d² = 8.

Substitute: r_s = 1 − (6 × 8) / (6 (6² − 1)) = 1 − 48 / 210 ≈ 0.771 → strong positive monotonic association.

📌 3. Interpreting results & practical checks

Magnitude: use context & sample size: |r_s| > 0.8 often strong, 0.5–0.8 moderate, 0.3–0.5 weak, below 0.3 negligible (guideline).
Direction: sign tells increase/decrease in ranks.
Statistical significance: compute p-value using technology and interpret in context — small n reduces power.
Ties & robustness: r_s is less sensitive to extreme values or outliers than Pearson’s r but many ties reduce discrimination and require tie-corrected methods.

Correlation Analysis definition, formula and step by step procedure

📝 Paper 1 Strategy

State method: explicitly say “Spearman’s rank correlation (r_s)” and justify why it is chosen (ordinal / monotonic / robust to outliers).
Show ranks & Σ d²: if doing by hand show ranking steps (including average ranks for ties) — method marks are awarded even if numeric slip occurs later.
Interpret in context: one clear sentence: “r_s ≈ 0.77 indicates a strong positive monotonic association; p = … (if given) shows whether association is statistically significant.”

📌 4. Limitations & when not to use r_s

Non-monotonic relationships: if relationship is curved (e.g., U-shape) r_s may be near zero despite a clear association — use scatterplots first.
Large number of ties: reduces effective variability — prefer other analyses or use technology with tie corrections.
Causation: r_s measures association only — it does not establish cause.

🔍 TOK Perspective

Consider how the choice of measure (Pearson vs Spearman) affects knowledge claims. What assumptions are hidden when we assert “strong correlation”?

🌐 EE Focus

An EE could compare rank-based correlations across countries (e.g., GDP rank vs life-satisfaction rank), discussing data quality, ties, and interpretation challenges.

📌 Quick checklist before submitting

Have you shown ranks (and average ranks for ties) or stated you used technology?
Did you compute Σ d² clearly and substitute into the formula (or state GDC was used)?
Did you include a contextual interpretation and mention statistical significance if asked?