Smaller Classes, Fewer Teachers

A Framework for Quantifying the Cost of Teacher Misallocation

https://petercourtney.co.za

2025-11-26

Introduction

Motivation: The class size puzzle

STR = 32.5, yet ECS = 43.3

Teachers are employed but not fully deployed.

Why schools choose inefficiency 🔎

What I measure: learner-experienced class size (ECS)

\[ \text{ECS} = \frac{\sum_{i=1}^{n} \text{CS}_i^2}{\sum_{i=1}^{n} \text{CS}_i} = \overline{\text{CS}} + \frac{\sigma^2_{\text{CS}}}{\overline{\text{CS}}} \]

Learner-weighted mean class size. Larger classes count more.

Example: Classes of {30,30} vs {50,10} both average 30, but ECS = 30 vs 34.3.

Algebra 🔎

Deployment algebra: the missing metrics

STR = SCR / TCR

  • STR: Student-Teacher Ratio (conventional)
  • SCR: Student-Class Ratio (proxy for class size)
  • TCR: Teacher-Class Ratio (utilisation intensity)
  • Realised teaching load = 1/TCR

TCR ≈ 1.22 nationally → teachers work ≈ 82% of time.

Example: Same STR of 30 can mean: - TCR=2.0 → classes of 60 (50% utilisation) - TCR=1.0 → classes of 30 (full utilisation)

Details 🔎

Data

  • 5.4 million learners in 9,512 public primary schools
  • 2023 cross-section; panel 2018–2023 for mechanisms
  • Final sample: ≈78% of all public primary learners

Novel feature: Reconstructed class-level timetables from learner-class assignments.

Direct measurement of teacher deployment, not survey self-reports.

Attrition 🔎

Literature

Four gaps:

  1. Misallocation literature (Walter 2020; Fagernäs & Pelkonen 2017) focuses on between-school STR variation
    • My finding: Within-school labour slack (18.8% ECS reduction) > spatial gains (5.8 pp)
  2. TCR remains “remarkably understudied” (Bennell 2022)
  3. Production function puzzle (Hanushek 1986; Glewwe & Kremer 2009)
    • Why don’t teacher hiring interventions improve learning?
    • If class size weakly affects learning, the appropriate margin is cost minimisation
    • My contribution: Hiring → absorbed into slack, not smaller classes
  4. Management literature (Bloom et al. 2015) omits efficient deployment
    • My proposal: Allocation efficiency as essential management capacity metric

Scheduled vs unscheduled absence

Key distinction: Not timetabled ≠ Absent when timetabled

Standard absence measures conflate these:

World Bank SDI: Class absence = 1 - (Teachers in classrooms / Total employed)

  • Cannot distinguish teachers on free periods from those absent
  • 44% of SSA teachers “absent”; 48% are at school but not timetabled (Bold et al. 2017; Bennell 2022)
  • Country-level: TCR vs SDI absence (R² = 0.38)

Policy implication: Administrative timetabling reforms require different instruments than behavioural interventions.

Three Core Contributions

1. Measurement - Quantify R22.3bn fiscal leakage from teacher mis-deployment - Distinguish scheduled vs unscheduled absence - Labour under-utilisation > spatial misallocation

2. Identification - Demographic shocks reduce inefficiency (β₂ = -0.269) - IV: Resource pressure causally reduces inefficiency (δ_FD = -4.5) - Latent capacity explains 33.5% of class size variance

3. Policy - Cut ECS by 25% (44 → 33) or release R22–30bn annually - District-level reforms sufficient; no provincial transfers needed

Methods

Method: optimisation as measurement

Solve for minimum ECS under nested constraints. Six scenarios reveal binding margins.

  • S1–S2: Technical efficiency (balance within/between grades)
  • S3: Allocative—capital (bind by classrooms)
  • S4: Allocative—labour (full teacher utilisation)
  • S5–S6: Spatial (pool across districts/provinces)

This is a measurement exercise, not an algorithm pitch.

Solver Details 🔎

Results

Baseline descriptives: provincial heterogeneity

  • National: ECS = 43.3, TCR = 1.21, STR = 32.5
  • TCR variation: 1.14 (NC, FS) to 1.24 (Limpopo)
  • Limpopo: Highest STR + TCR → largest classes (49.6)

Provincial constraint decomposition 🔎

Scenario Definitions

Representative school examples: S0 | S1 | S2 | S3 | S4 | S5 | S6 🔎

Cascading Impact of Optimisation Scenarios on Class Size

National Results: Key Insights

  • S4 (full teacher utilisation): −18.8% ECS. Core inefficiency.
  • S5 (district pooling): Most spatial gains captured.
  • S6 (province pooling): Negligible additional benefit.

Poverty quintile scenarios: efficiency ≠ equity

  • Baseline gap: Q1–Q2 classes 32% larger than Q5
  • Paradox: Wealthier Q5 schools show largest gains (−20.2%)—hold more slack
  • Persistent inequality: Efficiency gains proportional; relative gaps preserved

Policy implication: Efficiency reforms alone cannot close equity gaps.

Racial scenarios: persistent inequality

  • Baseline gap: Black learners 29% larger classes than white (44.3 vs 34.4)
  • Post-optimisation: Gap persists (33.4 vs 25.8)
  • White-learner schools hold more idle classroom capacity

Interpretation: Efficiency reforms reduce absolute crowding, not structural inequality.

Explaining the Patterns

What explains crowding? LMG decomposition

  • S4 reducibility: 33.5% of explained variance (R² = 52.3%)
  • Traditional factors small: Province 6.3%, Quintile 6.4%, Race 5.5%
  • Latent capacity > STR in explaining crowding

Crowding co-occurs with spare teacher capacity → deployment choices, not binding constraints.

What explains reducibility? Scenario variance decomposition

  • S4 (Labour): R² = 0.433. ECS explains 48.2%; crowded schools hold most capacity.
  • S5 (District pooling): Province dominates (72.6%). Spatial inefficiency local.
  • S1–S2, S6: Low R². Schools already technically efficient; provincial pooling minor.

Reallocation elasticity

Demographic shocks

Question: Are high TCRs forced or discretionary?

Test logic:

  • Constraints bind → shocks raise inefficiency

  • Discretionary slack → shocks reduce inefficiency (capacity revealed)

Design: Within-school FE, 2018–2023 - DV: Δ inefficiency - IVs: Δ grade-mix, Δ phase-mix, Δ enrolment

Identification caveat

Not quasi-random. Potential confounds: cohort dynamics, management quality, time-varying unobservables.

Inference: - β₂ < 0 across phases inconsistent with binding constraints - If truly binding: β > 0 - Schools can reallocate when pressed (even if costly); capacity exists but latent

Results: patterns consistent with adaptive capacity

  • β₁ (Grade-Mix) = −0.381*** (10 pp shift → 3.81 pp inefficiency reduction)
  • β₂ (Phase-Mix) = −0.256*** (10 pp shift → 2.56 pp reduction)
  • β₃ (Enrolment) = −0.007*** (scale effects present, composition dominates)

All coefficients negative → adaptive capacity exists, including across phases. Inconsistent with binding constraints.

Robustness 🔎

Alternative explanations: ruled out

17 mechanisms investigated. Five structural rigidities predict β > 0 if binding. All rejected:

  • Grade-specific match capital (β₁ = −0.381)
  • Qualification constraints (β₂ = −0.256)
  • Period indivisibilities (both β₁, β₂ < 0)
  • Cross-phase synchronisation (β₂ = −0.256)
  • Enrolment volatility buffering (β₃ = −0.007)

Caveat: Standby rosters create endogeneity. Estimates may be lower bounds.

Full table 🔎

Causal Evidence: Instrumental Variables Strategy

⚠️ Preliminary Work

IV Strategy: Does increased staffing increase inefficiency?

Endogeneity problem: - Reverse causality: Inefficient schools receive compensatory staffing - Selection: Management quality affects both staffing and deployment - Omitted variables: Neighbourhood, principal quality, union strength

Solution: Instrument STR with Post Provisioning Norm (PPN) - Bureaucratic formula mechanically determines allocations - Within-school grade-mix changes drive variation - Plausibly exogenous to management

Instrument: Post Provisioning Norm (PPN)

PPN formula: f(enrolment, grade-mix, quintile, provincial norms)

For identification, only grade composition matters: - We exploit variation in grade-mix weights only - Enrolment, quintile, and provincial norms are controlled for (absorbed by FE and controls) - Within-school grade composition changes trigger mechanical allocation adjustments - Schools don’t control cohort composition

Why valid: - Predicted STR insulated from management decisions - School FE + enrolment controls absorb time-invariant quality

PPN Weights and Identification Source

We exploit variation driven solely by grade-specific weights in the PPN formula.

Grade Max Class Size Period Load (%) Funding Level Weight
R 35 96 0 0
1-4 35 96 100 1.190
5-6 40 96 100 1.042
7 37 96 100 1.126
  • We only use the grade weights for the analysis.
  • Variation arises when cohorts move between weight bands (e.g., a large cohort moving from Grade 4 to 5 reduces teacher allocation).

Threats to Validity: Strategic Manipulation

Endogeneity is defined when cohort composition is correlated with efficiency, and this seems improbable.

The biggest threats to manipulation:

  1. Pausing student promotion where the student would otherwise enter a lower weight grade the following year (e.g., retaining Grade 4s).
  2. Expelling learners who would otherwise be promoted to a lower weight grade.
  3. Enrolling learners disproportionately at higher weight grades.

Why these refute the instrument: If schools manipulate composition to maximise weights, the instrument (grade mix) becomes correlated with unobserved management quality (the error term), violating the exclusion restriction.

I test for the above and find that almost all learner begin in Grade 1 or R and exit in Grade 7

Specification

\[ \begin{aligned} \text{First stage:} \quad & \Delta \text{STR}_{\text{actual}} = \alpha + \beta \cdot \Delta \text{STR}_{\text{predicted}} + \delta_t + \varepsilon \\ \text{2SLS:} \quad & \Delta \text{Inefficiency} = \alpha + \delta \cdot \Delta \widehat{\text{STR}} + \delta_t + \varepsilon \end{aligned} \]

First-differences specification. Clustering at school level.

Key identifying assumptions:

  1. Relevance: PPN formula strongly predicts actual STR changes (F-statistic)
  2. Exclusion restriction: PPN affects inefficiency only through STR
  3. Exogeneity: Grade-mix changes uncorrelated with time-varying management quality

Identification: threats & tests

Main threat: Exclusion restriction—could PPN affect inefficiency directly?

Test: Control for grade-mix, enrolment, etc. β stable → exclusionlikely holds.

Validation: - Balance tests: Instrument ⊥ pre-determined covariates - Falsification: Future shocks don’t predict past outcomes - Robustness: 17 specifications (FE, clustering, functional forms) - Result: F > 30 across all specs; direction robust; no pre-trends

IV Results

First-Differences
2SLS Coefficient -4.5***
Standard Error 0.111
Interpretation ΔSTR ↑ → ΔIneff ↓
First-stage F-stat 60.4
N (school-years) 28,703

Key finding: Resource pressure → forced efficiency. Resource slack → expand slack.

FD isolates behavioural response. Slack is discretionary, not structural.

Economic interpretation

Core finding: Resource pressure (↑STR) causally reduces inefficiency (δ_FD < 0)

If constraints bind: Fewer teachers → harder to optimise → δ > 0. We observe opposite.

Mechanism: Additional teachers bargained into lighter workloads, not new classes.

Explains disappointing input elasticities: Hiring expands capacity → absorbed into slack.

Policy implication: R22.3bn behaviourally defended. Administrative reforms needed, not fiscal expansion.

Robustness: 17 specification checks

Instrument validity: Balance, falsification, weak-IV robust CI, overidentification (✓)

Specification: 7 FE combinations, 4 clustering schemes, 5 functional forms (✓)

Heterogeneity: Effect stronger in small schools, low-STR contexts (✓)

Key finding: F > 30 across all specs; direction robust; no pre-trends

Full robustness table 🔎

Economic Impact & Policy

Fiscal leakage: the cost of idle time

  • R22.3bn annually: S4 teacher under-utilisation (18.2% of time)
  • R29.6bn annually: Total including S5 spatial misallocation
  • Exceeds national spending on school nutrition + learning materials

Assumptions 🔎 | Secondary extension 🔎

Policy design: three tractable levers

  1. Enforce contact-time norms: −18.8% ECS or free ≈18% capacity (≈R22bn)

  2. Activate idle classrooms: −10.2% ECS

  3. District pooling: Shift hiring from school to district level. Captures −24.2% cumulative without provincial disruption.

Conclusion

Summary

  1. Allocation, not scarcity: Labour under-utilisation dominant (LMG 33.5%; S4 −18.8%)

  2. Local reallocation suffices: District pooling captures spatial gains (−24.2%); provincial adds ≈0.4pp

  3. Administrative reforms feasible:

    • Adaptive capacity exists (β₂ = −0.269 across phases)
    • Can reduce ECS 25% or free 25% capacity (R22–30bn)

Appendix slides

Why would schools choose inefficiency?

Core mechanism: teachers value free periods above smaller classes.

  • Rigid wage schedules: Compressed salary structure limits pecuniary differentiation. Non-pecuniary compensation (lighter teaching loads) becomes the primary margin of adjustment.
  • Union bargaining: Educator unions negotiate workload norms. Enforcement is weak, creating space for discretionary interpretation of contact-time requirements.
  • Measurement vacuum: Until recently, class-level timetabling data was unavailable. What isn’t measured can’t be managed—or negotiated away.
  • Rational precaution: Schools maintain standby rosters to cover unscheduled absence, creating endogenous slack between scheduled and unscheduled absence.

Political economy equilibrium: High TCR represents a negotiated outcome where teachers capture rents through reduced contact time rather than higher wages. Principals lack incentives or instruments to enforce utilisation norms when crowding doesn’t trigger accountability penalties.

Algebra reference sheet

  • Core Identity: STR = SCR / TCR
    • STR (Student-Teacher Ratio) = Learners / Teachers
    • SCR (Student-Class Ratio) = Learners / Classes
    • TCR (Teacher-Class Ratio) = Teachers / Classes
  • Implication: Realised Teaching Load = 1 / TCR.
    • A TCR of 1.22 means teachers are timetabled for 1/1.22 ≈ 82% of periods.
  • Numerical Example: School with 300 learners, 10 teachers (STR = 30).
    • TCR = 1.0 (Full Utilisation): 10 teachers → 10 classes. SCR = 300/10 = 30.
    • TCR = 1.5 (Low Utilisation): 10 teachers → 10/1.5 ≈ 7 classes. SCR = 300/7 ≈ 43.
    • Same staffing, different class sizes.
  • Experienced Class Size (ECS): \[ \text{ECS} = \frac{\sum_{i} \text{CS}_i^2}{\sum_{i} \text{CS}_i} = \overline{\text{CS}} + \frac{\sigma^2_{\text{CS}}}{\overline{\text{CS}}} \]
    • ECS is the learner-weighted average class size, accounting for variance (\(\sigma^2_{CS}\)).
  • Why ECS > SCR (The “Wedge”):
    • Variation in class sizes means more learners experience larger classes.
    • Example: Classes of 50 and 10.
      • Simple average (SCR) = (50+10)/2 = 30.
      • Learner-weighted average (ECS) = (50×50 + 10×10)/(50+10) = 43.3.

Sample attrition and representativeness

  • Final coverage: ≈ 5.4 million learners (78% of public primary), 9,512 schools
  • Largest attrition: multigrade exclusion (≈ 12% of sample)
  • External validity concern: Multigrade schools likely face distinct (possibly larger) inefficiency challenges
  • Robustness check: Repeat analysis including multigrade (relaxing constraints) → ongoing work
  • ↩︎️ Back to Slide 7

Fiscal leakage: assumptions and sensitivity

  • Baseline: Mean educator salary R42,000/month; 12 months; S4 inefficiency 18.2%; extrapolate to full educator workforce
  • Sensitivity: ±10% salary → R20.0–24.5bn range; exclude principals vs include → R21.9–24.8bn
  • Coverage: Primary only (R22.3bn) vs primary + secondary extrapolation (≈ R62bn, assuming similar TCR)
  • Alternative framing: Leakage as % of GDP (≈ 0.3–0.9%), % of total budget (≈ 1%), % of education budget (7.6–10.1%)
  • Comparison: R22.3bn > National School Nutrition Programme (≈ R8bn) + LTSM budget (≈ R12bn)
  • ↩︎️ Back to Slide 26

Mechanisms: alternative specifications

  • Baseline FE: Grade‑mix β₁ = −0.141, Phase‑mix β₂ = −0.269, Enrolment β₃ = −0.002
  • Robustness checks:
    • Cluster SEs at district level: SEs increase slightly; significance unchanged
    • Include school‑level controls (baseline TCR, quintile, urban): Coefficients attenuate 10–15%; signs/significance robust
    • Quantile regression (median): β₁ ≈ −0.11, β₂ ≈ −0.21 (smaller but consistent pattern)
    • Split sample by quintile: β₂ < 0 in all quintiles; magnitude largest in Q1–Q3 (poorer schools)
  • Conclusion: Negative coefficients robust; adaptive capacity present across specifications
  • ↩︎️ Back to Slide 23

Alternative explanations: full table

Table 1: Alternative Explanations for Teacher Allocation Inefficiency: Empirical Tests and Findings
Mechanism Type Predicted Effect (if binding) Empirical Finding
Structural Rigidities (Grade-Level)
Grade-specific match capital Structural (grade-level) β₁ > 0 if binding: grade-mix shocks raise inefficiency Rejected: β₁ = -0.141, schools reassign teachers across individual grades
Structural Rigidities (Phase-Level)
Qualification constraints (phase-specific) Structural (phase-level) β₂ > 0 if binding: phase-mix shocks raise inefficiency (strongest barrier) Strongly rejected: β₂ = -0.269 (nearly twice grade-mix effect), teachers reassigned readily across Foundation/Intermediate boundaries despite institutional divisions
Period indivisibilities Structural (grade/phase-level) β₁, β₂ > 0 if binding: compositional shocks raise inefficiency Rejected: both β₁ = -0.141 and β₂ = -0.269, schools revert to generalist models
Cross-phase synchronisation Structural (phase-level) β₂ > 0 if binding: phase-mix shocks raise inefficiency Strongly rejected: β₂ = -0.269, largest negative effect despite distinct phase structures (breaks, period lengths, pedagogy)
Structural Rigidities (Aggregate)
Enrolment volatility buffering Structural (aggregate) β₃ > 0 if binding: enrolment shocks raise inefficiency Rejected: β₃ = -0.002, negligible response to ±14.77% volatility
School-Level Operational Constraints
Infrastructure constraints School-level operational No prediction for compositional response (operates at school level) Orthogonal to compositional shocks; infrastructure explains ≈8% inefficiency vs ≈19% for teacher under-utilisation
Protected administrative roles School-level operational No prediction for compositional response (operates at school level) Orthogonal to compositional shocks; may contribute to baseline inefficiency but does not prevent adaptive reallocation at either grade or phase level
Health accommodations School-level operational No prediction for compositional response (operates at school level) Orthogonal to compositional shocks; may contribute to baseline inefficiency but does not prevent adaptive reallocation at either grade or phase level
Pull-out remediation programmes School-level operational No prediction for compositional response (operates at school level) Orthogonal to compositional shocks; concentrated in wealthier schools; does not prevent adaptive reallocation
Feeding-scheme logistics School-level operational No prediction for compositional response (operates at school level) Orthogonal to compositional shocks; may contribute to baseline inefficiency but does not prevent adaptive reallocation
School-Level Governance
SGB substitution School-level governance No prediction for compositional response (operates at school level) Orthogonal to compositional shocks; both SGB and department staff appear reallocated when shocks arrive
Endogenous Precautionary Mechanisms
Standby rosters for unscheduled absence Endogenous precautionary Ambiguous: precautionary buffers may correlate with shocks Partially supported: creates endogeneity between scheduled and unscheduled absence; capacity for reallocation at both grade and phase levels still exists; warrants further investigation
Measurement Artefacts
Data classification artefacts Measurement artefact β₁, β₂ ≈ 0 if pure artefact: measurement error attenuates estimates Rejected: negative coefficients at both grade and phase levels suggest real capacity; measurement error would attenuate toward zero
Policy Equilibria (Supported)
Non-pecuniary compensation Policy equilibrium Consistent with β₁, β₂ Consistent: schools reduce inefficiency when pressure arrives, including substantial phase-level adjustments
Union bargaining and political economy Policy equilibrium Consistent with β₁, β₂ Consistent: capacity exists at all levels but withheld through bargaining
Discretionary specialisation Policy equilibrium Consistent with β₁, β₂ Consistent: schools revert from grade and phase specialisation under pressure; strong phase-mix response indicates phase boundaries not binding
Institutional inertia Policy equilibrium Consistent with β₁, β₂ Consistent: capacity latent at all levels absent forcing mechanisms; phase-level adjustments particularly revealing

IV Robustness: Full specification table

# Category Test Description
Instrument Validity Tests
1 Instrument Validity Balance on observables Instrument orthogonal to 15 pre-determined covariates (size, socioeconomic status, lagged outcomes)
2 Instrument Validity Falsification tests Future PPN shocks don't predict past outcomes (lag placebos)
3 Instrument Validity Weak instrument robust CI Anderson-Rubin and CLR confidence sets (robust to weak IV)
4 Instrument Validity Reduced form on placebos No effect on time-invariant school characteristics
5 Instrument Validity Exclusion restriction probes Control for PPN sub-components (enrolment, weights, district rates)
6 Instrument Validity Overidentification tests Grade-specific instruments yield consistent estimates
7 Instrument Validity First-stage heterogeneity Strong F-statistics across districts and time periods
Specification Robustness Tests
8 Specification Robustness Alternative fixed effects No FE, Year FE, District FE, School FE, Province×Year FE, District×Year FE (7 specifications)
9 Specification Robustness Alternative clustering School-level (primary), District-level, Two-way (School+Year), Two-way (District+Year)
10 Specification Robustness Subsample stability Balanced panel only, exclude large districts, exclude outliers (STR/size/class size), by quintile, leave-one-province-out
11 Specification Robustness Alternative functional forms Level-level (primary), log-log, level-log, log-level, quadratic
12 Specification Robustness Asymmetry tests STR increases vs decreases (gains vs losses)
13 Specification Robustness Specification ladder Progressive inclusion of controls and fixed effects (Spec 1 → 4)
14 Specification Robustness Control function approach Alternative estimator using residuals as control
Additional Validation Tests
15 Heterogeneity Analysis Effect modification By school size terciles, initial STR level, baseline class size, quintile, urban/rural, province, time period
16 Mediation Analysis Class count as mediator Tests whether STR affects inefficiency through class formation decisions (Baron & Kenny framework with FE)
17 Attrition & Sample Selection Sample attrition tests Validates representativeness of final sample (78% coverage)

Provincial scenarios: constraint decomposition

  • ECS Values (Left): Limpopo faces the largest classes (49.6), whilst Northern Cape has the smallest (37.0)
  • Capital Slack (S3): Limpopo & Eastern Cape show high reducibility from activating idle classrooms
  • Labour Slack (S4): Limpopo & Mpumalanga show high reducibility from activating surplus teachers
  • Interpretation: Provinces differ fundamentally in where their inefficiencies lie—suggesting tailored policy interventions

International comparisons: TCR in SSA

  • Documented TCRs: Tanzania 2.5 (Asim, Chugunov, and Gera 2019) (40% utilisation), South Africa 1.22 (82% utilisation), Ethiopia ≈ 1.4 (Bennell 2022 inference from SDI)
  • Vietnam policy: Explicitly targets TCR > 1 for extended contact hours and in-service training
  • Interpretation: Scheduled absence widespread; SDI class-absence rates partially reflect timetabling inefficiency
  • South Africa advantage: EMIS timetabling software → technical efficiency (S1–S2) high; problem is utilisation (S4)
  • Implication: SSA systems without timetabling tools may face both technical and allocative inefficiency → larger potential gains
  • ↩︎️ Back to Literature

Secondary school extension: subject specialisation amplifies inefficiency

  • Primary TCR ≈ 1.22 (generalist model; some discretionary specialisation). Secondary TCR likely higher:
    • Subject specialisation mandated (≈ 15+ subjects; average school writes 7 subjects in matriculation)
    • Period indivisibilities bind harder (6–8 periods/day; subjects allocated 4–6 periods/week)
    • Teacher qualifications phase-specific (FET trained; cannot teach foundation)
  • Preliminary estimates: If secondary TCR ≈ 1.5 (speculative), fiscal leakage ≈ R40bn additional (total ≈ R62bn)
  • Data challenge: LURITS does not include subject identifiers for secondary; requires EMIS timetable microdata linkage
  • Policy: Subject rationalisation (reduce subject offerings per school; pool teachers at district level for rare subjects)
  • ↩︎️ Back to Fiscal Leakage

The Allocation Solver

  • Objective: Minimise Experienced Class Size (ECS) for a fixed number of teachers. This is equivalent to minimising the sum of squared learners per class across all grades (Σ n₉²/k₉).
  • Method: A greedy algorithm using a max-priority queue. This approach is provably optimal as the objective function is convex.
  • Algorithm:
    1. Initialisation: Assign one class to each grade with enrolled learners.
    2. Iteration: For the remaining classes, calculate the “marginal gain”—the reduction in the objective function—of adding one more class to each grade.
    3. Allocation: Add the next class to the grade with the highest marginal gain. The gain from adding a class to a grade with n learners and k classes is n² / (k(k+1)).
    4. Repeat: Update the marginal gain for the affected grade and repeat until all classes are allocated.
  • Implementation:
    • High-performance C++ solver via Rcpp for national-scale computation.
    • Efficient O(K log G) complexity, where K is total classes and G is number of grades.

↩︎️ Back to Slide 8

Representative school: status quo (Scenario 0)

Baseline timetable at 82nd percentile reducibility. School: 1,220 learners, 27 classes, 7 grades, ECS 47.8.

Within‑grade dispersion modest (e.g., Grade 3: 35–44). Between‑grade imbalance small. Key: 27 classes but more teachers available (see S4). Red-bordered cell shows next improvement target.

↩︎️ Back to Scenario Definitions

Within-Grade Equalisation (S1)

↩︎️ Back to Scenario Definitions

Between-Grade Allocation (S2)

↩︎️ Back to Scenario Definitions

Classroom-Constrained Allocation (S3)

↩︎️ Back to Scenario Definitions

Educator-Optimal Allocation (S4)

↩︎️ Back to Scenario Definitions

District-Level Pooling (S5)

↩︎️ Back to Scenario Definitions

Province-Level Pooling (S6)

↩︎️ Back to Scenario Definitions

Asim, Salman, Dmitry Chugunov, and Ravinder Gera. 2019. Fiscal Implications of Free Education. World Bank, Washington, DC. https://doi.org/10.1596/31466.
Bennell, Paul. 2022. “Missing in Action? The World Bank’s Surveys of Teacher Absenteeism in Sub-Saharan Africa.” Comparative Education 58 (4): 489–508. https://doi.org/10.1080/03050068.2022.2083342.
Bold, Tessa, Deon Filmer, Gayle Martin, Ezequiel Molina, Christophe Rockmore, Brian Stacy, Jakob Svensson, and Waly Wane. 2017. What Do Teachers Know and Do? Does It Matter? Evidence from Primary Schools in Africa. World Bank, Washington, DC. https://doi.org/10.1596/1813-9450-7956.