A Framework for Quantifying the Cost of Teacher Misallocation
https://petercourtney.co.za
2025-11-26
STR = 32.5, yet ECS = 43.3
Teachers are employed but not fully deployed.
\[ \text{ECS} = \frac{\sum_{i=1}^{n} \text{CS}_i^2}{\sum_{i=1}^{n} \text{CS}_i} = \overline{\text{CS}} + \frac{\sigma^2_{\text{CS}}}{\overline{\text{CS}}} \]
Learner-weighted mean class size. Larger classes count more.
Example: Classes of {30,30} vs {50,10} both average 30, but ECS = 30 vs 34.3.
STR = SCR / TCR
TCR ≈ 1.22 nationally → teachers work ≈ 82% of time.
Example: Same STR of 30 can mean: - TCR=2.0 → classes of 60 (50% utilisation) - TCR=1.0 → classes of 30 (full utilisation)
Novel feature: Reconstructed class-level timetables from learner-class assignments.
Direct measurement of teacher deployment, not survey self-reports.
Four gaps:
Key distinction: Not timetabled ≠ Absent when timetabled
Standard absence measures conflate these:
World Bank SDI: Class absence = 1 - (Teachers in classrooms / Total employed)
Policy implication: Administrative timetabling reforms require different instruments than behavioural interventions.
1. Measurement - Quantify R22.3bn fiscal leakage from teacher mis-deployment - Distinguish scheduled vs unscheduled absence - Labour under-utilisation > spatial misallocation
2. Identification - Demographic shocks reduce inefficiency (β₂ = -0.269) - IV: Resource pressure causally reduces inefficiency (δ_FD = -4.5) - Latent capacity explains 33.5% of class size variance
3. Policy - Cut ECS by 25% (44 → 33) or release R22–30bn annually - District-level reforms sufficient; no provincial transfers needed
Solve for minimum ECS under nested constraints. Six scenarios reveal binding margins.
This is a measurement exercise, not an algorithm pitch.
Policy implication: Efficiency reforms alone cannot close equity gaps.
Interpretation: Efficiency reforms reduce absolute crowding, not structural inequality.
Crowding co-occurs with spare teacher capacity → deployment choices, not binding constraints.
Question: Are high TCRs forced or discretionary?
Test logic:
Constraints bind → shocks raise inefficiency
Discretionary slack → shocks reduce inefficiency (capacity revealed)
Design: Within-school FE, 2018–2023 - DV: Δ inefficiency - IVs: Δ grade-mix, Δ phase-mix, Δ enrolment
Not quasi-random. Potential confounds: cohort dynamics, management quality, time-varying unobservables.
Inference: - β₂ < 0 across phases inconsistent with binding constraints - If truly binding: β > 0 - Schools can reallocate when pressed (even if costly); capacity exists but latent
All coefficients negative → adaptive capacity exists, including across phases. Inconsistent with binding constraints.
17 mechanisms investigated. Five structural rigidities predict β > 0 if binding. All rejected:
Caveat: Standby rosters create endogeneity. Estimates may be lower bounds.
Endogeneity problem: - Reverse causality: Inefficient schools receive compensatory staffing - Selection: Management quality affects both staffing and deployment - Omitted variables: Neighbourhood, principal quality, union strength
Solution: Instrument STR with Post Provisioning Norm (PPN) - Bureaucratic formula mechanically determines allocations - Within-school grade-mix changes drive variation - Plausibly exogenous to management
PPN formula: f(enrolment, grade-mix, quintile, provincial norms)
For identification, only grade composition matters: - We exploit variation in grade-mix weights only - Enrolment, quintile, and provincial norms are controlled for (absorbed by FE and controls) - Within-school grade composition changes trigger mechanical allocation adjustments - Schools don’t control cohort composition
Why valid: - Predicted STR insulated from management decisions - School FE + enrolment controls absorb time-invariant quality
We exploit variation driven solely by grade-specific weights in the PPN formula.
| Grade | Max Class Size | Period Load (%) | Funding Level | Weight |
|---|---|---|---|---|
| R | 35 | 96 | 0 | 0 |
| 1-4 | 35 | 96 | 100 | 1.190 |
| 5-6 | 40 | 96 | 100 | 1.042 |
| 7 | 37 | 96 | 100 | 1.126 |
Endogeneity is defined when cohort composition is correlated with efficiency, and this seems improbable.
The biggest threats to manipulation:
Why these refute the instrument: If schools manipulate composition to maximise weights, the instrument (grade mix) becomes correlated with unobserved management quality (the error term), violating the exclusion restriction.
I test for the above and find that almost all learner begin in Grade 1 or R and exit in Grade 7
\[ \begin{aligned} \text{First stage:} \quad & \Delta \text{STR}_{\text{actual}} = \alpha + \beta \cdot \Delta \text{STR}_{\text{predicted}} + \delta_t + \varepsilon \\ \text{2SLS:} \quad & \Delta \text{Inefficiency} = \alpha + \delta \cdot \Delta \widehat{\text{STR}} + \delta_t + \varepsilon \end{aligned} \]
First-differences specification. Clustering at school level.
Key identifying assumptions:
Main threat: Exclusion restriction—could PPN affect inefficiency directly?
Test: Control for grade-mix, enrolment, etc. β stable → exclusionlikely holds.
Validation: - Balance tests: Instrument ⊥ pre-determined covariates - Falsification: Future shocks don’t predict past outcomes - Robustness: 17 specifications (FE, clustering, functional forms) - Result: F > 30 across all specs; direction robust; no pre-trends
| First-Differences | |
|---|---|
| 2SLS Coefficient | -4.5*** |
| Standard Error | 0.111 |
| Interpretation | ΔSTR ↑ → ΔIneff ↓ |
| First-stage F-stat | 60.4 |
| N (school-years) | 28,703 |
Key finding: Resource pressure → forced efficiency. Resource slack → expand slack.
FD isolates behavioural response. Slack is discretionary, not structural.
Core finding: Resource pressure (↑STR) causally reduces inefficiency (δ_FD < 0)
If constraints bind: Fewer teachers → harder to optimise → δ > 0. We observe opposite.
Mechanism: Additional teachers bargained into lighter workloads, not new classes.
Explains disappointing input elasticities: Hiring expands capacity → absorbed into slack.
Policy implication: R22.3bn behaviourally defended. Administrative reforms needed, not fiscal expansion.
Instrument validity: Balance, falsification, weak-IV robust CI, overidentification (✓)
Specification: 7 FE combinations, 4 clustering schemes, 5 functional forms (✓)
Heterogeneity: Effect stronger in small schools, low-STR contexts (✓)
Key finding: F > 30 across all specs; direction robust; no pre-trends
Enforce contact-time norms: −18.8% ECS or free ≈18% capacity (≈R22bn)
Activate idle classrooms: −10.2% ECS
District pooling: Shift hiring from school to district level. Captures −24.2% cumulative without provincial disruption.
Allocation, not scarcity: Labour under-utilisation dominant (LMG 33.5%; S4 −18.8%)
Local reallocation suffices: District pooling captures spatial gains (−24.2%); provincial adds ≈0.4pp
Administrative reforms feasible:
Core mechanism: teachers value free periods above smaller classes.
Political economy equilibrium: High TCR represents a negotiated outcome where teachers capture rents through reduced contact time rather than higher wages. Principals lack incentives or instruments to enforce utilisation norms when crowding doesn’t trigger accountability penalties.
| Mechanism | Type | Predicted Effect (if binding) | Empirical Finding |
|---|---|---|---|
| Structural Rigidities (Grade-Level) | |||
| Grade-specific match capital | Structural (grade-level) | β₁ > 0 if binding: grade-mix shocks raise inefficiency | Rejected: β₁ = -0.141, schools reassign teachers across individual grades |
| Structural Rigidities (Phase-Level) | |||
| Qualification constraints (phase-specific) | Structural (phase-level) | β₂ > 0 if binding: phase-mix shocks raise inefficiency (strongest barrier) | Strongly rejected: β₂ = -0.269 (nearly twice grade-mix effect), teachers reassigned readily across Foundation/Intermediate boundaries despite institutional divisions |
| Period indivisibilities | Structural (grade/phase-level) | β₁, β₂ > 0 if binding: compositional shocks raise inefficiency | Rejected: both β₁ = -0.141 and β₂ = -0.269, schools revert to generalist models |
| Cross-phase synchronisation | Structural (phase-level) | β₂ > 0 if binding: phase-mix shocks raise inefficiency | Strongly rejected: β₂ = -0.269, largest negative effect despite distinct phase structures (breaks, period lengths, pedagogy) |
| Structural Rigidities (Aggregate) | |||
| Enrolment volatility buffering | Structural (aggregate) | β₃ > 0 if binding: enrolment shocks raise inefficiency | Rejected: β₃ = -0.002, negligible response to ±14.77% volatility |
| School-Level Operational Constraints | |||
| Infrastructure constraints | School-level operational | No prediction for compositional response (operates at school level) | Orthogonal to compositional shocks; infrastructure explains ≈8% inefficiency vs ≈19% for teacher under-utilisation |
| Protected administrative roles | School-level operational | No prediction for compositional response (operates at school level) | Orthogonal to compositional shocks; may contribute to baseline inefficiency but does not prevent adaptive reallocation at either grade or phase level |
| Health accommodations | School-level operational | No prediction for compositional response (operates at school level) | Orthogonal to compositional shocks; may contribute to baseline inefficiency but does not prevent adaptive reallocation at either grade or phase level |
| Pull-out remediation programmes | School-level operational | No prediction for compositional response (operates at school level) | Orthogonal to compositional shocks; concentrated in wealthier schools; does not prevent adaptive reallocation |
| Feeding-scheme logistics | School-level operational | No prediction for compositional response (operates at school level) | Orthogonal to compositional shocks; may contribute to baseline inefficiency but does not prevent adaptive reallocation |
| School-Level Governance | |||
| SGB substitution | School-level governance | No prediction for compositional response (operates at school level) | Orthogonal to compositional shocks; both SGB and department staff appear reallocated when shocks arrive |
| Endogenous Precautionary Mechanisms | |||
| Standby rosters for unscheduled absence | Endogenous precautionary | Ambiguous: precautionary buffers may correlate with shocks | Partially supported: creates endogeneity between scheduled and unscheduled absence; capacity for reallocation at both grade and phase levels still exists; warrants further investigation |
| Measurement Artefacts | |||
| Data classification artefacts | Measurement artefact | β₁, β₂ ≈ 0 if pure artefact: measurement error attenuates estimates | Rejected: negative coefficients at both grade and phase levels suggest real capacity; measurement error would attenuate toward zero |
| Policy Equilibria (Supported) | |||
| Non-pecuniary compensation | Policy equilibrium | Consistent with β₁, β₂ | Consistent: schools reduce inefficiency when pressure arrives, including substantial phase-level adjustments |
| Union bargaining and political economy | Policy equilibrium | Consistent with β₁, β₂ | Consistent: capacity exists at all levels but withheld through bargaining |
| Discretionary specialisation | Policy equilibrium | Consistent with β₁, β₂ | Consistent: schools revert from grade and phase specialisation under pressure; strong phase-mix response indicates phase boundaries not binding |
| Institutional inertia | Policy equilibrium | Consistent with β₁, β₂ | Consistent: capacity latent at all levels absent forcing mechanisms; phase-level adjustments particularly revealing |
| # | Category | Test | Description |
|---|---|---|---|
| Instrument Validity Tests | |||
| 1 | Instrument Validity | Balance on observables | Instrument orthogonal to 15 pre-determined covariates (size, socioeconomic status, lagged outcomes) |
| 2 | Instrument Validity | Falsification tests | Future PPN shocks don't predict past outcomes (lag placebos) |
| 3 | Instrument Validity | Weak instrument robust CI | Anderson-Rubin and CLR confidence sets (robust to weak IV) |
| 4 | Instrument Validity | Reduced form on placebos | No effect on time-invariant school characteristics |
| 5 | Instrument Validity | Exclusion restriction probes | Control for PPN sub-components (enrolment, weights, district rates) |
| 6 | Instrument Validity | Overidentification tests | Grade-specific instruments yield consistent estimates |
| 7 | Instrument Validity | First-stage heterogeneity | Strong F-statistics across districts and time periods |
| Specification Robustness Tests | |||
| 8 | Specification Robustness | Alternative fixed effects | No FE, Year FE, District FE, School FE, Province×Year FE, District×Year FE (7 specifications) |
| 9 | Specification Robustness | Alternative clustering | School-level (primary), District-level, Two-way (School+Year), Two-way (District+Year) |
| 10 | Specification Robustness | Subsample stability | Balanced panel only, exclude large districts, exclude outliers (STR/size/class size), by quintile, leave-one-province-out |
| 11 | Specification Robustness | Alternative functional forms | Level-level (primary), log-log, level-log, log-level, quadratic |
| 12 | Specification Robustness | Asymmetry tests | STR increases vs decreases (gains vs losses) |
| 13 | Specification Robustness | Specification ladder | Progressive inclusion of controls and fixed effects (Spec 1 → 4) |
| 14 | Specification Robustness | Control function approach | Alternative estimator using residuals as control |
| Additional Validation Tests | |||
| 15 | Heterogeneity Analysis | Effect modification | By school size terciles, initial STR level, baseline class size, quintile, urban/rural, province, time period |
| 16 | Mediation Analysis | Class count as mediator | Tests whether STR affects inefficiency through class formation decisions (Baron & Kenny framework with FE) |
| 17 | Attrition & Sample Selection | Sample attrition tests | Validates representativeness of final sample (78% coverage) |
Σ n₉²/k₉).n learners and k classes is n² / (k(k+1)).Rcpp for national-scale computation.O(K log G) complexity, where K is total classes and G is number of grades.Baseline timetable at 82nd percentile reducibility. School: 1,220 learners, 27 classes, 7 grades, ECS 47.8.
Within‑grade dispersion modest (e.g., Grade 3: 35–44). Between‑grade imbalance small. Key: 27 classes but more teachers available (see S4). Red-bordered cell shows next improvement target.