South Korea's post-war transformation is one of the defining growth miracles of the twentieth century — yet rigorous subnational research on how it unfolded has been held back by data that is too coarse (province-level), too infrequent (census every 5–10 years), recorded in mixed Korean and classical Chinese scripts, fragmented across dozens of provincial archives, and scrambled by repeated administrative boundary changes. MIRACLE is assembling, digitising, and harmonising ~2 million pages of municipal records into the first consistent township-year panel for this era.
MIRACLE is a multi-year effort to build a comprehensive database that traces South Korea's development path at the township level. The goal is to bring all available subnational resources — administrative records, spatial archives, institutional data — into one place, linked by time-consistent identifiers that track townships through two major boundary reorganisations (1963, 1973). A natural starting point is the municipal statistical yearbooks (시군 통계연보), published annually by county governments, which provide the richest and most consistent administrative data for this period.
MIRACLE starts with South Korea's municipal statistical yearbooks, but the ambition extends in two directions. First, within Korea, we plan to incorporate additional administrative sources — expressway construction logs, agricultural extension records, Korea Forest Service archives, colonial-era household registries, and local personnel files — to deepen the panel and enable research designs that link infrastructure, agricultural modernisation, and environmental policy to local institutional conditions.
Second, across countries, the infrastructure we build is designed to accommodate other growth miracle economies with comparable subnational statistical traditions. If similar municipal records exist for Taiwan, or district-level yearbooks for post-war Japan, they belong in the same framework. The goal is a comparative subnational data platform for studying rapid development wherever it has occurred.
MIRACLE draws on multiple administrative source types. Municipal statistical yearbooks form the backbone; additional archival layers extend the platform to forestry, industrial, and institutional data.
Published annually by county governments. Township-level data on demographics, agriculture, industry, infrastructure, public finance, and education. Dispersed across provincial archives — never systematically compiled.
Korea Forest Service spatial archives. Shapefiles available from 조선임야분포도 (1910) and forest type maps from 1974. Enables studying one of history's largest reforestation programmes.
Foreign loan records linking firms to locations and financing sources. Geocoding firms and mapping industrial networks.
Colonial-era and early-Republic household records. Pre-treatment institutional measures including clan concentration and land ownership.
From archive to analysis-ready panel in six steps:
Systematic survey of provincial archives, university libraries, and government collections to locate surviving yearbook volumes. Mapping what exists, what is missing, and where physical copies are held.
Building partnerships with municipalities, counties, and provincial archives. Physical scanning of bound volumes into high-resolution page images — the raw input for digitisation.
Custom pipeline fine-tuned for mixed Hangul/Hanja archival tables. 87% pilot accuracy, targeting 92–95%. This is what makes the project feasible — these documents were previously unusable at scale.
| 읍면 | 합계 | 논 (답) | 밭 (전) | ✓ | ||
|---|---|---|---|---|---|---|
| 소계 | 1모작 | 2모작 | ||||
| 남해 | 8,054 | 5,849 | 1,230 | 4,619 | 2,205 | ✓ balanced |
| 이동 | 10,993 | 7,544 | 1,175 | 6,369 | 3,449 | ✓ balanced |
| 삼동 | 12,785 | 7,349 | 1,239 | 6,110 | 5,436 | ✓ balanced |
| 남면 | 11,470 | 6,012 | 857 | 5,155 | 5,458 | ✓ balanced |
| 고현 | 8,310 | 5,680 | 743 | 4,937 | 2,630 | ✓ balanced |
| 창선 | 13,173 | 7,901 | 2,311 | 5,590 | 5,272 | ✓ balanced |
See structured output table above.
Definitions, units, and table structures changed across editions and municipalities. We build crosswalks reconciling these into consistent time series.
Two major reorganisations (1963, 1973) plus dozens of smaller changes. We construct time-consistent miracle_id identifiers.
Every township linked to satellite, elevation, slope, soil, and transport network data. 196 Namhae-gun villages fully geocoded.
The dataset is organised into modules by domain, each a flat township-year panel. Merge across modules using Core Keys. CSV & Stata formats, with full codebook and variable documentation.
| miracle_id | year | prov | muni | twp | pop | hh | paddy_ha | schools | road_km |
|---|---|---|---|---|---|---|---|---|---|
| KR-48-840-010 | 1970 | 경남 | 남해군 | 남해읍 | 28,412 | 5,680 | 1,245 | 7 | 23.4 |
| KR-48-840-010 | 1975 | 경남 | 남해군 | 남해읍 | 25,891 | 5,320 | 1,198 | 8 | 31.7 |
| KR-48-840-010 | 1980 | 경남 | 남해군 | 남해읍 | 22,105 | 5,010 | 1,152 | 8 | 38.2 |
| KR-47-720-030 | 1970 | 경북 | 영주시 | 풍기읍 | 31,550 | 6,140 | 1,870 | 9 | 18.6 |
| Module | Description | ETA |
|---|---|---|
| Core Keys miracle_id · province · municipality · township · concordances | Geographic identifiers and boundary concordances across the 1963/1973 reorganisations. | 2026 |
| Demographics population · households · age structure | Population counts, household numbers, demographic composition. | 2026 |
| Agriculture paddy area · crop output · livestock | Cultivated area, output (harmonised to metric units), livestock. | 2026 |
| Industry establishments · employment · output | Industrial establishments, manufacturing employment, sectoral output. | 2027 |
| Infrastructure roads · electricity · water · telecom | Road length, electrification, public utilities. | 2027 |
| Public Finance revenues · expenditures · transfers | Municipal revenue/expenditure, central transfers, fiscal capacity. | 2027 |
| Education schools · enrolment · teachers | School counts, enrolment, teachers, educational infrastructure. | 2027 |
| Geospatial shapefiles · centroids · boundaries | GIS boundary files with consistent township geometries. | 2027 |
| Institutions clan concentration · bureaucratic capacity | Pre-treatment institutional measures from 1930 registries and personnel files. | 2028 |
Work in progress and working papers using MIRACLE data:
If you are using or interested in using MIRACLE data, we would like to hear from you. Get in touch.
Digitisation proceeds province by province, constrained by the uneven survival of physical yearbooks across Korea's provincial archives. Hover over each province for details on coverage, year range, and scanning status.
Last updated March 2026
Interactive coverage map → — township-level digitisation progress across 191 municipalities.
Hiring research assistants for 2026–27. Get in touch.




For early access, collaboration, or questions — [enable JavaScript]
@techreport{seol2026miracle,
author = {Seol, BooKang and Lee, Changkeun and Yang, Hyunjoo},
title = {{MIRACLE}: Subnational Economic Data for
South Korea's Developmental Period, 1960--1989},
institution = {London School of Economics},
year = {2026}
}