Disasters · spreadsheet failures, examined architecturally

Reinhart-Rogoff
(2010)

An AVERAGE formula stopped at row 44.
Five countries: Australia, Austria, Belgium, Canada, Denmark: were left out.
The corrected figure flipped sign.

The paper claimed countries with debt above 90% of GDP saw −0.1% median real growth. The corrected calculation: roughly +2.2%. In the three years between publication and discovery, the paper had been cited by the European Commissioner for Economic Affairs, the UK Chancellor, and the US House Budget Committee: all justifying austerity programs.

Sources · Herndon, Ash & Pollin (2013), “Does High Public Debt Consistently Stifle Economic Growth?” PERI Working Paper 322 · Reinhart & Rogoff (2010), “Growth in a Time of Debt,” NBER Working Paper 15639 · authors' subsequent corrections, NYT 2013-04-17.

Diagnostic · what actually happenedpublic record · cited

The model

The working paper computed, across twenty advanced economies and roughly two centuries of data, the relationship between sovereign debt levels and real GDP growth. The headline finding aggregated countries into four debt-to-GDP buckets and reported median and mean growth in each. A single Excel workbook held the country-year observations, the bucket assignments, and the bucketed averages.

The change

In the worksheet that computed the high-debt-bucket average, the formula referenced rows L30:L44 : fifteen rows. The data block actually extended through row 49: twenty rows. Five countries that belonged in the high-debt bucket were silently excluded from the average. The omitted rows held growth observations for Australia, Austria, Belgium, Canada, and Denmark.

The propagation

The bucketed average was the paper's headline number. The headline number became Figure 2. Figure 2 became a one-line summary in the abstract. The one-line summary entered the policy bloodstream: the European Commissioner cited the 90%-of-GDP threshold; the Chancellor of the Exchequer cited the negative-growth finding; the House Budget Committee cited it in its 2013 budget proposal. The number was the empirical anchor for austerity arguments across two continents.

The reveal

Three years after publication, a graduate student at the University of Massachusetts, Thomas Herndon, requested the original spreadsheet for a replication assignment. He reproduced the calculation and found his numbers did not match. He emailed the authors. They replied with the workbook. The discrepancy was the range. Herndon, Ash, and Pollin published the correction.

The cost

The cost is contested and depends on what counter-factual one assigns to policy choices that would have been made differently. Estimates of the fiscal-multiplier effect of the austerity programs that cited the paper run from hundreds of billions of dollars in lost GDP to single-digit percentage points of recession depth in several European economies. We do not litigate the counter-factual here. The architectural point stands independent of it.

The architectural cause

The architectural cause is not that the researchers were careless. The researchers were among the most careful in their field. The architectural cause is that Excel offered no structural way to express the invariant that the range must cover all of the rows that hold high-debt observations.There is no type system that would have known “this range should be a list of country-years with debt > 90%” and there is no assertion engine that would have checked “count of bucketed observations equals count of high-debt country-years in the source.” The substrate gave the operator no way to encode the invariant they were trying to enforce. The operator did everything right within the bounds of what the substrate allowed. The bounds were wrong.

Specimen · the range, reconstructed

RR-public-debt.xlsxreconstructed from public corrections · for illustration

L50|=AVERAGE(L30:L44)[ should have been =AVERAGE(L30:L49) ]

Row

Country (high-debt observations)

Real GDP growth

United States

−1.8%

…

United Kingdom

−2.4%

Australia ← omitted from the average

+3.7%

Austria ← omitted from the average

+1.4%

Belgium ← omitted from the average

+1.5%

Canada ← omitted from the average

+2.5%

Denmark ← omitted from the average

+2.4%

With L30:L44 (15 rows)

−0.1%

With L30:L49 (20 rows)

+2.2%

The figures above are reconstructed from the published corrections (Herndon, Ash & Pollin, 2013). The exact row numbers and country values in the original spreadsheet have been simplified for legibility; the architectural shape: a range that should have extended five rows further : is preserved.

The same model, in typed IR

The omitted countries
would have names.

In Flatland the high-debt bucket isn't a cell range; it's a computed driver that depends on each country-year by name: sum(united_states, ..., denmark). Dropping Denmark isn't a range boundary quietly stopping five rows early; it's deleting a named reference, visible in the formula and in the git diff. A first-class assertion rides alongside it: a coverage_gap driver compares the observations actually in the bucket against the count the model expects, and reports pass or fail on every compile. When the range falls short, the assertion turns red and the answer carries its own failure flag. The number still computes, but it cannot come out clean.

The deeper invariant, that the bucket should automatically cover every country-year where debt exceeds 90%, is the direction a typed substrate points. Flatland enforces the named composition and the assertion today; the automatic population predicate is where the substrate is heading.

The point isn't that Reinhart and Rogoff would have caught it with a different tool. The point is that the substrate they worked in offered no way to make the omission legible. Financial Reasoning Infrastructure does.

Try the substrate · sixty seconds

Compile a model
whose assertions can fire.

Three questions. A typed compile. A real Excel export. The same six-sheet artifact Reinhart and Rogoff would have produced: but with assertions that can fail loudly when the math is wrong.

Open the Workshop →Read /why typed

This page is commentary on a public-record event, reconstructed from the cited public sources above. It is not legal or financial advice, and Flatland is not affiliated with Carmen Reinhart, Kenneth Rogoff, Harvard University, or any party named here. The architectural analysis describes how a typed compilation substrate represents this class of model; it is not a claim about what any individual did or should have done.

Reinhart-Rogoff
(2010)

The omitted countries
would have names.

Compile a model
whose assertions can fire.

Product

Listed on

For

Learn

Comparisons

Disasters

Docs

Discover

Company

Legal

Reinhart-Rogoff(2010)

The omitted countrieswould have names.

Compile a modelwhose assertions can fire.

Product

Listed on

For

Learn

Comparisons

Disasters

Docs

Discover

Company

Legal

Reinhart-Rogoff
(2010)

The omitted countries
would have names.

Compile a model
whose assertions can fire.