🧩 Why 25% of the Puzzle Was Never Going to Be Proof
In 2023, a dataset was leaked that was widely claimed by some to “prove” mass harm from COVID-19 vaccination. The narrative that followed suggested tens of thousands of vaccine-related deaths and attempted to draw sweeping conclusions about national mortality trends. But there was one critical problem, visible from the start — it wasn’t the whole dataset.
In fact, it wasn’t even close.
The file contained approximately 4.1 million dose entries, which represented about 25% of all vaccine doses administered in New Zealand at the time. That’s not a sampling error — it’s a structural one.
❌ What Was Missing?
This data represented only a portion of New Zealand’s vaccination records, specifically from what’s known as the Pay Per Dose programme — a system used to reimburse GPs, pharmacies, and some contracted providers for each dose they administered. It excluded:
-
Hospital and DHB-administered doses,
-
Most early Dose 1 and Dose 2 entries,
-
And critically, 75% of all recorded vaccine doses nationwide.
This means it was missing:
-
Most of the early vaccine rollout, which disproportionately included the elderly and clinically vulnerable,
-
Many of the higher-risk dose windows that would be needed to assess real-world mortality signals,
-
And nearly all of the denominator data needed to calculate anything statistically valid (like per-capita mortality or excess death rates).
📉 What Could It Actually Tell Us?
Not much.
Yes, you could see how many people in that specific system had died. But that alone doesn’t tell you:
-
How many people survived, which is required to calculate rates,
-
How many deaths would be expected based on age and time since dose,
-
Or whether the patterns were different to other datasets — because there was nothing to compare it to.
You can’t determine cause of death from a timestamp. And you can’t draw population-level conclusions from a non-random 25% of the data — especially not when the data is already structurally skewed.
🧠 The Real Lesson
There are valid questions to ask about health outcomes, vaccine safety, and mortality. But these questions require cohort-based, age-stratified, and statistically complete data — not a leaked subset of a reimbursement spreadsheet.
The government has released datasets under the Official Information Act that include:
-
Deaths by age, dose number, and time since last dose,
-
National mortality trends, and
-
Vaccine administration records across the full population.
Those are the tools of proper analysis. The stolen data? It wasn’t proof — it was a quarter of a jigsaw puzzle, missing all the edges, and held upside down.