The Flynn Effect: Why Average IQ Scores Rose for a Century

Throughout most of the 20th century, raw IQ test scores in many countries rose at an extraordinary pace — roughly three points per decade on average. This pattern, now known as the Flynn effect, is one of the most debated and consequential findings in the entire history of intelligence research. It tells us that whatever IQ tests measure, the number they return for a given population is not fixed. It shifts with time, environment, and circumstance. Understanding why has reshaped how researchers think about intelligence, nutrition, education, and the meaning of a test score.

1. What the Flynn Effect Is

The Flynn effect is named after political scientist James R. Flynn (1934–2020), a New Zealand–based researcher who spent decades documenting the phenomenon. Flynn did not discover the original data — earlier researchers such as Bryn Beynon had noticed pieces of it — but he systematically assembled evidence from multiple countries and forcefully argued for its significance in a pair of influential papers published in 1984 and 1987.

The core observation is simple: when a new IQ test is normed, test publishers set the mean score at 100 for the current population. But if you give an older version of the same test to people today, they tend to score noticeably higher than the old norms would predict. Conversely, people from earlier decades who took newer versions of tests often score lower than their reported scores suggested. The raw performance has gone up; the norm has been reset to hide the change.

Flynn quantified this by comparing norming samples across time. He found that in the United States, average raw scores rose by roughly 3 IQ points per decade from the early 20th century through at least the 1980s. Similar patterns appeared in the Netherlands, France, Norway, Japan, and many other countries. In some nations, the gains were even larger in the early period.

2. The Scale of the Gains

To appreciate how substantial the Flynn effect is, consider what a generational comparison implies.

Decade Comparison	Approximate Raw Score Increase
1 generation (~25 years)	~7 – 8 IQ points
2 generations (~50 years)	~15 IQ points
Full 20th century (100 years)	~30 IQ points

A 30-point shift over a century is enormous by any psychometric standard. If modern norms were held constant and applied backward, the average person in the early 1900s would test somewhere around IQ 70 by today's standards — a score currently classified as extremely low. This is obviously not a realistic picture of cognitive capability in earlier eras; it instead tells us that the tests are measuring something strongly influenced by environmental conditions that changed dramatically over the 20th century.

The gains were not uniform across all subtests. Abstract reasoning tests — particularly those involving pattern recognition and fluid reasoning, such as Raven's Progressive Matrices — showed the largest gains. Tests of crystallized intelligence (vocabulary, general knowledge, arithmetic) showed smaller gains. This pattern is one of the most important clues about the underlying cause.

3. Proposed Explanations

No single explanation commands universal agreement among researchers. The scientific consensus is that the Flynn effect is almost certainly multifactorial — produced by several interacting causes operating at different points across the century.

Better Nutrition

Severe malnutrition, especially protein deficiency and micronutrient deficiencies (iodine, iron) in early childhood, reliably depresses performance on cognitive tests. The dramatic improvements in nutrition across the 20th century — particularly in industrializing and developing countries — are a strong candidate for a portion of the gains. Evidence that micronutrient supplementation affects test performance in deficient populations supports this idea, though it is evidence about undernourished populations rather than a general-population effect.

Expanded Formal Education

Formal schooling trains several habits of mind that IQ tests reward: following standardized instructions, thinking in terms of abstract categories, applying rules to hypothetical scenarios, and approaching problems analytically. As years of schooling increased across populations during the 20th century, test performance likely benefited. The pattern of largest gains on fluid reasoning tests (which involve abstract problem-solving) is consistent with this — schooling trains the kind of abstract, context-free thinking these tests require.

Test Familiarity

Modern populations are more test-sophisticated than their predecessors. Standardized testing in schools, cognitive puzzles in games and entertainment, and general cultural familiarity with multiple-choice formats may all contribute to performance on IQ-style tasks. This does not mean the gains are "fake" — test-taking fluency is a real skill — but it suggests part of the gain reflects procedural familiarity rather than changes in underlying reasoning capacity.

Reduced Disease Burden

Childhood illness, especially infections that cause fever and interrupt brain development during sensitive periods, has declined sharply across the 20th century. Reduced parasite load in particular has been proposed as an underappreciated driver of the gains. Research by Christopher Eppig and colleagues found a substantial correlation between infectious disease burden and national IQ averages — though disentangling cause and effect in such cross-national data is notoriously difficult.

Environmental Toxin Reduction

Lead exposure is neurotoxic at any level, and childhood lead exposure from leaded paint and leaded gasoline was widespread in industrialized countries for much of the 20th century. The removal of lead from gasoline and paint — largely completed in the United States by the 1990s — has been associated with cognitive gains in some research. Some investigators argue that lead reduction alone could account for a meaningful fraction of the Flynn effect in affected countries.

Smaller Family Sizes and Broader Cognitive Environments

The 20th century also brought smaller family sizes, denser and more complex urban environments, broader exposure to abstract media (reading, television, later the internet), and increased cognitive demands in many occupations. Any of these could plausibly support improved performance on abstract reasoning tests.

4. What the Flynn Effect Does NOT Mean

Because the Flynn effect involves rising IQ scores, it is sometimes misread as evidence that intelligence itself is rising, or that we can deliberately raise IQ scores. Neither claim is well-supported.

The gains are not straightforward evidence of rising general intelligence. Flynn himself was explicit about this distinction. The gains are heavily concentrated in abstract reasoning tasks — the kind of thinking that modern schooling and modern environments specifically reward. Other cognitive capacities, including practical knowledge and verbal ability, showed much smaller gains. This selective pattern suggests the tests are capturing improvements in specific learned skills, not a general rise in underlying cognitive capacity.

The Flynn effect does not support the idea that specific interventions reliably raise IQ. The effect unfolded across entire populations over decades, driven by broad secular changes in health, education, and environment. Researchers have not been able to isolate any single ingredient and use it to reproduce the effect on demand. Individual cognitive training programs have not been shown to produce anything resembling Flynn-magnitude gains.

Flynn scores are not comparable across eras without correction. This is the practical measurement implication of the effect: a score of 100 in 1960 and a score of 100 in 2000 are not the same thing in terms of raw test performance. Test publishers renorm regularly (roughly every 15–20 years) to keep the mean at 100, which erases the visible trend from standard score reports.

5. The Possible Reversal: A 'Negative Flynn Effect'

Perhaps the most surprising development in this area of research is evidence that the Flynn effect has slowed, stalled, or even reversed in some wealthy countries since roughly the 1990s.

Norwegian researchers Bernt Bratsberg and Ole Rogeberg published a widely-cited 2018 study showing that among Norwegian men born after the mid-1970s, test scores on military conscript data began declining. Similar patterns have been reported in Denmark, Finland, Sweden, France, the Netherlands, and the United Kingdom — essentially the countries where the positive Flynn effect was most clearly documented.

The causes of this possible reversal are debated. Proposals include:

Compositional shifts in who takes the tests
Reduced educational emphasis on the specific analytical skills IQ tests reward
Changes in leisure activities (passive screen time replacing active reading)
Immigration patterns affecting national norming samples
Measurement artifacts in specific datasets

The evidence is not yet definitive, and patterns differ markedly across countries. Some researchers caution that the apparent reversal in Norway reflects within-family data and may not generalize. Still, the prospect that the secular rise has plateaued is an active area of investigation.

6. Why the Flynn Effect Matters for Interpreting IQ Scores

The practical implications for anyone reading an IQ score are significant.

First, the Flynn effect is the main reason IQ tests are renormed every 15–20 years. If you take a test normed 30 years ago and apply today's raw score to those old norms, you will receive an inflated score — higher than your performance would yield on a current norming sample. This inflation is sometimes called the "outdated norms" problem and is clinically relevant: using old norms can lead to underestimating cognitive difficulties in individuals being evaluated.

Second, the Flynn effect reinforces that an IQ score is not a pure measure of fixed innate capacity. It captures performance on a specific set of tasks at a specific point in historical time, using norms calibrated to a specific population. The score is real and informative, but it is not a timeless quantity.

Third, the pattern of gains — concentrated in fluid reasoning and abstract pattern recognition — is a reminder that the cognitive skills IQ tests most heavily reward are substantially learnable through education and environmental exposure. This is not the same as saying "IQ can be trained up"; it means that access to quality education and rich cognitive environments at a population level is associated with better average test performance at a population level.

Frequently Asked Questions

What is the Flynn effect in simple terms?

The Flynn effect is the documented rise in average IQ test scores that occurred across most of the 20th century in many countries. Raw scores went up by approximately three points per decade in the United States and similarly in many other nations. Because test publishers regularly reset the mean to 100, this rise is invisible in reported scores but shows up when older and newer norming samples are compared directly.

Who discovered the Flynn effect?

The pattern was documented and popularized by political scientist James R. Flynn, whose analyses of multi-country data published in 1984 and 1987 drew widespread attention. Flynn spent the rest of his career refining and debating the explanation. He published extensively on the topic until his death in 2020, including the books 'What Is Intelligence?' and 'Are We Getting Smarter?'

Does the Flynn effect mean people are getting smarter?

Not in a straightforward sense. The gains are heavily concentrated in abstract reasoning and pattern-recognition tasks, with much smaller gains in crystallized knowledge. Flynn himself argued that the scores reflect improvements in abstract analytical thinking skills that modern education and environments cultivate — not a general rise in underlying intelligence. Many cognitive capacities not well-captured by standard IQ tests may not have changed at all.

Has the Flynn effect stopped?

Evidence from several Western European countries suggests that the positive trend has slowed or reversed since roughly the 1990s. Norwegian military conscript data and several other national datasets show declining raw scores in cohorts born after the mid-1970s. The causes are debated. The effect has not clearly reversed in all countries, and some researchers question whether measurement artifacts explain part of the pattern.

Why does the Flynn effect matter for IQ testing today?

It is the central reason IQ tests must be periodically renormed. Using outdated norms produces inflated scores — a person being evaluated on a test normed decades ago may receive a score several points higher than they would on a current test. This is clinically significant. It is also a conceptual reminder that IQ scores are norm-referenced, historically calibrated measures, not timeless reflections of fixed capacity.

Did James Flynn think one cause explained the whole effect?

No. Flynn consistently resisted single-cause explanations, even though he regarded some factors (particularly education and the spread of abstract thinking) as especially important. He argued the effect was almost certainly multifactorial — driven by cumulative environmental improvements including nutrition, health, schooling, and general cognitive complexity — and that teasing apart the contributions of each factor remained a challenge for future research.

Summary

The Flynn effect is one of the most revealing findings in the history of cognitive science. It demonstrates that population-level IQ test performance is not a fixed quantity — it moved substantially across the 20th century in response to broad changes in health, nutrition, education, and environment. The gains were concentrated in abstract reasoning tasks, the rise has apparently slowed or reversed in some countries since the 1990s, and no single explanation commands consensus. For anyone reading an IQ score today, the effect is a reminder that any score is a norm-referenced, historically situated measurement — informative, but not a timeless verdict about innate capacity.

Brambin offers an eight-dimension cognitive profile designed for self-exploration. It is not a clinical assessment and is not intended for diagnosis or educational placement. Treat any online score — ours included — as a starting point for curiosity, not a verdict.

The Flynn Effect: Why Average IQ Scores Rose for a Century