Analysis

Race: A Primitive Model in a Precision Age

The human genome contains three billion base pairs. The concept of race compresses this into five categories. We sorted ourselves using a system with less resolution than a medieval map — and never bothered to upgrade it. Forget regurgitating old tribal memes. It's the British-created era of AI.

Alex Coppen

27 Feb 2026 — 12 min read

Somewhere in the deep architecture of how we think about ourselves, there sits an idea so familiar it has become invisible. It is older than modern science. It predates the discovery of DNA by centuries. It was never formally tested against the structure of biology, never subjected to the rigours we would demand of any other scientific classification, and never revised when the data came in. It simply persisted — carried forward by habit, convenience, and the overwhelming confidence of the human eye.

The idea is race.

If you proposed race today, as a fresh scientific taxonomy, it would be rejected on arrival. Not because it is offensive. Because it is sloppy.

Consider the basic requirements for any biological classification to be taken seriously. It must demonstrate discrete boundaries between categories. It must show internal coherence within those categories. It must display external distinctness — members of one group being reliably distinguishable from members of another. And it must possess predictive power — the classification must tell you something useful about the thing being classified.

Species meet this standard. Blood types meet it. Genetic disorders meet it. Race does not meet a single one of these criteria with any scientific precision. It fails on boundaries, coherence, distinctness, and prediction — all four. And yet it remains, like a Georgian drainage system beneath a modern city, unexamined because it was installed before anyone thought to ask whether it worked.

Three Billion Letters, Five Crude Bins

The human genome is a document of extraordinary complexity. Approximately three billion nucleotide base pairs encode the biological instructions for building and operating a human being. Across the global population, millions of positions within this sequence vary between individuals. Each person can be understood as a unique coordinate in an immensely high-dimensional genetic space.

Modern population genetics can measure similarity across this space with remarkable precision. When it does, it finds structure. Real, measurable, reproducible structure. Individuals cluster statistically with others who share recent common ancestry. These clusters correlate with geography, because geography shaped mating patterns, migration routes, and environmental pressures over millennia.

None of this is controversial. The structure exists. The question is whether the traditional concept of race describes it accurately.

It does not.

The traditional racial framework established by Johann Friedrich Blumenbach takes this vast, high-dimensional landscape — thousands of distinguishable populations, millions of ancestry combinations, continuous gradients of similarity stretching across continents — and compresses it into roughly five categories (Caucasian, the white race; Mongolian, the yellow race; Malayan, the brown race; Ethiopian, the black race; and American, the red race), and sometimes more. It is the equivalent of reducing a photograph containing sixteen million colours down to five crayons and claiming you have captured the image. You can still see something. A vague outline. A smudge of recognisable form. But the overwhelming majority of real information has been destroyed in the compression.

Population genetics works with populations — groups of individuals sharing statistically greater genetic similarity due to common ancestry and historical interbreeding. Sardinians form a real genetic population. So do the Yoruba, the Han Chinese, the Basques, the Inuit, the Icelanders. These are measurable, distinguishable, historically traceable groupings. They emerge naturally from the data, not from intuition or appearance.

Race ignores almost all of this fine structure. It lumps the Yoruba with the Maasai, the Dinka with the Zulu — populations containing more internal genetic biodiversity than exists across the entirety of Europe and Asia combined. It places a Japanese individual and a Mongolian herder into the same box, despite centuries of distinct population history. It is not merely imprecise. It is operating at a resolution so low it actively obscures the real biology.

Why Your Eyes Deceive You

There is an uncomfortable mechanical truth about why race feels so obviously real: the human perceptual system evolved to detect precisely the kinds of signals race is built upon.

Skin pigmentation. Facial bone structure. Hair morphology. Eye shape. These are the features your visual cortex processes fastest and most fluently, because they were survival-relevant in ancestral environments. Recognising kin, identifying strangers, reading threat and alliance from physical cues — these were matters of life and death for hundreds of thousands of years. Evolution sculpted a perceptual engine exquisitely tuned to detect exactly these surface differences.

And those differences are real. Nobody with functioning vision would confuse an individual of deep West African ancestry with a Scandinavian. The phenotypic variation is genuine, measurable, and biologically grounded. Your eyes are not lying to you.

But they are giving you a spectacularly narrow slice of the total picture.

The traits driving racial perception — pigmentation, facial structure, hair form — are controlled by a remarkably small number of genes. Skin colour, the single most powerful visual signal in racial classification, is governed primarily by variants in perhaps six to ten key genes: MC1R, SLC24A5, SLC45A2, TYR, OCA2, KITLG, and a handful of others. Out of roughly twenty thousand protein-coding genes in the human genome, the entire visible basis for racial classification rests on a fraction so small it barely registers as a rounding error.

This creates a profound mismatch. Your perceptual system treats these few genes as though they represent the whole organism. They do not. They represent a tiny, environmentally selected subset — the biological equivalent of judging an entire library by its cover art.

Two individuals who look strikingly similar may be genetically distant. Two individuals who look dramatically different may be genetically close. Dark skin evolved independently in Sub-Saharan Africa, South India, indigenous Australia, and Melanesia — populations with vastly different overall genetic profiles arriving at the same phenotypic solution through convergent evolution. The visual signal says "same." The genome says "very different histories."

This is not a philosophical objection. It is a measurement problem. Perception is sampling approximately 0.001% of the variation space and treating it as representative. Modern genomic analysis samples hundreds of thousands to millions of sites. The difference in resolution is not incremental. It is several orders of magnitude.

The Coastline Problem: Where One Race Ends and Another Begins

Ask a simple question: where does the European race end and the Asian race begin?

Draw the line. Be precise.

You cannot do it, because no such line exists in biology. Human genetic variation across Eurasia changes gradually with distance — a continuous gradient called a cline. Populations in Turkey share genetic characteristics with both European and Central Asian groups. Populations in the Caucasus blend imperceptibly into both. Uyghurs carry ancestry components from East Asian and West Eurasian lineages in varying proportions. There is no cliff edge. There is no wall. There is a long, gentle slope extending over thousands of kilometres, and wherever you choose to draw your boundary is a decision imposed upon a system which contains no such boundary itself.

This is sometimes called the coastline problem in mathematics. The measured length of a coastline depends entirely on the resolution of your measurement. Zoom in, and more detail appears. Zoom out, and it simplifies. There is no objectively correct answer, because the underlying structure is fractal — detailed at every scale.

Human population structure behaves the same way. Tell a clustering algorithm to find three groups in global genetic data, and it will produce something resembling traditional continental categories. Tell it to find five, and new subdivisions appear. Ten, and finer structure emerges. Twenty, and you begin resolving regional populations. There is no natural number of human groups encoded in the genome. The number you get depends entirely on the resolution you choose.

Race assumes the answer is somewhere between three and seven, or maybe a dozen or two. Population genetics shows the real answer is: it depends on what you are trying to measure, and at what scale. The structure is real. The categorical boundaries are not.

The Purity Illusion: Every Lineage Is a Mixture

As we pointed out last week, consider a child born to a Jamaican father and a red-haired Scottish mother. What race is this child?

The question seems straightforward until you examine the premises it requires. It assumes the father carries a fixed, coherent "Jamaican" racial identity — but Jamaican populations are themselves complex mixtures of West African, European, and Indigenous Caribbean ancestry.

It assumes the mother carries a fixed "Scottish" identity — but modern Scots descend from at least three distinct ancient populations: Western European hunter-gatherers, early Anatolian farmers, and Eurasian steppe pastoralists. The child is not a blend of two pure types. The child is a new recombination drawn from parents who were themselves recombinations of earlier mixtures.

This is not an edge case. This is the universal condition.

Every population on earth, without exception, is a composite of earlier populations. Europeans are at minimum a three-way mixture. South Asians carry ancestry from multiple ancient Eurasian lineages. Sub-Saharan Africans contain the deepest and most complex internal diversity of any human group on the planet. There is no "pure" reference model anywhere in the species. There never was. Race requires fixed types from which deviation can be measured. Biology provides only continuous recombination from which no fixed reference point can be extracted.

The concept of purity is not merely politically charged — it is biologically incoherent. It presupposes a static original form, when the actual mechanism of sexual reproduction guarantees permanent recombination. Every generation reshuffles the deck. There is no card which remains in the same position across successive hands.

Albino Paradox and the Short-Circuit

Albinism provides one of the most striking illustrations of how superficial racial classification truly is. Interestingly in the same way sign language collapses the "language prescribes reality" fallacy in the social sciences.

An albino individual of deep West African ancestry — carrying mutations in genes like TYR or OCA2 which disable melanin production — will present with very pale skin, light hair, and light eyes. Visually, the most powerful racial signal has been switched off. Yet genetically, this individual remains overwhelmingly closest to their ancestral West African population. Their genome overall reflects thousands of generations of shared ancestry. Only the melanin production pathway has been disrupted.

Racial classification, confronted with this individual, simply breaks down.

The visual heuristic produces one answer. The genome produces another. They cannot be reconciled within a categorical framework, because the framework assumes visible traits reliably indicate overall genetic identity. Albinism demonstrates, with clinical clarity, they do not.

Pigmentation is a biochemical process. Disable the relevant enzymes, and it disappears — regardless of ancestry, regardless of population history, regardless of every other gene in the genome. The trait most central to racial perception is controlled by a handful of molecular switches. It is mechanistic, not categorical. It is chemistry, not essence.

Monogenesis Is Settled: Only Ever One Origin

In the nineteenth century, a genuine scientific debate existed between monogenesis — the idea of a single human origin — and polygenesis, the idea of multiple independent origins for different human groups. Polygenesis, had it been true, would have provided a biological foundation for discrete racial categories. Separate origins would imply deep, fundamental divergence.

Modern genetics has closed this question permanently.

All living humans descend from a common ancestral population in Africa, roughly two hundred to three hundred thousand years ago. The evidence is overwhelming and convergent: shared DNA architecture across all populations, shared rare mutations, genetic biodiversity patterns consistently pointing to Africa as the deepest source. Non-African populations descend from a subset of this African population — groups which migrated outward beginning approximately fifty to seventy thousand years ago.

This timeline matters enormously.

Fifty to seventy thousand years is very short in evolutionary terms. It is not remotely long enough to produce separate subspecies. For comparison, recognised subspecies in animals like wolves or tigers diverged over hundreds of thousands to millions of years with minimal gene flow between them. Humans have never experienced anything approaching this degree of isolation. Migration, trade, conquest, intermarriage — gene flow has been continuous throughout human history, maintaining the species as a single, interconnected reproductive network.

If races were genuinely separate biological lineages, genetic distances between human groups would be vastly greater than they are. They would approach the distances seen between subspecies in other animals. They do not. The numbers are not even close. Human genetic differentiation between continental populations, measured by the standard metric FST, is modest — far below the thresholds used to define subspecies in zoology.

We are one species with shallow variation, not multiple lineages with deep separation.

What a Precision Framework Actually Looks Like

If race is the wrong model, what is the right one? The answer already exists. Population genetics has provided it for decades. A higher resolution scientific framework describes human variation using populations, ancestry components, and genetic clustering — precise, measurable, and biologically real concepts which capture the actual structure without imposing artificial categorical limits unjustifiable in scientific terms.

An ancestry analysis does not declare you belong to a race. It reports probability distributions: percentages of your genome statistically associated with different historical populations. Forty per cent Northwest European. Thirty-five per cent West African. Twenty-five per cent Indigenous Caribbean. These proportions reflect real genetic inheritance from real historical populations. They are not a lexical shell game. They arise from measurable patterns in the DNA.

The accurate unit of human biological classification below the species level is the population — a group of individuals with statistically greater genetic similarity due to shared ancestry and historical interbreeding. The Yoruba are a real genetic population. So are Sardinians, Han Chinese, Icelanders, Tamils, and Inuit. These populations are genetically measurable, statistically distinguishable, and historically traceable.

But even these populations overlap. They mix. They change over time. They exist within a nested evolutionary hierarchy — continental ancestry groups containing regional populations, containing local populations, containing families, containing individuals. Race collapses this entire hierarchy into a single crude layer. It is a model with no zoom function, permanently stuck on the lowest magnification, insisting the blurred image it produces is all there is to see.

Human biodiversity is not accurately represented as red, yellow, black, white, brown.

The Evolved Compression Engine

So why does race persist? Why did humans hold on to this particularly simplistic classification in the age of massive technological advancement, and why does it feel so intuitively correct despite its scientific crudeness?

The answer lies not in biology but in cognitive architecture.

The human brain is, at its most fundamental level, a prediction engine operating under severe resource constraints. It cannot process every stimulus from scratch. It cannot hold the full complexity of a three-billion-base-pair genome in working memory. It must compress, categorise, and approximate — constantly, rapidly, and automatically.

This is not a defect. It is the core operating principle of biological intelligence. Without categorical compression, you could not recognise objects, learn language, or navigate the physical world. The same cognitive machinery recognising a chair as "chair-like" without analysing its molecular structure is the machinery sorting human beings into perceived types based on a handful of visible features.

Paul Bloom captured this with precision: our capacity to categorise people is not an arbitrary quirk of the mind but a specific instance of a general cognitive process — extracting predictive structure from experience and applying it to novel instances. You have experience with chairs. You generalise. You have experience with different-looking humans. You generalise. The mechanism is identical.

Unfortunately for the lunatics in the social sciences, stereotypes are more accurate than they are not accurate. As Bloom puts it:

Our ability to stereotype people is not some sort of arbitrary quirk of the mind, but rather it's a specific instance of a more general process, which is that we have experience with things and people in the world that fall into categories and we could use our experience to make generalizations of novel instances of these categories. So everyone here has a lot of experience with chairs and apples and dogs and based on this, you could see these unfamiliar examples and you could guess—you could sit on the chair, you could eat the apple, the dog will bark.

The problem is not the mechanism. The problem is the resolution. Evolved categorical cognition was optimised for small-scale ancestral environments where populations were relatively homogeneous and encounters with dramatically different phenotypes were rare. In those conditions, coarse visual categorisation was adequate. The error rate was tolerable. The cost of higher-resolution processing was not justified by the marginal survival benefit.

We are now operating this Stone Age compression engine in a global high-tech city environment it was never designed for, and treating its output as though it were high-fidelity scientific data. It is as though we had inherited a telescope from the seventeenth century and were using it to map exoplanets, insisting the blurry images it produced were definitive.

What the Genome Demands of Us

The human genome does not organise itself into races. It organises itself into a continuously mixing network of populations with varying degrees of genetic similarity, shaped by migration, isolation, adaptation, and recombination over deep time. The structure is real. The clusters are measurable. The variation is genuine and biologically meaningful.

But it is not categorical. It is not discrete. It is not fixed. And it is not captured — even approximately well — by the crude bins we have been using for three centuries.

We possess, right now, the instruments and the mathematics to describe human biological variation with extraordinary precision. Whole-genome sequencing can place any individual within the global ancestry landscape with a resolution our ancestors could not have imagined. Population genetics can trace migration histories, quantify admixture, identify adaptation signatures, and map the fine structure of human diversity at scales ranging from continental to village-level.

We have the high-resolution camera. We have had it for years. And yet we persist in describing the world using five crayons.

The concept of race is not wicked, in and of itself. It is not virtuous either. It is not a conspiracy and it is not a liberation. It is a primitive classification model — a first draft produced by a species whose perceptual hardware operates at dramatically lower resolution than the biological reality it attempts to describe. It was the best approximation available before we had the tools to do better. We now have those tools. The approximation has outlived its explanatory usefulness.

Human variation is one of the most extraordinary features of our species. It deserves a framework worthy of its complexity — not a relic from an era when we could barely see past our own village walls. The genome has been sequenced. The populations have been mapped. The gradients have been measured. The science is not ambiguous.

No, race is not a "social construct." Populations do not prescribe reality through word use.
No, stereotypes are not "harmful." They are how the brain functions.
No, human suspicion of other groups is not "racism." It is survival selection.
No, science is not "racist." It is a system of precision and error reduction.
No, race is not a good way of conducting human politics. It produces violence.

It is time to read the data at the resolution it was written in.