3 Persuasion as a Major Transition in Evolution

Chapter Overview

This chapter situates persuasion within deep evolutionary history. The capacity to influence the beliefs and actions of others is a biological phenomenon as much as a cognitive or cultural one: the biological record shows it has repeatedly restructured life at the level of the organism, the colony, and the species. The chapter surveys the evidence from eusocial insects, cooperative mammals, and primate societies before turning to human language as the most radical persuasive technology yet evolved.

Somewhere in France, roughly 17,000 years ago, a group of people entered a cave and by torchlight painted the walls with horses, bison, and deer. They mixed pigments, coordinated effort, returned across multiple sessions. Those images are still legible to us today. Anatomically modern Homo sapiens had existed for at least 60,000 years before those paintings appeared, and for most of that time left almost nothing behind. Then, around 40,000 years ago, the record changes: art, burial with grave goods, long-distance trade, explosion of tool types. The question this chapter tries to answer is what changed, and why it took so long.

Language is the obvious candidate, but its role only becomes legible once we understand the problem it solved. Look at what actually appears in the record at 40,000 years ago. Cave art that several people had to plan and return to across sessions. Ornaments traded across hundreds of kilometres. Burials staged by a community. Every entry on the list is an act of cooperation, of many individuals coordinating their behaviour around something none of them could produce alone. The puzzle of the cultural explosion is at bottom a puzzle about cooperation, and more precisely about its scale: how large a group can be brought to act as one, and what sets the limit. Laying out the pieces of that puzzle, and seeing how they fit together, is much of what this book sets out to do.

Evolution has been working on that question for hundreds of millions of years. Bacteria cooperate. Ants cooperate. Chimpanzees cooperate. Each time the intent is the same, to bring other agents to act in ways that serve a shared objective: survival, reproduction, resources, defence. And each time the mechanism is the same in kind, a signal passed from one agent to another that changes what the receiver does. What differs from one case to the next is how far the signal reaches, and with it how large a cooperating group can grow. The means of communication sets the ceiling on the scale, and that relationship, between a mode of signalling and the size of the unit it can hold together, is the spine of this chapter.

Prehistoric cave paintings on the walls of the Lascaux cave in France, showing horses painted approximately 17,000 years ago. — Cave paintings at Lascaux, France, painted roughly 17,000 years ago. The artists mixed pigments, coordinated effort across sessions, and produced images that survived intact to the present day — the oldest surviving evidence of the kind of shared representation that only language makes possible. *Photo: JoJan. CC BY 4.0.*

Start with our own species, measured against its relatives. The same regression that places a chimpanzee in a troop of a few dozen, and most monkeys in groups smaller still, predicts a human group of about 150. Dunbar [16] built it by measuring the ratio of neocortex volume to whole-brain volume across 36 primate genera and setting it against each species’ typical group size; the correlation is strong, r = 0.77, the more neocortex a species carries, proportionally, the larger the group it lives in. The cognitive load of tracking who is who, who helped, who cheated, who allied with whom, climbs with group size, and a bigger neocortex is what pays for it. Our outsized neocortex buys the largest face-to-face group of any primate, about 150, a figure now called Dunbar’s number [1] that recurs with striking consistency: in hunter-gatherer bands, Neolithic villages, military companies, and the working social networks people keep today.

But 150 is a ceiling other primates never reach, and the reason is the mechanism they bind their groups with. Apes hold a group together by grooming, one pair at a time, and grooming already consumes about a fifth of their waking hours; the time budget caps them at fifty to eighty. Language lifts that ceiling. Talk reaches several listeners at once, costs little time per bond, and carries on while hands and eyes are busy elsewhere [15], which is what lets a human sustain 150 relationships where grooming stalls below a hundred; Section 3.9 returns to how it does so. And the logic runs past 150. Writing, institutions, and shared story push the number beyond any face-to-face limit, into the thousands and then the millions, the shared fictions that Section 3.9.9 examines.

And scale here means more than the size of a group. Each time communication reached further, it did more than enlarge a circle of cooperation; it fused the members into a higher order of living thing, a body out of single cells, a colony out of solitary insects, a society out of strangers. A stronger medium of communication is what every higher level of life is assembled from, and the larger the unit, the more powerful the medium it took to hold it together. That is the through-line Section 3.2 takes up next and Section 3.11 follows to its end, from the chemistry of a single cell to the grammar of a civilisation.

3.1 Persuasion, Communication, and Coordination

Three concepts recur throughout this chapter and are often conflated in ways that obscure the biology.

Persuasion is the intent: the goal of aligning another agent’s beliefs, preferences, or actions with one’s own, or with some shared objective. It is a property of the agent, located in the goal the signal serves. A honeybee performing the waggle dance is engaged in communication as a means to an end: she is trying to recruit foragers to a flower patch. A chimpanzee coalition partner submitting to a dominant male exchanges submission: that is how alliances are secured. The intent to achieve a shared or aligned outcome is what distinguishes persuasion from noise.

Communication is the means by which persuasion is pursued. A pheromone trail, a waggle dance, a dominance display, an argument, a narrative: these are the mechanisms through which one agent attempts to shift the state of another. Communication is in service of persuasion. The signal is the instrument; the alignment it produces is the goal. This ordering matters, because it changes how we ask questions. The question worth asking about any signal is what outcome it was selected to achieve, and how reliably it achieves that. Evolution selects for persuasive outcomes. A signal that produces reliable alignment, even through manipulation or chemical suppression, is as evolutionary successful as one that produces it through accurate information transfer.

Coordination is the result at the group level when persuasion succeeds across multiple agents. Individuals act in ways that are mutually consistent and collectively productive because each has been brought to expect and rely on the behaviour of the others. Coordination is harder than cooperation: it requires shared models of what each party will do, over and above shared goals. Language is what made coordination possible beyond the bounds of direct observation, because language can represent plans, obligations, and the expected behaviour of parties who are absent, distant, or not yet born.

Persuasion is the intent, communication is the mechanism, coordination is the outcome. What this chapter traces is how evolution built progressively more powerful mechanisms for pursuing that intent, from chemical suppression in insect colonies to recursive grammar in human societies, each capable of achieving coordination at a larger scale than the last.

3.2 The Major Transitions Framework

Every cell in your body carries the same genome, yet 37 trillion of them divide labour across tissues with no central command. A honeybee colony directs fifty thousand workers toward one reproductive goal. A civilisation coordinates billions of strangers. The same problem recurs at every scale: how do formerly independent agents come to act as one?

Maynard Smith and Szathmáry [31] argued that the history of life is punctuated by eight moments when this problem was solved by building a new level of organisation. They named them major transitions:

Table 3.1: The eight major transitions identified by Maynard Smith and Szathmáry. In each, agents that had competed for their own replication came to serve the survival of the larger unit they composed, held together by a medium of communication that grew stronger at every step.

Transition	Independent agents	New unit	Communication medium
1	Replicating molecules	Molecules in a compartment	Physical enclosure
2	Independent genes	Chromosomes	Physical linkage
3	RNA (gene and enzyme)	DNA and protein	Molecular code
4	Prokaryotic cells	The eukaryotic cell	Chemical regulation
5	Asexual clones	Sexual populations	Genetic exchange
6	Single cells	Multicellular organisms	Hormonal gradients
7	Solitary insects	Eusocial colonies	Pheromones and touch
8	Primate societies	Human language societies	Language and culture

Read down the list and one structure repeats. In every case, agents that had been competing for their own replication came to serve a shared objective: the survival of the larger unit they now composed. Genes stopped racing one another and replicated together inside the chromosome. Single cells gave up independent life and specialised inside a body.

An objective shared among formerly independent agents cannot enforce itself. Something has to align them, carrying from one agent to the next the information that their fate now lies with the larger unit. That something is communication, in the sense set out in Section 3.1: persuasion is the intent, the alignment of an agent toward a shared objective; communication is the means; coordination is the result.

The right-hand column of the table names the medium each transition relied on. Read as a sequence, those media are the argument of this chapter.

Each is stronger than the one before, able to align more agents, across greater distance, around ever more abstract objectives. And each, by binding its own level into a single unit, hands the next transition the platform it needs to begin.

Begin with the chromosome. A gene is a stretch of genetic material that carries instructions and can be copied. In the earliest life, genes floated as separate molecules, each copied on its own, and each free to copy itself as fast as it could. That freedom was the problem. A gene that replicated faster than its neighbours would flood the next generation with copies of itself, even when those copies did nothing for the cell, or dragged it down. A crowd of free-floating genes pulls itself apart. The chromosome ends the race by chemically binding the genes end to end into a single strand, copied once per division and handed on whole. No gene can run ahead, because none is copied on its own any more. Bound to a common fate, a gene that helps the cell now helps its own copies too, and the cell becomes worth building. Linkage bought the first cooperation in the history of life, and a genome that finally held together was the platform the next transition needed: stable enough to divide labour among the molecules it carried.

That labour first divided inside the cell. The earliest life ran on RNA, a single molecule forced to do two incompatible jobs. It had to hold information, which rewards a molecule that sits still and stays stable, and it had to drive chemical reactions, which rewards a molecule that folds into restless, reactive shapes. No molecule can be both at once. RNA did each job badly because it could never stop doing the other.

The transition broke the deadlock by splitting the two jobs apart. DNA took the archive and grew stable and inert; proteins took the chemistry and grew fast and versatile. Each became superb at its one task and, in the same stroke, helpless without the other. DNA now held every instruction and could act on none of them. Proteins could act on anything and remembered nothing. Between them lay a gap no chemistry could bridge, because the two speak in different alphabets. What spanned it was a shared code. The genetic code is a fixed table both sides honour: DNA spells out its instructions three letters at a time, each three-letter codon standing for one building block, and the protein machinery reads the same table to assemble what the instructions specify. The code let memory and action work as one system instead of two stranded halves, and that system, a stable archive commanding a versatile workforce, was the toolkit the next transition would wrap inside a single complex cell.

Communication next learned to cross open space. A chemical signal is a molecule one body releases that drifts to another and changes its behaviour by latching onto a receptor, the way a key turns a lock. One cell could now reach into another without touching it. The eukaryotic cell, the elaborate cell that every animal, plant, and fungus is built from, was born from exactly such a reach. Around 1.5 billion years ago one bacterium swallowed another and, instead of digesting it, kept it alive as a captive. The prisoner, ancestor of the mitochondrion, could wring far more energy from oxygen than its captor, and over time the chemical signals that had once run its own life were turned to the host’s purposes: the molecules that once governed the captive now answer the host’s demand for energy. A free-living organism had been reduced to an organ, held in place by a chemical leash. That leash then scaled up. In a body of many cells, all sharing one genome, a cell becomes nerve or muscle or bone by reading the concentration of signalling molecules around it, taking its orders from a gradient. Signalling let chemistry hold billions of separate cells to a single plan while leaving each one intact, and that reach across the gaps between distinct bodies was what the next transitions would extend further still: first to two organisms, then to swarms of them.

Sexual reproduction reached the first of those, the exchange between two whole organisms. An asexual lineage is a closed line: every offspring is a copy of one parent, so a good gene that arises in it can never travel to another line, and a population’s best discoveries stay locked in the separate lineages that happened on them, each dying out with the line that carried it. Sex forces those lines open. By drawing genes from two parents and shuffling them, a process called recombination, it gathers variants that arose apart into a single offspring, letting a population pool its scattered gains far faster than any lone line could accumulate them. With sex, coordination crossed a threshold it had never reached before, from within a single organism to between two of them, and the leap from two to many would follow in the colony.

Chemistry then breached the wall of the single body. In a eusocial colony the cooperating units are whole organisms, thousands of them, held together by the same kind of signal that once governed cells. A honeybee queen secretes a pheromone into the shared air and surfaces of the hive, picked up by the workers as they groom her and feed one another. It shuts down their ovaries and turns them to foraging and defence, so that fifty thousand insects spend their lives on the queen’s reproduction as though they were organs of her body. Touch carries its own traffic across the colony: a returning forager’s waggle dance is read by followers pressed against her in the dark, a message felt rather than seen, decoded in full in Section 3.4.1. One chemical, broadcast through a shared medium, could now bend an entire society of separate organisms to a single purpose. And there its power stopped. The signal fires a fixed response; a worker cannot question it, and the queen cannot use it to ask for anything she has not asked before. Coordination had reached the scale of a whole society, yet the message stayed frozen, and thawing it was the task that fell to the medium that came next.

Language broke that freeze. Every medium so far ran on a fixed signal and an automatic answer: linkage cannot be renegotiated, a pheromone admits no exception, a gradient gives the same order every time. A word is different. Its tie to its meaning is arbitrary and learned, so it can be combined, qualified, taken back, and argued over in the act of speaking. And it can point away from the present moment: to the past, to a future that has not arrived, to an obligation that exists only because people say it does. A plan for tomorrow’s hunt. A promise to repay next season. A rule that binds people who are not in the room. For the first time a signal could name a shared objective, invite the listener to weigh it, and be talked out of it, so cooperation could reach strangers and stretch across time. Section 3.9 takes up how language crossed this threshold, and Section 3.9.3 traces the climb from simple signals to the recursive grammar that makes such open-ended messages possible. And language opened the way to one further medium that nothing before it could sustain: culture.

A worker bee must grow its response to the queen afresh in every generation, because the message is written in its genes. A human child is born only with the capacity for language and takes its content, a story, a norm, a name to trust or avoid, from the people nearby. Once a shared objective could live in a story instead of a gene, it could spread between strangers within a single lifetime and pile up across generations, each one starting from where the last left off. Culture carried coordination past the family and past the single lifespan, out to the scale of tribes, religions, and states. It is the engine behind the shared fictions, money, nations, gods, that Section 3.9.9 examines, and the twin inheritance of genes and culture that Section 3.9.8 develops.

One line runs the length of the chapter: every time cooperation reached a new scale, a stronger medium of communication had to come first. Physical linkage bound the genome. A shared code joined memory to action. Chemistry built the cell and the body, and sex let two lineages pool what each had found. Pheromones held the colony to one purpose. Language tore the message loose from the present, and culture carried it beyond any single life. The sections that follow take these media in turn, opening with the chemical, tactile, and gestural signals of insect colonies and primate troops in Section 3.4, and closing on the recursive grammar, and the shared fictions, that let human societies reach a scale no other species has come near.

3.3 From Genes to Culture: Bio-cultural Co-evolution

Each of those transitions shared a common feature: the communication system that enabled the transition co-evolved with the biological machinery that executed it. In humans, this co-evolution took an unusual form, one in which culture itself became a selective pressure on the genome.

This account situates the evolution of language within a broader bio-cultural co-evolution. The genetic capacity for language, specialised cortical regions, precise articulatory control, sensitivity to syntactic structure, was shaped by selection pressures that were themselves cultural: the demands of hunting coordination, pantomimic storytelling, and finally conversational argumentation. Each cultural advance in communication created new selection pressure on the biological substrate; each biological advance in communicative capacity opened new cultural possibilities. The result, visible in the archaeological record at 40,000 years ago, was a communication system of unprecedented power, one that could coordinate the construction of hunting parties and shared fictions, institutions, and normative systems.

The mechanism connecting cultural practice to genetic change is sometimes called the Baldwin effect. When a behaviour is culturally learned and repeatedly beneficial, individuals who learn it faster or more reliably have social advantages: they coordinate better, secure more alliance partners, and survive more successfully. Over many generations, this selective advantage favours genetic variants that facilitate the learning. [3, 14] Applied to language: children who could acquire grammar more efficiently were better cooperators, and so better survivors. Over time, the biological machinery for grammar acquisition, the neural architecture of Broca’s and Wernicke’s areas, the sensitivity to phonemic contrasts, the timing of the critical period, became more precisely tuned. The innate grammar-acquisition device was itself shaped by the cultural practice of using language. The boundary between innate and learned dissolves here: the innate is what the learned, repeated over thousands of generations, selected into the genome.

The division of labour between genome and culture follows from this logic. As Tooby and Cosmides [43] argued, the genome may as well store the vocabulary in the “cultural environment”: the words themselves are learned from the surrounding community, acquired culturally rather than encoded in DNA. This is precisely the right division of labour given that cultural evolution is far faster than genetic evolution. The grammar provides a stable generative engine; the vocabulary, which must track cultural innovations such as new tools, new social roles, new institutions, can evolve and diversify at the speed of culture.

The palaeontological record shows exactly when these co-evolutionary pressures produced their decisive outcome. But to understand what language changed, it helps first to see what it replaced: the persuasion machinery that evolution had been building for hundreds of millions of years before any hominin arrived.

3.4 Persuasion in Non-Human Animals

Strip away language, and what you find underneath is chemistry, vibration, touch, and movement. The problem of getting independent organisms to act together has been solved many times over, long before anything recognisable as a mind appeared on earth. Bacteria coordinate through chemical gradients. Slime moulds aggregate when food runs short. The biological record of cooperation runs back further than we can easily imagine, and the machinery it produced is still operating everywhere around us.

What makes other species valuable for understanding persuasion is precisely that they simplify the problem. In humans, every act of influence is buried under language, learned convention, institutional role, and the accumulated etiquette of thousands of years of social life. The underlying structure is nearly impossible to examine directly. A bee colony strips it away. An ant raiding column strips it away. A naked mole rat tunnel does it in mammals. What remains, once the cultural layers are gone, is the biological core: signals that shift behaviour toward a shared objective, simple enough for a biologist to catalogue.

The picture that emerges, across hundreds of millions of years and dozens of independent lineages, is both reassuring and instructive. Reassuring because the cooperation problem, however hard it looks, has proven solvable again and again. Instructive because each solution freezes at a specific point. The waggle dance can specify a location to within three degrees of bearing, but it has no past tense and no future tense. Queen pheromones can suppress ovulation across an entire colony, but they cannot propose an exception. Seeing where each system stops is the most direct route into understanding what language eventually had to become.

3.4.1 Eusocial Insects: Division of Labour at the Extreme

3.4.1.1 Honeybees: Democratic Persuasion at the Hive

Consider what a forager bee is doing when she returns to the hive. She has found a patch of white clover two kilometres to the north-northeast. She knows its location relative to the current sun angle, its distance, its quality. She needs to convey all of this to fifty thousand individuals who have never seen it, using no words, no map, and no shared prior knowledge of the site. What she does is dance.

On the vertical face of the comb, she runs a figure-eight. The angle of the waggle run from vertical encodes the bearing from the sun; its duration encodes distance. Karl von Frisch [19] spent decades decoding this system. The direction turns out to be accurate to within three degrees, the distance to within ten per cent, precise enough to send a raiding party to a flower patch a few square metres across, two kilometres away, a location every departing bee is visiting for the first time.

What lifts it above a simple announcement is that the audience evaluates. When multiple foragers return simultaneously from different sites, followers switch from one dancer to another before committing. Seeley [39] catalogued at least seventeen distinct signal types in the colony, the waggle dance among them, and showed that the colony’s foraging decisions emerge from something structurally resembling a debate: advocates perform, receivers assess, collective behaviour follows.

Apply the same architecture to a harder problem and you see how far it can reach. When a swarm must relocate, scouts disperse independently, inspect candidate cavities, and return to dance for their preferred option, more vigorously the better the site. Other scouts visit the advertised sites; if persuaded, they switch allegiance and begin dancing for a different location. Seeley [38] showed that this process, run by individuals with no knowledge of the full option set, reliably converges on the best available site. The colony is, in effect, conducting a vote in a language made entirely of movement.

Running beneath all of this is a continuous chemical negotiation. The queen’s mandibular pheromones suppress worker ovarian development moment to moment; worker policing, enforced through chemical recognition of worker-laid eggs, keeps individual reproductive ambition in check [36]. Fifty thousand individuals share one reproductive agenda because they are bathed, continuously, in signals that make any other arrangement physiologically difficult to sustain.

3.4.1.2 Ants: Distributed Intelligence Through Chemical Language

Where the bee writes in movement, the ant writes in scent: a medium with different properties altogether. A pheromone trail persists after the ant who laid it has moved on. It self-reinforces: more followers deposit more chemical, strengthening the signal. And it self-erases: when a food source depletes and fewer ants return, the trail volatilises and fades. A leafcutter ant (Atta cephalotes) returning from a rich patch drags her gaster along the substrate, leaving a marker whose intensity encodes food quality [46]. Each ant contributes one data point; the aggregate becomes a resource allocation, updated continuously, the colony’s overall state distributed across every interaction rather than held in any single ant.

Army ants (Eciton burchellii) apply the same logic to a structural problem. When a raiding column encounters a gap in the forest floor, workers anchor themselves to each other and to the substrate, forming a bridge of living bodies. Each ant responds only to the traffic it feels through vibrations in its own legs: heavy use widens the bridge; light use contracts it. The structure assembles when needed, shifts as the column’s demands shift, and dissolves when the column passes, collectively intelligent and individually oblivious [46].

3.4.1.3 Beyond Insects: Naked Mole Rats and Eusocial Spiders

By the 1970s, biologists had an elegant explanation for eusociality: haplodiploidy. In Hymenoptera, females share three-quarters of their genes with sisters rather than the usual half, making worker altruism arithmetically rational. The phenomenon appeared confined to one peculiar corner of the insect family tree.

Then Jennifer Jarvis published her 1981 paper [28] on the naked mole rat (Heterocephalus glaber), and the elegant explanation began to look less general. The naked mole rat is a diploid mammal with conventional genetics, yet colonies of up to 300 individuals organise around a single breeding queen, all other females reproductively suppressed. The queen’s mechanism of enforcement has nothing of the pheromone cloud drifting through a hive. She shoves. She jostles subordinate females for hours at a stretch, and the cumulative physiological effect of her physical presence suppresses the hormonal surges that would otherwise trigger ovulation in them. The ecological explanation is straightforward: the soil of East African savannahs is hard, dry, and yields food only to coordinated burrowing by many individuals. Extreme inbreeding has made colony-mates genetically close enough that altruism stays profitable.

A naked mole rat, a wrinkled, nearly hairless mammal, the first confirmed eusocial mammal species. — A naked mole rat (*Heterocephalus glaber*) — the first mammal confirmed as eusocial, living in colonies of up to 300 individuals dominated by a single breeding queen. Unlike insect eusociality, the naked mole rat’s cooperativity cannot be explained by haplodiploidy; extreme inbreeding and the ecology of underground burrowing drive relatedness instead. *Photo: VigilancePrime / Wikimedia Commons. CC BY-SA 3.0.*

The colonial spider Anelosimus eximius in tropical South America arrives at the same destination by yet another route. Thousands of individuals share a communal web; when prey hits the silk and struggles, the vibration propagates outward and spiders from across the structure converge on the source. The web is the communication channel: a shared physical medium that broadcasts the signal of struggle simultaneously to every individual in the colony and produces coordinated collective action from a physics problem [46].

Bee, mole rat, spider: different phyla, different genetics, different media, same outcome. That convergence is the important finding. Cooperation is a robust solution to a persistent problem, achievable through chemistry, through movement, through vibration, through the physical structure of a shared web. Evolution finds it whenever the pressure is strong enough. In the primates, the pressure was strong enough, and so was something else: the cognitive capacity to track other individuals as individuals, to remember who had helped and who had defected, to plan across time. That changes what persuasion has to do.

3.4.2 Primate Social Persuasion: Grooming, Coalitions, and Politics

Among mammals, the primates provide the richest evidence for sophisticated social persuasion, communication aimed at both coordinating immediate behaviour and managing long-term relationships and social standing.

3.4.2.1 Chimpanzees: The Emergence of Political Persuasion

De Waal’s [45] detailed observations at Arnhem Zoo remain the canonical account of chimpanzee social politics. Male chimpanzees compete for alpha status through a sustained process of coalition building rather than brute force; an outright fight risks injury to both parties. An ambitious male will systematically groom potential allies, sharing food and providing social support in return for backing in future confrontations. The grooming sessions function as negotiations: the sender offers a costly signal (time, grooming effort, food) that communicates commitment and creates a social debt. The receiver is expected to reciprocate support in future conflicts. The channel is tactile (grooming) and gestural; the message is alliance offer; the effect is the creation of a coalition that shifts the balance of power.

A chimpanzee in a contemplative posture, illustrating the social intelligence that De Waal documented in coalition politics at Arnhem Zoo. — A chimpanzee displaying the contemplative posture associated with social assessment — scanning the group, evaluating alliances, tracking who groomed whom. De Waal documented that alpha status in chimpanzee groups depends not on physical dominance alone but on sustained coalition management: who is owed a favour, who might defect, who needs to be appeased. *Photo: Thomas Lersch. CC BY-SA 3.0.*

De Waal documented that the most successful alpha males at Arnhem were those best at coalition management: grooming many partners, being reliably supportive to allies, and repairing relationships quickly after conflicts. Two males who had fought were often observed, within an hour, presenting themselves to each other for reconciliation. Uninvolved third parties would also approach the loser of a fight and groom or embrace them, a behaviour de Waal interpreted as post-conflict consolation. The consoler needs to recognise the loser’s distress and respond to it, which requires something like empathy, the capacity to model another individual’s emotional state and be moved to act on that model.

Chimpanzees also demonstrate strategic deception, a form of persuasion that involves deliberately misrepresenting information to influence a receiver’s behaviour. Males hide erections when approaching dominant individuals to avoid aggression, and individuals have been observed leading competitors away from food sources they have located, then circling back alone. Byrne and Whiten [8] catalogued dozens of such cases and proposed the “Machiavellian intelligence” hypothesis: that the demands of managing complex social relationships, tracking alliances, reading intentions, deceiving rivals, building coalitions, drove the expansion of primate neocortex. Social intelligence, in this view, is persuasive intelligence.

3.4.2.2 Baboons and Cooperative Sentinels

Baboons (Papio spp.) organise in large troops of 20 to over 100 individuals with multi-level dominance hierarchies that are reproduced daily through communication rather than continuous physical contest. Subordinates signal submission through vocalisation, postural appeasement displays, and grooming directed upward in the hierarchy; dominants signal status through gait, gaze, and priority-of-access behaviours. The social order is reproduced through the ongoing persuasive acts of its participants, maintained through influence as much as force.

The system goes considerably further than status display. Males use specific vocalisations, “grunts” directed at females before approaching them or their infants. The probability that a female will tolerate an approach is measurably higher after such grunts than in their absence, and the effect is stronger for males with a consistent history of non-aggressive behaviour toward that female. This is the full signal-model-decision sequence in operation: the receiver tracks the sender’s past behaviour, updates her model of his intentions, and regulates her response accordingly. [9, 10]

Females also engage in strategic alliance formation across matrilineal kin groups. A female facing a rival will selectively groom a high-ranking male ally in the days preceding likely conflict, then benefit from his proximity during the confrontation itself. The investment in grooming functions as a deposit in a social account, and the subsequent conflict is the withdrawal. Tracking these relationships across dozens of individuals, over weeks and months, requires a social memory that begins to look less like the fixed-response systems of insects and more like genuine political cognition.

Cooperative sentinel behaviour in meerkats (Suricata suricatta), studied extensively by Clutton-Brock and colleagues [11], illustrates how honest alarm signalling can evolve in the absence of close kinship. Sentinel individuals at elevated positions produce graded alarm calls: the call type encodes both the type of predator and the level of urgency, allowing group members to calibrate their response. The sender (sentinel) invests in costly vigilance; the message (alarm call) is graded and specific; the channel is acoustic; the effect is coordinated predator avoidance across the group. The system is maintained by reputation: sentinels who produce false alarms are less likely to receive reciprocal sentinel behaviour from groupmates.

3.4.2.3 Lions: Cooperative Hunting as Persuasion in Action

Lion (Panthera leo) cooperative hunting provides an example in a non-primate predator. Stander [41] documented that lionesses take on specialised roles in group hunts: “wings” flank the prey while “centres” drive it, with the role division emerging from the spatial positions individuals adopt before the hunt begins. The channel is purely visual (positional and postural signals); the message is the individual’s chosen role; the effect is coordinated group action that achieves prey capture beyond the capacity of any individual.

This coordination is accomplished without anything resembling symbolic communication. The information necessary to divide labour and time attacks is transmitted entirely through movement and position in space. The channel and message complexity need not be high to produce sophisticated collective outcomes. What matters is the reliability of the signal-response relationship.

An Asiatic lion pride resting together, illustrating the social coordination of cooperative hunters. — An Asiatic lion pride resting in Gir, India. The cooperative hunting strategies documented by Stander [41] in African lionesses depend on individuals reading and responding to each other’s spatial positions before a hunt begins. No vocalisation is required; position is the signal. *Photo: Rutvijsinh1991 / Wikimedia Commons. CC BY-SA 4.0.*

3.4.3 Semantic Signals: The Vervet Monkey

The examples above, chemical trails, waggle dances, alarm calls, cooperative hunts, all involve signals that influence behaviour. But are any of them semantic in the sense that a word is semantic: referring to a specific object or category in the world, independently of the immediate context? This question is sharpest in the case of the East African vervet monkey (Chlorocebus pygerythrus).

Vervets produce three acoustically distinct alarm calls, one for each of their major predators: a call for pythons, a call for martial eagles, and a call for leopards. Each call triggers a predator-appropriate response in listening monkeys. The leopard call causes them to climb higher into a tree (useless against an eagle); the eagle call causes them to look upward and move into dense undergrowth (useless against a leopard); the snake call causes them to stand upright and scan the ground. Each response is calibrated to the specific predator encoded in the call.

One interpretation is that the calls convey only degree of danger, and that the listening monkey looks around, identifies the predator itself, and responds accordingly. Seyfarth, Cheney and Marler [40] ruled out this explanation by playing recorded calls to monkeys in the absence of any actual predator. The monkeys still responded in the predator-appropriate way, climbing, scanning upward, or standing to scan the ground, demonstrating that the call itself drives the response — the predator’s physical presence is irrelevant.

The acid test for semantic representation was a habituation study. If an alarm signal is not followed by an objective threat, animals cease to react to it: they habituate. Vervets have two other calls, wrr and chutter, both used to signal the presence of a neighbouring group of monkeys. If a vervet is habituated to the wrr call, does that habituation transfer to the chutter call? If it does, this means the animal has coded both calls as referring to the same category of event, another group of monkeys, a hallmark of semantic representation.

Habituation does transfer between wrr and chutter, but only when both calls come from the same individual, not from different monkeys. Vervets can identify individuals by voice; what appears to be semantic transfer may partly be individual-recognition transfer. Seyfarth and Cheney concluded that the calls do function as semantic representations, but that the system is entangled with individual identity — a coupling that human language has largely severed.

What, then, are the limits of this system? They are severe, and illuminating by contrast. Vervets do not use calls to refer to objects that are not present: a vervet will not produce the python call to warn a companion about a snake seen yesterday, or at a distant location. The calls are anchored in the here and now; they lack displacement, one of the most important properties that Hockett [27] identified as distinguishing human language from animal communication systems. Vervets also use alarm calls as a means of deception, producing a false call to drive competitors away from a food item, but this deception is conspicuously crude: the deceiving monkey shows no sign of alarm, and bystanders can often see exactly what it is doing. Animals have calls only for objects of immediate biological interest: predators, rivals, food. The vervet call encodes the present threat only: yesterday’s leopard, tomorrow’s eagle, the conditional bargain: all are beyond its reach. A potentially complete representation system, one capable of referring to anything including hypotheticals and abstractions, was the decisive step. That is the step language made. But producing signals that benefit others looks, at first glance, like a losing strategy, which raises the question of why cooperative signalling evolves at all.

3.4.4 The Prisoner’s Dilemma and Why Cooperative Persuasion Evolves

The vervet case raises a theoretical question: why would any individual evolve to produce honest signals that benefit others? If cooperation is individually costly, why does it not collapse under defection? The Prisoner’s Dilemma frames this precisely.

A fundamental theoretical puzzle underlies all the examples above: why would natural selection produce organisms that are persuadable, that respond to the signals of others in ways that may benefit the sender?

The Prisoner’s Dilemma (PD) formalises the problem. Two individuals can each choose to cooperate (C) or defect (D). If both cooperate, each receives a reward R. If one defects while the other cooperates, the defector receives the temptation payoff T (highest), while the cooperator receives the sucker’s payoff S (lowest). If both defect, each receives the punishment payoff P. The payoff ordering T > R > P > S means that regardless of what the other player does, defection yields a higher immediate payoff. In a single interaction, rational actors defect, and cooperation collapses.

The iterated Prisoner’s Dilemma changes this dramatically. When the same individuals interact repeatedly, future payoffs discount the present gain from defection. Axelrod [2] ran computer tournaments in which strategies submitted by game theorists competed in iterated PD. The winner, across two tournaments, was the simplest possible strategy: tit-for-tat. Cooperate on the first move, then do whatever the other player did last round. Tit-for-tat is nice (it never defects first), retaliatory (it punishes defection immediately), forgiving (it returns to cooperation as soon as the other player does), and transparent (the other player can easily learn its rule). These properties, Axelrod showed, are exactly what is needed to sustain cooperation in populations of self-interested agents.

The evolutionary implication is deep: in any species with repeated interactions and the ability to recognise individuals, the incentive structure shifts towards cooperative signalling. Persuasion, the use of signals to alter another’s behaviour in ways that benefit the sender, becomes evolutionarily stable when receiver and sender interact repeatedly, because receivers who respond to honest signals and withdraw from exploitative relationships outcompete those who do not. Nowak [32] synthesised the evolutionary routes to cooperation, kin selection, direct reciprocity, indirect reciprocity, network reciprocity, and group selection, and showed that in each case, the underlying mechanism is a signalling system that makes cooperative intent legible and defection costly.

This is the deep evolutionary reason why the social world is saturated with persuasion: honest communication is a stable equilibrium in populations of agents who interact repeatedly and track reputations. Human language built directly on this foundation.

The logic plays out in ways that are easy to observe. In humans, the split-or-steal format of televised game shows provides a near-ideal natural experiment: two strangers face a single iterated-PD-like choice, with large sums of money at stake, no future interaction, and a live audience. Most players defect. But occasionally a player reframes the game entirely, announcing in advance that they will always steal and then promising to share the winnings afterward, converting a one-shot PD into a credible commitment device and demonstrating, in front of cameras, how reputation and pre-announced strategy can substitute for repeated interaction:

The same dynamic appears in non-human animals, with cooperation maintained through memory and reciprocity. Vampire bats (Desmodus rotundus) return to the roost after nightly foraging and regurgitate blood meals to roostmates who failed to feed, selectively favouring past donors and withholding from past defectors. The relationship is stable because the bats interact nightly and recognise individuals. David Attenborough’s narration of this system in The Trials of Life makes the iterated-PD structure unusually explicit for a wildlife documentary:

Reciprocity, however, requires repeated contact and individual recognition. That leaves open the deeper question: why do organisms produce costly signals at all in the first place, even in contexts where no future interaction is possible?

3.5 Kin Selection and the Logic of Cooperative Persuasion

A worker bee will sting an intruder and die doing it. A meerkat sentinel stands exposed on a rock, calling loudly, making itself conspicuous to exactly the predators it is warning others about. A Belding’s ground squirrel produces a loud alarm call when it spots a hawk, drawing the predator’s attention to itself. These are costly acts that benefit the group at the individual’s expense. The answer begins with genetics. Hamilton [24] showed that altruistic behaviour can evolve when the recipient is a sufficiently close relative, formalised as Hamilton’s rule: rb > c, where r is genetic relatedness, b is the benefit to the recipient, and c is the cost to the actor. This elegant inequality, derived across two landmark 1964 papers, unified the previously puzzling phenomena of worker sterility, altruistic alarm calls, and cooperative breeding under a single quantitative framework.

In Hymenoptera (bees, ants, wasps), haplodiploidy, the mechanism by which females develop from fertilised eggs and males from unfertilised ones, means that full sisters share three-quarters of their genes (r = 0.75), a relatedness higher than that between a mother and her offspring (r = 0.5) [44]. The arithmetic here is worth pausing on. A worker bee who sacrifices reproduction helps raise sisters who share 75% of her genes. The inclusive fitness gain, the benefit to shared genes flowing through those sisters, can easily exceed the direct fitness cost of forgoing personal reproduction. This is why worker sterility evolves readily in Hymenoptera and not in diploid species: the haplodiploidy coefficient r = 0.75 for sisters is higher than the r = 0.5 for siblings in diploid organisms, and Hamilton’s rule requires rb > c. Higher r makes the left-hand side larger, more than enough to outweigh the cost of sterility.

This connection between kinship and cooperability extends beyond arithmetic: it suggests that the trustworthiness of a persuasive signal, the probability that a receiver will act on it, is itself subject to evolutionary selection. Signalling systems evolve towards honesty when sender and receiver interests are sufficiently aligned, and towards manipulation when they diverge. Krebs and Dawkins [29] articulated this as the deep tension underlying all animal communication: signals that reliably benefit both parties are maintained by selection; signals that exploit receivers at their expense drive counter-adaptation. The evolutionary tension between honest persuasion and deceptive manipulation is the most ancient form of the propaganda arms race.

3.6 Costly Signals: Why Honest Communication Persists

The problem with honesty, from an evolutionary standpoint, is that it is always vulnerable to cheating. If a signal of high quality can be faked by a low-quality sender, selection favours fakers and the signal loses its informativeness. So why does honest communication persist at all?

Zahavi [47] provided the answer: signals that impose a genuine cost on the sender cannot be cheaply faked. A peacock’s tail is expensive to grow, metabolically costly to carry, and makes the bird more visible to predators. A weak peacock that grew such a tail would pay the cost without the fitness benefits that a strong peacock enjoys. The tail is informative precisely because it is burdensome, its very extravagance the guarantee of its honesty. Zahavi called this the handicap principle [48].

The principle extends well beyond feathers. Male deer fight with antlers that are costly in bone, time, and injury risk. Bowerbirds construct elaborate structures that signal building ability and aesthetic sense. Meerkats stand on exposed rocks and call loudly, advertising their sentinel role at personal risk. In each case, the signal is credible because a low-quality individual cannot afford it: the cost enforces honesty.

Human persuasion inherits this logic directly. A speaker who travels far to deliver a message signals commitment. An orator who publicly stakes a prediction signals confidence. A leader who accepts costly obligations, redistributing food, taking on dangerous tasks, absorbing the first risks of collective action, signals alignment of interest with followers. In political science this surfaces as the costly commitment literature; in economics as signalling games. The underlying biology is the same across every domain: costly acts are credible precisely because they are costly.

Reputational commitment extends this further. Publicly announced promises are hard to break without social cost; the public nature of the promise is itself the enforcement mechanism. Much of what we call rhetoric, the choice of a bold claim, the decision to speak on record rather than off, is the deployment of costly signals in the reputational domain.

3.7 The Arms Race: Honest Signals and Their Mimics

Zahavi’s handicap principle predicts honesty. But it predicts something else too: mimicry. Wherever an honest signal becomes reliably responded to, a selection pressure arises for cheap imitations, signals that mimic honesty rather than instantiating it. Krebs and Dawkins [29] named this the manipulation side of the evolutionary tension. Every signal system, given time, generates an arms race between honest senders and deceptive imitators, and between credulous receivers and skeptical ones.

The evidence is everywhere in nature. Orchids that mimic the appearance and scent of female wasps deceive male wasps into pseudocopulation, achieving pollination without offering any reward. Cuckoos produce eggs that mimic the host’s eggs closely enough to evade rejection. Firefly species Photuris females mimic the flashing patterns of other firefly species to lure and eat males who mistake them for mates. In each case, receivers eventually evolve counter-adaptations: finer discrimination, subtler recognition criteria, resistance to the previously exploited signal.

The arms race between sender manipulation and receiver resistance is the deepest structural feature of communication, older than nervous systems. It predicts something that will become central to the rest of this book: that persuasion and resistance to persuasion co-evolve. Every technique of influence that becomes widely deployed generates, over time, corresponding immunities. The history of mass media is one version of this cycle: from print propaganda to the media literacy it eventually prompted; from television advertising to commercial skepticism; from email spam to spam filters. What changes with AI-generated persuasion is the speed at which new sender strategies can be deployed, potentially outrunning the evolutionary pace at which receiver resistance develops. That asymmetry is examined in Chapter 7.

Hamilton’s rule explains cooperation within kin. Costly signalling explains cooperation among non-kin who can observe each other’s costly acts. Neither reaches the scale of modern human societies. Something else is needed, something that allows cooperation among complete strangers, across time, at continental scale. That gap is where the human record begins.

3.8 The Human Evolutionary Record

To understand language as a genuine major transition, it helps to ground the argument in the palaeontological record. The timeline below reveals a puzzle: the extraordinary delay between the appearance of our lineage and the appearance of our culture [31].

Table 3.2: Key events in human evolutionary history. Mya = million years ago; kya = thousand years ago.

Period	Species	Brain size	Key marker
4 Mya	Australopithecines	~1/3 modern	Upright posture; ape-sized brains
1.5 Mya	Homo erectus	Brain doubled	Hand axe — unchanged for 1 million years
250 kya	Earliest H. sapiens	Near-modern	Slightly more skilled toolmaking
100 kya	Fully modern H. sapiens	Modern	Fully modern large-brained humans, tools still conservative
40 kya	H. sapiens	Modern	Burst of innovation: cave paintings, burials, trade

A line chart showing hominin brain size in cubic centimetres from roughly 3.5 million years ago to the present, with a steep rise beginning around 500,000 years ago. — Trends in hominin brain size from *Australopithecus* to *Homo sapiens*, plotted against time. The steep acceleration over the last 500,000 years, followed by a slight decrease in the Holocene, underlines both the extraordinary speed of encephalization and the puzzle of the “cultural explosion” delay visible in the table above. *Source: DeSilva et al. (2021), Frontiers in Ecology and Evolution. CC BY 4.0.*

At 4 million years ago, the australopithecines walked upright but their brains were ape-sized, roughly one-third the volume of a modern human brain. By 1.5 million years ago, Homo erectus had appeared with a brain that had doubled, and was carrying a remarkable stone tool: the hand axe. What astonishes is the hand axe’s extraordinary stability. Virtually identical specimens have been found across Africa, Europe, and Asia, manufactured to the same specification for approximately one million years and across many thousands of generations [31]. Cultural transmission was clearly operating: the design was copied faithfully, but it was copying without cumulative improvement. This is culture, but not yet cumulative culture.

By 250,000 years ago the earliest Homo sapiens were present, with slightly more skilled toolmaking. By 100,000 years ago, fully modern large-brained humans were widespread, yet their tools remained broadly conservative. Then at 40,000 years ago there was a burst of innovation. Cave paintings appeared; the dead were buried with grave goods; shell ornaments were traded across hundreds of kilometres; tool types proliferated rapidly. Language is the obvious candidate for what changed. Only a communication system capable of displacement, of referring to the absent, the past, the hypothetical, could underpin simultaneous innovation in art, ritual, and long-distance trade, all of which require coordination around shared representations that do not exist in the immediate environment.

3.8.1 Big Game Hunting: Shared Interest and the Stages of Language

But what drove the evolution of language in the first place? Számadó [42] proposed an account centred on big game hunting, a peculiarly human activity unlike anything else in the primate repertoire. The coordination problem is specific: a large ungulate requires the whole group — more strength and coordination than any lone hunter commands — but only if roles are assigned before the animal is sighted. The flanker who will drive the prey, the blocker positioned at the ravine, the catcher waiting downstream: each must know the plan before it unfolds. Communicating that plan requires referring to events that have not yet happened, to locations currently out of sight, and to the roles of specific individuals, all properties that animal signals entirely lack. A chimpanzee’s grunt cannot distinguish “you go left while I go right” from “there is food ahead.”

Big game hunting creates shared interest in a way that most primate activities do not. All participants share the outcome whether the hunt succeeds or fails, which minimises the conflict between signaller and receiver that plagues other primate communication. Other primate activities, grooming, mating, food competition, place sender and receiver in partially opposed positions. Hunting aligns them. It also provides a natural check on dishonesty: who showed up, who ran, who held position is visible to all participants during and after the event.

This context created selection pressure that unfolded in two stages. The first was indexical and iconic signs: concrete objects or actions that referred directly to the prey, a skull or horn shown to indicate the target species, or a gesture mimicking the animal’s movement. These signs were easy to imitate, grounded in shared perceptual experience, and sufficient for the most basic task of recruitment: getting others to come along. Számadó’s model predicts that the earliest referential communication of this kind was about concrete objects, prey species and location, before extending to actions, roles, and plans. The second stage arose from the demands of planning itself. Communicating about events that have not yet happened, about positions to adopt before the animal arrives, drove selection toward more complex communication: compositional signs capable of specifying relations between agents, actions, and objects, rather than pointing at a present referent.

That compositional, planning-oriented communication almost certainly had its origins in the body rather than the voice.

3.8.2 Pantomime Predating Verbal Language

The sequence reconstructed by Számadó places gestural and pantomimic communication before verbal language in evolutionary time. Ferretti and Adornetti [18] develop this argument in detail: archaic hominins employed pantomime as a primary persuasive medium, a nonverbal, mimetic, non-conventionalized form of communication that represented events and stories through coordinated body movement and relied on shared mental imagery. Unlike the limited gestural repertoires of other apes, pantomime is inherently narrative: it can depict an absent entity, a sequence of events, a causal chain. Experimental evidence shows that gesture dominated over vocalization in early human communicative acts, and that gesture has significantly greater potential than vocalisation for bootstrapping a shared communicative system from scratch.

The reason gesture comes first is straightforward: mimicry of visible actions is simpler than arbitrary acoustic symbols. You can mime a running antelope with your hands; you cannot easily vocalise it without prior convention. Gesture is transparent to the receiver in a way that sound is not, because the form of the gesture shares properties with what it depicts. A plausible neural substrate for this is the mirror neuron system, first characterised in macaques: cells in the premotor cortex that fire both when an action is performed and when it is observed in another individual. [34] Because gesture is visible and imitable, and because the same neural circuits are activated both in producing and perceiving an action, gesture would naturally be the first medium for communication about the actions of agents. Sound became the dominant channel later, once the conventions were established, because of its advantages in range, darkness, and multitasking.

The emergence of conversational language built on this gestural base, adding the dimension that pantomime alone cannot achieve: turn-taking argumentation.

3.8.3 Conversational Language as Reciprocal Persuasion

Ferretti and Adornetti [18] locate the distinguishing feature of modern Homo sapiens in conversation: the turn-taking exchange in which both parties alternately produce and respond to communicative acts, each trying to shift the other’s beliefs or actions. Conversation is a negotiation, a reciprocal exchange in which each move shapes the next.

What makes conversation cognitively special is turn-taking itself. Each speaker must model what the other person understood from the previous turn and respond to that model, to what was registered rather than simply to what was said. This demands theory of mind operating in real time: I need to track your current mental state alongside my own intention as I produce the next utterance. Pantomime, however elaborate, does not require this because there is no conversational floor to manage, no obligation to respond to the other’s interpretation rather than repeating one’s own display.

Conversation, on this account, was the evolutionary trigger for grammar. The demands of reciprocal persuasion, composing novel arguments in real time, responding to objections, specifying precisely which object or action or time-point is at issue, required a combinatorial system capable of generating an unbounded number of distinct messages from a finite vocabulary. Consider what is needed to say “if you bring the spears to the ridge, I will drive the prey toward you from the south.” That sentence requires tense, conditionals, reference to locations currently out of sight, and subject-predicate structure linking specific agents to specific roles. Simple declaratives and requests, the kind pantomime can approximate, fall short of what coordinated argument requires. The pressures driving grammar were argumentative: exchanging reasons, proposing and rejecting plans, negotiating roles and obligations in real time.

Through this process, human communication became multimodal: integrating both speech and gesture as complementary channels. Speech took on the primary grammatical burden, the combinatorial, recursive system for specifying relations among arguments, while gesture retained its role in grounding reference, expressing emphasis, and conveying spatial and iconic information that resists easy encoding in syntax. The result is a hybrid system in which the full meaning of an utterance is often distributed across both channels, but whose core propositional structure is carried by the spoken word.

The genetic architecture that makes all this possible co-evolved gradually with the cultural practices it enabled. The question that remains is what this co-evolved system could do that no prior communication system could, and why that difference matters so much for cooperation.

3.9 Language: The Human Major Transition

The communication systems surveyed so far, chemical gradients, waggle dances, alarm calls, and coalition politics, share a fundamental constraint. They all operate in the present. A vervet alarm call signals a leopard here and now. A grooming session cements an alliance with the individual directly in front of you. Pheromone trails point to food that currently exists. No signal in any of these systems can refer to what happened yesterday, what might happen tomorrow, or what would have happened if someone had behaved differently.

Language broke that constraint. It differs from every prior communication system in kind, through three properties absent in all animal communication [27]:

Combinatorial productivity: a finite set of phonemes combines into an unbounded number of morphemes, which combine into an unbounded number of sentences, each capable of expressing a distinct meaning.
Displacement: language can refer to entities and events that are not present in the immediate environment, including objects in the past or future, distant locations, and purely hypothetical situations.
Propositional structure: language encodes both the identity of referents and the relations between them — causal, conditional, and normative relations.

These properties together mean that language can coordinate behaviour around representations of the world, including representations of social rules, obligations, and sanctions, rather than around present stimuli alone. A honeybee’s waggle dance communicates the location of a food source with extraordinary precision, but it cannot communicate that a certain flower patch is morally off-limits, or that a nestmate who visited it owes an apology. Language can. This is what makes language the pivot of this book: every mechanism of persuasion examined in the chapters that follow, attitude change, framing, narrative, political rhetoric, AI-generated content, operates through it. Without displacement, without propositional structure, without the ability to say if you do this I will do that or they did something terrible last year, none of those mechanisms exist. Persuasion at the scale that humans practise it may be the primary reason language evolved.

That connection runs in both directions. Language made large-scale persuasion possible; the selection pressure for large-scale cooperation made language evolutionarily advantageous. The two drove each other. The following sections trace how, starting with the social scaling problem language solved and ending with the way cultural transmission turned language into a system capable of persuading entire civilisations.

3.9.1 Language as an Enabler of Social Scale

Dunbar [15] proposed that language evolved primarily as a form of social grooming, a way of maintaining cooperative alliances among individuals who could not physically groom all their allies simultaneously. In non-human primates, physical grooming is the primary mechanism for building and maintaining social bonds, but it is costly: it occupies roughly 20% of a typical primate’s waking day, and it can only be directed at one individual at a time. These constraints impose a hard ceiling on group size, estimated at roughly 50–80 individuals for most non-human primates.

Language lifts this ceiling. Because vocal interaction can be directed at multiple individuals simultaneously, requires far less time per bond maintained, and can occur during other activities (foraging, walking, resting), it allows cohesive social groups of up to approximately 150, the figure derived from the neocortex regression [16], one that recurs with unusual regularity across hunter-gatherer bands, Neolithic village sizes, military unit structures, and functional social networks studied today.

Two chimpanzees grooming each other, illustrating the social bonding mechanism that language replaced at larger group sizes. — Chimpanzees grooming in Higashiyama Zoo. Physical grooming is the primary mechanism for maintaining social bonds in non-human primates, occupying roughly 20 per cent of waking time and proceeding strictly one pair at a time. It cannot scale beyond groups of roughly 50–80. Language, Dunbar argued, is what replaced it. *Photo: Nattanan23 / Wikimedia Commons. CC BY-SA 3.0.*

But language does more than scale up grooming. It changes the content of social bonding. Where grooming can only communicate presence and tolerance, language can communicate endorsement or condemnation of absent third parties, a capability with profound implications for cooperation. Knowing that a particular individual cheated in a trade three villages away, and being able to transmit that information to all one’s trading partners, creates a reputational infrastructure that extends cooperative norms far beyond the bounds of personal acquaintance. Nowak [33] termed this mechanism indirect reciprocity: cooperation with strangers is sustained through reputation rather than direct exchange, and reputation requires language.

Maynard Smith and Szathmáry [31] spell out what the social intelligence account implies for the content of language. In humans, it would have been selectively advantageous to communicate about time (yesterday’s hunt, tomorrow’s plan), possession (who owns what, who owes a favour), beliefs and desires (what she thinks, what he wants), tendencies (who can be relied upon, who defects), obligations (who owes what to whom), truth and probability (was she really there? how likely?), and above all hypotheticals and counterfactuals (what would happen if we cooperated; what would have happened if he had not run). As they put it: “the intellectual arms race took place within the species itself.” Every one of these communicative domains is socially consequential in exactly the way the social brain hypothesis would predict: they are the content that matters for managing alliances, predicting others’ behaviour, and coordinating complex collective action. Non-human primates, limited to signals about present and perceptible states, cannot engage with any of them.

Every communication system examined so far transmits capacity and content through the same channel: genetics. Each generation re-evolves the waggle dance, the alarm call, the grooming repertoire from scratch. Language breaks this pattern. The capacity is genetic, but the content (stories, norms, beliefs) is culturally inherited, spreads to strangers, changes within a generation, and compounds across centuries. Before tracing how that cultural transmission works, there is a prior question: is the language used to transmit beliefs the same thing as the thinking that produces them?

3.9.2 Language and Thought: Distinct Systems

If language evolved primarily for persuasion rather than reasoning, we should expect language and thought to be partially separable, distinct systems with distinct evolutionary histories that happen to interact. The evidence bears this out. A persistent misconception treats the two as the same thing, as if thinking were simply internal speech. Language and thought are biologically distinct systems, and this distinction matters profoundly for understanding what language does as a persuasive technology. Fedorenko, Piantadosi and Gibson [17] review the full body of evidence and conclude that language is primarily a communicative technology — a tool for coordinating minds, with reasoning running on largely separate, dissociated circuits.

The clearest evidence comes from patients with severe aphasia, the selective loss of language following damage to the left hemisphere’s language areas. Such patients may lose virtually all ability to produce or comprehend speech, yet retain the ability to perform arithmetic, solve spatial puzzles, follow complex non-verbal instructions, and reason causally about the world. The language system, even when catastrophically damaged, does not take reasoning down with it.

The complementary pattern is equally informative. Certain forms of frontal lobe damage or thought disorder leave linguistic fluency intact, the patient producing grammatical, well-formed sentences, while producing severe impairments in decision-making, planning, and logical inference. A person can speak perfectly and reason very poorly. Together, these two patterns constitute a double dissociation: each system can be selectively damaged while the other is preserved, which is the strongest evidence that they are anatomically and computationally distinct.

The structure of language itself reinforces this conclusion. If language had evolved primarily as a tool for thinking, we would expect it to be optimised for the demands of reasoning: precision, unambiguity, completeness. Instead, the statistical structure of every known human language reflects the pressures of communication between a sender and a receiver [17]. Across languages, the most frequently used words are the shortest: word length is predicted more strongly by contextual predictability than by meaning complexity. Grammatically related elements cluster together within sentences, reducing the listener’s memory load; analyses of large corpora show that actual sentences are consistently shorter in dependency length than random arrangements of the same words would be. And languages tolerate, even exploit, ambiguity: in predictable contexts a shorter ambiguous expression transmits the same information as a longer unambiguous one. Private reasoning, which has no receiver, would gain nothing from ambiguity. Language’s tolerance of it is a signature of its communicative function.

Developmental evidence runs in the same direction. Prelinguistic infants track object permanence, attribute intentions to agents, and compute causal chains before they have the syntactic or lexical resources to describe any of this. Going the other way, children acquire grammatical patterns in domains where their conceptual understanding lags, producing passive constructions or embedded clauses in contexts where they do not fully grasp the logical relationship being expressed. The two systems develop on their own schedules, consistent with distinct biological programmes.

Language enables the persuasive achievements described in the remaining sections of this chapter: shared fictions, institutions, normative systems. The mechanism is transmissive: language carries representations between minds, aligning the mental states of distinct agents. Persuasion via language is other-directed — a technology for coordination across individuals, across time.

3.9.3 From Signals to Recursion: The Hierarchy of Linguistic Capacity

Language emerged piecemeal: a hierarchy of increasing computational power, each level enabling communicative and cognitive capacities inaccessible to the level below it. Mapping this hierarchy clarifies what is shared between humans and other animals, what is uniquely human, and why the uniquely human components were the decisive step for cooperation at scale.

Level 1 — Signals. The most primitive communicative acts are signals: outputs that reliably trigger specific responses in receivers, with the signal-response relationship fixed by biology. The vervet alarm calls (Section 3.4.3) are paradigm cases: three acoustically distinct calls, each eliciting a predator-appropriate flight response, each biologically specified. The honeybee’s waggle dance encodes direction and distance to food with extraordinary precision. Meerkat sentinel calls grade continuously with threat level. In each case, the signal is tied to an immediate, perceptible state of the world: a real predator, a real food source, a real threat. What signals enable: rapid, reliable coordination of behaviour around present stimuli. What they cannot do: refer to the absent, the categorical, or the arbitrary. There is no vervet signal for yesterday’s python.

Level 2 — Symbols. A symbol is an arbitrary sign-referent relationship: the acoustic or gestural form of the symbol bears no iconic resemblance to what it refers to. The English word cat sounds nothing like a cat; the word red is not itself red; the sign for apple in American Sign Language does not look like an apple. This arbitrariness is the crucial enabling property. A signal system tied to iconic or indexical resemblance can only refer to things that can be imitated or pointed at. An arbitrary symbol can refer to anything, including things that cannot be perceived, imitated, or pointed at: obligations, possibilities, mathematical objects, the future. Washoe’s acquisition of arbitrary ASL signs demonstrates that the symbol-forming capacity extends to other species: trained apes can acquire a limited set. What symbols enable: naming, the assignment of a stable label to a category that can then be communicated across individuals and across time.

Level 3 — Vocabulary expansion. Once the symbol principle is established, the vocabulary can grow without limit. Each new word extends the referential scope of the system without requiring new signal infrastructure. Vocabulary can track cultural innovation: new concepts get new names (algorithm, democracy, copyright), and those names can spread through a community within a generation, far faster than any genetic process. This is why Tooby and Cosmides [43] argued that the genome stores the capacity to learn words rather than the words themselves: cultural evolution generates vocabulary faster than genetic evolution ever could. What vocabulary expansion enables: the categorical mapping of the world, including the shared conceptual maps that underlie coordinated action among strangers who have never met.

Level 4 — Simple combinations. Placing two symbols in relation (big train, tickle Washoe, my milk) produces a qualitatively richer output than either symbol alone. The combination encodes a proposition: an assertion about how two entities stand in relation to one another. This is the level at which the two-word child, the language-trained chimpanzee, and Genie all operate (see Section 3.9.6). The semantic territory covered by two-word combinations is already substantial: attribution of properties (red book), possession (my milk), location (walk street), agent-patient relations (Adam checker). What combinations enable: propositional content, the expression of states of affairs rather than the naming of objects alone. What they cannot express: the difference between the dog bit the man and the man bit the dog, because at this level word-order rules are absent or inconsistent, and embedding is impossible.

Level 5 — Syntax. Syntax is the system of rules that governs how symbols can be combined: which structural roles they can play, in which order, with which agreement relations. The decisive syntactic innovation, present in all known human languages and absent in all animal communication systems, is the subject-predicate distinction (see Section 3.9.4): the separation of the argument slot (what is being talked about) from the predicate slot (what is being said about it). This allows a vocabulary of N nouns and M verbs to generate N × M distinct propositions rather than N + M distinct signals. The expressive capacity grows multiplicatively rather than additively. What syntax enables: unbounded messages from finite means, the ability to say, and understand, sentences never before uttered.

Level 6 — Recursion. Recursion is the embedding of one linguistic structure inside another of the same type, without principled limit. A sentence can contain a relative clause that contains another relative clause: the dog that the man who the woman hit saw ran. A verb of mental state can take a propositional complement that contains another propositional complement: She knew that he believed that they had agreed to leave. Conditional and counterfactual reasoning is structured recursively: if he had known that she would have done what she said she would never do, he would not have…. Recursion is what allows language to represent thoughts about thoughts, the basis of full-blown theory of mind: knowing what they believe someone else believes about what you believe.

[26]

What recursion enables: the entire domain of embedded social reasoning, the contractual, legal, narrative, and moral reasoning that human cooperation depends on. A contract (“if you do X, I will do Y, unless Z obtains, in which case…”) is a recursively embedded conditional. A legal argument is a chain of recursively embedded propositions about what others did, intended, agreed to, and were permitted to do. A novel embeds one character’s consciousness inside another’s, nested within a narrator’s, nested within the author’s representation of a world that never existed. None of this is possible without recursion.

Level 7 — Abstract reference and displacement. The final level is the capacity to refer to entities and events that are not present in the immediate perceptual environment, and beyond that to entities that may not exist at all. Displacement [27] is the ability to speak of the past and future, of distant locations, of hypotheticals and counterfactuals. Abstract reference extends this to entities that have no spatiotemporal location whatsoever: obligations, rights, probabilities, mathematical objects, moral duties, social roles. A corporation is not a physical object; democracy is not a perceptual category; justice does not exist at any particular location. Yet these abstractions coordinate the behaviour of millions of people who have never met.

What abstract reference enables is precisely the institutional infrastructure that distinguishes human civilisation from the cooperation of every other species: laws, markets, religions, scientific communities, states. Every one of these institutions is, in the analysis of Section 3.9.9, a shared fiction, a collectively held representation of something that has no physical existence but that generates real coordination through belief. The capacity to represent and communicate about non-present, non-perceptible, non-existent entities is the communicative foundation of everything that makes human social organisation unique.

The hierarchy as an evolutionary ladder. The levels are descriptive categories, but they also map onto distinct evolutionary and developmental stages. Animal communication systems generally reach Level 2 (symbols, in trained apes) or remain at Level 1. Proto-language, in the two-year-old child, the trained ape, and the language-deprived human like Genie, operates at Levels 2–4. Full human language achieves Levels 5–7. The transitions between levels are sharp thresholds: each requires qualitatively new neural architecture. It is the jump from Level 4 to Level 5, from combination to syntax, that constitutes the major transition in human evolution, and it is Levels 6 and 7 that make human persuasion qualitatively unlike anything seen elsewhere in the animal kingdom.

Table 3.3: The hierarchy of linguistic capacity, from signals to abstract reference. Each level is a precondition for the next. Human language occupies all seven levels; no non-human system is known to exceed Level 2 without sustained human training.

Level	Capacity	Example	What it enables
1	Signals	Vervet alarm calls	Immediate behavioural coordination
2	Symbols	Washoe’s ASL signs	Naming arbitrary categories
3	Vocabulary expansion	Cultural words (algorithm)	Tracking cultural innovation
4	Simple combinations	Big train; Tickle Washoe	Propositional content
5	Syntax	Subject-predicate structure	Unbounded messages from finite vocabulary
6	Recursion	She knew that he believed that…	Theory of mind; contracts; narrative
7	Abstract reference	Obligations, rights, justice	Institutions; law; shared fictions

3.9.4 Grammar, Syntax, and Semantics

The most fundamental structural feature of human language, one that has no counterpart in any animal communication system, is the subject-predicate distinction. Consider four concepts: dog-running, dog-sleeping, lion-running, lion-sleeping. A system without grammar would need a separate signal for each of these four combinations. Human language does something more powerful: it provides two nouns (dog, lion) and two verbs (run, sleep), and allows any noun to be combined with any predicate. To say “the dog is sleeping” is a predication, an assertion that a property holds of an entity. Since many properties can be predicated of each entity, and many entities can be referred to by each noun, the range of things that can be communicated with a vocabulary of a given size grows multiplicatively rather than additively. The subject-predicate distinction is a universal feature of all known human languages, and it is the basis of our ability to produce and understand an indefinitely large number of sentences from a finite vocabulary. A vervet monkey cannot do this: its alarm calls form a closed inventory, each signal tied to one meaning.

Why is only the capacity to learn language innate, and not the vocabulary itself? If language were adaptive, would it not be more efficient to transmit the vocabulary genetically, rather than requiring each child to learn it from scratch? Tooby and Cosmides [43] offer a compelling answer: if the vocabulary is learned, we can acquire names for cultural innovations: screwdriver, constitution, algorithm. Cultural evolution is far faster than genetic evolution; long before any appreciable number of words could be genetically assimilated, dialects and distinct languages were already present and diverging. The genome may as well store the vocabulary in the “cultural environment”, meaning that the capacity to learn words is heritable, while the words themselves are maintained and transmitted culturally. This is a precise instance of the bio-cultural co-evolution described in Section 3.9.8: the biological endowment creates the receptacle; the cultural environment fills it.

The innateness of grammar’s deep structure presents a different puzzle. The surface rules of different languages vary enormously, but their deep structural properties, subject-predicate organisation, recursive embedding, and argument structure, are universal. If these structural principles cannot be learned from the input (bottom-up), nor derived from general reasoning principles (top-down), their existence requires explanation. As Bates and colleagues argued, there are logically only two possibilities: either universal grammar was installed directly by the Creator, or our species underwent a cognitive mutation of unprecedented magnitude, a Big Bang of the language faculty.

[4]

Any evolutionary biologist will recognise the problem here. We have been told too often that the eye could not have evolved by natural selection, because any alteration to its structure would destroy its function. Yet we know of many functional intermediates between a simple pigment spot and the vertebrate eye, each fully adequate for the organism’s needs at that stage. Complex organs are built incrementally, each stage functional in its own right. The question for language is whether comparable intermediates can be identified. The snag is that the intermediates no longer exist, unlike the eye, for which we have living comparative examples across phyla. But the evidence from cases of partial linguistic capacity, specific language impairment, aphasia, sign languages at different stages of grammaticalisation, suggests that grammatical competence is a matter of degree: partial grammar exists, and partial grammar is far better than none.

3.9.5 Language in the Brain: Genetic and Lesion Evidence

The neurological evidence for language as a biologically specific faculty, distinct from general cognitive capacity applied to communication, comes from two sources: cases of selective brain damage and cases of selective genetic impairment.

Diagram of the left hemisphere of the human brain showing Broca's area in the inferior frontal gyrus and Wernicke's area in the posterior superior temporal gyrus, with the arcuate fasciculus connecting them. — The principal cortical regions involved in language production and comprehension. Broca’s area (frontal lobe, left hemisphere) governs the grammatical sequencing and articulation of speech; Wernicke’s area (posterior superior temporal lobe) governs the comprehension of spoken language. Both are connected by the arcuate fasciculus. Damage to each produces distinct aphasia syndromes, illustrating the anatomical modularity of the language faculty. *Diagram: UX Stalin. CC BY-SA 4.0.*

Patients with damage to the temporal segment of the left lingual gyrus suffer from colour anomia. They experience colour normally and are in full command of word morphology, they can produce and comprehend sentences, but they are unable to pair colour names with colours: they may pair yellow with grass and green with banana. Given a colour name, they point to the wrong colour. The link between word and concept is selectively impaired. Patients with damage to the anterior and mid-temporal cortices recognise objects correctly but cannot name them; they say, “I know it, but I cannot say the name.” Oddly, this naming difficulty is more pronounced for natural objects than for human-made artefacts, suggesting that different parts of the brain represent categories of objects differently. Damage to the left anterior temporal lobe can selectively impair the ability to retrieve the names of unique persons, while leaving common-noun retrieval intact. Each of these dissociations reveals a distinct sub-component of the language system, a modularity within language itself that is anatomically grounded.

[13]

The biological specificity of language is shown most directly by a British family studied by Gopnik [22]. Across three generations, 16 of 30 family members were affected by a peculiar language disorder (dysphasia). The pattern of inheritance, affecting some members of a sibship while sparing others, is consistent with a single dominant autosomal gene. The disorder is not due to imitation of a disordered parent: children affected while one parent is normal still show the impairment. These individuals are not globally impaired: they tell jokes, converse, and some are mathematically capable. There is no general failure to handle hierarchical structures. The impairment is specific to one aspect of morphology: the affected individuals cannot generalise grammatical rules.

The nature of the deficit is beautifully illustrated by Gopnik’s examples. Affected children write sentences such as:

She remembered when she hurts herself the other day. Carol is cry in the church. On Saturday I went to nanny house with nanny and Carol.

In each case the child fails to mark tense or possession using the appropriate morphological change: hurt should be hurt (past), cry should be crying, nanny house should be nanny’s. When shown a picture of an imaginary creature called a wug and then a picture of several such creatures, a normal child immediately says wugs. The dysphasic child cannot: they can learn individual examples of plurals and past tenses, just as we all learn that the past of go is went, but they cannot generalise the rule to new cases. Grammar, for them, is a collection of memorised facts rather than a productive system.

One child demonstrated this precisely. On a Monday she wrote: On Saturday I watch TV. Her teacher corrected this to watched. The following week she wrote: On Saturday I wash myself and I watched TV and I went to bed. She had learned that the past of watch is watched as a particular fact; she had not extended the rule to wash; and she already knew went as a unique memorised form. The productive morphological rule, the ability to generalise, is the specific thing that is missing.

This case carries several implications for understanding what language is. First, it shows that there can be intermediates between perfect linguistic competence and none: these individuals have substantial language, just not generative morphology. This is evidence against the “all-or-nothing” view of language evolution, and for the possibility that the faculty evolved incrementally. Second, the impairment is specific to language, with no general cognitive defect, suggesting that at least some grammatical knowledge is instantiated in neural structures that are biologically dedicated to language rather than shared with domain-general reasoning.

Subsequent molecular research identified a gene, FOXP2, a transcription factor that regulates neural development, as implicated in heritable language disorder. The human version of FOXP2 differs from the chimpanzee version by two amino acid substitutions that appear to have arisen recently in our evolutionary history, and disruption of this gene produces impairments specifically in the sequencing and articulation of speech. It is one of the first concrete links identified between a specific genetic variant and a specific component of the human language faculty.

[23, 30]

3.9.6 Proto-language: Apes, Children, and Genie

The best way to understand what is distinctive about full human language is to examine what precedes it, what we can call proto-language: a communication system that conveys meanings but lacks the productive grammar that makes human language unbounded. The concept is brought into sharp focus by comparing two sets of two-word utterances from very different sources.

(12) Big train; Red book; Adam checker; Mommy lunch; Walk street; Go store; My milk; Pretty boat; Mama honey; Pig Mommy.

(13) Tickle Washoe; Open blanket; Roger ticket; You drink; Go in; In hat; Clothes Mrs. Gardner; Listen dog; Sign me; Hurry gimme.

These two sets are close to indistinguishable. Both draw on a small vocabulary of noun-like and verb-like terms assembled into pairs. They cover the same semantic territory: the attribution of qualities (big train, red book), possession (my milk, clothes Mrs. Gardner), location of actions (walk street, go in), and agent-patient relations (Adam checker, tickle Washoe). The apparent syntax, to the extent that word order carries meaning, is indistinguishable across the two samples.

Yet (12) is from children at the two-word stage of language acquisition, and (13) is from the chimpanzee Washoe, trained in American Sign Language [20]. The surface similarity masks a fundamental difference in motivation. Children at the two-word stage are in the business of categorising the world for its own sake: a child will say red book with no request implied, simply to predicate a property of an object. Washoe’s utterances, by contrast, are overwhelmingly communicative appeals about objects or actions she wants: requests to be tickled, to have a blanket opened, to receive food or attention. The ape is using proto-language instrumentally, as a tool for getting things. The child is using it representationally, as a tool for mapping reality, even when there is nothing to be gained.

This difference, categorisation for its own sake versus communication about immediate wants, marks the threshold between a communicative system and a representational one. Only a fully representational system can build the shared maps of reality, including maps of obligations, norms, past events, and hypothetical futures, that make complex cooperation possible.

Genie: the human case. In 1970, a 13-year-old girl was discovered in Los Angeles who had been confined and severely isolated since approximately 18 months of age, denied normal language input for over eleven years. The linguist Susan Curtiss [12] documented her linguistic development after her rescue. Genie acquired vocabulary rapidly, her capacity to learn words was intact, but her grammar remained permanently limited. Representative utterances from her early speech: Want milk; Mike paint; Applesauce buy store; I want Curtiss play piano. These are slightly more elaborated than the two-word utterances of (12) or (13): she can string together more than two content words and can express a desire for another person’s action. But tense marking, morphological agreement, and the recursive embedding that characterises adult grammar are absent. There is no generative phrase structure in the linguistic sense.

Genie’s language, like Washoe’s, like the two-word child’s, is what we can call proto-language. It is a communication system that is older, in an evolutionary sense, than full human language, phylogenetically ancient. It is present in very young children before grammar has developed, in language-trained apes who have never been exposed to grammar, and in a human who was denied the critical developmental window during which grammar normally takes hold. There appears to be no critical period for proto-language: Genie acquired it at age 13 without difficulty. There is, however, a critical period for the grammatical component that elevates proto-language into full human language, and that window had closed for Genie before she was found.

Genie’s case thus provides a natural experiment that dovetails with the genetic and lesion evidence reviewed in Section 3.9.5. The morphological generalisation ability damaged in Gopnik’s dysphasia family, the specific naming and categorisation abilities impaired by focal lesions, and the syntactic component that Genie could never acquire, all point to the same conclusion: human language is a family of biologically specific capacities, only some of which are shared with other primates, and only some of which can develop without normal early exposure.

The proto-language capacity appears to be the shared substrate, the platform on which, in our lineage, a new grammatical architecture was built. The question that follows naturally is: how does that architecture arise? Can it emerge from cultural processes alone, without genetic change? The evidence from pidgins and creoles is the clearest answer available.

3.9.7 Pidgins, Creoles, and the Speed of Cultural Evolution

A pidgin is a contact language that emerges spontaneously when speakers of mutually unintelligible languages must communicate, most commonly under conditions of trade or colonial labour. Pidgins draw their vocabulary primarily from one dominant language, but strip away most of its morphology and syntactic complexity. They have no regular tense system, little or no agreement, minimal embedding, and no native speakers. Pidgin speakers share no common language; the pidgin is their improvised solution to an immediate communicative need. It is, in the technical sense, a proto-language: its utterances resemble those of Washoe and of Genie far more than they resemble the output of a native English or Japanese speaker.

A creole is what happens in the next generation. When children grow up in a community where a pidgin is the primary medium of communication, they do not simply learn the pidgin they are exposed to. They systematically expand it. They add consistent grammatical morphology, stable word order, tense-aspect-modality markers, and recursive embedding, until the resulting language has the full expressive power of any natural human language. The proportion of purely grammatical items (articles, prepositions, tense markers, complementisers, relativising particles) rises from near zero in the pidgin to approximately 50 per cent in established creoles, a spontaneous creation of a grammatical system from proto-linguistic raw material — a spontaneous grammatical innovation, assembled within a single generation. And it happens within a single generation, without instruction, driven by the same biological endowment that allows any child to acquire whatever language their community speaks.

Why study pidgins and creoles? Because they constitute a natural experiment in language creation: a case in which we can observe, in historical time, the transition from proto-language to full language. Most of the evidence bearing on the origin of language is indirect, inferred from fossils, comparative anatomy, or computational models. Creolisation gives us a direct window.

The Hawaiian case. The clearest documented instance is Hawaiian Creole English. In the late nineteenth and early twentieth centuries, plantation workers arrived in Hawaii from China, Japan, the Philippines, Korea, Portugal, and Puerto Rico. With no common language, they developed a pidgin, simplified English mixed with fragments of their native tongues, for essential communication. Their children, immersed in this pidgin from birth, spontaneously produced Hawaiian Creole English: a fully grammatical language with consistent syntactic rules, a complete tense-aspect-modality system, and the recursive structures entirely absent from the pidgin input. The jump from proto-language to full language occurred in a single generation [5, 6].

Cultural evolution is faster than genetic evolution by many orders of magnitude. A human generation is approximately 25–30 years, far too short for any significant genetic change. The Hawaiian children’s brains were genetically identical to their parents’. The cultural environment changed: the children were immersed in communicative interaction, even in a structurally impoverished form, and their language faculty, the biological endowment shared by all Homo sapiens, supplied the grammatical architecture that the input lacked.

This means that the transition from proto-language to full language requires only children, a community, and enough time for the first creole-speaking generation to emerge — genetic change is unnecessary. The genetic endowment creates the capacity; the cultural environment triggers its full expression. This is a precise and historically documented instance of what Tooby and Cosmides [43] argued for vocabulary: that cultural transmission is the appropriate vehicle for the content of language, while the capacity to acquire language is the appropriate vehicle for genetics. Creolisation extends this argument from words to grammar itself.

For the study of persuasion, the creolisation finding carries a consequential implication. The most powerful persuasive technology in human evolutionary history, fully grammatical language, is as much a cultural product as a biological one. The capacity is genetic; the grammatical realisation is cultural. Groups that create the conditions for creolisation, dense communicative interaction across language boundaries, will produce, within a generation, the full expressive and persuasive power of human language. The biology is universal; the culture is the trigger. If that is true of grammar itself, it raises a further question: what happens when the content that language transmits, rituals, beliefs, and norms, begins to exert selection pressure back on the genome?

3.9.8 From Genetic to Cultural Transmission: A New Mode of Persuasion

The persuasive mechanisms reviewed in the preceding sections, pheromone trails, alarm calls, waggle dances, grooming exchanges, and dominance displays, share a fundamental property: the capacity for these behaviours is encoded genetically. A honeybee’s ability to produce and respond to the waggle dance, a meerkat’s alarm call repertoire, a chimpanzee’s disposition to engage in coalition politics, all are heritable in the strict biological sense. Each generation must re-evolve the relevant neural architecture through genetic reproduction.

Language introduces a qualitatively different mode of transmission. The capacity for language is genetically encoded, specialised cortical regions, precise articulatory control, sensitivity to syntactic structure, but the content of what language transmits is culturally inherited. A story, a ritual, a legal code, a religious belief is transmitted culturally rather than genetically — through imitation, teaching, and symbolic communication, and can spread horizontally across individuals who share no kinship at all. This distinction between the vehicle (genetic) and the payload (cultural) is the heart of what Boyd and Richerson [7] termed dual inheritance theory: human evolutionary history is the story of two inheritance systems, genetic and cultural, operating simultaneously and shaping one another.

The consequences for persuasion are profound. Boyd and Richerson [7] argued that culturally transmitted traits, including systems of ritual, belief, and social norm, are subject to selection in their own right. A group that is successful because of its system of ritual experiences two simultaneous effects: by cultural evolution, its belief system spreads to neighbouring groups through imitation, prestige, or conquest; and by genetic selection, it favours individuals within the group who are constitutionally more susceptible to those beliefs:

“There is between-group selection for culturally inherited systems of belief that favour the success of groups, and there is individual selection for the genetically inherited ability to be influenced by ritual.”

— Boyd & Richerson, Culture and the Evolutionary Process [7], 1985

This idea, that political authority requires the engineering of shared belief alongside the exercise of force, has a long history in political philosophy. Plato made it explicit in The Republic [35], arguing that the legislator’s central task is precisely this cultural management of belief:

“All he [the legislator] needs to do is to find out what belief is most beneficial to the state, and then use all the resources at his command to ensure that throughout their lives, in speech, story and song, the people all sing to the same tune.”

The candour here is remarkable: the primary instrument of social order is the alignment of belief through narrative, music, and repetition, persuasion systematically deployed across an entire culture. Rousseau [37], writing two millennia later, placed a related insight at the heart of The Social Contract: legitimate political authority rests on something deeper than force: it requires the transformation of natural individuals into citizens who genuinely identify with the general will, a transformation that is, at its core, an act of cultural persuasion through shared institutions, rituals, and civic education.

What Boyd and Richerson formalised in evolutionary terms, Plato and Rousseau had identified through political philosophy: stable large-scale cooperation requires the active management of belief, and the most powerful tool for that management is cultural transmission: narrative, ritual, education, and the arts.

This creates a powerful positive feedback loop. Groups that develop more compelling persuasion systems, narratives that evoke strong emotion, rituals that produce collective effervescence, institutions that generate trust, tend to outcompete groups with weaker ones. Within those successful groups, individuals who are more susceptible to narrative persuasion cooperate more reliably and contribute more consistently to group success, leaving more descendants. Over many generations, this dual process would have shaped the human brain to be exquisitely receptive to story, ritual, and authority — because groups containing highly susceptible individuals outcompeted those that were less so, even when individual susceptibility carried personal costs.

This coevolutionary logic explains something otherwise puzzling: why human beings are so readily moved by fiction. A novel’s characters are invented, the events imagined — yet readers experience genuine emotion, update moral intuitions, and sometimes change behaviour. A brain that evolved to treat culturally transmitted narratives as reliable guides to social reality will respond to fiction exactly this way.

3.9.9 Shared Narratives as Cooperative Technology

The most powerful application of language as persuasion technology is the creation and transmission of shared narratives, collectively held stories that coordinate behaviour by establishing what is valued, what is sanctioned, and what will be rewarded or punished.

Harari [25] argued that what distinguishes Homo sapiens from other animals is the capacity to cooperate flexibly in very large groups through shared fictions, collectively believed stories about entities that have no physical existence but that generate coordinated behaviour through belief alone. Money is real because enough people believe it is; a corporation has legal personhood; a nation-state commands loyalty across millions of strangers. Every institution is, in this analysis, a persuasive achievement, maintained by the ongoing, distributed persuasion of its members that the institution is real and worth sustaining.

Why should narrative outperform explicit argument as a persuasion medium? Arguments are more precise, more falsifiable, easier to scrutinise — and these are virtues of logical validity, valuable for the record, less decisive for uptake. Narratives carry several structural advantages over propositional arguments, each corresponding to a feature of how human memory and motivation actually work.

Memory is the first advantage. Events organised as a narrative, with causal links, a protagonist, a goal, an obstacle, and a resolution, are remembered far better than the same information presented as a list. The causal spine of a story provides retrieval cues at every step; each event implies the next. A study that shows something is true is forgotten more readily than a story about someone for whom it was true.

Emotional encoding is the second. Narrative engages the neural systems that process real experience: physiological arousal, perspective-taking, and affective response during story comprehension are measurable and resemble responses to actual events. Explicit arguments primarily engage evaluative processing, which is also the processing most likely to generate counter-arguments. Stories get in before the critical faculties organise a response.

There is also the matter of causal simulation. Narratives invite the reader to mentally simulate events, run counterfactuals, and predict outcomes, activating the same predictive machinery used to navigate real social situations. An abstract claim that cooperation pays leaves the assertion at the level of assertion. A story in which the cooperator thrives and the defector suffers runs the inference as an embodied experience — proprioceptive, emotional, and memorable in ways bare assertion rarely achieves.

Finally, and perhaps most durably, repeated exposure to narratives defines categories of person: hero, villain, traitor, martyr. Once those categories are established, specific arguments arrive pre-sorted. A person who has absorbed a thousand stories about the loyal son and the treacherous brother does not need to evaluate fresh arguments about kin betrayal. The moral conclusion is already indexed to identity.

The specific narrative forms human cultures developed are among the most effective persuasive technologies ever invented, and two examples show why.

Cain and Abel encodes a set of cooperative norms in narrative form. Abel is the favoured son; Cain, consumed by envy, kills him and attempts to deny responsibility. God’s punishment, the mark of Cain and the curse of perpetual wandering, makes explicit the norm that kinship solidarity is sacred and that betrayal of a brother has cosmic consequences. The story works differently from logical argument — it does something more effective. It attaches intense emotion (guilt, divine wrath, permanent exclusion from community) to the act of kin betrayal, stamping the norm into the listener’s moral imagination. Anyone who knows the story has been persuaded about the gravity of what Cain did, and by extension about what they must not do.

Shakespeare’s Macbeth encodes the norm that ambition unpaired with moral scruple is self-destructive. Macbeth is not punished by an external authority in the first instance; he is destroyed from within by guilt and paranoia. The play’s persuasive power lies precisely in this: it does not say “murder is wrong because society will catch you.” It says “murder reshapes the murderer’s inner world in ways that make life unliveable.” This is a far more powerful persuasive move, because it makes the violation of cooperative norms intrinsically aversive — something the violator does to themselves, binding rather than merely punitive.

Both stories are instances of what Gellner [21] described as the fundamental mechanism by which social orders reproduce themselves through persuasion:

“The way in which you restrain people from doing a wide variety of things, not compatible with the social order of which they are members, is that you subject them to ritual. The process is simple: you make them dance round a totem pole until they are wild with excitement, and become jellies in the hysteria of collective frenzy: you enhance their emotional state by any device, by all the locally available audio-visual aids, drugs, dance, music, and so on: and once they are really high, you stamp upon their minds the type of concept or notion to which they subsequently become enslaved.”

— Ernest Gellner, Plough, Sword and Book [21], 1988

The pattern extends well beyond ancient or canonical texts. Harari [25] points out that money is probably the most widely shared fiction in human history, a story about value that works only because everyone simultaneously believes it. The twenty-dollar bill in your pocket has no intrinsic worth; its power is entirely a function of collective belief, sustained by institutions, laws, and the continuous retelling of a narrative about what money is and what backs it. The same analysis extends to nation-states, legal systems, and corporations: none of these entities exist in the physical world, yet they command the behaviour of billions of people. Each is a persuasive achievement, a shared story maintained by the ongoing work of institutions, rituals, and education.

Religious narratives follow the same structure at a larger scale. The story of the Exodus, slaves liberated by divine intervention, bound by covenant to live by a set of moral rules, has been retold in Jewish, Christian, and Islamic traditions for three thousand years, influencing legal systems, political philosophy, and social norms across cultures that share virtually nothing else. The persuasive work runs through narrative entirely. Nobody who absorbs the Exodus story as a child is offered a syllogism. The story does it: emotion, dramatic stakes, a protagonist to identify with, a resolution that embeds the norm at the level of feeling rather than proposition.

The sophistication of modern narrative, the novel, cinema, serialised drama, is an elaboration of this same mechanism. Stories function as slow-release persuasion, shaping the moral intuitions and cooperative dispositions of audiences over timescales far longer than any direct persuasive appeal could achieve. The fact that Macbeth has been performed continuously for four centuries, in hundreds of languages, to audiences with no shared kinship or community, and that it continues to shape moral intuitions about ambition and guilt, is evidence of the extraordinary persistence and reach of narrative persuasion as a social technology. But canonical narratives are only the most visible layer of language-as-persuasion. Beneath them, moment to moment, a more continuous process does the same cooperative work.

3.9.10 Gossip, Reputation, and the Maintenance of Cooperation

The everyday form of language-as-persuasion is gossip: the informal exchange of information about the cooperative or defecting behaviour of third parties. Dunbar [15] estimated that roughly 65% of human conversation concerns social topics: who did what to whom, who can be trusted, who broke a norm. It is the maintenance mechanism of a reputation-based cooperative system. The group’s collective persuasive acts (praise, censure, ridicule, warning) police free-riding in a manner structurally parallel to worker policing in honeybee colonies, but operating through language rather than chemical signals, and scalable to groups of any size connected by communication networks.

The ethnographic record bears this out. In small-scale forager societies, public shaming and ridicule of individuals who claim more than their share, what anthropologists call “levelling mechanisms”, are among the most frequently observed cooperative enforcement tools. The mechanism is purely linguistic: no physical coercion is required. The threat of being talked about badly is sufficient to suppress many forms of free-riding. Individuals who acquire a reputation for generosity receive more food-sharing partnerships, more alliance support, and better coalition options. The reputational currency is maintained by talk.

This extends into modern institutions in ways that are sometimes invisible because they are so familiar. Professional references are gossip formalised into a hiring institution. Yelp reviews are gossip about businesses. Academic citation patterns carry reputational information about the credibility of researchers. Social media pile-ons are the digital equivalent of public shaming around the campfire, operating at a scale that no forager band could achieve but following the same structural logic: a defector is identified, talked about widely, and excluded from future cooperative exchanges. The technology changes; the underlying mechanism of reputation management through language remains constant.

What changes with scale is the symmetry of the process. In a band of 150, gossip is roughly symmetrical: anyone can talk about anyone. Scale the group, add writing, then print, then broadcast, and the reach becomes radically asymmetric: a sender with access to a platform can address millions simultaneously, while no individual can mount an equivalent reputational response. The mechanism is the same as in the forager band. The structural conditions have shifted fundamentally. That asymmetry is the starting point for everything in the rest of this book. Before asking what it means for modern persuasion, though, it is worth stepping back to ask how cooperation scaled in the first place, and whether language achieved it through argument or through something older.

3.10 Reason and Ritual: Two Modes of Cooperative Persuasion

Large-scale cooperation takes two distinct forms, and placing two examples side by side makes the contrast vivid.

A termite mound (Macrotermes bellicosus) can stand four metres tall, house several million individuals, maintain internal temperature to within one degree Celsius, and circulate air through a ventilation system that rivals modern mechanical engineering. The mound emerges from millions of local interactions: each termite responding to chemical and tactile signals from its immediate neighbours, the colony’s blueprint distributed across the interactions rather than held in any mind. This is cooperation through ritual: decentralised, self-organising, with order crystallising from the accumulated effect of local signalling acts.

A tall cathedral termite mound in Litchfield National Park, Australia, illustrating the scale of structure achieved through emergent collective behaviour with no central planner. — A *Macrotermes* termite mound reaching approximately five metres — built by millions of individuals through purely local chemical and tactile signals, the structure emerging from decentralised interactions. *Photo: J. Brew, Litchfield National Park, Australia. CC BY-SA 2.0.*

The Large Hadron Collider at CERN tells a different story. Over ten thousand scientists and engineers from more than one hundred countries coordinated the design, construction, and operation of a 27-kilometre ring of superconducting magnets buried beneath the Franco-Swiss border. Every component was specified in advance; every interface was explicitly agreed upon; every experiment was designed, pre-registered, and reviewed by committees. Large Language Models are a comparably planned achievement: teams of thousands, enormous investment, explicit architectural decisions at every level. This is cooperation through reason: explicit argumentation, centralised planning, a detailed blueprint agreed upon before any physical construction begins.

The interior of the Large Hadron Collider tunnel at CERN, showing the blue superconducting magnet array running through the 27-kilometre ring. — The LHC tunnel at CERN — 27 kilometres of superconducting magnets coordinated by more than ten thousand scientists from over one hundred countries. Every interface was explicitly specified; every experiment pre-registered and reviewed. This is cooperation through reason at its most extreme: a shared blueprint so detailed that the entire structure could be agreed upon before any physical construction began. *Photo: Julian Herzog. CC BY 4.0.*

The contrast between these two modes runs through the entire evolutionary history of cooperation. Eusocial insects cooperate almost entirely through ritual: chemical gradients, probabilistic responses to local signals, collective intelligence without any planning at any level. Human societies use both simultaneously. Historically, Gellner [21] argued that ritual has been the primary mechanism for creating large cooperative groups: communal, emotionally charged performance that stamps shared beliefs onto participants without requiring explicit argument. Reason, the explicit construction and exchange of arguments, is a more recent and more fragile achievement. It depends on specific institutional supports: writing, formal education, deliberative institutions.

Both depend on persuasion. They differ in whether the logic driving the outcome is accessible to the participants, and whether the result was intended by any of them. Language is the technology that made both possible at scale: without it, neither institutional reasoning nor shared ritual can bind strangers.

3.11 Persuasion Across Scales

Life kept finding the same solution to the same problem. Every time independent organisms needed to act as one, cells within a body, workers within a colony, hunters within a band, citizens within a state, they needed a way to make cooperative intent legible and defection costly. Each time the cooperation problem got harder, the communication system had to get more powerful. Chemicals. Dances. Grooming. Alarm calls. Coalition politics. Pantomime. Grammar. Shared fiction.

The table below traces that progression.

Table 3.4: Persuasion as the enabling mechanism of successive cooperative transitions.

Scale	Organism	Communication Channel	Key Persuasive Function	Transition Enabled
Cell → organism	Multicellular animals	Chemical (hormones, receptors)	Differentiation, division of labour	Multicellularity
Individual → colony	Eusocial insects, naked mole rats	Pheromones, mechanical signals	Recruitment, suppression of defection	Superorganism
Individual → group	Primates, lions, meerkats	Tactile, acoustic, gestural	Alliance building, alarm, coordination	Cooperative foraging and defence
Band → tribe	Early Homo	Proto-language, gesture	Reputation management, norm transmission	Coordinated hunting, territory
Tribe → civilisation	Homo sapiens	Symbolic language, writing	Shared fictions, institutions	States, markets, religions
Nation → globe	Modern humans	Print, broadcast media, internet	Mass persuasion, public opinion	Global coordination
Human → AI	Emerging	Large language models	Personalised persuasion at scale	To be determined

The final row is still being written. Large Language Models represent a new kind of communicative agent: one capable of generating persuasive messages at industrial scale, tailored to individual recipients, across every channel simultaneously.

This returns us to where the chapter began: seventeen thousand years ago, in a cave in the Vézère valley, someone mixed iron oxide with animal fat and pressed a hand to the rock. The question posed at the opening was why, given that anatomically modern humans had existed for 200,000 years, the cultural explosion of symbolic art, long-distance trade, and shared ritual happened so recently and so suddenly. The answer the chapter has been building toward: the accumulation of enough shared fiction. Language gave humans displacement and propositional structure. Culture gave them the ability to build on what previous generations had worked out. At some threshold, those compounding layers produced communities that could hold the same story in their heads long enough to coordinate around it, to paint the same animals on the same walls season after season, to trade the same ochre across hundreds of kilometres, to bury their dead with the same rituals. The cave recorded that threshold rather than creating it — evidence that something fundamental had changed in the minds that made it.

Whether the arrival of AI-generated persuasion at scale represents an analogous threshold, and what the consequences will be, is among the central questions this book tries to address.

Language and Persuasion

The connection between language and persuasion is developed further in Chapter 3, where different academic disciplines, linguistics, social psychology, economics, and neuroscience, are examined for the specific mechanisms by which messages change minds. The evolutionary framing developed here provides the deepest context for why those mechanisms exist at all: they are the accumulated solutions to the problem of cooperation across scale, refined over hundreds of millions of years of selection.

[1]

Aiello, L.C. and Dunbar, R.I.M. 1993. Neocortex size, group size, and the evolution of language. Current Anthropology. 34, 2 (1993), 184–193.

[2]

Axelrod, R. 1984. The evolution of cooperation. Basic Books.

[3]

Baldwin, J.M. 1896. A new factor in evolution. The American Naturalist. 30, 354 (1896), 441–451. https://doi.org/10.1086/276408.

[4]

Bates, E. and MacWhinney, B. 1989. Functionalism and the competition model. The crosslinguistic study of sentence processing. B. MacWhinney and E. Bates, eds. Cambridge University Press. 3–73.

[5]

Bickerton, D. 1983. Creole languages. Scientific American. 249, 1 (1983), 108–115.

[6]

Bickerton, D. 1990. Language and species. University of Chicago Press.

[7]

Boyd, R. and Richerson, P.J. 1985. Culture and the evolutionary process. University of Chicago Press.

[8]

Byrne, R.W. and Whiten, A. 1988. Machiavellian intelligence. Behavioral and Brain Sciences. 11, 2 (1988), 233–244.

[9]

Cheney, D.L. et al. 1995. The role of grunts in reconciling opponents and facilitating interactions among adult female baboons. Animal Behaviour. 50, (1995), 249–257.

[10]

Cheney, D.L. and Seyfarth, R.M. 1997. Reconciliatory grunts by dominant female baboons influence victims’ behaviour. Animal Behaviour. 54, (1997), 409–418.

[11]

Clutton-Brock, T.H. et al. 1999. Selfish sentinels in cooperative mammals. Science. 284, 5420 (1999), 1640–1644.

[12]

Curtiss, S. 1977. Genie: A psycholinguistic study of a modern-day wild child. Academic Press.

[13]

Damasio, H. et al. 1996. A neural basis for lexical retrieval. Nature. 380, 6574 (1996), 499–505. https://doi.org/10.1038/380499a0.

[14]

Dennett, D.C. 1995. Darwin’s dangerous idea: Evolution and the meanings of life. Simon & Schuster.

[15]

Dunbar, R. 1998. Grooming, gossip, and the evolution of language. Harvard University Press.

[16]

Dunbar, R.I.M. 1992. Neocortex size as a constraint on group size in primates. Journal of Human Evolution. 22, 6 (1992), 469–493.

[17]

Fedorenko, E. et al. 2024. Language is primarily a tool for communication rather than thought. Nature. 630, (2024). https://doi.org/10.1038/s41586-024-07522-w.

[18]

Ferretti, F. and Adornetti, I. 2021. Persuasive conversation as a new form of communication in Homo sapiens. Philosophical Transactions of the Royal Society B: Biological Sciences. 376, (2021), 20200196. https://doi.org/10.1098/rstb.2020.0196.

[19]

Frisch, K. von 1967. The dance language and orientation of bees. Harvard University Press.

[20]

Gardner, B.T. and Gardner, A.R. 1974. Comparing the early utterances of child and chimpanzee. Minnesota symposium on child psychology. A. Pick, ed. University of Minnesota Press. 3–23.

[21]

Gellner, E. 1988. Plough, sword and book: The structure of human history. University of Chicago Press.

[22]

Gopnik, M. 1990. Feature-blind grammar and dysphasia. Nature. 344, (1990), 715.

[23]

Gopnik, M. and Crago, M.B. 1991. Familial aggregation of a developmental language disorder. Cognition. 39, 1 (1991), 1–50. https://doi.org/10.1016/0010-0277(91)90058-C.

[24]

Hamilton, W.D. 1964. The genetical evolution of social behaviour, I and II. Journal of Theoretical Biology. 7, 1 (1964), 1–52.

[25]

Harari, Y.N. 2015. Sapiens: A brief history of humankind. Harper.

[26]

Hauser, M.D. et al. 2002. The faculty of language: What is it, who has it, and how did it evolve? Science. 298, 5598 (2002), 1569–1579. https://doi.org/10.1126/science.298.5598.1569.

[27]

Hockett, C.F. 1960. The origin of speech. Scientific American. 203, 3 (1960), 88–96.

[28]

Jarvis, J.U.M. 1981. Eusociality in a mammal: Cooperative breeding in naked mole-rat colonies. Science. 212, 4494 (1981), 571–573.

[29]

Krebs, J.R. and Dawkins, R. 1984. Animal signals: Mind-reading and manipulation. (1984).

[30]

Lai, C.S.L. et al. 2001. A forkhead-domain gene is mutated in a severe speech and language disorder. Nature. 413, 6855 (2001), 519–523. https://doi.org/10.1038/35097076.

[31]

Maynard Smith, J. and Szathmáry, E. 1995. The major transitions in evolution. Oxford University Press.

[32]

Nowak, M.A. 2006. Five rules for the evolution of cooperation. Science. 314, 5805 (2006), 1560–1563.

[33]

Nowak, M.A. and Sigmund, K. 2005. Evolution of indirect reciprocity. Nature. 437, (2005), 1291–1298.

[34]

Pellegrino, G. di et al. 1992. Understanding motor events: A neurophysiological study. Experimental Brain Research. 91, 1 (1992), 176–180. https://doi.org/10.1007/BF00230027.

[35]

Plato 2000. The republic. Cambridge University Press.

[36]

Ratnieks, F.L.W. 1988. Reproductive harmony via mutual policing by workers in eusocial Hymenoptera. American Naturalist. 132, (1988), 217–236.

[37]

Rousseau, J.-J. 1762. The social contract.

[38]

Seeley, T.D. 2010. Honeybee democracy. Princeton University Press.

[39]

Seeley, T.D. 1995. The wisdom of the hive: The social physiology of honey bee colonies. Harvard University Press.

[40]

Seyfarth, R.M. et al. 1980. Monkey responses to three different alarm calls: Evidence for predator classification and semantic communication. Science. 210, (1980), 801–803.

[41]

Stander, P.E. 1992. Cooperative hunting in lions: The role of the individual. Behavioral Ecology and Sociobiology. 29, 6 (1992), 445–454.

[42]

Számadó, S. 2010. Pre-hunt communication provides context for the evolution of early human language. Biological Theory. 5, 4 (2010).

[43]

Tooby, J. and Cosmides, L. 1990. Toward an adaptationist psycholinguistics. Behavioral and Brain Sciences. (1990), 760–762.

[44]

Trivers, R.L. and Hare, H. 1976. Haplodiploidy and the evolution of the social insects. Science. 191, 4224 (1976), 249–263.

[45]

Waal, F. de 1982. Chimpanzee politics: Power and sex among apes. Jonathan Cape.

[46]

Wilson, E.O. 1971. The insect societies. Harvard University Press.

[47]

Zahavi, A. 1975. Mate selection—a selection for a handicap. Journal of Theoretical Biology. 53, (1975), 205–214.

[48]

Zahavi, A. and Zahavi, A. 1997. The handicap principle: A missing piece of Darwin’s puzzle. Oxford University Press.