An Overview of Human Language

Let’s look at some highlights of human language—enough to get you started on a deeper study.

Getting Oriented

There are at least three ways to look at language:

We studied formal languages earlier in the course. Now let’s learn about human language.

Exercise: What do you think (human) language is? Share your definition with a partner. How do your definitions differ? What do they have in common?

How to Study Language

People have been studying language for thousands of years.

We’ve looked into the question of what language is, where it came from, why we have it, what we can do with it, and what language might tell us about ourselves. We might never get a complete story. These questions are really complex.

Answering these questions requires research from archaeology, anthropology, psychology, linguistics, computer science, philosophy, cognitive science, genetics, ethology, neuroscience, and many other fields. Focusing on a single field will get you an incomplete picture.

The study of language is inherently interdisciplinary.

Steven Mithen gets this! His 2024 book The Language Puzzle is really good. You should read it. It looks at language as a puzzle with many fragments:

Mithen identifies and assembles these fragments to form a compelling and modern understanding of language evolution.

Exercise: Read Mithen’s book.

Why Study Language

Because it’s fun! Also for personal enrichment and intellectual curiosity.

Professional Reasons

Language plays a huge role in computer science, cognitive science, and artificial intelligence. A good understanding of human language is crucial for developing natural language processing systems and effective language models.

It’s also fun to contrast the precision of formal languages and programming languages with the messiness, ambiguities, redundancies, and exceptions of evolved human languages. Maybe that’s why LLMs kind of work—all this stuff is rolled into probabilities and patterns in a network not unlike the human brain (with its long-distance connections and distributed processing).

Exercise: Research whether we could build conversational agents without resorting to neural networks. What would be the advantages and disadvantages of a rule-based system vs. a statistical one? Could the former approach even work?

A technical study of language can also lead to rewarding careers in education, speech therapy, language translation, teaching reading and writing, building text-to-speech and speech-to-text systems, coaching communication and social skills, and other fields that can make a huge difference in people’s lives.

Personal Reasons

But we don’t study language just to get a job.

Wielding language well can enhance communication, persuasion, and understanding across different contexts and cultures. It empowers you to persuade, inform, and connect with others effectively. It provides a buffer against those who would use language against you.

Academic Reasons

A holistic study of language and linguistics is important, because we want to get closer to answering our questions about language and to deepen our understanding. Like all fields, powerful theories arise which over time are found to be lacking or completely wrong. The study of language is no different. Here are three examples of once widely accepted theories that have been strongly challenged as new research and evidence have come to light:

  • Charles Hockett in the 1960s identified “16 design features” of language—9 we share with other primates (vocal-auditory channel, broadcast transmission, transitoriness, interchangeability, total feedback, specialization, semanticity, arbitrariness, discreteness, displacement), and 7 that are uniquely human (productivity, cultural transmission, duality of patterning, prevarication, reflexiveness, learnability). But recent research has found these features to be a “non-starter” as a framework for evolutionary linguistics.
  • Noam Chomsky from the 1960s onward argued for Universal Grammar (UG)—an innate Language Acquisition Device hardwired into the human brain, with a common deep structure underlying all human languages. The poverty of the stimulus argument held that children could not possibly learn grammar from input alone. But cross-linguistic typology has since revealed far more structural variation across languages than UG predicted, and modern approaches such as the Iterated Learning Model (ILM) have shown that general learning mechanisms may be sufficient.
  • Steven Pinker’s The Language Instinct (1994) argued that language is a Darwinian biological instinct shaped by natural selection specifically for communication, much like echolocation in bats. Critics find some aspects of this view outdated as (1) it is sympathetic to UG, (2) the use of the term “instinct” does not account for social interaction and cultural immersion, and (3) language evolves faster than a genetic explanation would seem to allow.
Exercise: Research and explain Hockett’s design features of language. Why are they now considered incompatible with modern research?
Exercise: Read this essay claiming there is no language instinct. What are the main arguments in this essay against Chomsky’s UG and Pinker’s language instinct view? Do you find them convincing?

Topics in Human Language

Following is a brief overview of some key topics in the study of human language. After this overview, we’ll be taking deeper dives into many of these topics, including words, sounds, structure and meaning, language diversity, and the relationship between language, thought, cognition, and culture.

Language Evolution

We clearly use language differently than our nearest relatives (the chimpanzees) who communicate with gestures, facial expressions, and vocalizations, and can learn ASL signs but can’t seem to organize all the signs recursively. Our LCA probably was very much like today’s chimpanzees.

The language we have today evolved with us. Mithen advocated the Hmmmmm hypothesis, which posits that language was preceded by “proto-language” that was Holistic, manipulative, multi-modal, musical, and mimetic.

Exercise: What do these terms mean?
Holistic: noncompositional; Manipulative: commands and suggestions rather than descriptive statements; Multi-modal: acoustic, gestural, facial, etc.; Musical: rhythm and melody important; Mimetic: imitative of actions and sounds.

But how did language evolve? Surely it was through a multitude of factors, all occurring over millions of years. A few historical highlights identified by Mithen in The Language Puzzle are:

Utterances

Modern language is characterized by combining words into utterances, according to various rules.

Utterances can be spoken, signed, or written.

Utterances have meanings which are influenced not only by the words themselves but also by prosody—intonation, stress, rhythm, tempo, and pauses in spoken language, and by body posture, movements, pauses, and facial expressions in signed languages.

CLASSWORK
Come up with utterances (sentences or phrases) using different forms of prosody to convey meaning changes such as statements vs. questions, emphasis, sarcasm, irony, etc.
Exercise: Consider, then (or do some research on), whether the rich prosody of sign language means that the evolution of the human vocal tract was not a necessary precondition for the emergence of language. Could we have evolved a fully expressive sign language without the vocal tract changes? What would that have looked like?

If utterances are made up of words, then what, exactly, are words?

Words

Exactly what counts as a word is kind of hard to pin down, and varies between languages. It’s typically understood to be a unit of meaning that can stand alone in an utterance. There’s a lot of variation among languages, especially in the orthographic representations—think of agglutinative languages like Finnish, Hungarian, Turkish, Japanese, Swahili, Korean, Quecha, and Tamil.

There is so much to study about words. We’ll cover words in much more detail later, but by way of introduction, here are some key concepts:

Putting words together into utterances leads to different dimensions of language study:

We’ll see how to classify words into lexical and grammatical words, and further into categories such as nouns, verbs, adjectives, prepositions, conjunctions, pronouns, and determiners. We’ll see word-like entities, like the English oops, shhh, hmm, uh, um, haha, tsk-tsk, yikes, oops. And (instinctive) screams, grunts, yawns, laughs, and other such vocalizations.

There’s another classification dimension: Some words are iconic, meaning their form resembles their meaning (e.g., buzz, bang, hiss, plop), and others are arbitrary (e.g., dog, cat).

The lexicon of a language is its inventory of words (and their meanings). Different languages have wide differences in the size of their lexicons! We’ll learn theories about why. Children acquire words at a remarkable pace—about 10 new words per day between ages 2–8—many of which are inferred from a single exposure in context. How? We’ll learn that too. And also how it is that new words are created every day.

Spoken language introduces a whole new set of elements to study. We’ll learn about phonemes (the smallest units of sound in a language that can distinguish meaning). Some spoken languages have only a few dozen phonemes, while others have over a hundred. The IPA (International Phonetic Alphabet) provides a standardized way to represent these sounds. And yes, we’ll learn all those crazy terms like bilabial, alveolar, velar, glottal, uvular, glottal, plosive, affricate, etc.

A lot of languages are written with the Latin alphabet, even when the sounds don’t match the letters perfectly. Watch the following video to see if the Latin alphabet is a good fit for Tlingit.

Exercise: How badly does the Latin alphabet fit English? How well does it fit Spanish? Why the difference? What other alphabets are used for English?
Exercise: Research the click sounds of Zulu. How are they represented in the Latin alphabet? How are they represented in the IPA?

We’ll also address the question of how words get their meanings. There are some theories, like Rosch’s Prototype Theory—that categories aren’t defined by certain conditions, but rather organized around prototypes (or best examples), such as a robin for the category "bird" (and not a penguin). Other members of the category are included based on their similarity to the prototype, leading to graded membership rather than a strict boundary. Then there’s Putnam’s Twin Earth Thought Experiment, which gives evidence for meaning not being entirely in one’s head (intensionally) but also (extensionally) within one’s environmental context. And another theory: Causal Theory of Reference, which suggests that words get their meaning anchored historically, through chains of use.

One more teaser for what is coming: Words are definitely related to each other. We’ll learn about synonyms, antonyms, hyponyms, hypernyms, homonyms, and polysemy.

Grammar

Words and word-like entities are combined (arranged) according to rules to form larger units like phrases and sentences. We need the rules because there’s a benefit to having not every arrangement of words be acceptable. The rules constitute a grammar. Where do the rules come from? How do we know if an utterance is well-formed?

Rules or Guidelines?

In programming languages, rules are explicit and prescriptive. In spoken natural language, we tend to treat them as guidelines, but in writing, they are often more rigidly followed.

A grammar will tell you where all the components of an utterance should go—based on the category (e.g., for English: noun, verb, adjective, noun phrase, verb phrase, subject, predicate, adverbial clause, prepositional phrase, etc.) of the word or phrase—of the components, but it won’t tell you what the utterance means. Here are two classic examples of perfectly grammatical (structurally sound) sentences:

Many (most?) human languages exhibit hierarchical structure, where smaller units combine to form larger units in a nested manner. They tend to all for recursion—the property of language that allows rules to be applied repeatedly, embedding structures within structures. This is what enables sentences to be infinitely long and complex, even with a finite set of rules and vocabulary.

We’ll cover these in much more detail later in the course.

Here’s a fascinating question: Where Does Grammar Come From? Chomsky thought language acquisition is too fast and input too impoverished for learning alone, putting forth the Poverty of the Stimulus argument: children know grammatical rules they’ve never seen exemplified. So he proposed a kind of innate Language Acquisition Device (LAD) for what became known as Universal Grammar. It was very popular for a while, but modern research, including the body of work by Tomasello and Christiansen, note that general learning mechanisms + social interaction are sufficient. The computer simulations of Kirby and colleagues support this view, showing that cultural transmission can lead to the emergence of structured language over generations.

Exercise: Read Tomasello’s Constructing a Language: A Usage-Based Theory of Language Acquisition. What is the main arguments against the need for an innate LAD? Do you find them convincing?
Exercise: Read about the Iterated Learning Model (ILM) and how it demonstrates the emergence of structured language from general learning mechanisms. Why must a bottleneck be involved? What does this suggest about the role of cultural transmission in language evolution?

There’s much more in the study of language acquisition that argues against the need for an innate Language Acquisition Device (LAD). Pidgins are rudimentary contact languages with no native speakers and minimal grammar. Creoles are pidgins that children acquire natively, with a spontaneously developed full grammar. And there’s a well-known case of deaf children in 1980s Nicaragua inventing a full sign language in one generation. Themselves. With no adults to mimic.

There seems to be a lot of evidence that given the right social conditions, grammar emerges.

Grammar is not innate. It emerges from the interaction of general learning mechanisms, social interaction, and cultural transmission.

Pragmatics

Pragmatics is the study of how context influences meaning, things like prosody, gestures, speaker attributes (age, gender, status), surroundings, relationship with the listener, context, innuendo, subtext, shared situational knowledge, and so on.

An interesting pragmatic concept isimplicature: a meaning that is implied but not explicitly stated. (Enables efficient and sometimes polite communication)

Paul Grice proposed the following maxims of implicature for cooperative conversation:

Efficient discourse is also aided by deixis: words or phrases whose reference (spatial, temporal, or personal) shifts with context, such as I, you, here, now, this, that, these, those, tomorrow, last week, the following.

Exercise: Read the Wikipedia article covering Grice’s maxims, and how they are often flouted and violated in human conversation. Written text is full of Grice violations, so how can LLMs make sense of them and manage to learn cooperative norms to make chatbots function pretty well? After all, they were never told any specific rules, and pre-trained models haven’t participated in a conversation before training ends.

Linguistic Diversity

There are ~7,000 languages in the world today, and many more that have gone extinct. They vary widely in their phonology (sound systems), morphology (word formation), syntax (sentence structure), semantics (meaning), and pragmatics (use in context). We’ll look later at many of these differences, and how they came to be. We’ll study vowel shifts, simplification, grammaticalization, borrowing, word shortening, reanalysis, contact, migration, splitting communities, and other mechanisms of language change.

Languages evolving into new languages is fascinating, and very often eye-opening. We’ll look at language family trees, such as this one:

Language Family Tree

The study of so many languages provides a good deal of evidence against Universal Grammar. We’ll be looking at many of the reasons why UG fell so out of favor. Here’s a quick sampling of observations that showed that several aspects thought to be universal simply aren’t so:

The World Atlas of Language Structures (WALS) is pretty cool, with many more examples. It shows that when linguists systematically catalogued features across thousands of languages, feature after feature that had been proposed as universal turned out to be highly variable (word order, case marking, use of morphology, how questions are formed, whether there’s a passive construction, tenses, and so on). The apparent universals mostly dissolved into tendencies or statistical preferences when real cross-linguistic data was gathered at scale.

Language and Thought

We can’t talk about language without considering its impact on thought. The question of whether and how much language influences thought (and vice versa) is complex and the subject of a lot of debate. Linguistic relativity asserts that language influences worldview or cognition. You’ll sometimes hear it called the “Sapir-Whorf hypothesis.” We’ll look at the weak and strong versions of this hypothesis.

We’ll also see how language may shape thought, by looking at:

Signs and Symbols

Human language is overwhelmingly symbolic. This is extraordinary! No other species uses symbols at scale. We’ve found ornamental shell beads dating back to ~100,000 BCE, and geometric ochre engravings dating back to ~75,000 BCE. These are likely symbolic artifacts, indicating that our ancestors were using symbols long before the emergence of fully developed language. The cave paintings at Lascaux and Chauvet (30,000–17,000 BCE) also suggest symbolic thought, as they depict animals and scenes that may have had cultural or spiritual significance.

Symbols are one of three kinds of signs, the others being icons (signs resembling what they represent) and indices (causally connected to their referent). But symbols are fascinating because they have no intrinsic connection to their referent. They require (a) shared convention, (b) memory, (c) the ability to hold “word” and “meaning” as linked but separate. The power is unlimited generativity — you can make a symbol for anything, even abstractions: justice, infinity, recursion, love, LLM, the meaning of life, etc.

We’ll study symbols later—where they come from, and specifically how they hook on to the world (what philosophers call the symbol grounding problem). There are a few theories about this, of course!

Even the concept of reference is pretty interesting. Before the word hooks onto its meaning, one has to learn or be taught what object is getting the word. Human infants develop a (proto-linguistic) joint attention at ~9 months. Without joint attention, ostensive definition (“that’s a dog”) doesn’t work.What’s fascinating is that pointing is surprisingly rare in non-human primates.

Connections to Formal Languages and LLMs

In the modern view of language, two things stand out:

What about formal languages and language models?

Formal languages are just words and rules. We covered them earlier.

LLMs, though, feel like something more:

When we study LLMs later, two key questions we’ll encounter are:

Recall Practice

Here are some questions useful for your spaced repetition learning. Many of the answers are not found on this page. Some will have popped up in lecture. Others will require you to do your own research.

  1. What are three views of language?
    Language as mathematics (formal language) • Language as an evolutionary communication system (human language) • Language as statistical patterns (LLMs)
  2. What are some of the disciplines that contribute to the study of language?
    Linguistics, anthropology, archaeology, psychology, neuroscience, genetics, computer science, philosophy, ethology, and more.
  3. What is Mithen’s central claim about the nature of human language?
    Language is not one thing, but rather a puzzle made of many pieces (words, grammar, symbols, sounds, social cognition, vocal anatomy, brain structure) that evolution assembled, imperfectly, over millions of years.
  4. What are three popular theories of language that have been seriously challenged and are considered somewhat incompatible with modern research?
    Hockett’s design features, Chomsky’s Universal Grammar, and Pinker’s language instinct.
  5. What is the Hmmmmm hypothesis?
    A theory of language evolution that posits a proto-language that was Holistic, Manipulative, Multi-modal, Musical, and Mimetic.
  6. When did hominins start moving from trees to open grasslands and why?
    Around 4 million years ago, due to global climate change leading to more arid conditions.
  7. What major physical evolutionary changes likely enabled the development of human language?
    Changes in brain size and structure, and especially the vocal tract.
  8. Once the capacity for generating and using words emerged, how did some words end up changing?
    Mishearings, mispronunciations, misunderstandings, social conventions, and a drive toward efficiency.
  9. What is an utterance?
    A unit of language made up of words that is spoken, signed, or written, and has meaning influenced by prosody and context.
  10. What are some types of prosody?
    Intonation, stress, rhythm, tempo, and pauses in spoken language; body posture, movements, pauses, and facial expressions in signed languages.
  11. What is morphology?
    The study of the structure and formation of words, including the analysis of morphemes, which are the smallest units of meaning.
  12. What is phonology?
    The study of the sound systems of languages, including the rules for combining and using phonemes, which are the smallest units of sound that can distinguish meaning.
  13. What is the difference between syntax and semantics?
    Syntax deals with structure and semantics deals with meaning.
  14. What is the difference between iconic and arbitrary words?
    Iconic words have a form that resembles their meaning (e.g., onomatopoeic words like "buzz"), while arbitrary words have a form that does not have a direct resemblance to their meaning (e.g., "dog").
  15. What is really annoying about the use of the Latin Alphabet to write English?
    The Latin Alphabet has far too few letters to match English phonemes, leading to many irregularities and inconsistencies in spelling and pronunciation.
  16. What is the origin of the sentence “The rat the cat the dog the cow tossed worried killed ate the malt”?
    It’s constructed from one of the stanzas of the poem This is the house that Jack built.
  17. What is Chomsky’s poverty of the stimulus argument and why is it considered less convincing today?
    The argument is that since children acquire grammatical rules they have never directly experienced (an impoverished stimulus), we must have an innate Language Acquisition Device. Modern research has shown that general learning mechanisms and social interaction may be sufficient to explain language acquisition.
  18. What do we now believe grammar emerges from?
    The interaction of general learning mechanisms, social interaction, and cultural transmission.
  19. In seven words or less, what is pragmatics?
    The study of how context influences meaning.
  20. What are Grice’s four maxims of cooperative conversation?
    Quantity (say enough, not too much), Quality (be truthful), Relation (be relevant), Manner (be clear).
  21. What is deixis? Give three examples.
    Deixis refers to words whose reference shifts with context. Examples: I, here, now.
  22. Approximately how many human languages are in use today?
    ~7,000
  23. What kinds of languages stand as arguments against universal grammar?
    Languages with an apparent lack of recursion, languages with an absence of numbers and counting, ergative-absolutive languages, topic-prominent languages, time without tense, tonal languages, sign languages.
  24. What is meant by linguistic relativity?
    The idea that language influences worldview or cognition.
  25. What is Peirce’s distinction between icons, indices, and symbols? Give an example of each.
    Icons resemble their referent (a photo, a map). Indices are causally connected to their referent (smoke → fire). Symbols are arbitrary and conventional (the word "dog"; red meaning stop).
  26. What is joint attention?
    The ability of two individuals to attend to the same thing simultaneously.
  27. How does human language differ from formal languages?
    Human language is an evolved communication system with words, grammar, symbols, social cognition, and more, while formal languages are artificial systems of symbols and rules only.
  28. How are language capabilities of LLMs not like human language?
    LLMs (currently, arguably) lack true understanding, intentionality, and grounding in the physical and social world.

Summary

We’ve covered:

  • Three views of language
  • How to study language
  • Why study language
  • Topics in the study of human languages
  • Language Evolution
  • Words
  • Grammar
  • Linguistic diversity
  • Signs and symbols
  • Human language connections to LLMs