One of the most ubiquitously Australian words is the word dag. A word known and loved by basically any Aussie.
It’s a light-hearted insult referring to someone who is unfashionable or socially awkward, basically a bit of a dork (Fig 1). But like most insults in Australian English it’s also used affectionately as a term of endearment (what does this say about how Australians relate to each other?). Typically in these cases, it is used to convey a sense of regard for the unashamedness of the dag in question – to express the lovable quality of someone who is just oblivious to certain social norms.
However, the origins of this this word are anything but loveable. According to the popular story (which appears to be supported by Macquarie Dictionary and The Australian National Dictionary), this usage is derived from the older meaning (attested in 1891) of the word dag to refer to a matted clot of wool and dung that forms around a sheep’s bum (Fig 2). By 1967 something ‘dirty and unkempt’ could be referred to as daggy and by the 1980s we were using the word for Figure 2 for the unfashionable yet loveable dad in Figure 1.
As an Australian, I am proud of my dagginess and am pleased to know our daggy little word has a pretty gross origin.
Of all the world’s languages, English is the most widely learnt by adults. Although Mandarin Chinese has the highest number of speakers overall, owing to the huge size of China’s population, second-language speakers of English outnumber those of Mandarin more than three times.
Considering that the majority of English speakers learn the language in adulthood, when our brains have lost much of their early plasticity, it’s just as well that some aspects of English grammar are pretty simple compared to other languages. Take for example the way we express the plural. With only a small number of exceptions, we make plurals by adding a suffix –s to the singular. The pronunciation differs depending on the last sound of the word it attaches to – compare the ‘z’ sound at the end of dogs to the ‘s’ sound at the end of cats, and the ‘iz’ at the end of horses – but it varies in a consistently predictable way, which makes it easy to guess the plural of an English noun, even if you’ve never heard it before.
That’s not the case in every language. Learners of Greek, for example, have to remember about seven common ways of making plurals. Sometimes knowing the final sounds of a noun and its gender make it possible to predict the plural, but other times learners simply have to memorise what kind of plural a noun has: for example pateras ‘father’ and loukoumas ‘doughnut’ both have masculine gender and singulars ending in –as, but in Standard Greek their plurals are pateres and loukoumathes respectively.
This is similar to how English used to work. Old English had three very common plural suffixes, -as, -an and –a, as well as a number of less common types of plural (some of these survive marginally in a few high-frequency words, including vowel alternations like tooth~teeth and zero-plurals like deer). The modern –s plural descends from the suffix –as, which originally was used only for a certain group of masculine nouns like stān, ‘stone’ (English lost gender in nouns, too, but that’s a subject for another blog post).
How did the -s plural overtake these competitors to become so overwhelmingly predominant in English? Partly it was because of changes to the sounds of Old English as it evolved into Middle English. Unstressed vowels in the last syllables of words, which included most of the suffixes which expressed the gender, number and case of nouns, coalesced into a single indistinct vowel known as ‘schwa’ (written <ə>, and pronounced like the ‘uh’ sound at the beginning of annoying). Moreover, final –m came to be pronounced identically to –n. This caused confusion between singulars and plurals: for example, Old English guman ‘to a man’ and gumum ‘to men’ both came to be pronounced as gumən in Middle English. It also caused confusion between two of the most common noun classes, the Old English an-plurals and the a-plurals. As a result they merged into a single class, with -e in the singular and -en in the plural.
This left Middle English with two main types of plural, one with –en and one with –(e)s. Although a couple of the former type remain to this day (oxen and children), the suffix –es was gradually generalised until it applied to almost all nouns, starting in the North of England and gradually moving South.
A similar kind of mass generalisation of a single strategy for expressing a grammatical distinction is often seen in the final stages of language death, as a community of speakers transition from a minority to a majority language as their mother tongue. Nancy Dorian has spent almost 50 years documenting the dying East Sutherland dialect of Scots Gaelic as it is supplanted by English in three remote fishing villages in the Scottish highlands. In one study the Gaelic speakers were divided into fluent speakers and ‘semi-speakers’, who used English as their first language and Gaelic as a second language. Dorian found that the semi-speakers tended to overgeneralise the plural suffix –an, applying it to words for which fluent speakers would have used one of another ten inherited strategies for expressing plural number, such as changing the final consonant of the word (e.g. phũ:nth ‘pound’, phũnčh ‘pounds’), or altering its vowel (e.g. makh ‘son’, mikh ‘sons’).
But why should the last throes of a dying language bear any resemblance to the evolution of a thriving language like English? A possible link lies in second language acquisition by adults. At the same time as these changes were taking place, English was undergoing intense contact with Scandinavian settlers who spoke Old Norse. During the same period English shows many signs of Old Norse influence. In addition to many very common words like take and skirt (which originally had a meaning identical to its native English cognate shirt), English borrowed several grammatical features of Scandinavian languages, such as the suffix –s seen in third person singular present verbs like ‘she blogs’ (the inherited suffix ended in –th, as in ‘she bloggeth’), and the pronouns they, their and them, which replaced earlier hīe, heora and heom. Like the extension of the plural in –s, these innovations appeared earliest in Northern dialects of English, where settlements of Old Norse speakers were concentrated, and gradually percolated South during the 11th to 15th centuries.
It’s possible that English grammar was simplified in some respects as a consequence of what the linguist Peter Trudgill has memorably called “the lousy language-learning abilities of the human adult”. Research on second-language acquisition confirms what many of us might suspect from everyday experience, that adult learners struggle with inflection (the expression of grammatical categories like ‘plural’ within words) and prefer overgeneralising a few rules rather than learning many different ways of doing the same thing. In this respect, Old Norse speakers in Medieval England would have found themselves in a similar situation to semi-speakers of East Sutherland Gaelic – when confronted with a number of different ways of expressing plural number, it is hard to remember for each noun which kind of plural it has, but simple to apply a single rule for all nouns. After all, much of the complexity of languages is unnecessary for communication: we can still understand children when they make mistakes like foots or bringed.
One of the peculiar habits that strikes a foreign visitor to a restaurant in the US (alongside heaps of ice in your drink and the sneaky habit of leaving sales tax off the price) is that menus typically list main course dishes as ‘entrees’. But ‘entrée’ is a French word that means something like ‘entry’ or ‘entrance’, so shouldn’t it be the same thing as appetizer or hors-d’oeuvre or starter? It seems like some fundamental misunderstanding of the term, like the rectangular chocolate ‘croissants’ shamelessly marketed outside of France. …
It has recently come to my attention that my vowels are weird. This was pointed out to me by a fellow American colleague who declared that, unexpectedly, we do not say Surrey the same way, and that my pronunciation has a “weird” vowel. I’ve already experienced confusion more than once from locals when I utter the word, and it’s enough to make me a little self-conscious.
I was already vaguely aware that Californians do some strange things with vowels. A bit of online digging revealed that as a San Francisco Bay Area native, I can blame my weird vowels on the Northern California vowel shift (outlined by Penny Eckert here). This sound change is what makes the surfer’s (and my) way of saying duuuude so distinctive (the vowel is fronted). My international friends make fun of the way I say “aw, man!”. Here, man for me becomes something like /miyn/. Even I have to admit it sounds pretty funny.
I like to think I am relatively aware of linguistic behavior, but as this experience showed me, we as linguists may not be as well-equipped as we think to recognize our own quirks.
In a 2016 twitter poll asking do you feel comfortable using gift as a verb? (ie: “I gifted that sweater to you”), 66% of respondents reported that they found this use ‘icky’. This phenomenon is known by linguists as ‘conversion’ or ‘zero-derivation’, because it involves taking a particular class of word, such as a verb, noun, or an adjective, and deriving another type of word from it without doing anything. This stands in contrast to common-and-garden ‘derivation’, where you convert the class of a word by changing its form somehow. For example, the verb sense becomes a noun sensation, which becomes an adjective sensational, which comes full circle to another verb sensationalise, all by the accumulation of suffixes. (The OED even lists a further derivation sensationalisation – but this sort of style has its own equally vociferous critics, showing you can’t win when it comes to linguistic taste).
In English verbs, nouns and adjectives all tend to look much the same, which makes it possible to zero-derive by stealth. It wasn’t always this way. Take for example the verb stone, a 12th-century example of the noun-as-verb phenomenon, derived from the noun stone (in its Middle English form stōn). Back then, the infinitive wasn’t simply stōn, but stōnen – the suffix –en was obligatory for all infinitives, and makes it clear that the word is no longer being used as a noun. Or compare the noun fight with the identical verb. In Old English the basic form of the noun was feoht, but there was no corresponding verb form feoht: instead, it was feohteð ‘he fights’, fihtest ‘you fight’, fuhton ‘we/you (pl)/they fought’, feaht ‘I/he/she fought’, or one of many other forms, depending on various grammatical properties such as subject (who is doing the fighting?), number (how many people are fighting?) and mood (is the fighting real, hypothetical, or an instruction?). As Old English evolved into its modern form, most of these inflectional suffixes were lost, encouraging a rise in the number of zero-derivations entering the language.
The laissez-faire attitude of English can be clearly recognised when comparing how languages deal with new words such as the recently-coined verb ‘to google’. Some languages, like Middle English stōnen, merely adapt the company’s name to express the grammatical categories which are important in the language (e.g. German du googlest, ‘you google’, ich habe gegoogelt ‘I googled’), while other languages add extra pieces of word to explicitly flag up the conversion, e.g. Greek γκουγκλίζω or γκουγκλάρω (pronounced ‘googlizo’/‘googlaro’), where the final syllable -o indicates that the subject of the verb is ‘I’, but the -iz- preceding it can’t be attributed any meaning beyond ‘I’m a verb!’.
In English, meanwhile, pretty much anything goes: in addition to verbs which have become nouns, we have numerous nouns becoming verbs (e.g. father, storm), adjectives becoming verbs (round, smooth), adjectives becoming nouns (intellectual), and liberal rules governing compounds, which let us treat nouns as if they were adjectives (stone wall). English has such a devil-may-care attitude to conversion that even whole phrases can become nouns or adjectives: basically, it’s a free-for-all.
This has been going on in English for a very long time, so why do examples like gifting make people feel ‘icky’? Partly it’s because we associate coinages like impact, action and workshop with corporate jargon – although some of these are actually of considerable age (impact was a verb before it became a noun, and it started out life even earlier as an adjective), their use boomed in the decades following the second world war, as management increasingly came to be seen as a scientific discipline. Another objection is that we already have words for verbs like to gift, namely give, which makes gift feel like an overelaborate solution to a non-problem, the linguistic equivalent of bic’s infamous ‘for her’ range of pens.
Nevertheless, zero-derivation can come in handy when a word has acquired a different or narrower meaning than the word it originally derived from. Gift originally referred to an action or instance of giving, in addition to the thing being given, but it now almost exclusively refers to something given for free in a spirit of goodwill. You can give someone a black eye, hepatitis, or the creeps, but it would be the height of irony to call these things gifts. Correspondingly, to gift has a more specific meaning than to give, and is much more concise than to give as a gift, just like texting someone is more concise than sending a text message to them, and friending someone is more concise than adding them as a friend on Facebook.
At some point a couple centuries back, somebody in North America got the idea to take the perfectly serviceable regular past tense form of dive — namely dived — and turn it into the irregular form dove (like drove). You can plot its course to ultimate victory in this graph derived from Google Books:
But they didn’t finish the job, because the past participle is still dived: someone who would say She dove into the water would still say She hasn’t dived into the water. So something of a Frankenstein’s monster, grafted together from different parts. I recently stumbled across another such monstrosity. The verb pet — for me at least — has an irregular past tense like let or set, where nothing changes: Yesterday she pet the cat and let it out. But its past participle is a ‘regular’ one with –ed: She hasn’t petted the cat or let it out. So both of them are kind of hybrids, with past tense forms transformed into irregular verbs but past participle forms left to be like regular verbs:
regular verb
hybrid verb
irregular verb
she thrives she frets
she dives she pets
she drives she lets
she thrived she fretted
she dove she pet
she drove she let
she has thrived she has fretted
she has dived she has petted
she has driven she has let
Which is bizarre: why take perfectly good regular verbs and change them into one-off oddities?
Wh- words like which, whom and why get a lot of knickers in a twist, as attested by this oatmeal comic on when to use who vs whom, or the age-old debate about the correct use of which vs that (on which see this blog post by Geoffrey Pullum). But in Old English the wh- words formed a complete and regular system which would have been easy to get the hang of. They were used strictly as interrogative pronouns – words that we use for asking questions like who ate all the pies? – rather than relative pronouns, which give extra information about an item in the sentence (Jane, who ate all the pies, is a prolific blogger) or narrow down the reference of a noun (women who eat pies are prolific bloggers). They developed their modern relative use in Middle English, via reinterpretation of indirect questions – in other words, sentences like she asked who ate all the pies, containing the question who ate all the pies?, served as the template for new sentences like she knew who ate all the pies, where who functions as a relative.
Originally, the new relative pronoun whom (in its Middle English form hwām) functioned as the dative case form of who, used when the person in question is the indirect object of a verb or after prepositions like for. For direct objects, the accusative form hwone was used instead. So to early Middle English ears, the man for whom I baked a pie would be fine, while the man whom I baked in a pie would be objectionable (on grammatical as well as ethical grounds). Because nouns also had distinct nominative, dative and accusative forms, the wh- words would have posed no special difficulty for speakers. But as English lost distinct case forms for nouns, the pronoun system was also simplified, and the originally dative forms started to replace accusative forms, just as who is now replacing whom. This created a two-way opposition between subject and non-subject which is best preserved in our system of personal pronouns: we say he/she/they baked a pie, but I baked him/her/them(in) a pie.
Thus hwone disappeared the way of hine, the old accusative form of he. Without the support of a fully-functioning case system in the nouns, other case forms of pronouns were reinterpreted. Genitive pronouns like my and his were transformed into possessive adjectives (his pie is equivalent to the pie of him, but you can no longer say things like I thought his to mean ‘I thought of him’). The wh- words also used to have an instrumental case form, hwȳ, meaning ‘by/through what?’, which became an autonomous word why.
Although him and them are still going strong, whom has been experiencing a steady decline. Defenders of ‘whom’ will tell you that the rule for deciding whether to use who or whom is exactly the same as that for he and him, but outside the most formal English, whom is now mainly confined to fixed phrases like ‘to whom it may concern’. For many speakers, though, it has swapped its syntactic function for a sociolinguistic one by becoming merely a ‘posh’ variant of who: in the words of James Harding, creator of the ‘Whom’ Appreciation Society, “those who abandon ‘whom’ too soon will regret it when they next find themselves in need of sounding like a butler.”
When I was asked if I could write a blog post, my first thought was “Well, I could do.” And I immediately did an internal double-take, as I had uttered something which, for me, should not be a possible English sentence. My North American English ear ought to reject this sort of orphaned “do” (“I could do”, “I will do”, “I should have done”, where North Americans would just leave it out), which struck me as some sort of diseased outgrowth when I first heard it in Britain some years ago. (For more information, see here.) After many years in the UK I believe I have retained my native pronunciation and native vocabulary — though sometimes just to be polite I stand “in a queue” rather than “on line” (as one does where I come from) — so why should my syntax change, of all things? It’s like I’ve kept up surface appearances (sound and words) but undergone some internal metamorphosis in my syntactic structures. Scary.
I’m certainly not alone in being a dialect contact situation, and watching my own language change in response to that. Is the way that it happened to me part of some general pattern? Or are other people affected differently, say, changing their pronunciation while jealously maintaining their syntax?
The death of the dual, or how to count sheep in Slovenian
One reason why translation is so difficult – and why computer translations are sometimes unreliable – is that languages are more than just different lists of names for the same universal inventory of concepts. There is rarely a perfect one-to-one equivalence between expressions in different languages: the French word mouton corresponds sometimes to English sheep, and at other times to the animal’s meat, where English uses a separate word lamb or mutton.
This was one of the great insights of Ferdinand de Saussure, arguably the father of modern linguistics. It applies not only in the domain of lexical semantics (word meaning), but also to the categories which languages organise their grammars around. In English, we systematically use a different form of nouns and verbs depending on whether we are referring to a single entity or multiple entities. The way we express this distinction varies: sometimes we make the plural by adding a suffix to the singular (as with hands, oxen), sometimes we change the vowel (foot/feet) and occasionally we don’t mark the distinction on a noun at all, as with sheep (despite the best efforts of this change.org petition to change the singular to ‘shoop’). Still, we can often tell whether someone is talking about one or more sheep by the form of the agreeing verb: compare ‘the sheep are chasing a ball’ to ‘the sheep is chasing a ball’.
Some languages make more fine-grained number distinctions. The English word sheep could be translated as ovca, ovci or ovce in Slovenian, depending on whether you’re talking about one, two, or three or more animals, respectively. Linguists call this extra category between singular and plural the dual. The difference between dual and plural doesn’t show up just in nouns, but also in adjectives and verbs which agree with nouns. So to translate the sentence ‘the beautiful sheep are chasing a ball’, you need to ascertain whether there are two or more sheep, not just to translate sheep, but also beautiful and chase.
According to some, having a dual number makes Slovenian especially suited for lovers (could this explain the Slovenian tourist board’s decision to title their latest campaign I feel sLOVEnia?). But putting such speculations aside, it’s hard to see what the point of a dual could be. We rarely need to specify whether we are talking about two or more than two entities, and on the rare occasions we do need to make this information explicit, we can easily do so by using the numeral two.
This might be part of the reason why many languages, including English, have lost the dual number. Both English and Slovenian ultimately inherited their dual from Proto-Indo-European, the ancestor of many of the languages of Europe and India. Proto-Indo-European made a distinction between dual and plural number in its nouns, adjectives, pronouns, and verbs, but most of the modern languages descended from it have abandoned this three-way system in favour of a simpler opposition between singular and plural. Today, the dual survives only in two Indo-European languages, Slovenian and Sorbian, both from the Slavic subfamily.
In English the loss of the dual was a slow process, taking place over thousands of years. By the time the predecessor of English had split off from the other Germanic languages, the plural had replaced the dual everywhere except the first and second-person pronouns we and you, and verbs which agreed with them. By the earliest written English texts, it had lost the dual forms of verbs altogether, but still retained distinct pronouns for ‘we two’ and ‘you two’. By the 15th century, these were replaced by the plural forms, bringing the dual’s final demise.
Grammatical categories do not always disappear without a trace – in some languages the dual has left clues of its earlier existence, even though no functional distinction between dual and plural remains. Like English, German lost its dual, but in some Southern German dialects the dual pronoun enk (cognate with Old English inc, ‘to you two’) has survived instead of the old plural form. In modern dialects of Arabic, plural forms of nouns have generally replaced duals, except in a few words mostly referring to things that usually exist in pairs, like idēn ‘hands’, where the old dual form has survived as the new plural instead. Other languages show vestiges of the dual only in certain syntactic environments. For example, Scottish Gaelic has preserved old dual forms of certain nouns only after the numeral ‘two’: compare aon chas ‘one foot’, dà chois ‘two feet’, trì casan ‘three feet’, casan ‘feet’.
Although duals seem to be on the way out in Indo-European languages, it isn’t hard to find healthy examples in other language families (despite what the Slovenian tourist board might say). Some languages have even more complicated number systems: Larike, one of the languages spoken in Indonesia, has a trial in addition to a dual, which is used for talking about exactly three items. And Lihir, one of the many languages of Papua New Guinea, has a paucal number in addition to both dual and trial, which refers to more than three but not many items. This system of five number categories (singular/dual/trial/paucal/plural) is one of the largest so far discovered. Meanwhile, on the other end of the spectrum are languages which don’t make any number distinction in nouns, like English sheep.