The quarterly update of the Oxford English Dictionary is always an occasion for rejoicing among hardcore lexicography buffs. The latest update is an even bigger bumper crop than usual, with a whopping 1,400 new words, senses, and subentries online. If you skim through the public list, you’ll see that a very large number have to do with the words arse, ass, bum, and butt, including related phrases and compounds. You might even say there’s a metric buttload of such additions. (Buttload, by the way, was already added to the OED back in June 2009, citing examples going back to 1988 — none of the metric variety, unfortunately.)
English-usage authority Bryan A. Garner shook Language Twitter by suggesting that only philistines pronounced pubes as a single syllable.
More than a few of us responded with tweets of bewilderment and skepticism, likely confusing everyone around us as we muttered “PYOO-beez. PYOOBZ. PYOO-beez??” at our screens. Continue reading
Soon after Samuel Johnson’s Dictionary was published in 1755, so the story goes, he was approached by a pair of prudish readers who commended him for omitting ‘improper’ words. Johnson, according to one account, replied to the women: ‘What! my dears! then you have been looking for them?’
Today you can find improper words in any good dictionary – but only the main set. Fuck is there, but not fuckish, fuckfaced, fuck-nutty, fuck my old boots!, or fuck the dog and sell the pups. You’ll see shit in the usual sources, but good luck finding shit-breath, shit factory, shit-squirting, shit out, or shit on the dining room table.* Regular dictionaries just don’t cover the remarkable range of taboo vocabulary, nor should they.
For this we turn to specialist slang dictionaries. These do not shy from obscenity but embrace it in all its mutable monstrousness (I say this as someone who loves monsters, and mutants). And the best slang dictionary in existence – it defines, expertly, all the phrases above and thousands like them – is Green’s Dictionary of Slang (GDoS). Last year it went online. If that’s news to you, prepare for a treat.
This is a guest post by Orin Hargraves, an independent lexicographer, language researcher, and past president of the Dictionary Society of North America. Orin is the author of several language reference books, including It’s Been Said Before: A Guide to the Use and Abuse of Clichés (Oxford) and Slang Rules!: A Practical Guide for English Learners (Merriam-Webster).
A few years ago I wrote about how collocations in fiction skew the statistics of collocations in a corpus because of their extremely frequent use; Ben Zimmer expanded on the idea in a later New York Times piece. In summary, the point is that a number of collocations would not be statistically significant were it not for their appearance in fiction. This is because writers of fiction—particularly writers of the amateur, unedited fiction that appears online—tend to reuse the same tropes and phrases so much that these effectively become clichés, formulaic ways of expressing the same (rather tired) ideas and events.
All of that came to light when I was working with the Oxford English Corpus, a well balanced and carefully curated corpus that, at the time, had about two billion words of English. These days I’m working with the enTenTen13 corpus, a web-crawled corpus of nearly 20 billion words, owned and made available by Sketch Engine. Sketch Engine’s web-crawler roves the Internet indiscriminately, pulling text from wherever it can be found. Like some grandmother aghast in Greenville, the web-crawler regularly comes upon sites with pornographic content. The difference between the grandmother and the web-crawler is that while she may avert her gaze in shock and dismay, the web-crawler grabs the text, parses and tags it, and adds it to the corpus. The result is that enTenTen13 houses a steaming, pulsating trove of pornographic writing.
This week, the Oxford English Dictionary (OED) is out with its latest update. Among its crop of over 600 new words, phrases, and senses, some sweary entries flashed us the come-to-bed eyes on Strong Language—and we don’t mean continental grip, dead rubber, or additions to the many meaning of come, as suggestive as they may sound. From mild abuses to sexual euphemisms to derogatory slang, we’ve got the highlights here.