Collocations of ‘cock’: What corpus linguistics tells us about porn writing

This is a guest post by Orin Hargraves, an independent lexicographer, language researcher, and past president of the Dictionary Society of North America. Orin is the author of several language reference books, including It’s Been Said Before: A Guide to the Use and Abuse of Clichés (Oxford) and Slang Rules!: A Practical Guide for English Learners (Merriam-Webster).

*

A few years ago I wrote about how collocations in fiction skew the statistics of collocations in a corpus because of their extremely frequent use; Ben Zimmer expanded on the idea in a later New York Times piece. In summary, the point is that a number of collocations would not be statistically significant were it not for their appearance in fiction. This is because writers of fiction—particularly writers of the amateur, unedited fiction that appears online—tend to reuse the same tropes and phrases so much that these effectively become clichés, formulaic ways of expressing the same (rather tired) ideas and events.

All of that came to light when I was working with the Oxford English Corpus, a well balanced and carefully curated corpus that, at the time, had about two billion words of English. These days I’m working with the enTenTen13 corpus, a web-crawled corpus of nearly 20 billion words, owned and made available by Sketch Engine. Sketch Engine’s web-crawler roves the Internet indiscriminately, pulling text from wherever it can be found. Like some grandmother aghast in Greenville, the web-crawler regularly comes upon sites with pornographic content. The difference between the grandmother and the web-crawler is that while she may avert her gaze in shock and dismay, the web-crawler grabs the text, parses and tags it, and adds it to the corpus. The result is that enTenTen13 houses a steaming, pulsating trove of pornographic writing.

Continue reading

‘Taint your balls, ‘taint your ass, but ’tis in the OED

This week, the Oxford English Dictionary (OED) is out with its latest update. Among its crop of over 600 new words, phrases, and senses, some sweary entries flashed us the come-to-bed eyes on Strong Language—and we don’t mean continental grip, dead rubber, or additions to the many meaning of come, as suggestive as they may sound. From mild abuses to sexual euphemisms to derogatory slang, we’ve got the highlights here.

Continue reading

The first fuckwit

The recent launch of the second edition of the Australian National Dictionary (AND) gave me a chance to indulge in my long-time hobby of looking up the swear words. I’m looking forward to sharing some of my favourite home-grown colourful language in a future post, but I want to start with an entry that gives me the kind of pride that others expended on the Olympic Games last month.

The entry for fuckwit (p. 647) includes the note:

Used elsewhere but recorded earliest in Australia

That’s right. Australia is the home of the fuckwit. The earliest citation in the AND and the Oxford English Dictionary is from Alex Buzo’s 1970 play The Front Room Boys. The earliest non-Australian citation in the OED is from a 1992 article in Making Music magazine from America.

The second edition of the AND expands the citations for fuckwit, makes a clearer distinction between nominal and adjectival use, and (most importantly) adds an earlier citation for fuckwitted. Here are the entries, along with the earliest few citations:

Continue reading

How “fuck” went mainstream

We’re delighted to share an extract from the new book From Skedaddle to Selfie: Words of the Generations by Allan Metcalf. It will be published next month by Oxford University Press, which describes it as “a lively look at the words that have come to define different generations in history” – including fuck.

Even major dictionaries declined to include fuck until quite recently, yet it now appears without fuss in an impressive range of cultural domains. So how did fuck make the leap? In the text below, Metcalf traces the word’s emergence out of largely disreputable use into ever more mainstream contexts.

Continue reading

Frogshit

Paul Dickson wrote a great book called Family Words. He collects terms used only by a very small circle of people, like a family or group of friends. My favorite family word is, to put it mildly, total frogshit.

Frogshit is a synonym for bullshit used mainly by my friend John and a few other friends. Sometimes he actually sends me a picture of a frog pooping, but that’s a matter for the courts.

Frogshit is a natural coinage born in the rich fertilizer of slang. When you’re talking about bullshit, you can’t go wrong with any kind of shit. Of course, horseshit is also a tried and true term for nonsense or drivel. Before batshit took on an insane meaning, it was also a word for bullshit. This Oxford English Dictionary use shows that meaning was still around in the eighties:

1985   D. Koontz Door to December (1994) iii. xxii. 274   Why would men of science..associate with a purveyor of bat shit and bunkum?

The OED also includes several examples of dog shit as meaning “Rubbish, nonsense”:

Continue reading