Collocations of ‘cock’: What corpus linguistics tells us about porn writing

This is a guest post by Orin Hargraves, an independent lexicographer, language researcher, and past president of the Dictionary Society of North America. Orin is the author of several language reference books, including It’s Been Said Before: A Guide to the Use and Abuse of Clichés (Oxford) and Slang Rules!: A Practical Guide for English Learners (Merriam-Webster).

*

A few years ago I wrote about how collocations in fiction skew the statistics of collocations in a corpus because of their extremely frequent use; Ben Zimmer expanded on the idea in a later New York Times piece. In summary, the point is that a number of collocations would not be statistically significant were it not for their appearance in fiction. This is because writers of fiction—particularly writers of the amateur, unedited fiction that appears online—tend to reuse the same tropes and phrases so much that these effectively become clichés, formulaic ways of expressing the same (rather tired) ideas and events.

All of that came to light when I was working with the Oxford English Corpus, a well balanced and carefully curated corpus that, at the time, had about two billion words of English. These days I’m working with the enTenTen13 corpus, a web-crawled corpus of nearly 20 billion words, owned and made available by Sketch Engine. Sketch Engine’s web-crawler roves the Internet indiscriminately, pulling text from wherever it can be found. Like some grandmother aghast in Greenville, the web-crawler regularly comes upon sites with pornographic content. The difference between the grandmother and the web-crawler is that while she may avert her gaze in shock and dismay, the web-crawler grabs the text, parses and tags it, and adds it to the corpus. The result is that enTenTen13 houses a steaming, pulsating trove of pornographic writing.

Continue reading

Profanity in American Sign Language

Sign languages are as expressive and systematic as spoken languages, and that includes taboo words. As Benjamin Bergen writes in What the F, ‘Signers use rules of grammar, some of them specific to profanity, just like speakers of spoken languages.’ There’s also great variation in how a given idea may be conveyed – not just between sign languages or their dialects but within them.

If Cory O’Brien’s terrific post on the middle finger in American Sign Language made you want to increase your rude repertoire in ASL, look no further. The video below features a host of signers demonstrating some favourite insults and profanities. It also shows how much fun swearing can be.

Continue reading

Mooch mouth: Scaramucci takes public profanity to a new level

Last October, when the Trump/Pussygate story broke, I said it would “go down in history as a watershed moment in public profanity.” Now, six months into the Trump administration, we’ve come to another banner day for profanity, as the New Yorker‘s Ryan Lizza reports on a remarkable conversation with Anthony Scaramucci, aka the Mooch, Trump’s incoming communications director. Scaramucci unloaded on White House chief of staff Reince Priebus, with a side order of profanity devoted to Trump’s chief strategist Steve Bannon. And news outlets all over the world are reporting on it.

Continue reading

Taking a turn in “cock” alley

It’s the Year of the Cock. No, no, not that Year of the Cock, when TIME named Donald Trump its 2016 Person of the Year. Today marks Lunar New Year, and for many of its Chinese celebrants, 2017 is the Year of the Rooster – or, if we’re not so prudish, Cock. But what’s all this cockeyed rooster/cock cockamamie about, anyway? 

cock.jpg
Cock-a-diddle-do? Image courtesy of pixabay.com.

Continue reading

Some observations on the phonaesthetics of tits, cunts, cocks, and spunk

Following on my initial observations on the phonology of cusswords, and connecting to Iva’s recent post on cooters and hooters, I’d like to spend some time looking at some phonaesthetic clusters in words relating to private parts and emissions. Continue reading