Collocations of ‘cock’: What corpus linguistics tells us about porn writing

This is a guest post by Orin Hargraves, an independent lexicographer, language researcher, and past president of the Dictionary Society of North America. Orin is the author of several language reference books, including It’s Been Said Before: A Guide to the Use and Abuse of Clichés (Oxford) and Slang Rules!: A Practical Guide for English Learners (Merriam-Webster).

*

A few years ago I wrote about how collocations in fiction skew the statistics of collocations in a corpus because of their extremely frequent use; Ben Zimmer expanded on the idea in a later New York Times piece. In summary, the point is that a number of collocations would not be statistically significant were it not for their appearance in fiction. This is because writers of fiction—particularly writers of the amateur, unedited fiction that appears online—tend to reuse the same tropes and phrases so much that these effectively become clichés, formulaic ways of expressing the same (rather tired) ideas and events.

All of that came to light when I was working with the Oxford English Corpus, a well balanced and carefully curated corpus that, at the time, had about two billion words of English. These days I’m working with the enTenTen13 corpus, a web-crawled corpus of nearly 20 billion words, owned and made available by Sketch Engine. Sketch Engine’s web-crawler roves the Internet indiscriminately, pulling text from wherever it can be found. Like some grandmother aghast in Greenville, the web-crawler regularly comes upon sites with pornographic content. The difference between the grandmother and the web-crawler is that while she may avert her gaze in shock and dismay, the web-crawler grabs the text, parses and tags it, and adds it to the corpus. The result is that enTenTen13 houses a steaming, pulsating trove of pornographic writing.

Continue reading

The further adventures of “AF”

In the two years since I first wrote about seeing “AF” — the abbreviation for the intensifier “as fuck” — in various interesting places, I’ve kept track of its spread from the fringes to the mainstream, or at least a major tributary of the mainstream, of popular culture. In April of this year, when I noted its use in New York subway advertisements by the food-delivery service FoodKick, I speculated that this was the first time AF had appeared in a commercial context. Well, I was wrong. It wasn’t the first. And it certainly hasn’t been the last.

“I’m feeling myself because my boobs are swoll AF”

Continue reading

The fresh prints of ‘bell-end’

When dance-lord Michael Flatley said he would perform at Donald Trump’s inauguration ball in January, someone cheekily redirected colossalbellend.com to Flatley’s website. (It now points to Trump’s Twitter page.) Reporting on the story, the Guardian noted: ‘Bellend is a British insult.’

Helpful, but short on detail. What kind of insult is bell-end? What does it mean, and how is it used? Where did it come from, and when, and why? And what’s bell end brie? If you gotta have more bell-end, you’re in the right place: Let Strong Language ring your bell.

Image macro of Christopher Walken in Saturday Night Live saying, "I gotta have more 'bell-end', baby!" (instead of "cowbell")

Continue reading

‘Taint your balls, ‘taint your ass, but ’tis in the OED

This week, the Oxford English Dictionary (OED) is out with its latest update. Among its crop of over 600 new words, phrases, and senses, some sweary entries flashed us the come-to-bed eyes on Strong Language—and we don’t mean continental grip, dead rubber, or additions to the many meaning of come, as suggestive as they may sound. From mild abuses to sexual euphemisms to derogatory slang, we’ve got the highlights here.

Continue reading

Sweary links #23

Strong Language contributor Jonathon Green (@misterslang), the author of Green’s Dictionary of Slang, has a new project of special interest to SL readers: Slang Family Trees. “The aim,” writes Jonathon, “is to look at some of slang’s primary themes and show the way the lexis assesses given topics on a semantic basis.” The trees are constructed with mind-mapping software and appear as .pdf files. To get started, see vagina, penisand drunk.

 

*
To drive awareness on International Women’s Day about how women are paid on average 25 percent less than men, J. Walter Thompson London created an outdoor campaign that uses censorship to show how offensive the world can seem with 25 percent missing. (Via Little Black Book)
 Find your purse?
*

Continue reading