What Are the Most Commonly Used Movie Clichés? A Statistical Analysis
Exploring the cliché phrases that dominate movies.
Welcome to the 188 new subscribers who’ve signed up for Stat Significant since last week’s article! I hope you enjoy this week’s deep dive into the fascinating world of movie clichés.
If you want to chat about data and statistics, need help with analytics, or are looking to produce data-driven content, shoot me an email at daniel@statsignificant.com or connect with me on LinkedIn.
Introducing Stephen Follows’ Decoding the World Through Data: A Data-Driven Newsletter for Movie Lovers
There aren’t many writers exploring pop culture through the lens of data. A few months ago, I discovered
’ newsletter Decoding the World Through Data, and I was immediately impressed. He’s written some awesome data essays about the film industry, including:For my post this week, I am featuring one of Stephen’s pieces on the most commonly used movie clichés. I really enjoyed this essay, and I hope you do too. If you like what you read, check out his newsletter.
Intro: Clichés in the Pantheon of Movies
Last week, a group of friends and I watched Last Action Hero (1993). The script started its life as a satire of dumb action moves (its original title was "Extremely Violent") but was so heavily re-written during development and production that (a) its original writers lost their full writing credit, and (b) it became the exact thing it was looking to send up (i.e. a forgettable dumb action movie).
During the many dull moments we had to chat while the movie draaagggggggged on, the conversation turned to dialogue clichés. Last Action Hero included a number of classics, including "This is not happening", "I'm just doing my job", and "Did you hear something?".
This movie gets a pass on clichés as its intent is to be a semi-parody, so some of those uses could be aimed at being self-aware and (intended at least) for comedic effect.
But what of the whole pantheon of movies? How many cite familour clichés? And which are the ones coming in and out of fashion?
I turned to my database of subtitle files to find out. I generated my long list of 138 dialogue clichés after consulting with screenwriters, reading blogs, and talking it through with Jack Malvern from The Times. With my list in hand, I tracked their appearance in over 72,000 movies released since 1940.
Jack published his own piece in The Times (which you can read here). and my headline findings are below. As you can imagine, this project generated far more data than one blog article can encompass.
The Most Commonly Used Movie Clichés
Of the movie dialogue clichés I studied, "What the hell" was most frequently used, appearing in just over a third of all movies studied. A close second was "What are you doing here?" followed by "Honey, is that you?".
It's interesting to note that the top four clichés are all questions, albeit "What the hell" and "What the fuck" might be rhetorical in many cases.
The chart above compresses sixty years of movie history into a single graphic, so I thought it might be interesting to look at changing usage over time.
The Fastest Growing Movie Clichés
The ones growing in prominence include "Why are you doing this to me?", "What the fuck!" and "This is not happening" (one would imagine it's being pronounced "This is NOT happening", but sadly, we can't track intonation).
I was particularly fascinated by the rapid rise of "How hard can it be?!" as it was almost completely absent until the 1970s and then shot up with each decade.
The Fastest Declining Movie Clichés
At the other end of the spectrum is the rapid decline and almost extinction of "Follow that [vehicle]". This was mostly "Follow that car" but I included vans, taxis, trucks, bikes and the like. My guess is that this dialogue became such a meme that few films felt they could use it without breaking the fourth wall.
Also in sharp decline were "Is that clear?", "We meet again" and "It's no use".
Digging through the subtitles files, I can do so much more with this dataset, both on clichés specifically and more broadly. For example, I'm keen to compare the trends seen above with the Google corpus of one million books to see to what degree the effects are movie-specific and which reflect wider changes in the English language.
Notes: Constructing the Analysis
For this analysis, I looked at the English-language subtitle files for 72,405 fiction feature films (released between 1911 and 2022) kindly provided by the lovely folk at OpenSubtitles.com. Metadata came from OMDb, IMDb, The Numbers, Wikipedia, and my own analysis.
I built up a master list of clichés from my own experience, from conversations online, and with the help of Jack Malvern at The Times. This gave me 138 clichés to look for.
In order to ensure I was capturing every instance, I split each cliché into its multiple variants. For example, "You ain't seen nothing yet" included the following variants:
“You ain't seen nothing yet”
“You havent seen nothing yet”
“You have not seen anything yet”
“You aint seen anything yet”
I then counted a movie if it used the cliché even once. When I studied the prevalence of swearing in movies, I looked at both density and frequency (i.e. how many times it appears, not just the binary of at least one single use), but I couldn't think of a clear way of showing that on this topic. The swearing research only looked at three bad words, not 138.
Want to Read More of Stephen’s Writing?
Subscribe to his newsletter for a weekly dose of data-driven analyses about the film industry.
I had a former business part that whenever there was a bug in his code would say “what the hell”, I might forward this to him.
You have me wondering if it's possible to quantify this recent clip that went viral of Reese Witherspoon decrying the lazy trope of women asking "what do we do now?" https://x.com/womenpostingws/status/1864316564234756310