+

Cookies on the Business Insider India website

Business Insider India has updated its Privacy and Cookie policy. We use cookies to ensure that we give you the better experience on our website. If you continue without changing your settings, we\'ll assume that you are happy to receive all cookies on the Business Insider India website. However, you can change your cookie setting at any time by clicking on our Cookie Policy at any time. You can also see our Privacy Policy.

Close
HomeQuizzoneWhatsappShare Flash Reads
 

These Charts Reveal How Best-Selling Novels Have Changed Over Time

Jun 2, 2014, 21:26 IST

If you've ever wondered how best-selling authors went from Elinor Dashwood to Bella Swan, math - believe it or not - can help.

Advertisement

Tyler Vigen, the statistician who brought us hilarious spurious correlations, did some work with words, too. He analyzed several of the most popular novels from the early 1800s to today, focusing on elements like sentence length and punctuation.

"I chose these books because they were seven of the all-time best-selling novels (which sold more than 50 million copies) that were written in English ... that were spaced in time periods," Vigen told Business Insider via email.

Now, these books might not represent their respective time periods, but the data provides interesting insight nonetheless. Some of the charts exhibit clear trends, while others seem more random.

Tyler Vigen

Advertisement

Sentence length appears to be declining over time. It's also important to note the number of words and sentences in each book, shown in the chart below.

Tyler Vigen

"Sense and Sensibility" and "Twilight" contain about the same number of words - 119,000. But Jane Austen wrote roughly half as many sentences as Stephanie Meyer - 5,179 compared to 12,386, respectively.

While this trend could relate to later authors, in comparison, writing for young audiences, it's fair to say that Victorian England possessed a greater appreciation for paragraph-long sentences than we do today.

Tyler Vigen

Here, "unique" means "different from every previously encountered word in the book," according to Vigen.

Advertisement

"Twilight" included greater word variation than many other novels, including Austen's work. But the most interesting relationship appears when you consider words used per unique word, shown in the chart below.

Tyler Vigen

The lower the number in the last column, the more extensive the author's vocabulary. In this case, Mark Twain takes the top spot. For every nine words he wrote, one of them had never appeared in the "The Adventures of Tom Sawyer Before" before.

Tyler Vigen

Adjective frequency appears to be increasing over time.

Advertisement

Tyler Vigen

The decline of the semicolon in writing is a clear trend.

"Sense and Sensibility" included 1,572 semicolons - one every 3.3 sentences on average. In a book of about the same number of words, Meyer used only 224 (one every 55.3 sentences).

Commas and other punctuation marks present less of a trend.

Tyler Vigen

Tyler Vigen

Advertisement

Tyler Vigen

You are subscribed to notifications!
Looks like you've blocked notifications!
Next Article