Document Type

Thesis or dissertation

Date of this Version



Elaine Zanutto


This paper set out to analyze trends in data usage within mainstream American news. Analyzing a sample of over 60,000 New York Times articles from 1981 to 2021, data usage, article subjectivity, and article polarity were measured. The purpose of this analysis was to test whether the prevailing narrative that the past 20 years is the ‘data age’ and that data usage is bigger than ever was true within the context of print journalism. Overall, current public confidence in mainstream newspapers is low and readership is decreasing. Thus, the value of journalists as collectors, interpreters, and presenters of data is of increasing importance. Contrary to the original hypotheses, the key findings of this analysis are that data usage has not increased absolutely, or as a ratio to word count over the past 40 years. No substantial trends in either data word usage or raw number usage could be detected. Further, the presence of data within New York Times articles was not found to have any strong correlation with changes in article polarity or subjectivity. This raises critical questions about the narrative of the ‘data age’ and why increased data availability has not resulted in increased data utilization in the context of the New York Times.


data, news, newspaper, data usage, data age, sentimentality, text analysis, polarity, journalism



Date Posted: 15 June 2021


To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.