Quantitative Analysis of Culture Using Millions of Digitized Books
Author(s) -
JeanBaptiste Michel,
Yuan Shen,
Aviva Presser Aiden,
Adrian Veres,
Matthew K. Gray,
Joseph P. Pickett,
Dale Hoiberg,
Dan Clancy,
Peter Norvig,
Jon Orwant,
Steven Pinker,
Martin A. Nowak,
E Aiden
Publication year - 2010
Publication title -
science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 12.556
H-Index - 1186
eISSN - 1095-9203
pISSN - 0036-8075
DOI - 10.1126/science.1199644
Subject(s) - quantitative analysis (chemistry) , linguistics , chemistry , philosophy , chromatography
We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of 'culturomics,' focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom