Author: | Dominique Haughton, Mark-David McLaughlin, Kevin Mentzer, Changan Zhang | ISBN: | 9783319094267 |
Publisher: | Springer International Publishing | Publication: | October 5, 2015 |
Imprint: | Springer | Language: | English |
Author: | Dominique Haughton, Mark-David McLaughlin, Kevin Mentzer, Changan Zhang |
ISBN: | 9783319094267 |
Publisher: | Springer International Publishing |
Publication: | October 5, 2015 |
Imprint: | Springer |
Language: | English |
Movies will never be the same after you learn how to analyze movie data, including key data mining, text mining and social network analytics concepts. These techniques may then be used in endless other contexts. In the movie application, this topic opens a lively discussion on the current developments in big data from a data science perspective. This book is geared to applied researchers and practitioners and is meant to be practical. The reader will take a hands-on approach, running text mining and social network analyses with software packages covered in the book. These include R, SAS, Knime, Pajek and Gephi. The nitty-gritty of how to build datasets needed for the various analyses will be discussed as well. This includes how to extract suitable Twitter data and create a co-starring network from the IMDB database given memory constraints. The authors also guide the reader through an analysis of movie attendance data via a realistic dataset from France.
Movies will never be the same after you learn how to analyze movie data, including key data mining, text mining and social network analytics concepts. These techniques may then be used in endless other contexts. In the movie application, this topic opens a lively discussion on the current developments in big data from a data science perspective. This book is geared to applied researchers and practitioners and is meant to be practical. The reader will take a hands-on approach, running text mining and social network analyses with software packages covered in the book. These include R, SAS, Knime, Pajek and Gephi. The nitty-gritty of how to build datasets needed for the various analyses will be discussed as well. This includes how to extract suitable Twitter data and create a co-starring network from the IMDB database given memory constraints. The authors also guide the reader through an analysis of movie attendance data via a realistic dataset from France.