Peering into the Internet Abyss: Using Big Data Audience Analysis to Understand Online Comments

John R. Gallagher ; Yinyin Chen University of Illinois Urbana-Champaign ; Kyle Wagner University of Minnesota System ; Xuan Wang University of Illinois Urbana-Champaign ; Jingyi Zeng University of Illinois Urbana-Champaign ; Alyssa Lingyi Kong Ernst & Young (Israel)

Abstract

This article offers a methodology for conducting large-scale audience analysis called “big data audience analysis” (BDAA). BDAA uses distant reading and thin description to examine a large corpus of text data from online audiences. In this article, that corpus is approximately 450,000 online reader comments. We analyze this corpus through sentiment analysis, statistical analysis, and geolocation to identify trends and patterns in large datasets. BDAA can better prepare TPC researchers for large-scale audience studies.

Journal
Technical Communication Quarterly
Published
2020-04-02
DOI
10.1080/10572252.2019.1634766
CompPile
Search in CompPile ↗
Open Access
Closed
Export

Citation Context

Cited by in this index (13)

  1. Journal of Business and Technical Communication
  2. Technical Communication Quarterly
  3. Computers and Composition
  4. Rhetoric Society Quarterly
  5. Technical Communication Quarterly
Show all 13 →
  1. Computers and Composition
  2. Technical Communication Quarterly
  3. Technical Communication Quarterly
  4. Technical Communication Quarterly
  5. Journal of Business and Technical Communication
  6. Computers and Composition
  7. Communication Design Quarterly
  8. Rhetoric & Public Affairs

References (71) · 25 in this index

  1. Technical Communication Quarterly
  2. Thinking about social justice: Interrogating the international in international technical…
    Connexions
  3. Journal of Technical Writing and Communication
  4. 10.1111/jcc4.12009
  5. A theory of persuasive computer algorithms for rhetorical code studies
    enculturation
Show all 71 →
  1. Writing through big data: New challenges and possibilities for data-driven arguments
    Composition Forum
  2. 10.3366/ijhac.2016.0162
  3. Computers and Composition
  4. Enthymeme as rhetorical algorithm
    Present Tense: A Journal of Rhetoric in Society
  5. Computers and Composition
  6. 10.2307/j.ctv65swg4
  7. 10.1177/2053951715622512
  8. 10.1111/jcom.2014.64.issue-4
  9. 10.7330/9781607328063
  10. Journal of Technical Writing and Communication
  11. Technical Communication Quarterly
  12. College Composition and Communication
  13. Etim B. (2017 June 13). The times sharply increases articles open for comments using Google’s technology. The…
  14. Automating inequality: How high-tech tools profile, police, and punish the poor
  15. Written Communication
  16. Computers and Composition
  17. Computers and Composition
  18. Technical Communication Quarterly
  19. Gardiner B. Mansfield M. Anderson I. Holder J. Louter D. & Ulmanu M. (2016 April 12). The dark side of Guardi…
  20. Custodians of the internet: Platforms, content moderation, and the hidden decisions that …
  21. The presentation of self in everyday life
  22. Converging fields, expanding outcomes: Technical communication, translation, and design a…
    Technical Communication
  23. Technical Communication Quarterly
  24. Revising the technical communication service course
    Programmatic Perspectives
  25. Journal of Business and Technical Communication
  26. 10.1007/978-3-319-51268-6_6
  27. 10.7208/chicago/9780226321370.001.0001
  28. Rhetorical allegorithms in bitcoin
    enculturation
  29. Jensen E. (2016). NPR website to get rid of comments. National Public Radio. Retrieved from https://www.npr.o…
  30. Journal of Technical Writing and Communication
  31. Technical Communication Quarterly
  32. 10.1177/2056305118765741
  33. 10.22329/il.v36i2.4672
  34. 10.2307/j.ctt20q22wm
  35. 10.21623/1
  36. Use what you choose: Applying computational methods to genre studies in technical communication
    Proceedings of the 34th ACM International Conference on the Design of Communication
  37. 10.1080/10646175.2012.695643
  38. Communication Design Quarterly
  39. Technical Communication Quarterly
  40. Journal of Technical Writing and Communication
  41. Graphs, maps, trees: Abstract models for literary history
  42. Distant reading
  43. Moss L. (2017 June 19). How The New York Times moderates 12 000 comments a day. Digiday. Retrieved from https…
  44. 10.1177/0361684314565777
  45. 10.1111/jcom.2017.67.issue-4
  46. Network sense: Methods for visualizing a discipline
  47. 10.18574/nyu/9781479833641.001.0001
  48. Weapons of math destruction: How big data increases inequality and threatens democracy
  49. Journal of Writing Research
  50. Hedge-O-Matic
    enculturation
  51. You can read the comments again: The faciloscope app and automated rhetorical analysis
    DHCommons Journal
  52. Pang B. & Lee L. (2005). Seeing stars: Exploiting class relationships for sentiment categorization with respe…
  53. 10.4159/harvard.9780674736061
  54. Communication Design Quarterly
  55. 10.1080/1461670X.2016.1161497
  56. Technical Communication Quarterly
  57. The intersectional internet: Race, sex, class and culture online
  58. 10.1080/1369118X.2014.940365
  59. Communication Design Quarterly
  60. Socher R. Perelygin A. Wu J. Y. Chuang J. Manning C. D. Ng A. Y. & Potts C. (2013). Recursive deep models for…
  61. Advances in the History of Rhetoric
  62. Southern L. (2017 March 16). The financial times: Readers who comment are 7 times more engaged. Digiday. Retr…
  63. Journal of Writing Analytics
  64. 10.1111/comt.1998.8.issue-1
  65. A genealogy of distant reading
    Digital Humanities Quarterly
  66. Journal of Technical Writing and Communication