Hand Collecting and Coding Versus Data-Driven Methods in Technical and Professional Communication Research

Claire Lauer Arizona State University ; Eva Brumberger ; Aaron Beveridge University of North Carolina at Greensboro

Abstract

Background: Qualitative technical communication research often produces datasets that are too large to manage effectively with hand-coded approaches. Text-mining methods, used carefully, may uncover patterns and provide results for larger datasets that are more easily reproduced and scaled. Research questions: 1. To what degree can hand collection results be replicated by automated data collection? 2. To what degree can hand-coded results be replicated by machine coding? 3. What are the affordances and limitations of each method? Literature review:We introduce the stages of data collection and analysis that researchers typically discuss in the literature, and show how researchers in technical communication and other fields have discussed the affordances and limitations of hand collection and coding versus automated methods throughout each stage. Research methodology: We utilize an existing dataset that was hand-collected and hand-coded. We discuss the collection and coding processes, and demonstrate how they might be replicated with web scraping and machine coding. Results/discussion: We found that web scraping demonstrated an obvious advantage of automated data collection: speed. Machine coding was able to provide comparable outputs to hand coding for certain types of data; for more nuanced and verbally complex data, machine coding was less useful and less reliable. Conclusions: Our findings highlight the importance of considering the context of a particular project when weighing the affordances and limitations of hand collecting and coding over automated approaches. Ultimately, a mixed-methods approach that relies on a combination of hand coding and automated coding should prove to be the most productive for current and future kinds of technical communication work, in which close attention to the nuances of language is critical, but in which processing large amounts of data would yield significant benefits as well.

Journal
IEEE Transactions on Professional Communication
Published
2018-12-01
DOI
10.1109/tpc.2018.2870632
CompPile
Open Access
Closed
Topics
Export

Citation Context

Cited by in this index (12)

  1. IEEE Transactions on Professional Communication
  2. IEEE Transactions on Professional Communication
  3. Computers and Composition
  4. Communication Design Quarterly
  5. Journal of Business and Technical Communication
Show all 12 →
  1. Business and Professional Communication Quarterly
  2. Technical Communication Quarterly
  3. Journal of Business and Technical Communication
  4. Technical Communication Quarterly
  5. Computers and Composition
  6. IEEE Transactions on Professional Communication
  7. IEEE Transactions on Professional Communication

References (29) · 6 in this index

  1. 10.4135/9781848607941.n13
  2. 10.4054/DemRes.2017.37.42
  3. Journal of Business and Technical Communication
  4. 10.1080/13645579.2011.625764
  5. textstem: Tools for Stemming and Lemmatizing Text Version 0.1.3
Show all 29 →
  1. Attention ecology: Trend circulation and the virality threshold
    Digital Humanities
  2. 10.37514/WRI-B.2017.0124
    Network Sense Methods for Visualizing a Discipline  
  3. Journal of Writing Research
  4. Topic modeling and digital humanities
    Digital Humanities
  5. Text Mining with R A Tidy Approach
  6. topicmodels: An R package for fitting topic models
    J Stat Softw  
  7. 10.2196/jmir.4612
  8. Content analysis: What texts talk about
    What Writing Does and How it Does it An Introduction to Analyzing Texts and Textual Practices
  9. Journal of Business and Technical Communication
  10. Journal of Business and Technical Communication
  11. Communication Design Quarterly
  12. 10.1007/s13142-014-0256-1
  13. Looking in the dustbin: Data janitorial work, statistical reasoning, and information rhetorics
    Comput Compos Online
  14. 10.4304/jetwi.1.1.60-76
  15. Technology and technical and professional communication through the lens of the MLA Job …
    Program Perspect
  16. 10.1080/0013188032000133548
  17. The evolution of technical communication: An analysis of industry job postings
    Tech Commun
  18. 10.1145/2133806.2133826
  19. Communication Design Quarterly
  20. Topic modeling: A basic introduction
    Digital Humanities
  21. Of horsemen and layered literacies: Assessment instruments for aligning technical and pr…
    Program Perspect
  22. 10.2196/jmir.9702
  23. Do curricula correspond to managerial expectations? Core competencies for technical comm…
    Tech Commun
  24. Analysis of the skills called for by technical communication employers in recruitment postings
    Tech Commun