From Assimilation to Autonomy: Rethinking Data Sovereignty in the Age of Large Language Models

Anirban Ray University of North Carolina Wilmington ; Jeremy Tirrell University of North Carolina Wilmington ; Addie Sayers University of North Carolina Wilmington
Journal
Technical Communication Quarterly
Published
2025-07-03
DOI
10.1080/10572252.2025.2490503
CompPile
Search in CompPile ↗
Open Access
Closed
Topics
Export

Citation Context

Cited by in this index (0)

No articles in this index cite this work.

References (68) · 3 in this index

  1. Axon S. (2024 July 26). X is training Grok AI on your data-here’s how to stop it. Ars Technica. https://arste…
  2. Belanger A. (2023 November 3). Artists May “poison” AI models before copyright office can issue guidance. Ars…
  3. Truth-telling: Critical inquiries on LLMs and the corpus texts that train them
    Composition Studies
  4. 10.1145/3583087
  5. 10.1177/1461444819865984
Show all 68 →
  1. David E. (2024 May 16). Reddit’s deal with OpenAI will plug its posts into “ChatGPT and new products”. The Ve…
  2. 10.1057/s41599-023-02096-w
  3. Dixit P. (2023 November 29). A “silly” attack made ChatGPT reveal real phone numbers and email addresses. Eng…
  4. Duplantier T. (2024 January 8). AI and ethical concerns for legal practitioners. LexisNexis. https://www.lexi…
  5. Edwards B. (2024a May 9). Stack overflow users sabotage their posts after OpenAI deal. Ars Technica. https://…
  6. Edwards B. (2024b May 31). Journalists “deeply troubled” by OpenAI’s content deals with vox the Atlantic. Ars…
  7. 10.1109/EuroSP57164.2023.00017
  8. Gilbertson A. & Reisner A. (2024 July 16). Apple Nvidia Anthropic used thousands of swiped YouTube videos to …
  9. Technical Communication Quarterly
  10. Hale C. (2024 July 15). Gemini AI platform accused of scanning Google Drive files without user permission. Te…
  11. Harding S. (2023 June 30). Reddit API changes are imminent. Here’s what’s happening to your favorite apps. Ar…
  12. Harding S. (2024 May 17). OpenAI will use reddit posts to train ChatGPT under new deal. Ars Technica. https:/…
  13. 10.1007/s10796-022-10352-8
  14. Holt K. (2024 June 27). Time strikes a deal to funnel 101 years of journalism into OpenAI’s gaping maw. Engad…
  15. 10.1177/2053951720982012
  16. Hunter T. (2024 June 6). Artists are fleeing Instagram to keep their work out of Meta’s AI. The Washington Po…
  17. Hurler K. (2023 June 20). John Oliver is the new face of the Reddit API protest. Gizmodo. https://gizmodo.com…
  18. 10.1371/journal.pone.0174698
  19. Knibbs K. (2024 August 8). One startup’s plan to fix AI’s “shoplifting” problem. WIRED. https://www.wired.com…
  20. The indigenous world 2020
  21. 10.22459/CAEPR38.11.2016
  22. 10.1177/01634437231174351
  23. 10.1007/s10462-024-10888-y
  24. LexisNexis. (2023 May 16). The power of artificial intelligence in legal research. LexisNexis Insights. https…
  25. 10.48550/ARXIV.2310.10383
  26. 10.48550/arXiv.2407.14933
  27. Good data
  28. Maiberg E. (2024 July 24). Google is the only search engine that works on Reddit Now thanks to AI deal. 404 M…
  29. Digital allotment and vanishing indians: IDSOV and LLMS
    American Indian Law Journal
  30. Mehrotra D. & Couts A. (2024 June 27). Amazon is investigating perplexity over claims of scraping abuse. Wire…
  31. Metz C. Kang C. Frenkel S. Thompson S. A. & Grant N. (2024 April 6). How tech giants cut corners to harvest d…
  32. Native Nations Institute. (2024). Indigenous data sovereignty and governance. The University of Arizona Nativ…
  33. 10.1080/13501763.2023.2172060
  34. 10.23962/10539/30360
  35. Journal of Business and Technical Communication
  36. Open Data Charter. (n.d.). ODC principles. Open Data Charter. Retrieved August 10 2024 from https://opendatac…
  37. Paul K. (2023 December 30). How social media’s biggest user protest rocked Reddit. The Guardian. https://www.…
  38. 10.1145/3539597.3575792
  39. 10.5281/ZENODO.2668475
  40. Reddit. (2024). Public content policy. https://support.reddithelp.com/hc/en-us/articles/26410290525844-Public…
  41. Journal of Technical Writing and Communication
  42. Roose K. (2024 July 19). The data that powers A.I. is disappearing fast. The New York Times. https://www.nyti…
  43. Roth E. (2024 February 22). Google cut a deal with Reddit for AI training data. The Verge. https://www.thever…
  44. For data-guzzling ai companies, the internet is too small
    The Wall Street Journal
  45. Shanklin W. (2024 July 24). AI search engines that don’t pay up can’t index reddit content. Engadget. https:/…
  46. Similarweb. (2024 July). Reddit.com. https://www.similarweb.com/website/reddit.com/#overview
  47. Sinclair S. & Rockwell G. (2024). Voyant tools (version 2.6.14).
  48. Snelling G. (2024 June 5). What is the Cara app and why are artists deleting instagram for it? Fast company. …
  49. Sokolsky E. & Sommese S. (2023 August 2). LexisNexis collaborates with Microsoft on product integrations and …
  50. Have I been trained?
  51. Thomson T. J. & Angus D. (2023 December 17). Data poisoning: How artists are sabotaging AI to take revenge on…
  52. Vincent J. (2022 October 25). Shutterstock will start selling AI-Generated stock imagery with help from OpenA…
  53. 10.1007/s12525-024-00693-4
  54. Walsh K. (2023 August 18). Understanding CC licenses and generative AI. Creative commons. https://creativecom…
  55. Indigenous data sovereignty and policy
  56. 10.1002/ajs4.141
  57. Weisenberger T. M. Milton D. C. Enright H. A. & Kim J. (2024). Case tracker: Artificial intelligence copyrigh…
  58. 10.4135/9781529798227
  59. White J. (2023 December 22). How strangers got my email address from ChatGPT’s model. The New York Times. htt…
  60. Wiggers K. (2024 May 6). Stack overflow signs deal with OpenAI to supply data to its models. TechCrunch. http…
  61. 10.1093/oso/9780197582794.003.0006
  62. 10.1016/j.hcc.2024.100211
  63. 10.1007/s44163-024-00121-8