AI companies train language models on YouTube’s archive, making family-and-friends videos a privacy risk2024-09-23

In an article for The Conversation, Ethan Zuckerman and I argue for greater attention to the privacy issues at stake when companies train their AI models with YouTube data. Here are the concluding paragraphs:

"The intentions of a YouTube uploader simply aren’t as consistent or predictable as those of someone publishing a book, writing an article for a magazine or displaying a painting in a gallery. But even if YouTube’s algorithm ignores your upload and it never gets more than a couple of views, it may be used to train models like ChatGPT and Gemini.

"As far as AI is concerned, your family reunion video may be just as important as those uploaded by influencer giant Mr. Beast or CNN."

The full article is available without a paywall at The Conversation.

Last updated:
2025-01-02

Text licensing:
By Ryan McGrady and Ethan Zuckerman. The Conversation uses CC BY-ND 4.0 (you can use/republish this, but please don't modify it).

Media licensing:
American oystercatcher family at Fort Tilden, by Ryan McGrady, CC BY-SA 4.0.