<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=2006193252832260&amp;ev=PageView&amp;noscript=1">

1 Min Read

Reddit IPO Reveals That Your User Data Is Training AI

Featured Image

Wondering how to get started with AI? Take our on-demand Piloting AI for Marketers Series.

Learn More

Reddit is going public. And, as part of its IPO filing, the company revealed it has data licensing agreements with AI companies.

The agreements, worth more than $200 million, presumably give these companies' AI models the ability to train legally on Reddit data.

Reddit hasn't revealed all the AI companies involved. But Reuters found out that one of the deals is with Google to the tune of $60 million per year.

What's going on here?

I got the answers on Episode 85 of The Artificial Intelligence Show from Marketing AI Institute founder/CEO Paul Roetzer.

The future of AI models is licensed and synthetic data

"This is going to be worth a lot of money," says Roetzer. "The future of these models is going to be licensed and synthetic data."

The most powerful AI models learn by ingesting enormous quantities of information. How they get at that information is the source of controversy. (Some companies like OpenAI are being sued for their use of copyright material.)

But by training on licensed content, or training on AI-generated content, AI companies avoid legal issues and get unique datasets that give them an advantage over other models. That makes proprietary datasets like Reddit's worth a lot of money to AI companies.

Companies with proprietary data are sitting on a goldmine

Access to unique data is how AI models stay competitive, says Roetzer. So it's no wonder Google and others are inking these types of deals. 

It's also why Elon Musk turned off access to the API of X, formerly Twitter. Now, only his AI model, Grok, can train on that data.

This likely impacts media companies and online sites with proprietary data, says Roetzer. Because this data represents a goldmine to AI companies.

"If you have a bunch of proprietary data, licensing of data is gonna be huge moving forward."

Related Posts

Turn Google Analytics Data into Actionable Recommendations with AI

Paul Roetzer | June 20, 2021

Company Spotlight: PaveAI. Use machine learning and data science to turn Google Analytics data into actionable recommendations and reports.

This AI Tool Gets Retail Customers to Buy More—and Get Smarter Over Time

Paul Roetzer | July 12, 2018

AI platform Bluecore says it increases revenue, sales and marketing ROI by using machine learning to tell what retail customers will buy next. We interviewed the company to learn more.

Are You Sending the Right Signals to Recruit AI Talent to Your Organization?

Mike Kaput | March 26, 2018

Companies are falling behind in the race for acquiring AI talent—often, not by choice or ignorance, but because they aren’t sending the right signals to the talent marketplace. Here’s a few ways you can recruit top AI talent.