<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=2006193252832260&amp;ev=PageView&amp;noscript=1">

2 Min Read

Big Problems Discovered with AI Training

Featured Image

Wondering how to get started with AI? Take our on-demand Piloting AI for Marketers Series.

Learn More

AI companies like OpenAI are coming under fire for how AI tools are trained…

Reddit, which is often scraped to train language models, just announced it would charge for API access, in order to stop AI companies from training models on Reddit data without compensation.

Twitter recently made a similar move. And Elon Musk publicly threatened to sue Microsoft for, he says, “illegally using Twitter data” to train models.

Don’t be surprised if other companies follow suit…

An investigative report by the Washington Post recently found that large language models from Google and Meta trained on data from major websites like Wikipedia, The New York Times, and Kickstarter.

The report raises concerns that models may be using data from certain sites improperly. In one example, the Post found models had trained on an ebook piracy site—and so likely did not have permission to use the data it trained on. Not to mention, the copyright symbol appeared more than 200 million times in the data set the Post studied.

What concerns does this raise for marketing and business professionals using these tools?

I spoke to Marketing AI Institute co-founder and CEO Paul Roetzer on Episode 44 of the Marketing AI Show to learn more.

  1. This will change the value proposition of putting data out there for free. Expect to see companies with proprietary data either train their own AI models and products (like Quora has) or charge for access to the data via API. Some might do both, says Roetzer. It also changes the value equation of putting data out there for free. In the past, you gave free access to your data in exchange for valuable benefits like more users or traffic. That equation may now change, as free access means you could be training a model that replaces the need for your site or brand.
  2. AI training will need to change. In Europe, it’s looking like AI companies are struggling to train models in ways that don’t violate European law. Everywhere, it also appears AI companies are training models on copyrighted material. AI companies may get hit with massive penalties or legal actions—or dodge regulations entirely. But one thing is clear, no matter what happens. “The way they build these models is going to have to evolve,” says Roetzer.
  3. Business leaders need to be prepared. “You have to address the fact that you may be using technology that was built illegally,” says Roetzer. That doesn’t mean you’ll get in trouble for using the technology. (It’s highly doubtful, but please check with a lawyer.) But you are going to likely train custom versions of models moving forward, models that are trained largely on compliant data that you legally own. And, prepare to hear about legal cases hitting big AI companies, even some you might use, moving forward.

Don’t get left behind…

You can get ahead of AI-driven disruption—and fast—with our Piloting AI for Marketers course series, a series of 17 on-demand courses designed as a step-by-step learning journey for marketers and business leaders to increase productivity and performance with artificial intelligence.

The course series contains 7+ hours of learning, dozens of AI use cases and vendors, a collection of templates, course quizzes, a final exam, and a Professional Certificate upon completion.

After taking Piloting AI for Marketers, you’ll:

  1. Understand how to advance your career and transform your business with AI.
  2. Have 100+ use cases for AI in marketing—and learn how to identify and prioritize your own use cases.
  3. Discover 70+ AI vendors across different marketing categories that you can begin piloting today.

Learn More About Piloting AI for Marketers

Related Posts

Inside the E/Acc Movement That's Speeding Up AI Development

Mike Kaput | January 9, 2024

A new movement is taking over Silicon Valley—and it has big implications for AI’s future.

Google Removes Gender Pronouns from Smart Compose

Ashley Sams | November 29, 2018

Our team reads tons of AI and mahine learning articles to share the best ones with you. This week we're reading all about Google's removal of gender pronouns in its Smart Compose Gmail feature, which smart speaker is best and how China uses cheap labor to support their AI ambitions.

Are You Sending the Right Signals to Recruit AI Talent to Your Organization?

Mike Kaput | March 26, 2018

Companies are falling behind in the race for acquiring AI talent—often, not by choice or ignorance, but because they aren’t sending the right signals to the talent marketplace. Here’s a few ways you can recruit top AI talent.