+

Cookies on the Business Insider India website

Business Insider India has updated its Privacy and Cookie policy. We use cookies to ensure that we give you the better experience on our website. If you continue without changing your settings, we\'ll assume that you are happy to receive all cookies on the Business Insider India website. However, you can change your cookie setting at any time by clicking on our Cookie Policy at any time. You can also see our Privacy Policy.

Close
HomeQuizzoneWhatsappShare Flash Reads
 

As most AI execs scramble for more data, Mark Zuckerberg says there's actually something more 'valuable'

Apr 21, 2024, 23:43 IST
Business Insider
Mark Zuckerberg seems pretty chill about the amount of data out there for AI. Josh Edelson/AFP via Getty Images
  • Meta CEO Mark Zuckerberg weighed in on the AI data race in a new interview.
  • As the AI arms race heats up, many tech companies are scrambling for new data sources.
Advertisement

Meta CEO Mark Zuckerberg has a hot take on Big Tech's race for AI training data: It's not about the data.

"The thing that I think is going to be more valuable is the feedback loops rather than any kind of upfront corpus," Zuckerberg said in an interview with the Command Line, a tech industry newsletter.

Feedback loops are used to retrain and improve AI models over time based on their previous outputs. These algorithms let AI models know when they make an error, for example, and provide them with data to adjust their future performance.

"Having a lot of people use it and then seeing how people use it and being able to improve from there is actually going to be a more differentiating thing over time," he said.

Sourcing new data for their insatiable AI models to consume —which theoretically will make them smarter — is now an obsession for companies racing to dominate AI.

Companies like OpenAI, Google, Amazon, Meta, and others have considered some wild solutions. Meta, for instance, was so desperate for data at one point that it considered buying the publishing company Simon & Schuster and even weighed risking copyright lawsuits for more material, The New York Times reported.

Advertisement

Another solution to the problem of limited data is just creating new data, something Big Tech calls "synthetic data." Synthetic data is artificially generated and designed to mimic data generated by real-world events. Zuckerberg's into it.

"I think there's going to be a lot in synthetic data, where you are having the models trying to churn on different problems and see which paths end up working, and then use that to reinforce," he said.

Anthropic, the maker of chatbot Claude, has also fed internally generated data into its models. And ChatGPT maker OpenAI is considering it, although CEO Sam Altman said at a conference last May that the key is having a model "smart enough to make good synthetic data."

And while Zuckerberg sees feedback loops as the key to building powerful AI models, there are also risks in relying on them. They could reinforce some of their mistakes, limitations, and biases if they're not trained on "good data" to begin with.

You are subscribed to notifications!
Looks like you've blocked notifications!
Next Article