Business Insider India has updated its Privacy and Cookie policy. We use cookies to ensure that we give you the better experience on our website. If you continue without changing your settings, we\'ll assume that you are happy to receive all cookies on the Business Insider India website. However, you can change your cookie setting at any time by clicking on our Cookie Policy at any time. You can also see our Privacy Policy.
Big Tech needs to get creative as it runs out of data to train its AI models. Here are some of its wildest solutions.
Big Tech needs to get creative as it runs out of data to train its AI models. Here are some of its wildest solutions.
Lakshmi VaranasiApr 8, 2024, 03:11 IST
Big Tech is scouring the internet for new data sources to train its AI models.Gilnature/Getty Images
OpenAI, Meta, Google, and other Big Tech firms train their AI models using online data.
But AI models learn so fast that all that data could run out by 2026.
More is more when it comes to AI. The more data AI systems are trained on, the more powerful they will be.
But as the AI arms race heats up, tech giants like Meta, Google, and OpenAI face a problem: They're running out of data to train their models.
Many leading AI systems have been trained on the vast supply of online data. But by 2026, all the high-quality data could be exhausted, according to Epoch, an AI research institute.
So major tech companies are searching for new data sources to keep their systems learning. Here's a look at some of the most creative options that tech companies are considering.
Advertisement
Google considered tapping consumer data available in Google Docs, Sheets, and Slides.
Google considered using data from Google Docs, Sheets, and Slides for training its AI systems. Shutterstock
Splurging on the publishing house, Simon & Schuster.
Simon & Schuster's New York City headquarters in 2016.Robert Alexander/Getty Images
Advertisement
Generating synthetic data
OpenAI is exploring synthetic data to train its systems. RICHARD JONES/SCIENCE PHOTO LIBRARY/Getty Images
Whisper, a speech recognition tool that translates YouTube videos
YouTube wants to create AI-generated music.Getty Images
Advertisement
Photobucket: A treasure trove of photos from Myspace and Friendster
Photobucket, which hosted photos on Myspace, might be licensing its data to tech companies. eHowTech/YouTube