ET Infographic | Running out of data

With foundation models running out of data for training, companies are placing their bets on the use of synthetic data for training the models. But based on a recent research paper, “indiscriminate use of model-generated content in training causes...

ETtech
With foundation models running out of data for training, companies are placing their bets on the use of synthetic data for training the models. When Meta released the latest iteration of its open-source model, Llama 3.1 405B, it also updated the model’s licence to generate synthetic data which can be further used to train proprietary small models. On paper, this sounds plausible.

AI charticle 1

But recent studies cast doubts on this model. Based on a research paper published in science journal ‘Nature’, “indiscriminate use of model-generated content in training causes irreversible defects in the resulting models”.


AI charticle 2

ET takes a look.
Download
The Economic Times Business News App
for the Latest News in Business, Sensex, Stock Market Updates & More.
Download
The Economic Times News App
for Quarterly Results, Latest News in ITR, Business, Share Market, Live Sensex News & More.
READ MORE
ADVERTISEMENT

READ MORE:

LOGIN & CLAIM

50 TIMESPOINTS

More from our Partners

Loading next story
Business News › Tech › AI › ET Infographic | Running out of data
Text Size:AAA
Success
This article has been saved

*

+