Searched for
AI DATA SCRAPING
Egocentric data collection fuels AI robotics growth in IndiaIndian startups are entering the lucrative egocentric data collection business. This data, captured from a first-person view, is vital for ...
Gold loan startups find new shine; Infy Q4 profit jumpsHappy Friday! Gold loan fintechs are moving towards building their own loan books. This and more in today's ETtech Morning Dispatch.
Humyn Labs to deploy $20 million to scale human data layer for physical AI, roboticsThe physical AI startup is trying to solve a key constraint faced by robotics and physical AI companies — limited availability of high-qual...
Personal data isn't collected: MHA on security agencies using open-source intelligence from public sourcesSecurity agencies are utilizing open-source intelligence from public sources like social media for information gathering, assuring no priva...
Where is defining gadget of AI age? Jony Ive, former design chief of Apple, is working on his toughest assignment yet, give AI a physiqueThe AI era needs a defining gadget. Jony Ive, who shaped the smartphone, is now working with OpenAI on a new AI device. This aims to redefi...
Is internet more AI-driven than human? Report reveals the way forwardPer a report by cybersecurity firm HUMAN Security, in 2025, automated internet traffic grew 23.51% YoY, almost eight times faster than the ...
New York Times sues Perplexity AI for 'illegal' copying of contentThe New York Times has filed a lawsuit against AI startup Perplexity. The Times claims Perplexity copied and used its articles without perm...
Reddit accuses 'data scraper' companies of stealing its informationReddit is taking a firm stand against four data scraping companies, including SerpApi, Oxylabs, and AWMProxy, by initiating legal proceedin...
Reddit locks out Wayback machine to stop AI from scraping old postsReddit has restricted the Internet Archive’s Wayback Machine from extensively capturing its content due to concerns over unauthorized AI da...
Cloudflare introduces default blocking of AI data scrapersCloudflare has introduced a new setting allowing websites to block AI bots from scraping content without permission. The move aims to prote...
Companies alert as along come AI web spidersAI crawlers are computer programs that collect data from websites to train large language models. Enterprises are increasingly blocking AI ...
Canadian news media are suing OpenAI for copyright infringement, but will they win?The lawsuits claim that OpenAI "scraped" large amounts of content from media sites without permission. They have also claimed that the AI c...
Tech giants push to dilute Europe's AI ActThe EU has invited companies, academics, and others to help draft the code of practice, receiving nearly 1,000 applications, an unusually h...
ET Explainer: Cloudflare's new tool aims to block AI bots from scraping website contentCloudflare has introduced a new tool to block AI bots from scraping website content. The tool aims to protect content publishers from unaut...
Reddit to update web standard to block automated website scrapingAI startups face scrutiny for bypassing Reddit's updated scraping rules. Plagiarism accusations against firms like Perplexity highlight the...
BBC blocks OpenAI data scraping, to harness Generative AIThe BBC's director of nations, Rhodri Talfan Davies, believes that the use of artificial intelligence (AI), specifically Generative AI (Gen...
Google training Bard with scraped web data? Here’s everything you may want to knowGoogle has acknowledged training its AI systems using publicly available web data, prompting concerns over privacy, copyright infringement ...