Google crawls 3X more of the web than OpenAI, & how that mattersGoogle's extensive web crawling capabilities, significantly exceeding competitors like OpenAI, Microsoft, Anthropic, and Meta, could grant ...
19 Jan, 2026, 08.49 PM IST
As AI data scrapers sap websites' revenues, some fight backAI crawlers from major tech firms are scraping vast amounts of web content without permission, undermining traffic and revenues for publish...
14 Nov, 2025, 03.58 PM IST
Future of search lies in indexing for AI: Perplexity CEO Aravind SrinivasIn an exclusive interview with ET's Samidha Sharma, Srinivas said that traditional web indexes were designed for users clicking through lin...
01 Oct, 2025, 01.24 PM IST
The great Indic data huntRace to build Indic models has heated up, but challenge over availability of Indic language data persists.
16 Sep, 2025, 11.17 AM IST
Cloudflare accuses Perplexity of using stealth crawling techniques to evade network blocksIn a blog post, Cloudflare alleged that Perplexity initially uses a declared user agent for its bots, and later switches to undeclared, gen...
05 Aug, 2025, 10.55 AM IST
AI-led ad frauds skim billions from brands one click at a timeFrom China to Egypt, the murky world of fake clicks is thriving. So-called “click farms” are inflating website traffic numbers and duping a...
01 Aug, 2025, 06.00 AM IST
Cloudflare introduces default blocking of AI data scrapersCloudflare has introduced a new setting allowing websites to block AI bots from scraping content without permission. The move aims to prote...
02 Jul, 2025, 09.50 AM IST
Cloudflare launches tool to help website owners monetise AI bot crawler accessThe tool allows website owners to choose whether artificial intelligence crawlers can access their material and set a price for access thro...
01 Jul, 2025, 03.41 PM IST
Companies alert as along come AI web spidersAI crawlers are computer programs that collect data from websites to train large language models. Enterprises are increasingly blocking AI ...
15 Dec, 2024, 06.00 AM IST
Canadian news media are suing OpenAI for copyright infringement, but will they win?The lawsuits claim that OpenAI "scraped" large amounts of content from media sites without permission. They have also claimed that the AI c...
02 Dec, 2024, 01.39 PM IST
The data that powers AI is disappearing fastWeb data restrictions impact AI models like ChatGPT. MIT's study finds 25% of top-quality data restricted. Smaller firms use synthetic data...
22 Jul, 2024, 01.41 PM IST
ET Explainer: Cloudflare's new tool aims to block AI bots from scraping website contentCloudflare has introduced a new tool to block AI bots from scraping website content. The tool aims to protect content publishers from unaut...
09 Jul, 2024, 05.40 PM IST
Reddit to update web standard to block automated website scrapingAI startups face scrutiny for bypassing Reddit's updated scraping rules. Plagiarism accusations against firms like Perplexity highlight the...
26 Jun, 2024, 09.17 AM IST
As Google pushes deeper into AI, publishers see fresh challengesSince May, Google has begun rolling out a new form of search powered by generative AI, after industry observers questioned the tech giant's...
19 Oct, 2023, 10.43 PM IST
As Google pushes deeper into AI, publishers see fresh challengesSince May, Google has begun rolling out a new form of search powered by generative AI, after industry observers questioned the tech giant's...
19 Oct, 2023, 04.01 PM IST