As AI data scrapers sap websites' revenues, some fight backAI crawlers from major tech firms are scraping vast amounts of web content without permission, undermining traffic and revenues for publish...
14 Nov, 2025, 03.58 PM IST
The great Indic data huntRace to build Indic models has heated up, but challenge over availability of Indic language data persists.
16 Sep, 2025, 11.17 AM IST
Cloudflare accuses Perplexity of using stealth crawling techniques to evade network blocksIn a blog post, Cloudflare alleged that Perplexity initially uses a declared user agent for its bots, and later switches to undeclared, gen...
05 Aug, 2025, 10.55 AM IST
AI search pushing an already weakened media ecosystem to the brinkGenerative AI tools like ChatGPT are reducing traffic to news websites by providing direct summaries, threatening media revenue models. As ...
04 Aug, 2025, 08.54 AM IST
Cloudflare introduces default blocking of AI data scrapersCloudflare has introduced a new setting allowing websites to block AI bots from scraping content without permission. The move aims to prote...
02 Jul, 2025, 09.50 AM IST
Cloudflare launches tool to help website owners monetise AI bot crawler accessThe tool allows website owners to choose whether artificial intelligence crawlers can access their material and set a price for access thro...
01 Jul, 2025, 03.41 PM IST
OpenAI asks Indian court to throw out book publishers challenge in copyright battleOpenAI has requested an Indian court to dismiss a plea from a group of book publishers who claim its ChatGPT service infringes on their cop...
28 Jan, 2025, 11.26 AM IST
Companies alert as along come AI web spidersAI crawlers are computer programs that collect data from websites to train large language models. Enterprises are increasingly blocking AI ...
15 Dec, 2024, 06.00 AM IST
Canadian news media are suing OpenAI for copyright infringement, but will they win?The lawsuits claim that OpenAI "scraped" large amounts of content from media sites without permission. They have also claimed that the AI c...
02 Dec, 2024, 01.39 PM IST
Why OpenAI's SearchGPT won't kill Google Search anytime soonWhether GenAI-based search can ever dethrone incumbents like Google Chrome is still a far sight. But if it does, it could cause a tectonic ...
06 Aug, 2024, 12.23 PM IST
Apple claims to not use unethical data to train its AIApple explained in the paper that its main aim is to assist users with everyday tasks. “Our models are designed to help users with daily ac...
30 Jul, 2024, 06.16 PM IST
The data that powers AI is disappearing fastWeb data restrictions impact AI models like ChatGPT. MIT's study finds 25% of top-quality data restricted. Smaller firms use synthetic data...
22 Jul, 2024, 01.41 PM IST
Pay for content, slice for responsible AIAI deals with media outlets like WSJ and FT help tech firms avoid conflicts. Google pays WSJ for new AI content. Need for regulation in tra...
01 May, 2024, 10.43 PM IST
ETtech Explainer: OpenAI’s response to NYT’s copyright lawsuitOpenAI has refuted some of these allegations and claimed that the fair use doctrine of copyright laws will apply in its case. ET explains t...
13 Jan, 2024, 09.59 AM IST
ETtech Explainer | NYT vs OpenAI: Why news publishers are fighting Big Tech over LLMsLast week’s copyright infringement lawsuit by The New York Times ( NYT) against ChatGPT- maker OpenAI has opened another battlefront betwee...
01 Jan, 2024, 06.01 AM IST
BBC blocks OpenAI data scraping, to harness Generative AIThe BBC's director of nations, Rhodri Talfan Davies, believes that the use of artificial intelligence (AI), specifically Generative AI (Gen...
07 Oct, 2023, 10.35 AM IST
What is webcrawler GPTBot that OpenAI has newly released?OpenAI's GPTBot plans to expand the horizons of AI through web crawling.
09 Aug, 2023, 11.11 AM IST