Google to allow AI opt-out to ease UK competition concernsGoogle is creating new search controls. Websites can now choose to avoid its generative AI features. This move addresses concerns from Brit...
18 Mar, 2026, 09.17 PM IST
Google crawls 3X more of the web than OpenAI, & how that mattersGoogle's extensive web crawling capabilities, significantly exceeding competitors like OpenAI, Microsoft, Anthropic, and Meta, could grant ...
19 Jan, 2026, 08.49 PM IST
As AI data scrapers sap websites' revenues, some fight backAI crawlers from major tech firms are scraping vast amounts of web content without permission, undermining traffic and revenues for publish...
14 Nov, 2025, 03.58 PM IST
Cloudflare accuses Perplexity of using stealth crawling techniques to evade network blocksIn a blog post, Cloudflare alleged that Perplexity initially uses a declared user agent for its bots, and later switches to undeclared, gen...
05 Aug, 2025, 10.55 AM IST
AI search pushing an already weakened media ecosystem to the brinkGenerative AI tools like ChatGPT are reducing traffic to news websites by providing direct summaries, threatening media revenue models. As ...
04 Aug, 2025, 08.54 AM IST
Cloudflare launches tool to help website owners monetise AI bot crawler accessThe tool allows website owners to choose whether artificial intelligence crawlers can access their material and set a price for access thro...
01 Jul, 2025, 03.41 PM IST
Companies alert as along come AI web spidersAI crawlers are computer programs that collect data from websites to train large language models. Enterprises are increasingly blocking AI ...
15 Dec, 2024, 06.00 AM IST
ET Explainer: Cloudflare's new tool aims to block AI bots from scraping website contentCloudflare has introduced a new tool to block AI bots from scraping website content. The tool aims to protect content publishers from unaut...
09 Jul, 2024, 05.40 PM IST
Reddit to update web standard to block automated website scrapingAI startups face scrutiny for bypassing Reddit's updated scraping rules. Plagiarism accusations against firms like Perplexity highlight the...
26 Jun, 2024, 09.17 AM IST
Multiple AI companies bypassing web standard to scrape publisher sites, licensing firm saysPerplexity likely bypassed web crawler blocks via the Robots Exclusion Protocol, as reported by Wired, using analytics to track AI traffic.
21 Jun, 2024, 08.47 PM IST
Pay for content, slice for responsible AIAI deals with media outlets like WSJ and FT help tech firms avoid conflicts. Google pays WSJ for new AI content. Need for regulation in tra...
01 May, 2024, 10.43 PM IST
BBC blocks OpenAI data scraping, to harness Generative AIThe BBC's director of nations, Rhodri Talfan Davies, believes that the use of artificial intelligence (AI), specifically Generative AI (Gen...
07 Oct, 2023, 10.35 AM IST
All the content that's fit to be paid forML deepens the trench between online content and its distribution. Publishers have been seeking a larger share of online advertising revenu...
28 Aug, 2023, 10.58 PM IST
What is webcrawler GPTBot that OpenAI has newly released?OpenAI's GPTBot plans to expand the horizons of AI through web crawling.
09 Aug, 2023, 11.11 AM IST
Tencent says 'loophole' allowed WeChat searches on Google, BingContent from China's most popular messaging app WeChat, including articles and videos on its popular public accounts page has opened to ext...
22 Oct, 2021, 04.03 PM IST
IBM, HP, HCL, Wipro, Oracle and TCS pitch for project to track suspects onlineProject O-sint is a strategic initiative by Delhi Police to let them snoop on social networks and the internet for tracking anti-social ele...
10 Jul, 2013, 11.45 AM IST
- Google lets online media keep stories, photos or video out of its index
Publishers have always been able to block Google from including their website content in the search engine index.
03 Dec, 2009, 03.04 AM IST