But it actually was always there... We already have tools to battle it to a degree

I don’t think traffic hogging and data scraping will ever go away. We were okay with Google doing it, but at least people used to click through to your site. Now you get beautiful summary boxes in your AI tool of choice…

Contextually, it’s great. But let’s be honest: how often do you really end up on the source page?

I think AI tools need to cite all the sources they use, but then again, none of us really do…


@cameronwilson.bsky.social on Bluesky

The Wikimedia Foundation, which owns Wikipedia, says its bandwidth costs have gone up 50% since Jan 2024 — a rise they attribute to AI crawlers. AI companies are killing the open web by stealing visitors from the sources of information and making them pay for the privilege

favicon bsky.app

So what’s the answer to this ongoing phenomenon? Here are some options (not an exhaustive list):

There are probably more resources… and a bunch of strategies you can use, like:

  • Robots.txt and AI-specific directives
  • CDN/WAF bot management
  • Rate limiting & request throttling
  • IP blocking & geofencing
  • Behavioral analysis
  • Honeypots & trap links
  • Dynamic DOM obfuscation
  • NoAI meta tags & HTTP headers
  • Terms of Service enforcement
  • Content poisoning
  • Anomaly monitoring & alerting
  • Captcha

This is a continuation of my “Traffic hogging… Has arrived…” post from linked in:

https://www.linkedin.com/feed/update/urn:li:activity:7313284435347443714/