Main Menu

GPTBot crawl

Started by ergophobe, November 10, 2024, 12:26:10 AM

Previous topic - Next topic

ergophobe

I was just looking at a few raw server logs and noticed that GPTBot was crawling like mad in October. A site with between 100-200 pages got

5700 so far in Nov
97,000 hits from GPTBot in October
2068 in September
1700 in August

I looked at some other mini sites with maybe a dozen pages and they got 37,000 to 47,000 hits in October and just a handful in September.

I can't imagine the crawl budget OpenAI must have

rcjordan

Debbie says they may be running full blast before the copyright lawsuits throttle them.

https://www.shacknews.com/article/141313/openai-needs-copyright-material

OpenAI insists it can't sufficiently train AI models without copyrighted material | Shacknews

---

Take a look....

https://www.google.com/search?q=ai+bots+courts+copyright

ai bots courts copyright - Google Search



ergophobe

Well.... they have my content. I wonder how hard it would be to seed an AI, like the SEO comps where people would compete to rank some previously unique phrase.

rcjordan


ergophobe

Ah yes. I thought we had had some discussion of that