Comment on What are You Working on Wednesday
dubbel@discuss.tchncs.de 8 months ago
Private project, not really security related: Crawling robots.txts to gather some statistics on which bots people are most often excluding - weirdly I couldn’t find any recent/regularly updated stats on this.
PaddleMaster@beehaw.org 8 months ago
That’s a neat project. Are you looking for trends, or something specific?
dubbel@discuss.tchncs.de 8 months ago
It started with a popular mastodon posts on how to block openai crawlers I think, and I’d like to know whether people are actually implementing it.
PaddleMaster@beehaw.org 8 months ago
That’s neat. I’m curious about this now. With “normal” search engines that have generally gone to shit, AI chat bots are on trend to give better results. If the robots.txt file is blocked from OpenAI, can I assume it hits other chatbots? And would that extend to Google/bing?