Comment on Pearson complaining about using Linux to access my course material

<- View Parent
Irelephant@lemm.ee ⁨2⁩ ⁨days⁩ ago

A small publisher’s ebook platform recently started blocking firefox for me, did a bit of digging and found that if pages aren’t requested with the right headers (which work in chrome and msedge) it will respond with a 302, suggesting you go to another page which takes a few minutes and then times out.

This is probably to stop scraping, and could be because I started testing some scraping scripts on it.

Anyway, this hasn’t even stopped me scraping, I just copied the headers and use those in my script.

source
Sort:hotnewtop