Robots.txt - Jim Nielsen’s Blog
I realized why I hadn’t yet added any rules to my
robots.txt
: I have zero faith in it.
I realized why I hadn’t yet added any rules to my
robots.txt
: I have zero faith in it.
I endorse this statement.
Readability is back, but now it’s called Mercury.
This tool for building ScrAPIs is an interesting development—the current trend for not providing a simple API (or even a simple RSS feed) is being interpreted as damage and routed around.
David Cole shares the ideas for projects he would like to develop further, but probably never will. I like this a lot (and there are some great ideas in here).
A handy step-by-step guide to scraping HTML to get data out. Useful for services (—cough—Twitter—cough—) that keep changing the rules of their API use.
A new feature on Matthew Somerville's brilliant train timetable site. Just put /fares at the end of any URL to get the cheapest available fare.