SafeSearch Certifying URLs

Feed URLs to Google’s SafeSearch to determine whether or not they point at questionable content.

Only three things in life are certain: death, taxes, and accidentally visiting a once family-safe web site that now contains text and images that would make a horse blush.

As you probably know if you’ve ever put up a web site, domain names are registered for finite lengths of time. Sometimes registrations accidentally expire; sometimes businesses fold and allow the registrations to expire. Sometimes other companies take them over.

Other companies might just want the domain name, some companies want the traffic that the defunct site generated, and in a few cases, the new owners of the domain name try to hold it “hostage,” offering to sell it back to the original owners for a great deal of money. (This doesn’t work as well as it used to because of the dearth of Internet companies that actually have a great deal of money.)

When a site isn’t what it once was, that’s no big deal. When it’s not what it once was and is now rated X, that’s a bigger deal. When it’s not what it once was, is now rated X, and is on the link list of a site you run, that’s a really big deal.

But how to keep up with all the links? You can go visit every link periodically and see if it’s still okay, or you can wait for the hysterical emails from site visitors, or you can just not worry about it. Or you can put the Google API to work.

This program lets you give provide a list of URLs and check ...

Get Google Hacks now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.