February 2006
Intermediate to advanced
304 pages
6h 16m
English
Aside from broken links, orphan pages are the biggest problem plaguing webmasters. An orphan page is one that exists on a web server but has no link to it. In other words, there is no way to get to it.
The previous script checks (and lists) all the links on a site. You now need a way to compare this against the list of files on your site to make sure that every page is visible to the outside world.
1 use strict; 2 use warnings; 3 use Getopt::Std; 4 use URI; 5 6 use File::Find (); 7 use vars qw/*name/; # Name of the file from find 8 *name = *File::Find::name; 9 10 use vars qw/$opt_s $opt_w/; 11 12 # List of files on the website 13 my @file_list; 14 15 # Called by find for each file 16 sub wanted 17 { 18 ...