May 2017
Beginner
552 pages
28h 47m
English
Lynx is a command-line web browser; it can dump a text version of a website as we will see in a web browser, instead of returning the raw HTML as wget or cURL does. This saves the step of removing HTML tags. The -nolist option shows the links without numbers. Parsing and formatting the lines that contain Rank is done with sed:
sed -e 's/ *Rank-\([0-9]*\) *\(.*\)/\1\t\2/'
These lines are then sorted according to the ranks.