Tools like RepeatMasker are used to find areas of the genome with low complexity; check out http://www.repeatmasker.org/ for more information IUPAC ambiguity codes may be useful to have in hand when processing other genomes; check out http://www.bioinformatics.org/sms/iupac.html for more information