book

The Art of SEO

by Eric Enge, Stephan Spencer, Rand Fishkin, Jessie Stricchiola

October 2009

Beginner

602 pages

19h 50m

English

O'Reilly Media, Inc.

Read now

Unlock full access

The Art of SEO
Dedication
Foreword
Preface
Who Should Read This Book
How to Use This Book
Conventions Used in This Book
Using Code Examples
Safari® Books Online
How to Contact Us

Acknowledgments
1. The Search Engines: Reflecting Consciousness and Connecting Commerce
The Mission of Search Engines
The Market Share of Search Engines
The Human Goals of Searching
Who Searches and What Do They Search For?
Determining Searcher Intent: A Challenge for Both Marketers and Search Engines
Navigational QueriesInformational QueriesTransactional Queries
How People Search
How Search Engines Drive Commerce on the Web
Eye Tracking: How Users Scan Results Pages
Click Tracking: How Users Click on Results, Natural Versus Paid
Distribution of Search Results and TrafficDifferent Intents and Effects of Listings in Paid Versus Natural ResultsInteraction Between Natural and Paid SearchOther Factors to Consider
Conclusion
2. Search Engine Basics
Understanding Search Engine Results
Understanding the Layout of Search Results PagesHow Vertical Results Fit into the SERPs
Algorithm-Based Ranking Systems: Crawling, Indexing, and Ranking
Crawling and IndexingRetrieval and RankingsEvaluating Content on a Web PageWhat Content Can Search Engines “See” on a Web Page?What search engines cannot see
Determining Searcher Intent and Delivering Relevant, Fresh Content
Document Analysis and Semantic ConnectivityLink AnalysisProblem Words, Disambiguation, and DiversityWhere freshness mattersA Few Reasons Why These Algorithms Sometimes Fail
Analyzing Ranking Factors
Negative Ranking FactorsOther Ranking Factors
Using Advanced Search Techniques
Advanced Google Search OperatorsCombined Google queriesFirefox plug-ins for quicker access to Google advanced search queriesYahoo! Advanced Search OperatorsCombined Yahoo! QueriesFirefox extensions to help with Yahoo! (Link:) and (LinkDomain:) search operatorsBing Advanced Search OperatorsMore Advanced Search Operator TechniquesKeyword difficultyUsing number rangesAdvanced doc type searchDetermine listing ageUncover subscriber-only or deleted contentIdentify neighborhoodsFind Creative Commons (CC) licensed content
Vertical Search Engines
Vertical Search from the Major Search EnginesImage searchVideo searchNews searchLocal search/mapsBlog searchBook searchJob searchCelebrity xRankUniversal Search/Blended SearchMeta searchMore specialized vertical search engines
Country-Specific Search Engines
Optimizing for Specific CountriesProfile of China’s Internet Usage
Conclusion
3. Determining Your SEO Objectives and Defining Your Site’s Audience
Setting SEO Goals and Objectives
Strategic Goals SEO Practitioners Can FulfillVisibility (branding)Website trafficHigh ROIEvery SEO Plan Is CustomUnderstanding Search Engine Traffic and Visitor Intent
Developing an SEO Plan Prior to Site Development
Business Factors That Affect the SEO Plan
Understanding Your Audience and Finding Your Niche
Mapping Your Products and ServicesContent Is KingSegmenting Your Site’s AudienceAdvanced Methods for Planning and EvaluationSWOT analysisGet SMART
SEO for Raw Traffic
SEO for E-Commerce Sales
SEO for Mindshare/Branding
SEO for Lead Generation and Direct Marketing
SEO for Reputation Management
SEO for Ideological Influence
Conclusion
4. First Stages of SEO
The Major Elements of Planning
Technology ChoicesMarket SegmentationWhere You Can Find Great LinksContent ResourcesBranding ConsiderationsCompetition
Identifying the Site Development Process and Players
Defining Your Site’s Information Architecture
Technology DecisionsStructural DecisionsTarget keywordsCross-link relevant contentUse anchor textUse breadcrumb navigationMinimize link depth
Auditing an Existing Site to Identify SEO Problems
Elements of an AuditUsabilityAccessibility/spiderabilitySearch engine health checkKeyword health checksDuplicate content checksURL checkTitle tag reviewContent reviewMeta tag reviewSitemaps file and robots.txt file verificationRedirect checksInternal linking checksAvoidance of unnecessary subdomainsGeolocationExternal linkingPage load timeImage alt tagsCode qualityThe Importance of Keyword ReviewsStep 1: Keyword researchStep 2: Site architectureStep 3: Keyword mappingStep 4: Site reviewKeyword CannibalizationExample: Fixing an Internal Linking ProblemServer and Hosting Issues
Identifying Current Server Statistics Software and Gaining Access
Web AnalyticsLogfile TrackingGoogle and Bing Webmaster ToolsSearch Analytics
Determining Top Competitors
Two Spam ExamplesSeeking the BestUncovering Their Secrets
Assessing Historical Progress
Maintain a Timeline of Site ChangesTypes of Site Changes That Can Affect SEOPrevious SEO Work
Benchmarking Current Indexing Status
Benchmarking Current Rankings
Benchmarking Current Traffic Sources and Volume
Leveraging Business Assets for SEO
Other Domains You Own/ControlPartnerships On and Off the WebContent or Data You’ve Never Put OnlineCustomers Who Have Had a Positive ExperienceYour Fans
Combining Business Assets and Historical Data to Conduct SEO/Website SWOT Analysis
Conclusion
5. Keyword Research
The Theory Behind Keyword Research
Understanding the Long Tail of the Keyword Demand Curve
Traditional Approaches: Domain Expertise, Site Content Analysis
Include Competitive Analysis
Keyword Research Tools
Keyword Research Data from the EnginesBlog search countsRelated termsCommon usage and phrase combinationsFrequency of recent usageKeyword Research with ToolsGoogle’s AdWords Keyword Tool and Traffic EstimatorWhat the Keyword Tool providesWhat the Traffic Estimator providesWhere the tools get their dataHow the tools are usefulCostYahoo! Search MarketingWhere it gets its dataHow it is usefulCostMicrosoft’s adCenter Keyword Generation ToolWhere it gets its dataHow it is usefulCostWordtrackerWhere it gets its dataHow it is usefulCostKeywordDiscoveryWhere it gets its dataHow it is usefulCostGoogle TrendsWhere it gets its dataHow it is usefulCostHitwiseWhere it gets its dataHow it is usefulCostcomScore MarketerWhat it providesWhere it gets its dataHow it is usefulCostEnquisite OptimizerWhat it providesWhere it gets its dataHow it is usefulCostThings to Keep in Mind
Determining Keyword Value/Potential ROI
Estimating Value, Relevance, and Conversion RatesDetermining keyword valueIdentifying relevant keywordsDetermining conversion ratesTesting Ad Campaign Runs and Third-Party Search DataUsing Landing Page Optimization
Leveraging the Long Tail of Keyword Demand
Extracting Terms from Relevant Web PagesMining Keyword Research ToolsIdentifying Long Tail PatternsEditorial Content Strategies for Long Tail TargetingUser-Generated Content Strategies for Long Tail Targeting
Trending, Seasonality, and Seasonal Fluctuations in Keyword Demand
Conclusion
6. Developing an SEO-Friendly Website
Making Your Site Accessible to Search Engines
Indexable ContentSpiderable Link StructuresXML SitemapsLayout of an XML SitemapWhat to include in a Sitemap fileWhere to upload your Sitemap fileManaging and updating XML SitemapsUpdating your Sitemap with BingUpdating your Google Sitemap
Creating an Optimal Information Architecture
The Importance of a Logical, Category-Based FlowUsability and search friendlinessAn analogySubdomainsRedirectsURLsSite Architecture Design PrinciplesDesigning site architectureCategory structuringCreating broad→narrow topical relevanceTaxonomy and ontologyFlat Versus Deep ArchitectureSearch-Friendly Site NavigationBasics of search engine friendlinessSite elements that are problematic for spidersSearch and web formsJava, images, audio, and videoAJAX and JavaScriptFramesSearch-engine-friendly navigation guidelines
Root Domains, Subdomains, and Microsites
When to Use a SubfolderWhen to Use a SubdomainWhen to Use a Separate Root DomainMicrositesMaking the case for micrositesWhen to Use a TLD Other Than .com
Optimization of Domain Names/URLs
Optimizing DomainsPicking the Right URLs
Keyword Targeting
Title TagsMeta Description TagsHeading (H1, H2, H3) TagsDocument TextImage Filenames and Alt AttributesBoldface TextAvoiding Keyword CannibalizationKeyword Targeting in CMSs and Automatically Generated ContentSEO Copywriting: Encouraging Effective Keyword Targeting by Content CreatorsLong Tail Keyword Targeting
Content Optimization
Content StructureContent length and word countVisual layoutCSS and Semantic MarkupContent Uniqueness and DepthA word of caution to affiliates
Duplicate Content Issues
Consequences of Duplicate ContentHow Search Engines Identify Duplicate ContentIdentifying and Addressing Copyright InfringementAn actual penalty situationHow to Avoid Duplicate Content on Your Own Site
Controlling Content with Cookies and Session IDs
What’s a Cookie?What Are Session IDs?How Do Search Engines Interpret Cookies and Session IDs?Why Would You Want to Use Cookies or Session IDs to Control Search Engine Access?
Content Delivery and Search Spider Control
Cloaking and Segmenting Content DeliveryWhen to Show Different Content to Engines and VisitorsHow to Display Different Content to Search Engines Versus VisitorsThe robots.txt fileSyntax of the robots.txt fileThe Rel="NoFollow” attributeThe meta robots tagThe canonical tagBlocking and cloaking by IP address rangeBlocking and cloaking by user agentUsing iframesHiding text in imagesHiding text in Java appletsForcing form submissionUsing login/password protectionRemoving URLs from a search engine’s index
Redirects
Why and When to RedirectGood and Bad RedirectsMethods for URL Redirecting and RewritingMod_rewrite and ISAPI_Rewrite for URL rewriting and redirectingRedirecting a Home Page Index File Without LoopingThe default document redirect solution
Content Management System (CMS) Issues
Selecting a CMSThird-Party CMS Add-ons
Optimizing Flash
FlashFlash Coding Best PracticesFlash meta tagsAdobe Flash search engine SDKInternal Flash codingSWFObject and NoScript tagsSWFObjectNoScriptScalable Inman Flash Replacement (sIFR)JavaScript and AJAX
Best Practices for Multilanguage/Country Targeting
Targeting a Specific CountryProblems with Using Your Existing DomainThe Two Major ApproachesMultiple-Language Issues
Conclusion
7. Creating Link-Worthy Content and Link Marketing
How Links Influence Search Engine Rankings
The Original PageRank AlgorithmAdditional Factors That Influence Link ValueAnchor textRelevanceAuthorityTrustHow Search Engines Use Links
Further Refining How Search Engines Judge Links
Additional Link Evaluation CriteriaSource independenceLinking domainsSource diversityTemporal factorsContext/relevanceSource TLDsDetermining a Link’s Value
The Psychology of Linking
Why Are Links Created?How Can Sites Approach Getting Links?
Types of Link Building
Using Content to Attract LinksMarketing Content for Link AcquisitionDirectoriesWhat search engines want from directoriesClassifying directoriesDetecting directories that pass link juiceIncentive-Based Link RequestsGiveawaysDangerous tacticsDirect Link RequestsCreating a value proposition for direct requestsBasic email pitchManual Social Media Link CreationGray Hat/Black HatBuying links for SEOGoogle’s policy on paid linksMethods for buying linksStrategies that are not considered buying linksLink farms/link networksAutomated link droppingSpammy giveawaysNoFollow uses and scams
Choosing the Right Link-Building Strategy
Outline of a ProcessIdentify types of sites that might link to a site like yoursFind out where your competitors get linksReview your website assetsIdentify any strategic limitationsIdentify methods for contacting potential linkersLink-Building Process SummaryPutting It All TogetherExecute aggressivelyConduct regular strategic reviewsCreate a link-building cultureNever stop
More Approaches to Content-Based Link Acquisition
A Closer Look at Content SyndicationLeveraging User-Generated ContentCreating Link Bait/Viral ContentComing up with link bait ideasHow far should you go with link bait?Encourage link bait to spread virally
Incentive-Based Link Marketing
Helping Other Sites Boost Their ValueCustomer Discounts/Incentives
How Search Engines Fight Link Spam
Algorithmic Approaches to Fighting Link SpamOther Search Engine Courses of Action
Social Networking for Links
Blogging for LinksLeveraging Social News and Tagging SitesForum and Social Network ParticipationOffline Relationship BuildingSome Success Stories Using YouTubeSocial Media Tips for More SitesWikipediaWikisFlickrMeetup.comTwitterSocial Media Summary
Conclusion
8. Optimizing for Vertical Search
The Opportunities in Vertical Search
Universal Search and Blended SearchThe Opportunity Unleashed
Optimizing for Local Search
Foundation: Check Your Local ListingsAdditional local info guidesAdditional local online Yellow PagesAdditional vertical directory sitesNewspapersChambers of commerceOnline classifieds and eBayLocal guidesSpecialty Yellow PagesIntroduction to Local Business ProfilesLocal Agency ManagementOptimizing Your Website for Local Search Engines
Optimizing for Image Search
Image Optimization TipsOptimizing Through Flickr and Other Image Sharing Sites
Optimizing for Product Search
Getting into Google Product SearchProduct search optimizationPerformance reporting
Optimizing for News, Blog, and Feed Search
RSS Feed OptimizationRSS Feed Tracking and MeasurementOther RSS Optimization ConsiderationsBlog OptimizationStructural blog optimizationsOptimizing your anchor textSticky postsAuthor profile pagesMore blog optimization basicsLinks remain criticalCan you do this?News Search OptimizationOptimizing for news searchSubmission detailsSitemaps and RSS feeds
Others: Mobile, Video/Multimedia Search
Mobile SearchVideo Search OptimizationOther video optimization tipsPublicizing your video
Conclusion
9. Tracking Results and Measuring Success
Why Measuring Success Is Essential to the SEO Process
The Tracking Cycle: Produce, Launch, Measure, RefineUsing Analytics As a Business Case for SEO
Measuring Search Traffic
Basic OverviewSelecting the Right Analytics PackageValuable SEO Data in Web AnalyticsTraffic by search engineTraffic by keywordSegmenting Search Traffic with Multiple ParametersReferring SitesUsing Analytics DashboardsA Deeper Look at Action TrackingSeparating the Analytics Wheat from the ChaffCommon analytics mistakes
Tying SEO to Conversion and ROI
AttributionSetting Up Analytics Software to Track ConversionsConversion tracking strategySegmenting Campaigns and SEO Efforts by Conversion RateIncreasing ConversionThe link bait bumpAction tracking by referral sourceDetermining Project ROI
Competitive and Diagnostic Search Metrics
Search Engine and Competitive MetricsSite Indexing DataLink Building, Link Tracking, and Link-Based Metrics (Including Anchor Text Analysis)Search-engine-supplied toolsThird-party link-measuring toolsLinkscapeLink DiagnosisOther third-party link-building toolsGoogle Blog SearchTechnoratiExaleadMeasuring the value of a linkRankingsCrawl ErrorsTools from the search enginesThird-party tools to check for crawl errorsTracking the BlogosphereTracking Your Blog(s)Blog subscribersBlog linksSearch Engine Robot Traffic AnalysisGoogle Webmaster ToolsWeb Traffic ComparisonGoogle Trends for WebsitesAlexaCompeteQuantcastTemporal Link Growth Measurements
Key Performance Indicators for Long Tail SEO
Brand-to-Non-Brand RatioUnique Crawled URLsSearch Visitors per Contributing PageKeywords per PageSearch Visitors per KeywordIndex-to-Crawl RatioSearch Visitors per Crawled Page
Conclusion
10. Domain Changes, Post-SEO Redesigns, and Troubleshooting
The Basics of Moving Content
Large-Scale Content MovesMapping Content MovesExpectations for Content Moves
Maintaining Search Engine Visibility During and After a Site Redesign
Maintaining Search Engine Visibility During and After Domain Name Changes
Unique Challenges of Domain Name ChangesPremove Preparations
Changing Servers
Monitoring After Your Server MoveOther Scenarios Similar to Server Moves
Hidden Content
Identifying Content That Engines Don’t SeeIdentifying the Cause of Non-SpideringBlocked by robots.txtBlocked by the robots meta tagNo direct linksRequires form submissionSession IDsNot enough link juice to remain in main indexesHidden Content That May Be Construed As SpamA few ways to create hidden textUnintentionally creating hidden text
Spam Filtering and Penalties
Recognizing Low-Quality Domains and Spam SitesCompetitors Can Report YouDuplicate ContentBasic Rules for Spam-Free SEOIdentifying Search Engine PenaltiesReinclusion/Reconsideration Requests
Content Theft
Changing SEO Vendors or Staff Members
Potential ProblemsDocument SEO Actions and ProgressRapid TrainingClean Up
Conclusion
11. Honing the Craft: SEO Research and Study
SEO Research and Analysis
SEO ResourcesWebsitesMagazinesCommentary from search engine employeesInterpreting commentarySEO TestingSample experimental approachOther useful SEO metricsStart with a hypothesisAnalysis of Top-Ranking Sites and PagesAnalysis of Algorithmic Differentiation Across Engines and Search TypesUsing Experience and Instinct
Competitive Analysis
Content AnalysisInternal Link Structure and Site ArchitectureExternal Link Attraction AnalysisWhat Is Their SEO Strategy?Competitive Analysis SummaryUsing Competitive Link Analysis ToolsCompetitive Analysis for Those with a Big Budget
Using Search-Engine-Supplied SEO Tools
Search Engine Webmaster ToolsGoogle Webmaster ToolsBing Webmaster ToolsYahoo! Site Explorer, Yahoo! Search Engine Link CommandsYahoo! Site ExplorerYahoo! Search
The SEO Industry on the Web
BlogsForumsCommunities in Social Networks
Participation in Conferences and Organizations
Conclusion
12. Build an In-House SEO Team, Outsource It, or Both?
The Dynamics and Challenges of Using In-House Talent Versus Outsourcing
The Value of In-House SEOThe Value of Outsourced SolutionsLeveraging SEO Knowledge in an Organization
Solutions for Small Organizations
In-House SEO SpecialistOutsourced Agency/Consultant/Contractor
Working with Limited Resources/Budget
Basic Low-Budget SEO Ideas
Solutions for Large Organizations
Contracting for Specialist Knowledge and ExperienceApplying SEO Recommendations Intelligently
Hiring SEO Talent
How to Select the Right SEO PractitionerPitching the PersonSample Job Opening
The Case for Working with an Outside Expert
How to Best Leverage Outside Help
Selecting an SEO Firm/Consultant
Getting the Process StartedPreparing a Request for Proposal (RFP)Step 1: Nominate a “point person” for the engagementStep 2: Define “needs” and “wants” using a decision matrixStep 3: Define your success metricsStep 4: Prepare to disclose all known influencing factorsStep 5: Provide an estimated timeline and budget for project completionA sample RFP document outlineCommunicating with Candidate SEO FirmsMaking the Decision
Mixing Outsourced SEO with In-House SEO Teams
Building a Culture of SEO into Your Organization
Conclusion
13. An Evolving Art Form: The Future of SEO
The Ongoing Evolution of Search
The Growth of Search ComplexityGoogle’s Dominance
More Searchable Content and Content Types
Engines Will Make Crawling ImprovementsEngines Are Getting New Content SourcesMultimedia Is Becoming Indexable
Search Becoming More Personalized and User-Influenced
Determining User IntentUser InteractionsNew Search PatternsUser-Driven Search ResultsGrowing Reliance on the Cloud
Increasing Importance of Local, Mobile, and Voice Recognition Search
Local SearchMobile SearchU.S. marketWorldwide mobile Internet growthU.S. mobile search market shareVoice Recognition Search
Increased Market Saturation and Competition
SEO As an Enduring Art Form
Conclusion
Index
About the Authors
Colophon
Copyright

Content preview from The Art of SEO

Duplicate Content Issues

Duplicate content can result from many causes, including licensing of content to or from your site, site architecture flaws due to non-SEO-friendly CMSs, or plagiarism. Over the past five years, however, spammers in desperate need of content began the now much-reviled process of scraping content from legitimate sources, scrambling the words (through many complex processes), and repurposing the text to appear on their own pages in the hopes of attracting long tail searches and serving contextual ads (and various other nefarious purposes).

Thus, today we’re faced with a world of “duplicate content issues” and “duplicate content penalties.” Here are some definitions that are useful for this discussion:

Unique content: This is written by humans, is completely different from any other combination of letters, symbols, or words on the Web, and is clearly not manipulated through computer text-processing algorithms (such as Markov-chain-employing spam tools).
Snippets: These are small chunks of content such as quotes that are copied and reused; these are almost never problematic for search engines, especially when included in a larger document with plenty of unique content.
Shingles: Search engines look at relatively small phrase segments (e.g., five to six words) for the presence of the same segments on other pages on the Web. When there are too many shingles in common between two documents, the search engines may interpret them as duplicate content.
Duplicate content issues ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9780596809133Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

The Art of SEO

by Eric Enge, Stephan Spencer, Rand Fishkin, Jessie Stricchiola

Duplicate Content Issues

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

The Art of SEO, 2nd Edition

Effective SEO and Content Marketing

SEO Management

SEO For Dummies, 7th Edition

Publisher Resources