Skip to Content
Hands-On Software Engineering with Golang
book

Hands-On Software Engineering with Golang

by Achilleas Anagnostopoulos
January 2020
Intermediate to advanced
640 pages
16h 56m
English
Packt Publishing
Content preview from Hands-On Software Engineering with Golang

The content extractor

The content extractor attempts to identify and extract all text from a document downloaded from a remote server. For instance, if the link is pointed to a plaintext document, then the extractor would emit the document content as is. On the other hand, if the link pointed to an HTML document, the extractor would strip off any HTML elements and emit the text-only portion of the document.

The emitted content is sent off to the content indexer component so it can be tokenized and update the Links 'R' Us full-text search index.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Hands-On Software Architecture with Golang

Hands-On Software Architecture with Golang

Jyotiswarup Raiturkar

Publisher Resources

ISBN: 9781838554491Supplemental Content