CHAPTER 5Unstructured Content Processing

When this book was first proposed, it was meant to give its readers a full picture of all of the features available (and upcoming) with what was then known as Microsoft Syntex (Syntex). Far from where it originated years ago as a part of Project Cortex, Syntex had become its own autonomous product, offering a myriad of solutions for modern document management. It would not be an exaggeration to say that the tool set now known as Microsoft SharePoint Premium has become its own suite rather than just a single product.

However, even with that being true, the one feature most people think about if they have heard about Microsoft SharePoint Premium is its ability to extract key data points from a document and store that metadata in fields associated with the document in a SharePoint library. While it has grown and become more powerful over the years, this single feature is probably what most people think about when they think about Syntex. This chapter will exclusively cover that feature.

What's in This Chapter

  • What is Microsoft SharePoint Premium and how can it improve your document management systems?
  • What is a content center and how can it be activated in your own environment?
  • What is an unstructured content processing model?
  • What is an extractor? An explanation? And how do they work together?
  • Where can you download and install Syntex sample models?
  • How can you create your own custom models?
  • What thoughts should go into planning your ...

Get Microsoft SharePoint Premium in the Real World now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.