O'Reilly logo

Intelligent Document Capture with Ephesoft - Second Edition by Jon Solove, Michael Muller, Ike Kavas, Pat Myers

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Multiple layouts for a single document type

Often, documents of a single type will vary widely in appearance. Invoices from different vendors, for example, will contain similar information, but the format of this information, the layout of the page, and the labels may all vary. When this happens, you must first assess the content of the samples. If the text on the documents is very similar, then it may not be necessary to include samples of each format for classification. Similarly, if the field labels are consistent, it may not be necessary to create new extraction rules for the content.

Note

When analyzing new documents of similar types, you must also decide if it is best to create a new document type (copy an existing document type) or include ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required