O'Reilly logo

Intelligent Document Capture with Ephesoft by Clifford Laurin, Michael Muller, Ike Kavas, Pat Myers

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Fuzzy DB

When the "fuzzy database" is configured, Ephesoft can populate document fields with content from a row in an external database. Ephesoft automatically selects the row whose values match the most content in the current document. Ephesoft uses the Lucene full-text search engine to implement this feature.

Note

As of version 3.0, Ephesoft can also match an extracted field value to a column in a database.

Let's configure Ephesoft to populate fields on our invoice documents using information from a database. Assume that we have a database that contains vendor information, including the vendor's name and ID. This vendor ID differs from the customer number we extracted from the document. First, create the new fields of VendorID and VendorName.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required