Book description
Entity Resolution and Information Quality presents topics and definitions, and clarifies confusing terminologies regarding entity resolution and information quality. It takes a very wide view of IQ, including its six-domain framework and the skills formed by the International Association for Information and Data Quality {IAIDQ).The book includes chapters that cover the principles of entity resolution and the principles of Information Quality, in addition to their concepts and terminology. It also discusses the Fellegi-Sunter theory of record linkage, the Stanford Entity Resolution Framework, and the Algebraic Model for Entity Resolution, which are the major theoretical models that support Entity Resolution. In relation to this, the book briefly discusses entity-based data integration (EBDI) and its model, which serve as an extension of the Algebraic Model for Entity Resolution. There is also an explanation of how the three commercial ER systems operate and a description of the non-commercial open-source system known as OYSTER. The book concludes by discussing trends in entity resolution research and practice. Students taking IT courses and IT professionals will find this book invaluable.
- First authoritative reference explaining entity resolution and how to use it effectively
- Provides practical system design advice to help you get a competitive advantage
- Includes a companion site with synthetic customer data for applicatory exercises, and access to a Java-based Entity Resolution program.
Table of contents
- Cover Image
- Table of Contents
- Front matter
- Copyright
- Dedication
- Foreword
- Preface
- Acknowledgements
- 1. Principles of Entity Resolution
- 2. Principles of Information Quality
- 3. Entity Resolution Models
- 4. Entity-Based Data Integration
- 5. Entity Resolution Systems
- 6. The OYSTER Project
- 7. Trends in Entity Resolution Research and Applications
- Bibliography
- Glossary
- Appendix A
- Index
Product information
- Title: Entity Resolution and Information Quality
- Author(s):
- Release date: January 2011
- Publisher(s): Morgan Kaufmann
- ISBN: 9780123819734
You might also like
book
40 Algorithms Every Programmer Should Know
Learn algorithms for solving classic computer science problems with this concise guide covering everything from fundamental …
book
Building Event-Driven Microservices
Organizations today often struggle to balance business requirements with ever-increasing volumes of data. Additionally, the demand …
book
Entity Information Life Cycle for Big Data
Entity Information Life Cycle for Big Data walks you through the ins and outs of managing …
book
Architecting Data-Intensive Applications
Architect and design data-intensive applications and, in the process, learn how to collect, process, store, govern, …