Metadata is data about data. Specifically, in the context of data ingestion processes, it means discovering and knowing all that is necessary to know about the structures of your source data. This means having, at a minimum, all the details of the following:

  • Tables used
  • Columns available
  • Data types
  • Length of data in columns
  • Column nullability

For certain sources, you should also know:

  • Primary and unique keys
  • Indexes
  • Triggers

Exactly how much of this information is essential to the process you are building will depend on each individual project. Some of it will be merely useful, but other aspects could be key to a successful ...

