76 Patterns: Information Aggregation and Data Integration with DB2 Information Integrator
and manipulate that data. The data may be stored in files and accessed through
file I/O routines or may be stored in a database with more structured and
managed access methods. Although omitted for simplicity of representation, an
Application Server/Services node can be substituted for the Data
Server/Services node where access to the data is provided through an
application API rather than directly to the database management system.
The Population node is a specialized processing node designed and optimized
for reading and writing data from/to data stores and transforming the data, often
in sophisticated ways, as it passes through. Some Population nodes are further
specialized for handling the data under different circumstances, such as efficient
throughput of large batches of records that require extensive transformation, or
for fast throughput of individual records in near real-time.
Multiple data sources may be involved in the base Population runtime pattern
process; and reasonably sophisticated filtering, cleansing, and transformations
may occur within the Population function. The main point is that this process can
occur in a single step.
3.4.3 Population: Multi Step variation pattern
The Application and Runtime patterns for the Population: Multi Step variation
pattern are described here.
Population: Multi Step variation application pattern
Figure 3-8 on page 77 represents the Population: Multi Step variation application
pattern.