Technical requirementsTransformations – making raw data more valuableCooking, baking, and data transformationsTransformations as part of a pipelineTypes of data transformation toolsApache SparkHadoop and MapReduceSQLGUI-based toolsData preparation transformationsProtecting PII dataOptimizing the file formatOptimizing with data partitioningData cleansingBusiness use case transformsData denormalizationEnriching dataPre-aggregating dataExtracting metadata from unstructured dataWorking with change data capture (CDC) dataTraditional approaches – data upserts and SQL viewsModern approaches – the transactional data lakeHands-on – joining datasets with AWS Glue StudioCreating a new data lake zone – the curated zoneCreating a new IAM role for the Glue jobConfiguring a denormalization transform using AWS Glue StudioFinalizing the denormalization transform job to write to S3Create a transform job to join streaming and film data using AWS Glue StudioSummary