2.1 Understanding dataset management service2.1.1 Why deep learning systems need dataset management2.1.2 Dataset management design principles2.1.3 The paradoxical character of datasets2.2 Touring a sample dataset management service2.2.1 Playing with the sample service2.2.2 Users, user scenarios, and the big picture2.2.3 Data ingestion API2.2.4 Training dataset fetching API2.2.5 Internal dataset storage2.2.6 Data schemas2.2.7 Adding new dataset type (IMAGE_CLASS)2.2.8 Service design recap2.3 Open source approaches2.3.1 Delta Lake and Petastorm with Apache Spark family2.3.2 Pachyderm with cloud object storageSummary