book

Managing Cloud Native Data on Kubernetes

by Jeff Carpenter, Patrick McFadin

December 2022

Intermediate to advanced

329 pages

9h 35m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

Why We Wrote This BookWho Is This Book For?How to Read This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
Infrastructure TypesWhat Is Cloud Native Data?More Infrastructure, More ProblemsKubernetes Leading the WayManaging Compute on KubernetesManaging Network on KubernetesManaging Storage on KubernetesCloud Native Data ComponentsLooking ForwardGetting Ready for the RevolutionAdopt an SRE MindsetEmbrace Distributed ComputingPrinciples of Cloud Native Data InfrastructureSummary
Docker, Containers, and StateManaging State in DockerBind MountsVolumesTmpfs MountsVolume DriversKubernetes Resources for Data StoragePods and VolumesPersistentVolumesPersistentVolumeClaimsStorageClassesKubernetes Storage ArchitectureFlexvolumeContainer Storage InterfaceContainer Attached StorageContainer Object Storage InterfaceSummary
The Hard WayPrerequisites for Running Data Infrastructure on KubernetesRunning MySQL on KubernetesReplicaSetsDeploymentsServicesAccessing MySQLRunning Apache Cassandra on KubernetesStatefulSetsAccessing CassandraSummary
Deploying Applications with Helm ChartsUsing Helm to Deploy MySQLHow Helm WorksLabelsServiceAccountsSecretsConfigMapsUpdating Helm ChartsUninstalling Helm ChartsUsing Helm to Deploy Apache CassandraAffinity and Anti-AffinityHelm, CI/CD, and OperationsSummary
Extending the Kubernetes Control PlaneExtending Kubernetes ClientsExtending Kubernetes Control Plane ComponentsExtending Kubernetes Worker Node ComponentsThe Operator PatternControllersCustom ResourcesOperatorsManaging MySQL in Kubernetes Using the Vitess OperatorVitess OverviewPlanetScale Vitess OperatorA Growing Ecosystem of OperatorsChoosing OperatorsBuilding OperatorsSummary
K8ssandra: Production-Ready Cassandra on KubernetesK8ssandra ArchitectureInstalling the K8ssandra OperatorCreating a K8ssandraClusterManaging Cassandra in Kubernetes with Cass OperatorEnabling Developer Productivity with Stargate APIsUnified Monitoring Infrastructure with Prometheus and GrafanaPerforming Repairs with Cassandra ReaperBacking Up and Restoring Data with Cassandra MedusaCreating a BackupRestoring from BackupDeploying Multicluster Applications in KubernetesSummary
Why a Kubernetes Native Approach Is NeededHybrid Data Access at Scale with TiDBTiDB ArchitectureDeploying TiDB in KubernetesServerless Cassandra with DataStax Astra DBWhat to Look for in a Kubernetes Native DatabaseBasic RequirementsThe Future of Kubernetes NativeSummary
Introduction to StreamingTypes of DeliveryDelivery GuaranteesFeature ScopeThe Role of Streaming in KubernetesStreaming on Kubernetes with Apache PulsarPreparing Your EnvironmentSecuring Communications by Default with cert-managerUsing Helm to Deploy Apache PulsarStream Analytics with Apache FlinkDeploying Apache Flink on KubernetesSummary

Introduction to AnalyticsDeploying Analytic Workloads in KubernetesIntroduction to Apache SparkDeploying Apache Spark in KubernetesBuild Your Custom ContainerSubmit and Run Your ApplicationKubernetes Operator for Apache SparkAlternative Schedulers for KubernetesApache YuniKornVolcanoAnalytic Engines for KubernetesDaskRaySummary
The Cloud Native AI/ML StackAI/ML DefinitionsDefining an AI/ML StackReal-Time Model Serving with KServeFull Lifecycle Feature Management with FeastVector Similarity Search with MilvusEfficient Data Movement with Apache ArrowVersioned Object Storage with lakeFSSummary
The Vision: Application-Aware PlatformsCharting Your Path to SuccessPeopleTechnologyProcessThe Future of Cloud Native DataSummary

Content preview from Managing Cloud Native Data on Kubernetes

Chapter 4. Automating Database Deployment on Kubernetes with Helm

In the previous chapter, you learned how to deploy both single-node and multinode databases on Kubernetes by hand, creating one element at a time. We did things the “hard way” on purpose to help maximize your understanding of using Kubernetes primitives to set up the compute, network, and storage resources that a database requires. Of course, this doesn’t represent the experience of running databases in production on Kubernetes, for a couple of reasons.

First, teams typically don’t deploy databases by hand, one YAML file at a time. That can get pretty tedious. And even combining the configurations into a single file could start to get pretty complicated, especially for more sophisticated deployments. Consider the increase in the amount of configuration required in Chapter 3 for Cassandra as a multinode database compared with the single-node MySQL deployment. This won’t scale for large enterprises.

Second, while deploying a database is great, what about keeping it running over time? You need your data infrastructure to remain reliable and performant over the long haul, and data infrastructure is known for requiring a lot of care and feeding. Put another way, the task of running a system is often divided into “day one” (the joyous day when you deploy an application to production) and “day two” (every day after the first, when you need to operate and evolve your application while maintaining high availability).

These ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781098111380Errata Page

Managing Cloud Native Data on Kubernetes

by Jeff Carpenter, Patrick McFadin

Chapter 4. Automating Database Deployment on Kubernetes with Helm

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Cloud Native DevOps with Kubernetes

Cloud Native DevOps with Kubernetes, 2nd Edition

Kubernetes Native Development: Develop, Build, Deploy, and Run Applications on Kubernetes

Managing Kubernetes

Publisher Resources

Chapter 4. Automating Database Deployment on Kubernetes with Helm

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Cloud Native DevOps with Kubernetes

Cloud Native DevOps with Kubernetes, 2nd Edition

Kubernetes Native Development: Develop, Build, Deploy, and Run Applications on Kubernetes

Managing Kubernetes

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.