book

Managing Cloud Native Data on Kubernetes

by Jeff Carpenter, Patrick McFadin

December 2022

Intermediate to advanced

329 pages

9h 35m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

Why We Wrote This BookWho Is This Book For?How to Read This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
Infrastructure TypesWhat Is Cloud Native Data?More Infrastructure, More ProblemsKubernetes Leading the WayManaging Compute on KubernetesManaging Network on KubernetesManaging Storage on KubernetesCloud Native Data ComponentsLooking ForwardGetting Ready for the RevolutionAdopt an SRE MindsetEmbrace Distributed ComputingPrinciples of Cloud Native Data InfrastructureSummary
Docker, Containers, and StateManaging State in DockerBind MountsVolumesTmpfs MountsVolume DriversKubernetes Resources for Data StoragePods and VolumesPersistentVolumesPersistentVolumeClaimsStorageClassesKubernetes Storage ArchitectureFlexvolumeContainer Storage InterfaceContainer Attached StorageContainer Object Storage InterfaceSummary
The Hard WayPrerequisites for Running Data Infrastructure on KubernetesRunning MySQL on KubernetesReplicaSetsDeploymentsServicesAccessing MySQLRunning Apache Cassandra on KubernetesStatefulSetsAccessing CassandraSummary
Deploying Applications with Helm ChartsUsing Helm to Deploy MySQLHow Helm WorksLabelsServiceAccountsSecretsConfigMapsUpdating Helm ChartsUninstalling Helm ChartsUsing Helm to Deploy Apache CassandraAffinity and Anti-AffinityHelm, CI/CD, and OperationsSummary
Extending the Kubernetes Control PlaneExtending Kubernetes ClientsExtending Kubernetes Control Plane ComponentsExtending Kubernetes Worker Node ComponentsThe Operator PatternControllersCustom ResourcesOperatorsManaging MySQL in Kubernetes Using the Vitess OperatorVitess OverviewPlanetScale Vitess OperatorA Growing Ecosystem of OperatorsChoosing OperatorsBuilding OperatorsSummary
K8ssandra: Production-Ready Cassandra on KubernetesK8ssandra ArchitectureInstalling the K8ssandra OperatorCreating a K8ssandraClusterManaging Cassandra in Kubernetes with Cass OperatorEnabling Developer Productivity with Stargate APIsUnified Monitoring Infrastructure with Prometheus and GrafanaPerforming Repairs with Cassandra ReaperBacking Up and Restoring Data with Cassandra MedusaCreating a BackupRestoring from BackupDeploying Multicluster Applications in KubernetesSummary
Why a Kubernetes Native Approach Is NeededHybrid Data Access at Scale with TiDBTiDB ArchitectureDeploying TiDB in KubernetesServerless Cassandra with DataStax Astra DBWhat to Look for in a Kubernetes Native DatabaseBasic RequirementsThe Future of Kubernetes NativeSummary
Introduction to StreamingTypes of DeliveryDelivery GuaranteesFeature ScopeThe Role of Streaming in KubernetesStreaming on Kubernetes with Apache PulsarPreparing Your EnvironmentSecuring Communications by Default with cert-managerUsing Helm to Deploy Apache PulsarStream Analytics with Apache FlinkDeploying Apache Flink on KubernetesSummary

Introduction to AnalyticsDeploying Analytic Workloads in KubernetesIntroduction to Apache SparkDeploying Apache Spark in KubernetesBuild Your Custom ContainerSubmit and Run Your ApplicationKubernetes Operator for Apache SparkAlternative Schedulers for KubernetesApache YuniKornVolcanoAnalytic Engines for KubernetesDaskRaySummary
The Cloud Native AI/ML StackAI/ML DefinitionsDefining an AI/ML StackReal-Time Model Serving with KServeFull Lifecycle Feature Management with FeastVector Similarity Search with MilvusEfficient Data Movement with Apache ArrowVersioned Object Storage with lakeFSSummary
The Vision: Application-Aware PlatformsCharting Your Path to SuccessPeopleTechnologyProcessThe Future of Cloud Native DataSummary

Content preview from Managing Cloud Native Data on Kubernetes

Chapter 1. Introduction to Cloud Native Data Infrastructure: Persistence, Streaming, and Batch Analytics

Do you work at solving data problems and find yourself faced with the need for modernization? Is your cloud native application limited to the use of microservices and service mesh? If you deploy applications on Kubernetes (sometimes abbreviated as “K8s”) without including data, you haven’t fully embraced cloud native. Every element of your application should embody the cloud native principles of scale, elasticity, self-healing, and observability, including how you handle data.

Engineers who work with data are primarily concerned with stateful services, and this will be our focus: increasing your skills to manage data in Kubernetes. By reading this book, our goal is to enrich your journey to cloud native data. If you are just starting with cloud native applications, there is no better time to include every aspect of the stack. This convergence is the future of how we will consume cloud resources.

So, what is this future we are creating together?

For too long, data has lived outside of Kubernetes, creating a lot of extra effort and complexity. We will get into valid reasons for this, but now is the time to combine the entire stack to build applications faster, at the needed scale. Based on current technology, this is very much possible. We’ve moved away from the past of deploying individual servers and toward the future where we will be able to deploy entire virtual datacenters. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781098111380Errata Page

Managing Cloud Native Data on Kubernetes

by Jeff Carpenter, Patrick McFadin

Chapter 1. Introduction to Cloud Native Data Infrastructure: Persistence, Streaming, and Batch Analytics

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Cloud Native DevOps with Kubernetes

Cloud Native DevOps with Kubernetes, 2nd Edition

Kubernetes Native Development: Develop, Build, Deploy, and Run Applications on Kubernetes

Managing Kubernetes

Publisher Resources

Chapter 1. Introduction to Cloud Native Data Infrastructure: Persistence, Streaming, and Batch Analytics

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Cloud Native DevOps with Kubernetes

Cloud Native DevOps with Kubernetes, 2nd Edition

Kubernetes Native Development: Develop, Build, Deploy, and Run Applications on Kubernetes

Managing Kubernetes

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.