book

Modern Data Protection

Name: Modern Data Protection
Author: W. Curtis Preston
ISBN: 9781492094050

by W. Curtis Preston

April 2021

Intermediate to advanced

384 pages

12h 22m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Foreword
Preface
The Work ContinuesConventions Used in This BookO’Reilly Online LearningHow to Contact UsAcknowledgments
1. Risks to Your Data: Why We Back Up
Human DisastersAccidentsBad CodeMalicious AttacksTerrorismElectronic AttacksRansomwareInternal ThreatsMechanical or System FailurePower DisruptionsThere Is No CloudSystem FailureNatural DisastersFloodsFiresEarthquakesHurricanes, Typhoons, and CyclonesTornadoesSinkholesTakeaways
2. Gathering and Determining Service Levels
What Does Your Organization Do?Build Yourself a FrameworkDocument TemplatesReview/Advisory BoardsCollecting RequirementsWhat Are RPO and RTO?Find the Subject Matter ExpertsSolicit RequirementsReview RequirementsDesign and Build Your SystemDraw up Multiple DesignsReview the DesignsSelect and Build the SystemDocument and Implement the New SystemDefining Operational ResponsibilityOperations Review and DocumentationDocumentation Is GoodRunbooksImplement the New SystemTakeaways
3. Backup and Archive Are Very Different
Before We Get StartedWhat Is Backup?“Copy”“Stored Separately from the Original”“For the Purposes of Restoring”What Is a Restore?How Does a Restore Work?The 3-2-1 RuleWhat Is an Archive?To Serve as a ReferenceStored with Additional MetadataWhat Is a Retrieve?Protecting Backup and Archive DataEncryptionAir GapsImmutabilityTakeaways
4. Backup and Recovery Basics
Recovery TestingBackup LevelsTraditional Full BackupTraditional Incremental BackupDo Backup Levels Matter?MetricsRecovery MetricsCapacity MetricsBackup WindowBackup and Recovery Success and FailureRetentionUsing MetricsBackup and Archive MythsItem- Versus Image-Level BackupsItem-Level BackupImage-Level BackupsFile-Level Recovery from an Image-Level BackupCombining Image- and File-Level BackupsBackup Selection MethodsSelective Inclusion Versus Selective ExclusionTag-Based and Folder-Based InclusionTakeaways
5. Using Disk and Deduplication for Data Protection
DeduplicationWhat Can Dedupe Do?How Dedupe WorksTarget DeduplicationSource DeduplicationTarget Versus Source DeduplicationHybrid DedupeSelecting the Right Dedupe for YouUsing Disk in Your Backup SystemDisk CachingDisk-to-Disk-to-Tape (D2D2T)Disk-to-Disk-to-Disk (D2D2D)Direct-to-Cloud (D2C)Disk-to-Disk-to-Cloud (D2D2C)Recovery ConceptsImage RecoveryFile-Level RecoveryInstant RecoveryChoosing a Recovery TypeTakeaways
6. Traditional Data Sources
Physical ServersStandard BackupBare-Metal BackupBacking Up NASVirtual ServersVM-Level BackupsWhat Is VSS?Specialized Backups for HypervisorsDesktops and LaptopsLaptops as a CacheNormal Desktop and Laptop UsageDesktop and Laptop Backup OptionsMobile DevicesCloud SyncPhysical SyncMobile Device BackupMobile Device Management (MDM)Takeaways
7. Protecting Databases
Database Delivery ModelsTraditional Database SoftwarePlatform-as-a-ServiceServerless DatabasesDatabase ModelsConsistency ModelsTraditional Databases Running in Your DatacenterPaaS and Serverless DatabasesTraditional Database TerminologyInstanceDatabaseTableIndexRowAttributeData FileTablespacePartitionMaster FileTransactionTransaction LogBacking Up Traditionally Delivered DatabasesCold BackupSplit ReplicaHot Backup ModeSnap and SweepDump and SweepStream-to-Backup ProductTransaction Log BackupMaster FileBacking Up PaaS and Serverless DatabasesDump and SweepIntegrated Backup-as-a-ServiceRecovering Traditional DatabasesRecovering Modern DatabasesTakeaways
8. Modern Data Sources
The Public CloudInfrastructure-as-a-Service (IaaS)Platform-as-a-Service (PaaS)Serverless ServicesSoftware-as-a-Service (SaaS)You Need to Protect the CloudHybrid Cloud ConfigurationsNFS/SMB GatewayThe Cloud in a BoxDocker and KubernetesHow Containers Break BackupsDockerfilesDocker ImagesKubernetes etcdPersistent VolumesDatabasesKubernetes: A New PathThe Internet of Things (IoT)Making Backup DecisionsCriticality to the OrganizationConsider the SourceTakeaways

9. Backup and Recovery Software Methods
Is Everything Backup?Backup Methods Supporting a Traditional RestoreMultiplexingTraditional Full and Incremental BackupsFile-Level Incremental ForeverBlock-Level Incremental ForeverSource DeduplicationMethods Supporting Instant RecoveryReplicationContinuous Data Protection (CDP)SnapshotsNear-Continuous Data Protection (Near-CDP)Copy Data ManagementOther Software with Instant RecoveryLeveraging Backups for MoreDeciding on a Backup MethodDoes What You Have Meet Your Needs?Advantages and Disadvantages of Different ApproachesComplete SolutionTakeaways
10. Archive Software Methods
A Deeper Dive into ArchiveRetrieval Versus RestoreTypes of Archive SystemsTraditional Batch ArchiveReal-Time ArchiveHSM-Style ArchiveDeciding on an Archive SystemDo You Need One?RequirementsTakeaways
11. Disaster Recovery Methods
Disaster Recovery Becomes ParamountRansomware Changed EverythingAn Overview of Disaster RecoveryWhat Is in a DR Plan?A Box of Tapes Isn’t a DR PlanA Replicated Dedupe Appliance Isn’t Much BetterIt’s All About the RTABuilding a Recovery SiteRoll Your Own DR SiteRecovery-Site-as-a-ServiceThe Public Cloud Was Born for DRKeeping the DR Site Up to DateCold, Hot, and Warm SitesChoosing Hot, Warm, or ColdRecovery MechanismsSoftware or ServiceCommercial DR SoftwareDR-as-a-ServiceAll-in-One or Best of Breed?Choosing a PlanCreating a DR RunbookRunbook GoalsOverviewTechnology InventoryContact InformationProceduresException Processing with EscalationTakeaways
12. Data Protection Targets
Tape DrivesWhat Tape Is Good AtWhat Tape Is Bad AtHow Did This Happen?Tape Drive TechnologiesOptical MediaIndividual Disk DrivesStandard Disk ArraysObject StorageTarget Deduplication AppliancesVirtual Tape LibrariesNAS AppliancesPublic Cloud StorageChoosing and Using a Backup TargetOptimize the Performance of What You HaveSelect a More Appropriate DeviceTakeaways
13. Commercial Data Protection Challenges
A Brief History of BackupChallenges with Commercial Backup SolutionsSize the Backup SystemMaintain Backup Server OSMaintain Backup SoftwareManage Multiple VendorsSeparate System for DRSeparate System for E-DiscoveryTape-Related ChallengesDisk-Related ChallengesLarge Up-Front Capital PurchasesOverprovisioning Is RequiredDifficult to ScaleDifficulty of Changing Backup ProductsLet Them ExpireUse a ServiceRestore and BackupTakeaways
14. Traditional Data Protection Solutions
Not Naming NamesTraditional Backup SolutionsAdvantages of Traditional BackupChallenges with Traditional BackupAnalysisTarget Deduplication Backup AppliancesAdvantages of Target DedupeChallenges with Target DedupeAnalysisTakeaways
15. Modern Data Protection Solutions
Virtualization-Centric SolutionsAdvantages of Virtualization-Centric SolutionsChallenges of Virtualization-Centric BackupAnalysisHyper-Converged Backup AppliancesAdvantages of Hyper-Converged Backup AppliancesChallenges with HCBAsAnalysisData-Protection-as-a-Service (DPaaS)Advantages of DPaaSChallenges of DPaaSAnalysisFully Managed Service ProvidersAdvantages of Using an MSPChallenges of Using an MSPAnalysisAdapting to the MarketTraditional Backup AppliancesSubscription PricingResponding to the CloudTakeaways
16. Replacing or Upgrading Your Backup System
Which Solution Is Best for You?Your ResponsibilitiesBefore You Do AnythingThis Is Your Backup SystemConsider TCO, Not Just Acquisition CostPicking a SolutionFind Any ShowstoppersPrioritize Ease of UsePrioritize ScalabilityPrioritize Future ProofingTakeaways
Index

Content preview from Modern Data Protection

Chapter 10. Archive Software Methods

Note

This chapter is written by Dan Frith, aka @penguinpunk, an industry veteran from Down Under. Dan’s a fan of my work, but not what I would call a fanboy. (He has enough chutzpah to tell me when he thinks I’m wrong. Must be an Australian thing.) He’s got great field experience, so in addition to being a tech reviewer of the book, I asked him to write this chapter.

An archive is the one data protection system you probably need and most likely don’t have. In Chapter 3, I defined an archive as a separate copy of data stored in a separate location, made to serve as a reference copy, and stored with enough metadata to find the data in question without knowing where it came from. Backup is, on the other hand, a secondary copy of data that you use to recover in the event that your primary copy of the data is affected in some way, whether from corruption, deletion, or some other misfortune.

You most likely have a backup system, but you just as likely do not have an archive system. Most people therefore have no idea what an actual archive system does or why you might want one. Let’s explore that topic a bit.

A Deeper Dive into Archive

Another way I like to define archive is the primary copy of your data that has secondary value (i.e., is no longer primary data). Typically, archive data is no longer current, infrequently accessed, or simply no longer valued (as much) by the users who created it. Not every piece of data needs to be kept in a prominent ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781492094043Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design