book

Cloud Native Data Center Networking

Name: Cloud Native Data Center Networking
Author: Dinesh G. Dutt
ISBN: 9781492045601

by Dinesh G. Dutt

November 2019

Intermediate to advanced

483 pages

13h 43m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
AudienceHow This Book Is OrganizedSoftware Used in This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
1. The Motivations for a New Network Architecture
The Application-Network ShuffleThe Network Design from the Turn of the CenturyThe Charms of BridgingBuilding Scalable Bridging NetworksThe Trouble with the Access-Aggregation-Core Network DesignUnscalabilityComplexityFailure DomainUnpredictabilityInflexibilityLack of AgilityThe Stories Not ToldSummary
2. Clos: Network Topology for a New World
Introducing the Clos TopologyA Deeper Dive into the Clos TopologyUse of Homogeneous EquipmentRouting as the Fundamental Interconnect ModelOversubscription in a Clos TopologyInterconnect Link SpeedsPractical ConstraintsFine-Grained Failure DomainScaling the Clos TopologyComparing the Two Three-Tier ModelsApplication MatchupData Center Build OutImplications of the Clos TopologyRethinking Failures and TroubleshootingCablingSimplified Inventory ManagementNetwork AutomationSome Best Practices for a Clos NetworkUse of Multiple Links Between SwitchesUse of Spines as Only a ConnectorUse of Chassis as a Spine SwitchHost Attach ModelsSummaryReferences
3. Network Disaggregation
What Is Network Disaggregation?Why Is Network Disaggregation Important?Controlling CostsAvoiding Vendor Lock-InStandardization of FeaturesWhat Made Network Disaggregation Possible Now?Difference in Network Operations with DisaggregationPurchase and SupportFirst BootOpen Network Installer EnvironmentHow Does ONIE Work?The Players in Network Disaggregation: HardwarePacket-Switching SiliconODMsCPU ComplexThe Standards BodiesCommon Myths About Network DisaggregationSome Best Practices for Engaging with Network DisaggregationSummaryReferences
4. Network Operating System Choices
Requirements of a Network DeviceThe Rise of Software-Defined Networking and OpenFlowMore Details About SDN and OpenFlowThe Trouble with OpenFlowOVSThe Effect of SDN and OpenFlow on Network DisaggregationNOS Design ModelsLocation of Switch Network StateProgramming the Switching SiliconAPIThe Reasons Behind the Different AnswersUser InterfaceComparing the NOS Models with Cloud Native NOS RequirementsIllustrating the Models with an ExampleWhat Else Is Left for a NOS to Do?SummaryReferences
5. Routing Protocol Choices
Routing OverviewHow Routing Table Lookups WorkHow Routes Are ChosenTypes of Routing Table EntriesRIB and FIBRouting Protocols OverviewDistance Vector Protocols Versus Link-State ProtocolsDistance Vector DissectedLink-State DissectedSummarizing Distance Vector Versus Link-State Route ExchangeComparing Distance Vector and Link-State ProtocolsScaling in Link-State and Distance Vector ProtocolsMultipathing in Distance Vector and Link-State ProtocolsNo News Is Good NewsPropagation Delay in Link-State and Distance Vector ProtocolsMultiprotocol SupportUnnumbered InterfacesRouting Configuration ComplexityRouting Protocols in Clos NetworksLink-State Versus Distance Vector When Links or Nodes FailRoute Summarization in Clos NetworksSecurity and SafeguardsBidirectional Forwarding DetectionRequirements of a Routing Protocol in the Data CenterBasic RequirementsAdvanced RequirementsRare or Futuristic RequirementsChoosing the Routing Protocol for Your NetworkSummaryReferences
6. Network Virtualization
What Is Network Virtualization?Uses of Network Virtualization in the Data CenterForcing Traffic to Take a Certain PathApplications That Require L2 AdjacencyCloudSeparating Switch Management Network from Data TrafficNetwork Virtualization ModelsService Abstraction: L2 or L3Inline Versus Overlay Virtual NetworksNetwork Tunnels: The Fundamental Overlay ConstructBenefits of Network TunnelsThe Drawbacks of Network TunnelsNetwork Virtualization Solutions for the Data CenterVLANVRFVXLANOther Network Virtualization SolutionsPractical Limits on the Number of Virtual NetworksSize of Virtual Network ID in Packet HeaderHardware LimitationsScalability of Control Plane and SoftwareDeployment ModelControl Protocols for Network VirtualizationRelationship of Virtual and Physical Control PlaneThe Centralized Control ModelThe Protocol-Based Control ModelVendor Support for Network VirtualizationMerchant SiliconSoftwareStandardsIllustrating VXLAN Bridging and RoutingVXLAN Bridging Example: H1 to H5VXLAN and Routing: H1 to H6Summarizing VXLAN Bridging and RoutingSummary
7. Container Networking
Introduction to ContainersNamespacesNetwork NamespacesVirtual Ethernet InterfacesContainer Networking: Diving InSingle-Host Container NetworkingMultihost Container NetworkingComparing Different Container Network SolutionsKubernetes NetworkingSummary
8. Multicast Routing
Multicast Routing: OverviewThe Uses of Multicast RoutingProblems to Solve in Multicast RoutingBuilding a Multicast TreeMulticast Routing ProtocolPIM Sparse ModeRendezvous PointBuilding a Multicast Distribution TreeMultiple RPs and MSDPPIM-SM in the Data CenterPIM-SM and UnnumberedSummary
9. Life on the Edge of the Data Center
The ProblemsConnectivity ModelsWhy Connect to the External World?Bandwidth Requirements for External ConnectivityConnecting the Clos Topology to the External WorldRouting at the EdgeServicesHybrid Cloud ConnectivitySummary

10. Network Automation
What Is Network Automation?Who Needs Network Automation?Does Network Automation Mean Learning Programming?Why Is Network Automation Difficult?The Trouble with IP Addresses and InterfacesScaleNetwork Protocol Configuration ComplexityLack of Programmatic AccessTraditional Network OS LimitationsWhat Can Network Developers Do to Help Network Automation?Tools for Network AutomationAutomation Best PracticesAnsible: An OverviewInventoryPlaybooksAd Hoc CommandsStructuring PlaybooksA Typical Automation JourneyGlorified File CopyAutomate the Configuration That Was Not Device SpecificTemplate the Routing and Interface ConfigurationMore Templating and RolesSome Observations from Fellow JourneymenValidating the ConfigurationSingle Source of TruthCommit/Rollback in the Age of AutomationVagrant and Network TestingAutomating VerificationSummaryReferences
11. Network Observability
What Is Observability?The Current State of Network ObservabilityThe Disenchantments of SNMPBox-by-Box Approach to Network ObservabilityWhy Is Observability Difficult with Networking?Observability in Data Center Networks: Special CharacteristicsDecomposing ObservabilityThe Mechanics of TelemetryWhat Do We Gather?How Do We Gather?When Do We Gather?Storing the DataThe Uses for Multiple Data SourcesOf Alerts and DashboardsSummaryReferences
12. Rethinking Network Design
Standard, Simple Building BlocksNetwork DisaggregationFailure: Missing the Forest for the TreesL2 Failure Model Versus L3 Failure ModelSimple Versus Complex FailuresHandling UpgradesThe Pursuit of LessHow the Right Architecture HelpsFeature Set EssentialismConstraints on the Cloud Native Network Design PrinciplesSummary
13. Deploying OSPF
Why OSPF?The Problems to Be AddressedDetermining Link-State Flooding DomainsNumbered Versus Unnumbered OSPFSupport for IPv6Support for VRFsRequirements for Running OSPF on ServersOSPF Route TypesThe Messiness of StubbinessOSPF TimersDissecting an OSPF ConfigurationConfiguration for Leaf-Spine in a Two-Tier Clos Topology: IPv4Configuration for Leaf-Spine in a Two-Tier Clos Topology: IPv6Configuration with Three-Tier Clos Running OSPFConfiguration with Servers Running OSPF: IPv4Summarizing Routes in OSPFOSPF and UpgradesBest PracticesSummary
14. BGP in the Data Center
Basic BGP ConceptsBGP Protocol OverviewBGP PeeringBGP State MachineAutonomous System NumberBGP CapabilitiesBGP Attributes, Communities, Extended CommunitiesBGP Best-Path ComputationSupport for Multiple ProtocolsBGP MessagesAdapting BGP to the Data CentereBGP Versus iBGPeBGP: Flying SoloPrivate ASNsBGP’s ASN Numbering SchemeMultipath SelectionFixing BGP’s Convergence TimeSummary
15. Deploying BGP
Core BGP Configuration ConceptsTraditional Configuration for a Two-Tier Clos Topology: IPv4Peer GroupRouting PolicyRoute Maps: Implementation of Routing PolicyProviding Sane Defaults for the Data CenterBGP Unnumbered: Eliminating Pesky Interface IP AddressesA remote-as by Any NameHow Unnumbered Interfaces Work with BGPFinal Observations on BGP Configuration in FRRUnnumbered BGP Support in Routing StacksSummaryConfiguring IPv6BGP and VRFsPeering with BGP Speakers on the HostBGP Dynamic NeighborsBGP and UpgradesAS_PATH PrependGRACEFUL_SHUTDOWN CommunityMax-MEDBest PracticesSummary
16. EVPN in the Data Center
Why Is EVPN Popular?The Problems a Network Virtualization Control Plane Must AddressWhere Does a VTEP Reside?One Protocol to Rule Them All, Or…?iBGP CharacteristicsSeparate Underlay and Overlay ProtocolseBGP OnlyBGP Constructs to Support Virtual Network RoutesRoute DistinguisherRoute TargetFRR’s use of RD and RTEVPN Route TypesCommunicating Choice of BUM HandlingEVPN and BridgingEVPN Bridging with Ingress ReplicationEVPN Bridging with Routed Multicast UnderlayHandling MAC MovesSupport for Dual-Attached HostsHost-Switch Interconnect ModelVXLAN Model for Dual-Attached HostsSwitch Peering OptionsHandling Link FailuresAvoiding Duplicate Multidestination FramesARP/ND SuppressionEVPN and RoutingCentralized Versus Distributed RoutingSymmetric Versus Asymmetric RoutingRoute AdvertisementsThe Use of VRFsDeploying EVPN in Large NetworksSummary
17. Deploying Network Virtualization
The Configuration ScenariosDevice-Local ConfigurationSingle eBGP SessionOSPF Underlay, iBGP Overlayallowas-in Versus Separate ASNPIM/MSDP ConfigurationEVPN on the HostBest PracticesSummary
18. Validating Network Configuration
Validating the Network StateSystem ValidationCabling ValidationUsing Ansible to Validate CablingInterface Configuration ValidationAutomating Interface Configuration ValidationRouting Configuration ValidationValidating an OSPF ConfigurationValidating a BGP ConfigurationStripping the Private ASNsValidating Network VirtualizationApplication’s Network ValidationData-Plane ValidationSummary
19. Coda
Glossary
Index

Content preview from Cloud Native Data Center Networking

Chapter 11. Network Observability

One of the dirtiest secrets in systems engineering is just how many outages are never really fully explained or understood. Or how many can’t actually be explained or understood given existing telemetry.

Charity Majors

A distributed system is one in which the failure of a computer you didn’t even know existed can render your own computer unusable.

Leslie Lamport

Two distributed systems experts, one a theoretician and the other a practitioner, separated by a generation, make the same observation. Distributed systems are hard to understand, hard to control, and always frustrating when things go wrong. And sandwiched in the middle between the endpoints is the network operator. “Is it the network?” is not too far down the list of universal questions such as, “What is the meaning of life, the universe, and everything?” Sadly, network operators do not even have the humor of a Douglas Adams story to fall back on.

The modern data center with its scale and the ever increasing distributed nature of its applications only makes it more difficult to answer the questions that network operators have been dealing with since the dawn of distributed applications. Observability represents the operator’s latest attempt to respond adequately to the questions. Along with automation, observability has become one of the central pillars of the cloud native data center.

The primary goal of this chapter is to leave you with an understanding of the importance of observability ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781492045595Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Cloud Native Data Center Networking

by Dinesh G. Dutt

Chapter 11. Network Observability

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.