Skip to Content
Designing Big Data Platforms
book

Designing Big Data Platforms

by Yusuf Aytas
July 2021
Beginner to intermediate
336 pages
9h 22m
English
Wiley
Content preview from Designing Big Data Platforms

Appendix AFurther Systems and Patterns

Throughout the book, we have touched on many subjects. Some of the subjects would have been great to add but might not be appropriate with the flow of the book. Thus, I have moved these subjects to the appendix to give a rough idea of them. In this part, I would discuss Lambda architecture, Apache Cassandra, and Apache Beam.

A.1 Lambda Architecture

Lambda architecture is a deployment model where organizations complement batch processing with stream processing for real‐time big data problems. It has arisen due to troubles in serving data in real‐time (Marz, 2011). Ideally, a system wants to scan entire data to respond to a query. In practice, responding to a query gets tricky since there is just so much data to scan for some queries. The data volume can result in outrageous response times. Moreover, organizations choose availability over consistency. Most organizations would prefer services to be available. Choosing availability over inconsistency results in weaker consistency levels. A read after write might not return the expected response. Without read repairs, the data can stay corrupted. Human error can also lead to problems. Updates to systems pose corruption threats that cannot be recoverable (Figure A.1).

Schematic illustration of lambda architecture.

Figure A.1 Lambda architecture.

To address these problems, the Lambda architecture uses an immutable stream of data and ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Designing Cloud Data Platforms

Designing Cloud Data Platforms

Lynda Partner, Danil Zburivsky
Designing Cloud Data Platforms

Designing Cloud Data Platforms

Danil Zburivsky, Lynda Partner

Publisher Resources

ISBN: 9781119690924Purchase Link