O'Reilly logo
live online training icon Live Online training

Fundamentals of Data Architecture

Bridging the Gap between Doer and Visionary

Ted Malaska

If you're an engineer who wants to be a data architect, this course is for you. You'll explore data architecture as a science including vital topics such as data weight, data modeling, streaming, and interface design. You'll learn about the different types of architects, and discover the best model for you. You'll be able to recognize good architecture (and how to avoid bad architecture). And you'll get a solid understanding of the disciplines that mark the difference between an average architect and a great one.

What you'll learn-and how you can apply it

By the end of this online course, you'll understand:

  • The different types of architects
  • Qualities that make a good architect
  • Designing for risk mitigation
  • Interface design
  • Multi-team/department interdependencies
  • Data modeling approaches
  • The difference from building products vs services
  • The importance of benchmarking

And you'll be able to:

  • Break down a large-scale project into executable components
  • Build an effective strategy for evangelism
  • Earn the respect of follow engineers
  • Influence design and direction without having to force top-down decisions
  • Master the relationship with project management and product management

This training course is for you because...

  • You are a software engineer and want to deepen your understanding of architecture rules to move past just coding.

  • You are a software architect who wants sharpen your skills.

  • You are a product manager and you what to know what traits to look for when selecting a data architect.


  • Familiarity with data analytics and data modelling.
  • Some experience with software development and the software life cycle would be helpful.

About your instructor

  • Ted Malaska is the director of enterprise architecture at Capital One. Previously, he was on the Battle.net team at Blizzard Entertainment, he was also a principal solutions architect at Cloudera, where he helped clients succeed with Hadoop and the Hadoop ecosystem, and a lead architect at the Financial Industry Regulatory Authority (FINRA). He has contributed code to Apache Flume, Apache Avro, Apache Yarn, Apache HDFS, Apache Spark, Apache Sqoop, and many more. Ted is the coauthor of Hadoop Application Architectures, a frequent conference speaker, and a blogger on data architectures.


The timeframes are only estimates and may vary according to how the class is progressing

Day One

  • To be a great engineer (10 minutes)
  • Reviewing the 9 box review process (15 minutes)
  • Types of architects: An introduction (15 minutes)
  • Top-down vs bottom-up (15 minutes)
  • Staying one step ahead (15 minutes)
  • Looking for risk and issues (15 minutes)
  • Data modeling (20 minutes)
  • Tricks to dig into an existing system (20 minutes)
  • Dealing with performance problems (20 minutes)
  • Finding a vision (10 minutes)
  • How to make a powerpoint that is remembered (15 minutes)

Day Two

  • Discovering data value in existing datasets (20 minutes)
  • What can be learned from a salesman (20 minutes)
  • Building out a vision and getting buy-in (20 minutes)
  • Walking and running & the art of PoC (20 minutes)
  • How to deal with conflict (20 minutes)
  • The art of promoting others (10 minutes)
  • Earning the respect of engineers (20 minutes)
  • Earning the respect of a superior (CTO) (20 minutes)
  • How to get a talk at a conference (20 minutes)
  • Type of architects: A reflection (10 minutes)