Fast Data Architectures for Streaming Applications (2nd Edition)

Dean Wampler, Ph.D., VP of Fast Data Engineering, Lightbend, Inc.

Audience: Developers and Architects

Technical level: Beginning-Intermediate

Why have real-time, stream-oriented data systems become so popular, when batch-oriented systems have served Big Data needs for many years? While batch-mode processing isn’t going away, it’s clear that exclusive use of these systems is now a competitive disadvantage.

In this 2nd Edition of the O’Reilly eBook, Dr. Dean Wampler examines the rise of streaming systems–known as Fast Data architectures–for handling time-sensitive problems like detecting fraudulent financial activity as it happens.

Using several open source tools, you’ll explore the characteristics needed to implement real-time, streaming data architectures. You will also learn that while these systems are much harder to build, they represent the state of the art for dealing with mountains of data that require immediate attention.

  • Learn step-by-step how a basic Fast Data architecture works
  • Understand why event logs are the core abstraction for streaming architectures, while message queues are the core integration tool
  • Use methods for analyzing infinite data sets, where you don’t have all the data and never will
  • Take a tour of open source streaming engines, and discover which ones work best for different use cases
  • Get recommendations for making real-world streaming system responsive, resilient, elastic, and message driven
  • Explore two example applications, data ETL and analysis, and predictive analytics in IoT (Internet of Things) for telemetry ingestion and anomaly detection in home automation systems

Grab your copy

About Author(s)

Dean Wampler, Ph.D., VP of Fast Data Engineering, Lightbend, Inc.

Dean Wampler is the Vice President of Fast Data Engineering at Lightbend, where he leads the efforts around streaming data, machine learning, and real-time analytics features of Lightbend Platform. Dean is the author of Programming Scala and Functional Programming for Java Developers and the coauthor of Programming Hive, all from O’Reilly. He is a contributor to several open source projects and he is the co-organizer of several conferences around the world and several user groups in Chicago.