Sessions for the 2024 edition are organized in 3 days:
1. Apache Beam Overview (July 23)
Sessions on this day will provide an overview of Apache Beam. First we will focus on understanding what is Apache Beam, how it differs from other tools in the data processing ecosystem and when it is a good fit for your project or organization. After that, you will learn how you can get started with Apache Beam and build your first pipeline.
2. Apache Beam for AI (July 24)
On day 2, you will learn how you can use Apache Beam for implementing AI pipelines. On the first series of lessons, you will implement a machine learning pipeline all the way from conceptualization to coding and running it on a notebook. We will have an additional session on using Beam to interact with Google Gemini via Google AI Studio.
3. Making the jump from batch to streaming (July 25)
One of the main advantages of Apache Beam is that you use the same programming model for implementing batch and streaming pipelines. In the sessions for this day we will go over the key concepts that you need to understand for implementing streaming pipelines in Beam and walk through a demo. We will then provide an overview of Beam Quest, a learning resource for advanced streaming concepts with Apache Beam.