Beam webinar series April 2021

The webinar series provides hands-on training to solve data analytics use cases using Apache Beam.

The 5-day webinar series is designed to be flexible, so the participants can sign up and drop-in based on topics of their interests and needs.

The webinars start on April 7th, 2021. Participation is free!

Curriculum

April 7, 2021

The distributed data processing landscape, and Apache Beam under the hood

  • Time: 4 pm to 9 pm UTC
  • Target skill level: Beginner. Users who have heard about Beam but haven’t deployed a pipeline in production.

We will start with an overview of the current landscape for data processing tools, explaining where Beam comes to play. After this, we will go into the details of Beam technology, it’s components and value proposition.

April 8, 2021

Advanced distributed data processing

  • Time: 4 pm to 9 pm UTC
  • Target skill level: Intermediate users already familiar with the basic components of Beam and know how they can apply to distributed data processing systems.

We will cover elemental concepts of distributed data processing systems and we’ll review how to apply them to real business applications.

April 9, 2021

Beam features to scale and productionalize your use case

  • Time: 4 pm to 8 pm UTC
  • Target skill level: Intermediate/Advanced users who are familiar with basic Beam deployments and are interested in expanding their use and investments.

We will study the Beam features that make the service fully portable, making your data pipelines ready to run anywhere and shareable with stakeholders in your organization, regardless of the programming language of their preference.

April 15, 2021

Strategies for performance and cost optimization

  • Time: 5 pm to 9 pm UTC
  • Target skill level: Intermediate, users already deploying standard Beam solutions from templates or without too much customization.
We will cover best practices to optimize the performance and cost of data processing jobs using Beam in some of the available runners like Cloud Dataflow. We’ll be using the Dataflow runner for this session.
April 16, 2021

Best practices for debugging Beam pipelines

  • Time: 4 pm to 7 pm UTC
  • Target skill level: Intermediate/Advanced. Users who have deployed Beam pipelines in production and at scale.
We will review some of the most common issues experienced when using Beam, and will learn tips to prevent them leveraging runner features to monitor and debug Beam jobs. We’ll be using the Dataflow runner for this session.