Live Event

Data in Motion NYC: Messaging, Streaming, and Processing

Location: New York, NY
When: Jun 4, 2026 | 11:00 am ET

Join meshIQ in NYC on June 4 for talks on Apache ActiveMQ®, Beam, and GNN pipelines—plus rooftop networking with data engineering and streaming pros.

Agenda

  • 11:00 AM – 11:20 AM: Arrival & Networking
  • 11:20 AM – 12:00 PM: Data in Motion with Apache ActiveMQ® and Apache Beam | JB Onofré, Principal Software Engineer, Dremio + Director, Apache Foundation
  • 12:00 PM – 1:00 PM: Rooftop Lunch & Networking (on the rooftop if the weather permits)
  • 1:00 PM – 1:40 PM: GraphFlow & Beam: Pythonic, Scalable GNN Pipelines | Yogesh Tewari, Senior Cloud Data Engineer, Google
  • 1:40 PM – 2:00 PM: Final Networking

Abstracts

Data in Motion with Apache ActiveMQ® and Apache Beam

JB Onofré, Principal Software Engineer, Dremio + Director, Apache Foundation

Modern data architectures demand more than batch processing — they require reliable, scalable, and flexible pipelines that can handle data as it moves. This session explores the powerful combination of Apache ActiveMQ, a battle-tested message broker for enterprise messaging, and Apache Beam, a unified programming model for both batch and streaming data processing.

We’ll walk through the fundamentals of integrating ActiveMQ as a durable message source and sink within Beam pipelines, enabling real-time event-driven workflows across distributed systems. Attendees will learn how to build end-to-end pipelines that consume messages from ActiveMQ queues and topics, apply transformations, enrichments, and windowing strategies using Beam’s expressive API, and route results to downstream systems — all with portability across runners like Apache Flink, Apache Spark, and Google Dataflow.

Key topics include:

  • ActiveMQ connectivity patterns in Beam (JMS I/O)
  • Message acknowledgment and exactly-once semantics
  • Schema handling and payload deserialization
  • Scaling strategies for high-throughput messaging workloads
  • Real-world use cases: event sourcing, CDC, and operational data pipelines

Whether you’re modernizing a legacy messaging infrastructure or designing a new streaming architecture from scratch, this talk will give you practical patterns and insights to put data in motion — reliably and at scale.

GraphFlow & Beam: Pythonic, Scalable GNN Pipelines

Yogesh Tewari, Senior Cloud Data Engineer at Google

Learn how GraphFlow, a modular Python toolkit, utilizes Apache Beam to create efficient and scalable data pipelines for Graph Neural Networks (GNNs). We’ll demonstrate how GraphFlow on Beam tackles large-scale graph data challenges, including distributed ingestion from cloud databases, scalable feature normalization, graph sampling, and online model inference.

Cookies preferences

Others

Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.

Necessary

Necessary
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.

Advertisement

Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.

Analytics

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.

Functional

Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.

Performance

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.