Big Data Track Overview

Big data and open source remain closely intertwined, and this year’s track will look at the processes, tools and frameworks making the analysis and management of large datasets possible. This track is absolutely packed year after year! 

*Note, this track takes place on both Monday, October 28 and Tuesday, October 29 in room 302B (third floor) of the Convention Center. 

See who’s speaking and what’s being covered below.

Thanks to the wonderful team at MongoDB for making this track possible.




Sarah Gran's headshot

Open Source Privacy-Preserving Metrics

Sarah Gran, VP of Brand & Donor Development & Brandon Pitman, Engineer and Technical Lead, Divvi Up


Sidhant Kohli's headshot
Juanlu Yu's headshot


Equipping easy-to-use and scalable stream processing technologies on Kubernetes

Sidhant Kohli, Senior Software Engineer & Juanlu Yu, Senior Software Engineer, Intuit

Nyah Macklin's headshot

Understanding Vector Databases

Nyah Macklin, enior Developer Advocate, Couchbase




Machine Learning Pipelines at Scale with Apache Beam

Danny McCormick, Senior Software Engineer, Google