Webinar On Demand

Building a Data Infrastructure for AI/ML

Recorded October 30, 2024

View a Complimentary Live Webinar Sponsored by MinIO

The Open Table Formats (OTFs) designed by Netflix (Apache Iceberg), Uber (Aache Hudi), and Databricks (Delta Lake) have made it possible to build a cloud-native data infrastructure capable of supporting all the datatypes needed for AI/ML. Commonly called a Data Lakehouse, this new platform disaggregates compute from storage and can scale out as capacity requirements change. 

This session will introduce the Data Lakehouse and show how it supports the specialized tooling needed for traditional AI, generative AI, distributed training, and MLOps. This talk concludes with some observations on Nvidia’s GPUs and provides recommendations for adopting GPUs.

Download Slides
Keith Pijanowski

Subject Matter Expert AI/ML, MinIO

Speaker

Keith Pijanowski is MinIO’s subject matter expert for all things AI/ML where he researches and writes about storage requirements for AI and ML workloads. Keith has extensive experience in the software space, most recently as an Enterprise Architect on BNY Mellon’s Distribution Analytics team building data pipelines and analytics solutions. Prior to BNY Mellon, Keith spent more than a decade at Microsoft where he served in a number of different developer evangelism and business roles. He was one of the first members of Microsoft’s Evangelism team when the .NET Framework was first released.