Loading…
Subject: Data Engineering / Architecture and Streaming (DataWeek) clear filter
Wednesday, September 3
 

12:00pm PDT

PRO WORKSHOP (DataWeek): Building a RAG System for Video Search and Analysis
Wednesday September 3, 2025 12:00pm - 12:50pm PDT
Elizabeth Fuentes Leone, AWS, Developer Advocate

This talk addresses the challenge of making video content searchable and analyzable using modern AI techniques. While text and image RAG systems are common, video presents unique challenges due to its multimodal nature combining visual frames and audio content. 
Speakers
avatar for Elizabeth Fuentes Leone

Elizabeth Fuentes Leone

Developer Advocate, AWS
As a Data Analytics and Machine Learning/Artificial Intelligence (ML/AI) Specialist, my mission is to break down complex concepts into easily understandable terms. I strive to develop innovative solutions that tackle real-world challenges effectively. By sharing my knowledge and experience... Read More →
Wednesday September 3, 2025 12:00pm - 12:50pm PDT
DataWeek -- Main Stage

2:00pm PDT

PRO WORKSHOP (DataWeek): Building New Cost-Effective Analytics Platform with Open Source tools for Fanatics
Wednesday September 3, 2025 2:00pm - 2:50pm PDT
Bhanu Cherukumille, Fanatics, Director - Data Engineering

At Fanatics, we previously relied on a mix of in-house, open-source, and commercial tools for Analytics and BI. To streamline and modernize our data ecosystem, we built a unified platform — a "one to rule them all" solution — powered entirely by an open-source stack: Kafka, StarRocks, and Superset. The result? A cost-effective, high-performance, and feature-rich platform that unlocks powerful new capabilities for the business. 
Speakers
avatar for Bhanu Cherukumille

Bhanu Cherukumille

Director - Data Engineering, Fanatics
Specialize in building real-time and self-serve analytics solutions in the cloud. Empowering teams with agile, data-driven insights and transforming complex data into actionable intelligence for innovative organizations
Wednesday September 3, 2025 2:00pm - 2:50pm PDT
DataWeek -- Main Stage
 
Thursday, September 4
 

9:30am PDT

PRO Session (DataWeek): Intelligent Automation of Data Engineering Workflows with LLMs
Thursday September 4, 2025 9:30am - 9:55am PDT
Manohar Sai Jasti, Workday,  Analytics Engineer

I will share how I developed an AI-driven system to transform raw SQL into production-ready dbt models using Large Language Models (LLMs). By combining retrieval-augmented generation techniques with dbt’s semantic framework, I automated SQL refactoring, modularization, testing, and documentation. This approach accelerates data engineering workflows, reduces manual effort, and enables scalable, production-ready analytics pipelines. I will walk through the architecture, challenges faced during scaling, validation strategies for AI-generated SQL, and key lessons learned from deploying this solution in real-world environments. Attendees will gain practical insights into applying LLMs for data workflow automation, improving pipeline quality, and driving faster AI productionization across modern data stacks. 
Speakers
avatar for Manohar Sai Jasti

Manohar Sai Jasti

Analytics Engineer, Workday
Manohar Sai Jasti is an experienced Analytics Engineer specializing in building efficient and scalable data pipelines. With expertise in tools like dbt, Trino, and cloud platforms, he helps organizations turn data into actionable insights. Manohar is passionate about simplifying data... Read More →
Thursday September 4, 2025 9:30am - 9:55am PDT
DataWeek -- Main Stage
 
Friday, September 5
 

10:00am PDT

PRO Session (DataWeek): Data Sovereignty in the Age of AI
Friday September 5, 2025 10:00am - 10:25am PDT
Michel Tricot, Airbyte, Co-founder and CEO

This session will explore the intersection of data sovereignty and artificial intelligence, addressing how organizations can maintain control of their valuable data assets while still leveraging the power of AI. Drawing from extensive experience building open-source data infrastructure solutions, Michel will illuminate the challenges companies face when integrating AI into their data ecosystems without compromising ownership, security, or compliance requirements.

The session targets data leaders, CDOs, and enterprise architects who are navigating the complex landscape of AI adoption while maintaining strict data governance standards. Michel will share practical frameworks for implementing a self-managed data integration strategy that enables AI innovation while preserving first-party data sovereignty—a crucial consideration as regulatory requirements around data protection continue to evolve globally. Attendees will gain actionable insights on building resilient data architectures that support AI initiatives without surrendering control of sensitive information.

This session aligns perfectly with Data Week's focus on "Data Engineering & Governance" and "AI & ML" tracks, offering attendees a unique perspective on balancing innovation with control. Conference participants will benefit from Michel's vision of how open-source data integration infrastructure can serve as the foundation for responsible AI development, empowering organizations to build competitive advantages while maintaining complete sovereignty over their data. The presentation will include real-world examples of companies that have successfully implemented these principles.
Speakers
avatar for Michel Tricot

Michel Tricot

Co-founder and CEO, Airbyte
Michel Tricot is co-founder and CEO of Airbyte, the open data movement platform. The company was started in 2020 with a vision of commoditizing data integration pipelines across all industries and organizations and today has more than 170,000 deployments. Michel has been working in... Read More →
Friday September 5, 2025 10:00am - 10:25am PDT
DataWeek -- Main Stage
 
Wednesday, September 10
 

9:00am PDT

[Virtual] PRO WORKSHOP (DataWeek): Building a RAG System for Video Search and Analysis
Wednesday September 10, 2025 9:00am - 9:50am PDT
Elizabeth Fuentes Leone, AWS, Developer Advocate

This talk addresses the challenge of making video content searchable and analyzable using modern AI techniques. While text and image RAG systems are common, video presents unique challenges due to its multimodal nature combining visual frames and audio content. 
Speakers
avatar for Elizabeth Fuentes Leone

Elizabeth Fuentes Leone

Developer Advocate, AWS
As a Data Analytics and Machine Learning/Artificial Intelligence (ML/AI) Specialist, my mission is to break down complex concepts into easily understandable terms. I strive to develop innovative solutions that tackle real-world challenges effectively. By sharing my knowledge and experience... Read More →
Wednesday September 10, 2025 9:00am - 9:50am PDT
VIRTUAL DataWeek -- Main Stage

11:00am PDT

[Virtual] PRO WORKSHOP (DataWeek): Building New Cost-Effective Analytics Platform with Open Source tools for Fanatics
Wednesday September 10, 2025 11:00am - 11:50am PDT
Bhanu Cherukumille, Fanatics, Director - Data Engineering

At Fanatics, we previously relied on a mix of in-house, open-source, and commercial tools for Analytics and BI. To streamline and modernize our data ecosystem, we built a unified platform — a "one to rule them all" solution — powered entirely by an open-source stack: Kafka, StarRocks, and Superset. The result? A cost-effective, high-performance, and feature-rich platform that unlocks powerful new capabilities for the business. 
Speakers
avatar for Bhanu Cherukumille

Bhanu Cherukumille

Director - Data Engineering, Fanatics
Specialize in building real-time and self-serve analytics solutions in the cloud. Empowering teams with agile, data-driven insights and transforming complex data into actionable intelligence for innovative organizations
Wednesday September 10, 2025 11:00am - 11:50am PDT
VIRTUAL DataWeek -- Main Stage
 
Thursday, September 11
 

9:30am PDT

[Virtual] PRO Session (DataWeek): Intelligent Automation of Data Engineering Workflows with LLMs
Thursday September 11, 2025 9:30am - 9:55am PDT
Manohar Sai Jasti, Workday,  Analytics Engineer

I will share how I developed an AI-driven system to transform raw SQL into production-ready dbt models using Large Language Models (LLMs). By combining retrieval-augmented generation techniques with dbt’s semantic framework, I automated SQL refactoring, modularization, testing, and documentation. This approach accelerates data engineering workflows, reduces manual effort, and enables scalable, production-ready analytics pipelines. I will walk through the architecture, challenges faced during scaling, validation strategies for AI-generated SQL, and key lessons learned from deploying this solution in real-world environments. Attendees will gain practical insights into applying LLMs for data workflow automation, improving pipeline quality, and driving faster AI productionization across modern data stacks. 
Speakers
avatar for Manohar Sai Jasti

Manohar Sai Jasti

Analytics Engineer, Workday
Manohar Sai Jasti is an experienced Analytics Engineer specializing in building efficient and scalable data pipelines. With expertise in tools like dbt, Trino, and cloud platforms, he helps organizations turn data into actionable insights. Manohar is passionate about simplifying data... Read More →
Thursday September 11, 2025 9:30am - 9:55am PDT
VIRTUAL DataWeek -- Main Stage
 
Friday, September 12
 

10:00am PDT

[Virtual] PRO Session (DataWeek): Data Sovereignty in the Age of AI
Friday September 12, 2025 10:00am - 10:25am PDT
Michel Tricot, Airbyte, Co-founder and CEO

This session will explore the intersection of data sovereignty and artificial intelligence, addressing how organizations can maintain control of their valuable data assets while still leveraging the power of AI. Drawing from extensive experience building open-source data infrastructure solutions, Michel will illuminate the challenges companies face when integrating AI into their data ecosystems without compromising ownership, security, or compliance requirements.

The session targets data leaders, CDOs, and enterprise architects who are navigating the complex landscape of AI adoption while maintaining strict data governance standards. Michel will share practical frameworks for implementing a self-managed data integration strategy that enables AI innovation while preserving first-party data sovereignty—a crucial consideration as regulatory requirements around data protection continue to evolve globally. Attendees will gain actionable insights on building resilient data architectures that support AI initiatives without surrendering control of sensitive information.

This session aligns perfectly with Data Week's focus on "Data Engineering & Governance" and "AI & ML" tracks, offering attendees a unique perspective on balancing innovation with control. Conference participants will benefit from Michel's vision of how open-source data integration infrastructure can serve as the foundation for responsible AI development, empowering organizations to build competitive advantages while maintaining complete sovereignty over their data. The presentation will include real-world examples of companies that have successfully implemented these principles.
Speakers
avatar for Michel Tricot

Michel Tricot

Co-founder and CEO, Airbyte
Michel Tricot is co-founder and CEO of Airbyte, the open data movement platform. The company was started in 2020 with a vision of commoditizing data integration pipelines across all industries and organizations and today has more than 170,000 deployments. Michel has been working in... Read More →
Friday September 12, 2025 10:00am - 10:25am PDT
VIRTUAL DataWeek -- Main Stage
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.