Loading…
Type: DataWeek clear filter
arrow_back View All Dates
Thursday, September 11
 

9:30am PDT

[Virtual] PRO Session (DataWeek): Intelligent Automation of Data Engineering Workflows with LLMs
Thursday September 11, 2025 9:30am - 9:55am PDT
Manohar Sai Jasti, Workday,  Analytics Engineer

I will share how I developed an AI-driven system to transform raw SQL into production-ready dbt models using Large Language Models (LLMs). By combining retrieval-augmented generation techniques with dbt’s semantic framework, I automated SQL refactoring, modularization, testing, and documentation. This approach accelerates data engineering workflows, reduces manual effort, and enables scalable, production-ready analytics pipelines. I will walk through the architecture, challenges faced during scaling, validation strategies for AI-generated SQL, and key lessons learned from deploying this solution in real-world environments. Attendees will gain practical insights into applying LLMs for data workflow automation, improving pipeline quality, and driving faster AI productionization across modern data stacks. 
Speakers
avatar for Manohar Sai Jasti

Manohar Sai Jasti

Analytics Engineer, Workday
Manohar Sai Jasti is an experienced Analytics Engineer specializing in building efficient and scalable data pipelines. With expertise in tools like dbt, Trino, and cloud platforms, he helps organizations turn data into actionable insights. Manohar is passionate about simplifying data... Read More →
Thursday September 11, 2025 9:30am - 9:55am PDT
VIRTUAL DataWeek -- Main Stage

10:00am PDT

[Virtual] PRO Session (DataWeek): Generative AI Operation Evaluation Framework
Thursday September 11, 2025 10:00am - 10:25am PDT
Cigil Achenkunju, LivePerson, Data and Product Management

How do we know if this gen AI investment is moving the needle? It is a question heard almost daily across finance, healthcare, and retail. And honestly, it is the right question to ask. On top of that, should we continue to invest in AI at the same rate or optimize? How can a stakeholder show that solution usage has a positive or maybe negative impact on their operation?
Survey results indicate that up to 85% of AI initiatives eventually fail to deliver their promises. Organizations using gen AI want to understand the impact of such solutions clearly. Can you blame them? So, let’s define a strategic decision-making framework that broadly answers these business questions in an operational setting that balances the benefits of business value and AI integration.

An analytical framework for operations measurement 2S/2E: Think of it as four pillars used together to tell a complete story of your gen AI-enabled operation's health: Each pillar reveals a different facet of your performance, and I'll show you exactly how to measure them. These pillars offer valuable insights to measure your operations. What makes this framework powerful is its systematic approach and adaptability.
Speakers
avatar for Cigil Achenkunju

Cigil Achenkunju

Data and Product Management, LivePerson
A leader in advanced data analytics and a strategic advisor, Cigil has a robust background in data science and product management. With extensive experience across various organizations, Cigil has helped companies transform data into actionable insights that drive business success... Read More →
Thursday September 11, 2025 10:00am - 10:25am PDT
VIRTUAL DataWeek -- Main Stage

10:30am PDT

[Virtual] OPEN Session (DataWeek): Hiring for AI Success: Why Your First Hire Should Be a Data Engineer
Thursday September 11, 2025 10:30am - 10:55am PDT
Brenna Buuck, MinIO, Developer Evangelist

AI initiatives are at the top of every organization’s priority list, yet many fail before they even begin—not because of poor models, but because of poor data foundations. While hiring an AI/ML engineer may seem like the logical first step, success depends on a different approach: hiring a data engineer first.

In this session, I'll explore why data infrastructure is the true bottleneck in AI adoption and how the right data engineering expertise ensures AI models perform at scale. Drawing on real-world experience, I’ll walk through the hiring missteps organizations often make and how to avoid costly mistakes when building AI initiatives from the ground up.
Speakers
avatar for Brenna Buuck

Brenna Buuck

Developer Evangelist, MinIO
Brenna Buuck is the subject matter expert at MinIO for databases and datalakes. A data engineer turned developer evangelist, she is passionate about coding, data, and learning. She endeavors to inspire and educate other developers about the latest tools and technologies with the goal... Read More →
Thursday September 11, 2025 10:30am - 10:55am PDT
VIRTUAL DataWeek -- Main Stage
  DataWeek

11:30am PDT

[Virtual] OPEN Session (DataWeek): Strategies for Image Dataset Curation from High-Volume Industrial IoT data
Thursday September 11, 2025 11:30am - 11:55am PDT
Apurva Godghase, Brambles, Senior Computer Vision Engineer

In Industrial IoT for Supply chain, and logistics, massive amounts of data is generated by edge devices that capture data continuously. For embedded vision systems, managing the sheer volume of images and metadata can be challenging. Selecting a diverse subset of high-quality data is crucial for effective modeling and analysis. This work outlines a comprehensive method for selecting relevant images from an extensive dataset to build a high-quality image database for building and monitoring computer vision and machine learning models. This systematic approach not only enhances the efficiency of data management in industrial IoT applications but also improves the generalizability and accuracy of Computer Vision learning models. 
Speakers
avatar for Apurva Godghase

Apurva Godghase

Senior Computer Vision Engineer, Brambles
Apurva is a Senior Computer Vision Engineer at Brambles, with over seven years of R&D experience across diverse industrial domains. At Brambles, she specializes in designing and deploying cutting-edge machine learning and computer vision IoT prototypes to enhance supply chain efficiencies... Read More →
Thursday September 11, 2025 11:30am - 11:55am PDT
VIRTUAL DataWeek -- Main Stage

1:00pm PDT

[Virtual] PRO Session (DataWeek): AI-Driven Innovation: Scalable Data Architectures
Thursday September 11, 2025 1:00pm - 1:25pm PDT
Pritam Roy, Capgemini, Sr. Manager

As enterprises embrace AI for scalable automation, predictive analytics, and real-time decision intelligence, the need for robust data architectures and machine learning frameworks has never been greater. This session, led by Pritam Roy, a seasoned AI and data engineering leader, will explore how to design and implement scalable AI-powered data solutions that optimize business operations, cloud efficiency, and enterprise intelligence. 
Speakers
avatar for Pritam Roy

Pritam Roy

Sr. Manager, Capgemini
Pritam Roy is a seasoned AI and data engineering leader, specializing in enterprise-scale AI solutions, cloud computing, and machine learning-driven business transformation. With over 20 years of experience, he has played a pivotal role in AI innovation, predictive analytics, and... Read More →
Thursday September 11, 2025 1:00pm - 1:25pm PDT
VIRTUAL DataWeek -- Main Stage
  DataWeek

1:30pm PDT

[Virtual] PRO Session (DataWeek): Integrating Data Governance into Cyber Risk Management
Thursday September 11, 2025 1:30pm - 1:55pm PDT
Nandini Singh, Google, Sr. TPM

This session is designed for cybersecurity professionals, data governance leaders, and IT managers seeking to strengthen their organization's cybersecurity posture through effective data governance practices. Attendees will leave with actionable insights and strategies to enhance their organization's resilience against cyber threats.

Drawing upon my experience of working at the Office of Cybersecurity Resilience at Google, I will share lessons learned from integrating data governance into cyber risk management, with a focus on evaluating metric quality levels (introducing the concept of Metric Bill of Materials) and developing a continuous improvement and adaptation roadmap.
Speakers
avatar for Nandini Singh

Nandini Singh

Sr. TPM, Google
Nandini Singh is a seasoned professional in the fields of data modeling, analytics, and cybersecurity technologies, with a robust career that spans over a decade. She currently serves as a Senior Technical Program Manager at Google, where she leads initiatives on product, platform... Read More →
Thursday September 11, 2025 1:30pm - 1:55pm PDT
VIRTUAL DataWeek -- Main Stage

3:00pm PDT

[Virtual] OPEN Session (DataWeek): AI Leadership in Data Strategy: Transforming Large-Scale Data Systems for Business Growth
Thursday September 11, 2025 3:00pm - 3:25pm PDT
Vijay Panwar, Panasonic Avionics Corporation, Senior Software Engineer

As organizations progressively depend on data to foster innovation, the significance of leadership in shaping and executing AI-driven strategies becomes crucial. In this session, I will present insights gained from over 12 years of experience spearheading transformative initiatives incorporating AI into extensive data systems. The discussion will emphasize strategic frameworks for the adoption of AI, the alignment of technological advancements with business goals, and the development of scalable data ecosystems. By referencing real-world examples, including my involvement in managing and optimizing terabyte-scale data, I will demonstrate how AI can transform backend systems, enhance workflows, and provide tangible value. 
Speakers
avatar for Vijay Panwar

Vijay Panwar

Senior Software Engineer, Panasonic Avionics Corportion
I am, an accomplished IT professional with a decade of experience, possess expertise in a wide array of technologies, including Python, SQL Server, MySQL, PHP, Web services, REST API, and more. I have a proven track record of contributing to the field, having published two research... Read More →
Thursday September 11, 2025 3:00pm - 3:25pm PDT
VIRTUAL DataWeek -- Main Stage
  DataWeek

3:30pm PDT

[Virtual] OPEN Session (DataWeek): Balancing Velocity with Academic Rigor When Building with LLMs
Thursday September 11, 2025 3:30pm - 3:55pm PDT
Lauren Peate, Multitudes, CEO & founder

We’re all building AI features now. But building with LLMs brings its own challenges – namely: How can we use cutting-edge practices, weave in AI ethics, and consider the cost of different models without blowing past delivery dates. Not to mention making sure that the features we build will be stable, reliable and maintainable in the future.

We recently built our first LLM feature, to show the quality of feedback given in code reviews. In 1 month, we did a literature review, consultation with academic experts, data labelling, model experimentation, a cost assessment, and finally, all the ML engineering to launch it into production. The outcome: <1% extreme misclassification and zero hallucinations. In this talk, we’ll share our approach to building LLM features – how we partnered with academia (without being delayed by their timelines), what tooling we used, and how we made the cost and money tradeoffs to keep business stakeholders happy. I’ll also speak to how we built this into our microservices architecture, including how we used tools to generate structured outputs from LLMs on top of AWS’s Bedrock API to have parseable responses from a range of models.

You'll walk away with practical strategies for leading your own teams through AI implementations, identifying ethical issues early, addressing them efficiently, and still delivering on time and on budget.
Speakers
avatar for Lauren Peate

Lauren Peate

CEO & founder, Multitudes
Lauren Peate is the CEO and founder of Multitudes, which helps engineering teams improve delivery sustainably. She’s focused her career on using data to support people, including as the founder of Ally Skills NZ, a consultancy helping global tech companies improve team performance... Read More →
Thursday September 11, 2025 3:30pm - 3:55pm PDT
VIRTUAL DataWeek -- Main Stage
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -