With AI models consuming more data than ever, ensuring the integrity and traceability of that data is critical. This talk focuses on how to build trust into your data pipelines -- using concepts like SBOMs (Software Bill of Materials) for datasets, audit trails, and metadata tagging to make data consumption safer and more transparent. I’ll also touch on how this ties into emerging compliance frameworks and how we’ve approached this in practice.
International Red Hat Women in Open Source Awardee | Mozilla Open Leader 2019 | a strong open source diversity supporter | Google Venkat Scholarship winner | Speaker