11-30, 15:15–15:45 (Europe/Amsterdam), Ernst-Curie
Royal FloraHolland is the biggest flower and plant auction in the world. We sell products on behalf of the growers. Every day we receive new supply and we want to ensure quality standards. In our talk we will present a project that aimed to revolutionize the quality control process within our organization. With a substantial inflow of supply, far surpassing the quality control team's capacity, the traditional approach of random checking proved impractical. Clearly, a smarter solution was needed. We will showcase how we developed and implemented a multi-model system, which utilizes a dynamic risk threshold for optimal selection of supply items. This is an informative, yet engaging talk with practical examples of techniques and infrastructure used.
The challenge we faced was in determining when a supply item is faulty and should be sent for quality control inspection. With a daily influx of around 15,000 supply items and a quality control team capable of inspecting only 200 items due to extensive manual labor requirements. Random checking was not a viable option, and we sought to leverage data science to preemptively identify the most likely faulty items.
What makes this project particularly captivating is the delicate balance it strikes between forecasting the expected volume of supply items and predicting the risk scores for each item. This synergy between volume forecasting and risk prediction enabled us to establish a dynamic risk threshold, a pivotal factor in determining whether a supply item should undergo quality control inspection at that moment in time. The dynamic nature of this threshold accommodated the ever-changing landscape of supply volumes, ensuring that it adapted to meet the evolving needs of our organization.
We will first introduce the problem and business requirements, then explain our solution, and finally show an overview of techniques and frameworks used (Delta Unity Catalog tables, Airflow, PySpark, MLflow model registry, HyperOpt model optimization, Databricks serving endpoint).
The key takeaways for our audience:
Understand the importance and method of dynamically adjusting risk thresholds in quality control.
Learn the benefits of using a multi modal approach in supply chain management systems.
Expected background knowledge:
Attendees should have a basic understanding of predictive modeling and data science concepts. Familiarity with supply chain management or quality control processes is beneficial but not mandatory.
Audience:
This talk is designed for data- scientists/engineers, supply chain professionals, quality control teams, and anyone interested in the application of data science in real-world challenges.
Previous knowledge expected
Data Scientist at Royal FloraHolland
Machine Learning Engineer at Xebia Data.