1161553083.20251024-122340

This webinar will bring together insights from ODI projects, and combines them with input from experts in the field of open data and AI. We will explore the AI readiness of open data, unpack key projects that use open data for AI, and investigate the opportunities open data can bring to the AI ecosystem, and how to realise them.

Background

Data is the foundation of AI. Poor quality data drives up costs and can lead to hidden problems for AI, especially in complex fields like healthcare. Meanwhile, biased data negatively affects the performance of AI models, and uninspected evaluation datasets can lead to false positives or overestimates of model accuracy.

At the ODI we’ve been thinking about what it means for data to be AI ready. Earlier in 2025 developed a framework to define the distinct aspects of what makes data ready to be used for AI. The framework explores the specific criteria for data publishers to follow, aiming to offer actionable recommendations for data, metadata, and infrastructure design. Building on recent work to test the framework in the public sector setting, we have expanded the framework to consider governance design as well.

Open data and AI

The relationship between open data and AI is full of potential, but involves significant complexities. Open data has played a key role in the recent explosion of AI innovation, and enables more transparent and accountable governance. However, there are also significant concerns about the use of open data for training AI, and the lack of reference or attribution to open datasets.

Despite significant amounts of open data being published online, it is not all necessarily AI ready. Just as during the first wave of open data, simply making open datasets available online is not enough for them to be used to support AI development. Ensuring datasets have high-quality metadata and data infrastructure, as well as robust data governance, is essential for AI innovation.

In 2025 we explored the landscape of open data use, developing the Open Data Use Case Observatory as part of our final year partnership with Microsoft. The observatory includes case studies from a broad range of subject matters, modalities and geographies. The case studies focus on the intricacies of each dataset and how they enabled the datasets to be used for impact, whether that came from use in the training of AI models or in other technological, data-led interventions. In a second phase of work, we have applied our framework of AI readiness to a subset of these case studies, providing some initial insight into the landscape of open data and AI.