UK Government Plans to Use National Data for AI Development
Tech Beetle briefing GB

UK Government Plans to Use National Data for AI Development

Essential brief

UK Government Plans to Use National Data for AI Development

Key facts

The UK government plans to license data from national institutions like the Met Office and National Archives for AI development.
Projects will explore using public data to assist local agencies and small businesses with practical needs such as weather-related decisions and legal guidance.
The government aims to balance AI innovation with copyright protections amid industry concerns about the use of protected works.
A national data library and a cultural data library are key components of the government's AI action plan to provide state-controlled datasets.
The creative content exchange marketplace will facilitate large-scale licensing of digitized cultural assets for AI and other uses.

Highlights

The UK government plans to license data from national institutions like the Met Office and National Archives for AI development.
Projects will explore using public data to assist local agencies and small businesses with practical needs such as weather-related decisions and legal guidance.
The government aims to balance AI innovation with copyright protections amid industry concerns about the use of protected works.
A national data library and a cultural data library are key components of the government's AI action plan to provide state-controlled datasets.

The UK government is advancing plans to integrate nationally owned data from institutions like the Met Office and the National Archives into artificial intelligence (AI) systems. This initiative aims to leverage public sector data to enhance AI tools that can assist various sectors, including local agencies and small businesses. For example, researchers are being funded to explore how Met Office weather data could optimize decisions such as when councils should purchase road grit. Another project will investigate how legal documents from the National Archives might support small and medium-sized enterprises by providing accessible legal guidance.

These efforts are part of a broader government strategy to license content from national institutions such as the National History Museum and the National Library of Scotland. Ian Murray, the minister for digital government and data, emphasized that making authoritative legal data AI-ready could help business owners quickly navigate complex regulations, freeing up their time and boosting confidence to invest and grow. This approach reflects a push to make public sector data more usable and beneficial for citizens and businesses alike.

Data is a critical input for AI models underpinning tools like ChatGPT. However, the use of copyrighted material in AI training has sparked controversy in the UK, with creative industries opposing government proposals that would allow AI companies to use protected works without explicit permission unless artists opt out. In response to criticism, the government announced a review aiming to reset these plans. Despite this, the government remains committed to utilizing nationally owned data to drive AI development, as outlined in its AI action plan launched the previous year.

The AI action plan includes creating a national data library to provide state-controlled datasets to researchers and companies. Health data managed by the NHS has been identified as a potential component, though privacy, ethics, and data protection remain paramount considerations. Additionally, the plan proposes a cultural data library as a commercial venture, potentially containing copyright-cleared content from institutions such as the BBC, British Library, Natural History Museum, and National Archives.

To facilitate this, the government is expanding its "creative content exchange," a marketplace launched in June designed to enable the buying, selling, licensing, and access to digitized cultural and creative assets at scale. A pilot scheme will explore how data from various institutions—including the Imperial War Museums, Royal Botanic Gardens Kew, Science Museum Group, and Victoria and Albert Museum—can be utilized by consumers, technology companies, and AI developers while respecting creators' rights. The pilot platform is expected to launch in the summer, marking a significant step toward integrating public data into AI innovation.

This initiative reflects the UK government's dual focus on fostering AI innovation and safeguarding intellectual property rights. By making authoritative public data accessible for AI development, the government hopes to stimulate economic growth, support small businesses, and maintain ethical standards in data use. The success of these efforts could position the UK as a leader in responsible AI deployment using publicly owned resources.