UK Government Plans to Use National Data for AI Development
Essential brief
UK Government Plans to Use National Data for AI Development
Key facts
Highlights
The UK government is advancing plans to integrate nationally owned data from institutions like the Met Office and the National Archives into artificial intelligence (AI) systems. This initiative aims to leverage public sector data to enhance AI tools that can assist various sectors, including local agencies and small businesses. For example, researchers are being funded to explore how Met Office weather data could optimize decisions such as when councils should purchase road grit. Another project will investigate how legal documents from the National Archives might support small and medium-sized enterprises by providing accessible legal guidance.
These efforts are part of a broader government strategy to license content from national institutions such as the National History Museum and the National Library of Scotland. Ian Murray, the minister for digital government and data, emphasized that making authoritative legal data AI-ready could help business owners quickly navigate complex regulations, freeing up their time and boosting confidence to invest and grow. This approach reflects a push to make public sector data more usable and beneficial for citizens and businesses alike.
Data is a critical input for AI models underpinning tools like ChatGPT. However, the use of copyrighted material in AI training has sparked controversy in the UK, with creative industries opposing government proposals that would allow AI companies to use protected works without explicit permission unless artists opt out. In response to criticism, the government announced a review aiming to reset these plans. Despite this, the government remains committed to utilizing nationally owned data to drive AI development, as outlined in its AI action plan launched the previous year.
The AI action plan includes creating a national data library to provide state-controlled datasets to researchers and companies. Health data managed by the NHS has been identified as a potential component, though privacy, ethics, and data protection remain paramount considerations. Additionally, the plan proposes a cultural data library as a commercial venture, potentially containing copyright-cleared content from institutions such as the BBC, British Library, Natural History Museum, and National Archives.
To facilitate this, the government is expanding its "creative content exchange," a marketplace launched in June designed to enable the buying, selling, licensing, and access to digitized cultural and creative assets at scale. A pilot scheme will explore how data from various institutions—including the Imperial War Museums, Royal Botanic Gardens Kew, Science Museum Group, and Victoria and Albert Museum—can be utilized by consumers, technology companies, and AI developers while respecting creators' rights. The pilot platform is expected to launch in the summer, marking a significant step toward integrating public data into AI innovation.
This initiative reflects the UK government's dual focus on fostering AI innovation and safeguarding intellectual property rights. By making authoritative public data accessible for AI development, the government hopes to stimulate economic growth, support small businesses, and maintain ethical standards in data use. The success of these efforts could position the UK as a leader in responsible AI deployment using publicly owned resources.