More than 340 million duplicate image files are clogging the servers of London's public-sector bodies, according to figures compiled by digital asset management consultancy Clearpath Analytics in a June 2026 audit commissioned by the Greater London Authority. The redundant data — everything from scanned planning documents in Southwark to patient X-rays duplicated across Guy's and St Thomas' NHS Foundation Trust — is costing London's taxpayers an estimated £2.3 billion in combined storage, licensing and IT maintenance annually.
The timing matters. Keir Starmer's government has made digitising public services a central plank of its public-sector reform agenda, and Sadiq Khan's City Hall is under pressure to demonstrate tangible efficiency savings ahead of the next mayoral budget cycle. Duplicate image data, unglamorous as it sounds, sits at the heart of both ambitions. Every replicated file consumes server capacity, inflates software licensing fees and slows the retrieval systems that NHS clinicians and council planners rely on daily.
Where the Problem Is Worst
Transport for London holds an estimated 47 million duplicate CCTV stills and operational images across its network, according to the GLA audit. Many were captured at Vauxhall Bus Station and King's Cross St. Pancras interchange and stored simultaneously across three separate server environments — a legacy of TfL's pre-pandemic infrastructure expansion. The duplication rate across TfL's image archive sits at roughly 38 percent, meaning more than a third of stored visual data serves no unique purpose.
The NHS picture is starker. Barts Health NHS Trust, which operates the Royal London Hospital in Whitechapel and Newham University Hospital in Plaistow, reported in its 2025-26 digital estate review that radiology image duplication had reached 52 percent across its PACS — picture archiving and communication systems. That redundancy is directly linked to delayed retrieval times averaging 11 seconds per image pull, a figure that clinical informatics teams say compounds pressure on already stretched radiology departments.
Camden Council's planning department, meanwhile, has been running a pilot deduplication programme since January 2026 through a contract with London-based software firm Axiom Data Solutions. Early results shared with the council's overview and scrutiny committee show a 61 percent reduction in duplicate planning-application images within the department's document management system, freeing up 4.2 terabytes of storage in six months. The council projects annual savings of £180,000 if the programme is extended authority-wide.
What Deduplication Actually Costs — And What It Saves
Cloud storage is not cheap. The average cost of managed enterprise storage for London public bodies runs at approximately £22 per terabyte per month, according to pricing schedules published by Crown Commercial Service in March 2026. Multiply that across the GLA's estimated 1.8 petabytes of duplicated image data and the monthly bill exceeds £39 million before licensing and labour are factored in.
Deduplication software licences typically run between £80,000 and £250,000 per annum for a large public-sector deployment, with implementation projects averaging 14 months for bodies the size of a London borough. The GLA audit recommends a pan-London procurement framework — potentially anchored through the London Office of Technology and Innovation on Victoria Street — that could reduce per-organisation licence costs by up to 30 percent through collective bargaining.
The audit also flags a compliance dimension that sharpens the urgency. Under the UK GDPR regime enforced by the Information Commissioner's Office, retaining duplicate patient images or planning records beyond defined retention schedules constitutes a data governance breach. The ICO issued fines totalling £3.6 million to UK public bodies for data retention failures in 2025, a 40 percent increase on the previous year.
For Londoners, the practical stakes are straightforward: faster NHS imaging retrieval means quicker diagnoses; leaner council document systems mean faster planning decisions; and lower storage bills leave more budget headroom for front-line services. The GLA expects its deduplication framework recommendation to go before the London Assembly's budget and performance committee in September 2026, with a formal procurement process potentially launching before the end of the financial year.