London's public sector is drowning in duplicate digital images. Across the capital's borough councils, NHS trusts, and Transport for London's sprawling infrastructure network, IT auditors and records managers have identified duplicate image files as one of the fastest-growing sources of wasted storage expenditure — a problem that has quietly ballooned alongside the shift to remote working and cloud-based document systems accelerated after 2020.
The timing matters. With Keir Starmer's government pushing digital reform across Whitehall and Sadiq Khan's City Hall under pressure to cut operational costs ahead of the 2027 budget cycle, the mundane question of how many times a JPEG or a scanned PDF exists on a council server has become a genuine fiscal and administrative concern. Planning applications alone — which under the government's revised National Planning Policy Framework now flood London boroughs at record volume — generate thousands of image attachments per week, many uploaded repeatedly by applicants, officers, and third-party consultants.
The Scale of the Problem in London
At the Greater London Authority's data centre operations, storage audits conducted as part of ongoing cloud migration work have found that duplicate and near-duplicate files can account for anywhere between 20 and 40 percent of total unstructured data holdings — a range consistent with figures published by the Jisc-backed Digital Preservation Coalition in its 2024 guidance for public sector organisations. For a large NHS trust such as King's College Hospital NHS Foundation Trust, which handles imaging data from its Denmark Hill site in Southwark as well as satellite facilities, the financial implication is significant: enterprise-grade cloud storage billed at roughly £18 to £25 per terabyte per month means that even a single redundant terabyte retained unnecessarily over a year costs upwards of £300 before staff time is factored in.
Tower Hamlets Council, which processed more than 4,200 planning applications in 2024-25 according to its published authority monitoring report, has been among the boroughs trialling automated deduplication software as part of a broader document management overhaul. The London Borough of Hackney, still rebuilding its digital infrastructure following the ransomware attack that struck its systems in October 2020, has similarly made storage hygiene a priority, with officers noting internally that the post-attack rebuild revealed just how extensively duplicated image assets had been spread across legacy systems. Neither council has published a full cost figure for the deduplication exercise.
Why Deduplication Is Now a Policy Priority
The push to fix this is not purely about tidiness. Under the government's Data (Use and Access) Act 2025, public bodies face new obligations around data accuracy and minimisation — requirements that create a direct legal incentive to eliminate redundant records, including image files. For planning departments along the Thamesside development corridor, where schemes from Battersea Power Station to Silvertown Quays generate enormous volumes of architectural renders, site photographs, and scanned heritage documents, the administrative burden is acute.
Deduplication software vendors — including firms with London office presences such as Veritas Technologies, which operates from offices near Fenchurch Street in the City — argue that automated tools can reduce storage footprints by 30 to 60 percent within three to six months of deployment. Independent assessments from the National Audit Office have previously noted, in reports covering central government digital programmes, that storage waste is consistently underestimated at the procurement stage.
For Londoners, the practical consequence of unresolved duplicate image problems is slower public-facing services. Planning portal queries that should resolve in seconds stall when databases are bloated. NHS imaging systems that retrieve X-rays and MRI scans for clinicians at sites like University College Hospital on Euston Road can slow perceptibly when storage environments are poorly maintained.
Boroughs and trusts that have not yet begun formal deduplication audits are being advised by the Local Government Association to treat storage rationalisation as a prerequisite for any new cloud contract renewal — a step that, if taken across all 32 London boroughs simultaneously, could in aggregate free up tens of thousands of terabytes and redirect millions in annual storage spend toward frontline services.