London's public bodies and property developers are sitting on hundreds of thousands of duplicate digital images, a problem that is costing organisations significant sums in wasted storage, licensing fees and staff hours spent manually reviewing archives. Estimates from digital asset management consultants working with local authorities suggest that, in large organisations, between 30 and 60 per cent of stored images are either exact duplicates or near-identical variants created through repeated uploads, format conversions or legacy system migrations.
The issue has sharpened in 2026 as the Starmer government's planning reform push — centred on the Planning and Infrastructure Bill, introduced to Parliament in March — forces councils to digitise and publish vastly more documentation than before. Greater London Authority departments and borough councils across the capital are racing to bring planning records, heritage asset photographs and housing stock imagery online, often pulling from multiple disconnected databases built up over decades.
The Scale Across London Boroughs
Tower Hamlets Council, which manages one of the country's most active planning pipelines given the ongoing transformation of Whitechapel and Poplar, began a digital asset audit in January 2026. Digital infrastructure teams working on projects like that are routinely finding that image libraries accumulated over 15 to 20 years contain duplication rates that push storage costs significantly higher than necessary. Cloud storage pricing for large public-sector bodies on frameworks such as the Crown Commercial Service's G-Cloud 14 agreement typically runs between £0.018 and £0.023 per gigabyte per month — figures that multiply fast when archives contain millions of redundant files.
The property sector feels the same pressure. Rightmove and Zoopla listings for central London postcodes — particularly around Nine Elms and the broader Vauxhall regeneration zone — routinely carry duplicated images pulled from estate agents' internal systems, where photographs are uploaded multiple times under different property reference numbers as listings are refreshed or transferred between branch databases. The Royal Institution of Chartered Surveyors flagged property data quality as a priority concern in its 2025 annual review of the residential market, noting inconsistencies in how digital records are maintained across the sector.
Transport for London's communications archive presents a different dimension of the same challenge. TfL's image library, which covers everything from construction photography along the Elizabeth line's western branches to promotional material for the Night Tube, spans multiple content management systems inherited from separate project teams. Digital teams managing that estate have publicly acknowledged, in procurement documents published on Contracts Finder, the need for deduplication tools as part of broader digital transformation contracts issued since 2024.
What Deduplication Actually Costs — and Saves
The mathematics of fixing the problem are reasonably straightforward, even if the operational effort is not. Automated deduplication software — tools from vendors including Canto, Bynder and open-source alternatives — can process tens of thousands of images per hour using perceptual hashing algorithms that identify visually identical files even when they differ in file size, format or metadata. Licensing for enterprise-grade platforms typically starts at around £8,000 per year for mid-sized organisations, rising toward £40,000 or more for councils or developers managing archives of more than a million assets.
For a borough like Southwark, which maintains digital records across its housing regeneration programme at the Elephant and Castle and numerous listed building files in Borough Market's conservation area, the return on that investment can be realised within a single financial year if the audit uncovers, as is common, several terabytes of redundant data. Storage reclamation, reduced manual review hours and cleaner data for AI-assisted planning tools all feed into the saving.
Organisations that have not yet started should begin with a baseline audit of their existing digital asset management systems — or, where no formal system exists, their shared drives and content management platforms. The Local Government Association published updated digital records guidance in February 2026 recommending that councils prioritise image deduplication ahead of any large-scale migration to new planning portals required under the Planning and Infrastructure Bill's implementation timetable, which runs through to 2028. Getting the numbers right now means the digitisation push will not simply move the mess from one system to another.