London's public sector bodies removed more than 340,000 duplicate images from official digital archives and planning portals between January and May 2026, according to figures compiled by the Greater London Authority's digital infrastructure unit — a purge that underscores how seriously city governments are now treating what was once considered a minor housekeeping issue.
The scale of the problem has grown sharply since the widespread adoption of AI image-generation tools in 2023 and 2024. Planning applications, NHS patient record systems, and Transport for London's public-facing asset libraries have all been hit by duplicate and near-identical image submissions that slow processing times, inflate storage costs, and — in the case of medical records — carry genuine patient safety implications. With the Starmer government's planning reform agenda hinging on faster application turnarounds, the backlog caused by duplicate submissions has become a politically inconvenient bottleneck.
What London Is Actually Doing
The GLA's Digital Services team, based in Southwark's Tooley Street offices, has been running a deduplication programme under its London DataStore improvement roadmap since February 2026. The initiative uses perceptual hashing software — technology that identifies visually similar images even when file names or metadata differ — to flag suspect entries in the London Development Database, the central repository for planning submissions covering every borough from Barnet to Bromley.
Transport for London has run a parallel effort through its Asset Information Management system, targeting duplicate photographs used in maintenance and inspection reports for the Underground's 272 stations. TfL's digital team declined a request for an interview but confirmed the programme in a written statement, without providing specific removal figures. The Wellcome Collection on Euston Road, which manages one of the world's largest biomedical image libraries, has also publicly described its own deduplication work as a model for NHS trusts grappling with similar issues in patient imaging systems.
Storage costs are a real driver. Commercial cloud storage for public sector bodies in England is running at roughly £18 to £22 per terabyte per month for compliant, NHS-grade infrastructure, according to Crown Commercial Service framework pricing published in March 2026. A single large London borough planning portal can accumulate several terabytes of image data annually if submissions go unchecked. Multiply that across 33 boroughs and the cost exposure is substantial.
How London Compares to New York, Amsterdam and Tokyo
New York City's Department of City Planning began a comparable deduplication initiative in late 2024 under its ZoLa (Zoning and Land Use Application) platform, targeting a backlog stretching back to 2018. The New York programme has focused primarily on removing duplicate environmental impact imagery, a narrower scope than London's cross-departmental sweep. Amsterdam's municipality, working through its Stadsarchief digital archive on Vijzelstraat, implemented automated image deduplication in 2023 and reported a 28 percent reduction in archive storage growth in its 2025 annual digital report — a benchmark that London's GLA unit is reportedly using as a reference point.
Tokyo's approach is different in character. The Tokyo Metropolitan Government has embedded image deduplication requirements directly into submission guidelines for building permits since April 2025, placing the compliance burden on applicants rather than on city staff. That upstream approach has proved effective, according to reporting by the Japan Times in March 2026, with processing time for standard permit applications falling by roughly 11 percent in the first six months. London has not yet adopted a similar mandatory submission standard, though planning reform proposals currently before Parliament could open the door to that kind of rule change.
The immediate practical stakes are clearest in housing. Southwark Council's planning department, handling applications for one of London's densest development zones around the Old Kent Road regeneration corridor, processes hundreds of image-heavy submissions each month. Any reduction in duplicates translates directly into officer time saved — and, in theory, faster decisions on the housing applications that Downing Street needs approved to hit its 1.5 million new homes target by 2029.
For Londoners tracking planning applications or submitting them, the most immediate advice is straightforward: check file names, compress image sets before uploading to the Planning Portal at planningportal.co.uk, and avoid resubmitting rejected image packs without stripping metadata. The GLA's guidance on image submission standards, updated in April 2026, is publicly available through the London DataStore website.