Skip to main content
The Daily London

London news, every day

News

London's Archives Are Full of Duplicate Images. Officials and Experts Say the Fix Is Long Overdue.

From the City of London Corporation to Southwark Council, the push to clean up duplicated digital image records is drawing urgent calls from archivists, planners and open-data advocates.

Share

By London News Desk · Published 5 July 2026, 4:45 am

4 min read

Updated 3 h ago· 5 July 2026, 2:02 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily London is independently owned and covers London news free from advertiser or sponsor influence. Read our editorial standards →

London's Archives Are Full of Duplicate Images. Officials and Experts Say the Fix Is Long Overdue.
Photo: Andrews, Charles McLean, 1863-1943 Davenport, Frances G. (Frances Gardiner), 1870-1927 / Public domain (Wikimedia Commons)

Tens of thousands of duplicate images are clogging the digital archives of London's public bodies, slowing planning applications, inflating storage costs and, in some cases, sending housing officers to the wrong property. The problem is old, but the pressure to fix it is new — and the people responsible for it are finally talking.

The issue has sharpened this year as the Starmer government's planning reform agenda forces local authorities across Greater London to accelerate the processing of housing applications. Councils under pressure to hit new housebuilding targets cannot afford systems where a single streetscape photograph of, say, Bermondsey Street in Southwark exists in seventeen separate folders across three different departments. The duplication wastes server space, yes — but more critically, it creates version-control failures that delay decisions.

What Officials Are Saying

At the Greater London Authority, data governance teams have been working since January 2026 on what they have described internally as a digital asset rationalisation programme. The GLA manages spatial and photographic records for the London Plan, the statutory document governing development across all 33 boroughs, and duplicate imagery within that system has been identified as a material problem for planning inspectors. No official figure has yet been published for the number of duplicates involved, but the programme targets completion before the end of the 2026-27 financial year.

Southwark Council — which oversees some of the most active regeneration zones in the capital, including the Elephant and Castle redevelopment corridor and the Old Kent Road Opportunity Area — has gone further, commissioning a third-party audit of its image libraries in March 2026. Council officers told a scrutiny committee meeting that duplicate records had been found across planning, housing and communications systems, with some image files replicated more than 40 times. The council did not put a cost figure on the problem at that meeting, but noted that cloud storage expenditure had risen sharply over the preceding two years.

The City of London Corporation, which maintains detailed photographic and mapping records of the Square Mile for heritage and planning purposes, has pointed to the problem in its Digital Strategy 2025-2028, published last autumn. The document flags that without automated deduplication tools, staff time spent manually resolving conflicting image records runs into hundreds of hours annually.

Experts: The Technology Exists, the Will Has Been Slow

Digital preservation specialists at the Bishopsgate Institute, which holds significant photographic and documentary collections relating to London's East End, have argued publicly that local authorities are behind the curve. The tools for automated hash-based duplicate detection — software that compares file fingerprints rather than filenames — have been commercially available and affordable since at least 2019. The barrier, specialists say, is not technical. It is organisational: procurement cycles, data-sharing agreements between departments, and a lack of dedicated archival staff.

University College London's Department of Information Studies, based in Gower Street in Bloomsbury, has been running a postgraduate module on digital preservation since 2021 that specifically addresses institutional duplication as a governance failure. Course materials available online describe the problem as endemic to public-sector bodies that expanded digital storage rapidly during the pandemic without establishing metadata standards.

The costs are not trivial. Cloud storage pricing for public-sector contracts — benchmarked through the Crown Commercial Service framework — has increased since 2022, making the financial argument for deduplication stronger than it was three years ago. For a mid-sized London borough storing tens of terabytes of unrationalised image data, the annual saving from a clean-up exercise can run to five or six figures.

Transport for London, which holds one of the largest operational image databases in Europe — covering CCTV footage, engineering survey photography and station imagery across 270 tube stations — declined to comment on how it manages duplicate records within that system.

For councils, the practical next step is straightforward in principle if not in practice: establish a single source of truth for image assets, apply consistent metadata standards at the point of upload, and run deduplication software quarterly rather than retrospectively. The Local Government Association has been developing guidance on exactly this, with a framework document expected before September 2026. Authorities that wait for it may find themselves behind those that have already started.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily London

Covering news in London. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to London news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily London and accept our Privacy Policy. Unsubscribe anytime.

Before you go

Get the London brief

The day's London news in a 2-minute read. Free, weekday mornings.

No spam. Unsubscribe anytime.