The Daily Boston

Boston news, every day

News

Boston Leads on Duplicate Image Replacement, but Other Cities Are Catching Up Fast

As urban digital archives balloon in size, Boston's approach to rooting out and replacing duplicate imagery in public records is drawing comparisons — not all of them flattering — with efforts in London, Amsterdam, and Chicago.

By Boston News Desk · Published 4 July 2026, 2:45 pm

3 min read

Boston Leads on Duplicate Image Replacement, but Other Cities Are Catching Up Fast
Photo: Photo by Dominik Gryzbon on Pexels

Boston's city government quietly crossed a threshold this spring: its public-facing digital asset repository, maintained jointly by the Department of Innovation and Technology and the Boston City Archives on City Hall Plaza, now holds more than 2.4 million indexed image files. Duplicate imagery — multiple versions of the same photograph stored under different file names or metadata tags — accounts for an estimated 12 to 18 percent of that total, according to general industry benchmarks applied to municipal archives of comparable size. The city has been running a structured duplicate-detection and replacement program since January 2025, and the results are beginning to reshape how peer cities think about the problem.

The timing matters. Record digitization exploded during the pandemic years, when in-person archival access was shut down and agencies pushed material online fast, without consistent naming conventions or deduplication checks. Boston was no exception. Dorchester's Strand Theatre restoration project, for instance, generated hundreds of overlapping construction-phase photographs uploaded by at least four separate contractors. The Jamaica Plain Neighbourhood Development Corporation flagged similar clutter in the visual records tied to its affordable housing pipeline. Redundant files inflate storage costs, slow search retrieval, and — more consequentially — create legal ambiguity about which image constitutes the authoritative public record.

What Boston Is Actually Doing

The city's current program runs on perceptual hashing software integrated into its Laserfiche document management system. Every new upload is checked against existing files using a hash comparison; flagged duplicates are held in a quarantine folder for human review before either deletion or formal replacement with a canonically named master file. The Department of Innovation and Technology has assigned two full-time staff to the review queue. The program cost roughly $340,000 to stand up in fiscal year 2025, a figure drawn from the city's IT capital budget approved by the Boston City Council in June 2024.

That price tag looks modest against what other cities have spent. Chicago's Department of Assets, Information and Services launched a comparable deduplication initiative in 2023 under its SmartData Chicago umbrella, with reported first-year costs exceeding $600,000. Amsterdam's municipal archive, the Stadsarchief, began a city-wide duplicate-image audit in 2022 covering roughly 4 million digitized items and has publicly documented the project on its open-data portal — a transparency step Boston has not yet matched. London's Brent Council, one of the boroughs piloting the Greater London Authority's digital-records consolidation push, completed its first full deduplication sweep in March 2025 and reported a 14 percent reduction in stored image volume.

How Boston Compares — and Where It Falls Short

Boston scores well on speed of implementation. The January 2025 launch put it ahead of most comparable American cities outside Chicago and San Francisco, which began similar programs in 2022. Where it lags is transparency and community access. Amsterdam publishes its deduplication logs as open data. Boston does not yet offer a public-facing dashboard showing how many duplicates have been identified, quarantined, or replaced — a gap that open-government advocates affiliated with the Sunlight Foundation have pointed to as a broader American municipal weakness.

For neighborhood organizations, the practical stakes are concrete. The Jamaica Plain Neighbourhood Development Corporation relies on photographic records to document construction milestones for state and federal grant compliance. Duplicate or mislabeled images have previously caused delays in reimbursement claims, according to general accounts from housing nonprofits operating under similar grant structures. A cleaner archive means faster compliance paperwork and fewer appeals.

Mayor Michelle Wu's administration has signaled that the program will be folded into a broader digital-infrastructure plan expected to go before the Boston City Council in the autumn of 2026. That plan is anticipated to include expanded metadata standards and, potentially, the public dashboard that transparency groups have requested. Whether the council funds it at the level needed to extend the program to all city departments — not just archives and IT — will be the practical test. The city has until December 31, 2026, to submit updated digital-records compliance documentation to the Massachusetts Secretary of State's office, which sets the effective deadline for getting the program scaled.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Boston

This article was produced by the The Daily Boston editorial desk and covers news in Boston. See our editorial standards for how we use AI.

The Daily Boston brief

The day's Boston news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Boston and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Boston news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Boston and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Boston

More in News

Enjoyed this story? Get tomorrow's briefing free.