The Daily Boston

Boston news, every day

News

How Boston's Public Records Ended Up Flooded With Duplicate Images — and Why That's Finally Changing

Years of siloed city departments, competing software systems, and rapid digitization left Boston's municipal archives riddled with redundant files, but a coordinated cleanup effort is now underway.

By Boston News Desk · Published 4 July 2026, 2:44 pm

3 min read

How Boston's Public Records Ended Up Flooded With Duplicate Images — and Why That's Finally Changing
Photo: Photo by Mohan Nannapaneni on Pexels

Boston city officials have spent the better part of a decade accumulating a digital archive problem that nobody wanted to own. Thousands of duplicate photographs, scanned permit documents, and planning images have piled up across at least a dozen municipal databases, slowing government workflows and, in some cases, causing confusion about which version of a record is authoritative. A formal duplicate-image replacement initiative, coordinated through the Mayor's Office of New Urban Mechanics and the city's Department of Innovation and Technology, launched in earnest in early 2026 and is now roughly halfway through its first audit phase.

The timing matters. Mayor Michelle Wu's administration has leaned hard on digital transparency as a pillar of its progressive governance pitch — publishing planning data for the BPDA's Jamaica Plain and Dorchester rezoning corridors, and rolling out open-data dashboards tied to the MBTA bus-priority work along Washington Street. When those dashboards serve up duplicate or mismatched images, the credibility of the whole transparency push takes a hit. That political reality accelerated a cleanup effort that had been stuck in committee since at least 2022.

The roots of the problem go back further. When the city's Inspectional Services Department began scanning paper permit files around 2015, it used a different document management platform than the one the Boston Planning & Development Agency was running on Tremont Street. Neither system talked cleanly to the other. By the time a unified open-records portal was proposed — a process that stretched through the Walsh and early Wu administrations — individual departments had independently uploaded images to at least three separate cloud repositories. Aerial photographs of the Seaport, elevation drawings from Roxbury variance hearings, and construction-site images from East Boston waterfront projects all exist in multiple versions across systems, with no single record flagged as the master copy.

The Audit That Exposed the Scope

A 2025 internal assessment, cited in budget documents attached to the city's fiscal year 2026 technology appropriation, found more than 40,000 image files across city systems that shared identical or near-identical metadata. The Department of Innovation and Technology estimated that roughly 18 percent of those files were true duplicates — same image, different filename, different storage location. The remainder were near-duplicates: cropped versions, watermarked copies, or images re-exported at different resolutions during software migrations. The FY2026 technology budget allocated $1.4 million toward the deduplication and archive standardization project, according to publicly posted budget line items from City Hall on Cambridge Street.

The practical consequences showed up most visibly in the BPDA's project-review pages, where community members in neighborhoods like Dorchester's Neponset corridor and along the Hyde Park Avenue development stretch sometimes encountered outdated site renderings sitting alongside current ones, with no clear date hierarchy. Residents attending community meetings at the Mattapan branch library and at Roxbury's Dudley Square Municipal Building reported confusion about which set of images reflected the approved designs.

What the Cleanup Looks Like in Practice

The current phase involves a combination of automated hashing tools — software that generates a unique fingerprint for each image and flags matches — and manual review by staff in the city's archives unit. Priority has been given to files tied to active development projects, particularly those in the BPDA pipeline for South Boston and the Newmarket Square area in Roxbury. Once a master image is designated, redundant copies are not deleted immediately; they are moved to a quarantine folder and held for 90 days before permanent removal, a safeguard built in after an earlier, smaller cleanup in 2023 accidentally retired an image that turned out to be the only scan of a 1970s zoning variance document.

Officials have set a target of completing the first full audit by October 2026, ahead of the city's next budget cycle. Departments that have not yet migrated to the centralized Boston.gov document portal — including parts of the Public Works Department — are scheduled to begin that transition in the third quarter of this year. For residents tracking development projects in their neighborhoods, the practical upshot is straightforward: the BPDA's online project pages should begin displaying single, clearly dated image sets by late fall, making it easier to verify exactly what was approved and when.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Boston

This article was produced by the The Daily Boston editorial desk and covers news in Boston. See our editorial standards for how we use AI.

The Daily Boston brief

The day's Boston news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Boston and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Boston news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Boston and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Boston

More in News

Enjoyed this story? Get tomorrow's briefing free.