Boston's city government is midway through a two-year overhaul of its digital records infrastructure, with duplicate image files identified as one of the costliest and most persistent problems inside the Archives & Records Management division on City Hall Plaza. The cleanup, which began in January 2025 under a contract with Cambridge-based software firm Ametros Digital, has already flagged more than 1.4 million redundant image files across planning, permitting, and public health databases — a figure that surprised even veteran archivists who had long suspected the problem was serious.
The timing is not accidental. Mayor Michelle Wu's administration has pushed hard on open-data commitments, and bloated, duplicate-laden databases make those commitments harder to fulfill. When the Inspectional Services Department processes a permit application in Dorchester or Jamaica Plain, staff can pull up four nearly identical scanned documents for a single property — each stored separately, each eating server space, each requiring manual cross-referencing. That friction costs time and, ultimately, taxpayer money.
What Boston Is Actually Doing
The Ametros contract covers deduplication work across three city departments: Inspectional Services, the Boston Planning Department on Atlantic Avenue, and the Office of Vital Records at 1 City Hall Square. The contract value has not been disclosed in full under procurement rules, but budget documents released in March 2026 show the city allocated $2.3 million for digital records modernization in fiscal year 2026. Deduplication is listed as the single largest line item within that allocation.
The Boston Public Library's Digital Repository at the Copley Square branch is running a parallel effort. Its Special Collections unit has been working since late 2024 to remove duplicate scans of historical photographs — many of them images of the Big Dig construction era and earlier South Boston waterfront development — using open-source tools maintained by Harvard's Library Innovation Lab in Cambridge. Librarians say that project has reduced redundant image storage by roughly 30 percent in the repository's public-facing portal since it launched.
Boston is not alone in wrestling with this, but it is further along than several peer cities. London's Government Digital Service flagged duplicate image records as a formal priority only in its 2025-26 annual plan, published last October, with most active remediation expected to begin in late 2026 across Greater London Authority departments. Singapore's Smart Nation initiative, which has been benchmarked globally for digital governance, completed a comparable deduplication sweep of its Urban Redevelopment Authority's property image database in 2024 — a project that took 18 months and covered roughly 4 million files.
The Practical Stakes
The comparison with Singapore matters because that city-state moved early, standardized its image metadata requirements across agencies in 2021, and mandated a single file-naming protocol before ingestion. Boston never did that. The result is a patchwork of legacy formats — TIFFs from the early 2000s sitting alongside JPEGs and PDFs, many uploaded without consistent naming conventions — that makes automated deduplication harder and more expensive.
Housing advocates in Jamaica Plain have raised a specific concern. Several community land trust applications submitted to the Boston Planning Department in 2024 and early 2025 were delayed, in part, because staff could not quickly verify which site photographs in the system were current and which were years-old duplicates of earlier proposals for the same parcels. The Planning Department has not publicly confirmed the delays were caused by the image problem, but internal records obtained through a public records request in April 2026 reference "document verification bottlenecks" in multiple Jamaica Plain files.
The city aims to complete the first phase of deduplication — covering Inspectional Services and Vital Records — by October 2026. Phase two, bringing in the Planning Department's full archive, is scheduled for the first quarter of 2027. Whether that timeline holds depends partly on how the Ametros contract performs during the summer audit period, which runs through September.
For residents and developers dealing with the city on a day-to-day basis, the most immediate practical step is straightforward: when submitting permit applications or public records requests, attach clearly labeled, single high-resolution images rather than multiple similar versions of the same document. Inspectional Services staff say that small habit alone reduces the chance of a file getting flagged for manual review by their deduplication filters.