A quiet but consequential problem has been spreading through Boston's digital archives: duplicate images — identical or near-identical files used to fill gaps in scanned public records — are being silently swapped in databases without adequate disclosure, and the people responsible for maintaining the integrity of those records say the practice has gotten out of hand.
The issue surfaced publicly in recent weeks after staff at the Boston City Archives on Boylston Street flagged that several property deed records in the city's online registry system contained placeholder images recycled from unrelated documents. The concern is not just cosmetic. When a duplicate image stands in for a missing original — without a clear notation — it can create legal ambiguity for property owners, researchers, and attorneys trying to verify title chains in neighborhoods like Dorchester and Jamaica Plain, where housing development activity has accelerated sharply under Mayor Michelle Wu's affordable housing push.
Why This Matters Right Now
Boston is in the middle of one of its most ambitious periods of digitization. The city's Digital Equity and Engagement Initiative, launched in 2024, set a goal of making 90 percent of public records accessible online by the end of fiscal year 2027. That aggressive timeline has meant large volumes of historical documents — some fragile, some partially damaged — have been scanned and uploaded quickly. Preservation specialists say the speed creates pressure to fill gaps with whatever image is available rather than flag a record as incomplete.
The Massachusetts Secretary of State's office, which oversees public records law statewide, has requirements around record integrity, but those rules were written well before AI-assisted image processing became standard in municipal workflows. Digital preservation professionals say the legal framework has not caught up with what scanning vendors are actually doing when they process batches of thousands of documents at a time.
At Northeastern University's NULab for Texts, Maps, and Networks on Huntington Avenue, researchers who work closely with Boston's historical collections have been vocal about the risk. The lab's work on digitized neighborhood surveys and urban renewal maps for the South End and Lower Roxbury means they regularly cross-reference city records with physical originals — and they have flagged discrepancies to city staff on multiple occasions in the past 18 months.
What Officials and Experts Are Saying
Boston Public Library's Digital Repository Services team, based at the Central Library in Copley Square, has been working on metadata standards that would require any substituted image to carry an explicit tag identifying it as a duplicate or placeholder. Librarians involved in that effort say the technical fix is straightforward; the harder problem is getting scanning contractors and city IT procurement staff to require it in their contracts going forward.
On the civic technology side, Code for Boston — the volunteer brigade that meets regularly in Cambridge and has worked on open data projects tied to city permitting — has identified duplicate image issues in at least two publicly accessible city datasets since January 2026. Members of the group have submitted formal comments to the city's Office of New Urban Mechanics asking for audit provisions to be built into the next round of digitization contracts.
The MBTA, which maintains its own archive of station condition photographs used for capital planning, confirmed earlier this year that it had reviewed its image management protocols following questions from transit advocacy groups about whether inspection photos were being accurately catalogued. The agency said it had added internal review steps but did not specify a timeline for a full audit.
City Councilor offices covering Districts 7 and 4 — covering Roxbury and Dorchester respectively — have received constituent inquiries about records accuracy in connection with ongoing housing disputes on streets including Geneva Avenue and Blue Hill Avenue, where title clarity is critical to development approvals.
For residents, the immediate practical step is to request a physical copy of any document from the City Archives if an online version shows a watermark, repeated imagery, or metadata that does not match the document date. The Archives office accepts requests by email and in person, and staff are required under state law to provide certified copies within ten business days. For property matters specifically, the Suffolk County Registry of Deeds on McCormack Way maintains its own parallel digital index and may hold cleaner scans of the same records.