Yesterday we deployed two new features to help you evaluate Gemini 3 (and eventually others) results against human transcribed or corrected text. First, we’ve developed a comparison screen that shows the differences between an AI generated page transcription and human created ground truth: Next, we calculate statistics, again comparing the AI draft against the human […]
Main Content
Recent Posts from FromThePage

Introducing Gemini 3.0 Support in FromThePage
When Ben sent me Mark Humphries’ report on testing a new, unreleased Gemini model, I got scared. And excited. Mark is a historian and digital humanist who’s gone deep on analyzing AI tools for textual transcription. He understands the dangers of “seductive plausibility” in LLM outputs. He knows what researchers and historians need from archival text. […]

Is That Transcription Really Human?
Last month, Denyse Allen asked this question on the Genealogy and AI Facebook group:If volunteers use AI to transcribe documents, is that OK? I have strong opinions, but want to explain them. First off, the institutions running big crowdsourcing projects have staff who can automate sending all of their documents to AI engines for transcription. As a […]

AI and Crowdsourcing are Overturning Archival Workflows
March 2025 We were talking recently to Paige Roberts, the lone archivist at the Phillips Academy, and she said something interesting: "I just acquired a new collection. I'm kind of weird, I don't do processing, I just digitize it and throw it up on FromThePage, and boom, people transcribe it. If it's from 1790 you […]
We Got Clobbered by Bots
A behind-the-scenes look at how an invisible attack nearly brought down FromThePage—and what we did next.A Four-Hour Outage, Bots, and a Hard Lesson Learned Last Thursday, June 26th, FromThePage experienced more than a dozen outages totalling four hours. In the days leading up to this, we also saw a noticeable slowdown in performance. This wasn’t […]

Bot Traffic, AI Training, and Infrastructure Strain
This month’s update is a bit more technical than usual, but we think it’s important. It offers a behind-the-scenes look at how bot traffic—often related to AI training—impacts platforms like FromThePage and the digital infrastructure of cultural heritage institutions more broadly. Over the past year, FromThePage has experienced three major service outages caused by waves […]

