• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar

FromThePage Blog

Crowdsourcing, transcription and indexing for libraries and archives

  • Home
  • Interviews
  • crowdsourcing
  • how-to
  • Back to FromThePage
  • Collections
Home » hackathon

hackathon

An Interview with Dr. Camille Westmont of Sewanee: The University of the South

June 14, 2021 By Bethany Radcliff

Dr. Camille Westmont, Visiting Professor of History at the University of the South, kindly spoke with Sara Brumfield about the Convict Leasing Project, and her experience using FromThePage. First, tell us about your documents. The documents we are working to transcribe are the prison records from the Lone Rock Stockade. The Lone Rock Stockade was a private prison built by the … [Read more...] about An Interview with Dr. Camille Westmont of Sewanee: The University of the South

Detecting Handwriting in OCR Text

February 25, 2013 By Ben Brumfield

This is my fourth and final post about the iDigBio Augmenting OCR Hackathon.  Prior posts covered the hackathon itself, my presentation on preliminary results, and my results improving the OCR on entomology specimens.  The other participants are  slowly adding their results to the hackathon wiki, which I recommend checking back with (their efforts were much more … [Read more...] about Detecting Handwriting in OCR Text

Results of the "Ocrocrop" Approach to Improving OCR

February 15, 2013 By Ben Brumfield

This project attempted to improve the quality of OCR applied to difficult entomology images[*] by cropping labels from the images to run through OCR separately. In order to identify labels on the image to crop, an initial, 'naive' pass of OCR was made over the whole image, generating both A) a set of rectangles on the image defined as word bounding boxes by the OCR engine, … [Read more...] about Results of the "Ocrocrop" Approach to Improving OCR

iDigBio Augmenting OCR Hackathon

February 15, 2013 By Ben Brumfield

I spent the last three days at the iDigBio Augmenting OCR Hackathon working alongside mycologists, botanists, entomologists, herbarium managers, and bioinformaticians to explore ways to improve parsing of digitized specimen labels.  While I'm pleased with the results of my own contribution, I'd like to take a minute to talk about the hackathon process itself before I post … [Read more...] about iDigBio Augmenting OCR Hackathon

Improving OCR Inputs from OCR Outputs?

February 14, 2013 By Ben Brumfield

This is a transcript of my talk at the iDigBio Augmenting OCR Hackathon, presenting preliminary results of my efforts before the event. For my preliminary work, I tried to improve the inputs to our OCR process through looking at the outputs of a naive OCR. One of the first things that we can do to improve the quality of our inputs to OCR is to not feed them handwriting.  To … [Read more...] about Improving OCR Inputs from OCR Outputs?

Primary Sidebar

What’s Trending on The FromThePage Blog

  • Archives as an Antidote for ChatGPT
  • How Do I Read Old Handwriting?
  • Spreadsheet Transcription in FromThePage
  • Classifying the Mistakes We Make When We Transcribe
  • An Interview with Dr. Camille Westmont of Sewanee:…
  • 10 Ways to Host a Great Transcribathon

Recent Client Interviews

An Interview with NC State University Libraries

An Interview with Richard Gilreath of the Texas State Library and Archives Commission

An Interview with Julanne Neal of the Queensland State Archives

An Interview with Andrea Meyer of East Hampton Public Library

An Interview with Keith Mitchell of The National Archives (UK)

Read More

artificial intelligence crowdsourcing features fromthepage projects handwriting history iiif indexing Indianapolis Indianapolis Children's Museum interview Jennifer Noffze machine learning metadata ocr paleography podcast Ryan White spreadsheet transcription transcription transcription software
Privacy Policy | Terms & Conditions | About Us | Contact Us

Copyright © 2023 · FromThePage.com