• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar

FromThePage Blog

about crowdsourcing, manuscript transcription, digital humanities and digital documentary editions

  • Home
  • Project Profiles
  • Interviews with Clients
  • Collections
  • Back to FromThePage

Uploading existing transcriptions or OCR with Page Images

It is possible to import existing transcripts in the zip file upload.
First, create a folder with image files in it.  Then make sure that each image file has a .txt file containing the transcript of that page, following the same name conventions as the image, like in this example:
envelope.jpg
envelope.txt
page_001.jpg
page_001.txt
page_002.jpg
page_003.txt
postmark.JPG
postmark.txt
Not all image files need corresponding text files, but the filenames do need to be identical (except for the extension) when there are text files with transcripts.
Create a metadata.yml file if you wish, and place it in the same folder.
Then zip up the folder (along with other folders, if you want), and upload it to the Start a Project screen.  Make sure to check the “OCR” box.
The folders should be converted into a FromThePage work with the contents of the text files set as the raw OCR text.  You’ll probably want to convert the work into a manuscript transcription work from an OCR correction project so that the nomenclature is changed appropriately.

Primary Sidebar

What’s Trending on The FromThePage Blog

  • How to Learn to Read Shorthand
  • Learn to Decipher Old Handwriting with Online and…
  • The Decade in Crowdsourcing Transcription
  • Day of DH 2015
  • 2018 Paleography Courses
  • Project Profile: Howitt and Fison Papers

Recent Client Interviews

An Interview with Olivia Carlisle of the State Archives of North Carolina

An Interview with Paige Roberts of Phillips Academy Archives & Special Collections

An Interview with Riley Bogran of the Sandy Spring Museum

An Interview with Sara David of the Nantucket Historical Association

An Interview with Daniel Hartwig & Hannah Scates Kettler of the Iowa State University Library

Privacy Policy | Terms & Conditions | About Us | Contact Us

Copyright © 2021 · FromThePage.com