• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar

FromThePage Blog

Crowdsourcing, transcription and indexing for libraries and archives

  • Home
  • Interviews
  • crowdsourcing
  • how-to
  • Back to FromThePage
  • Collections

Image Upload File Guidelines

How are the files inside of zip folders treated when uploaded? Zip files may contain folders containing images, PDFs, or folders containing PDFs. Each folder will be treated as a different work, so do not mix files that belong to different works in the same folder. Each PDF will be treated as its own work, so do not split PDF files among more than one work. For example, a zip file with 3 images, 2 PDFs, and 1 folder containing 5 more images would create 4 works: the top level images in one, each PDF in their own work, and a last work containing the 5 images from the folder.

What are the acceptable file formats and file sizes for uploads?

  • Acceptable file formats: PDF, PNG, GIF, JPG
  • Acceptable file sizes: File sizes are limited by the upload timeout, so we recommend files that are less than 100 MB per file uploaded (either individually or in a zip file). FromThePage will resize each image file down to 1 MB.

What are the page image guidelines?

  • PNG, GIF and JPG files are all acceptable.
  • Images should be oriented so that they are right side up.
  • Images should be split down the spine, so that only one page is visible on an image. (See the autosplit tool if you need to split two-page images.)
  • Images should be named so that an alphabetical sort will result in the correct page order. Note, this may require "zero-padding" for page numbers - page_09.jpg, page_10.jpg will sort correctly, but page_9.jpg, page_10.jpg will not.

Does FromThePage resize page images?

We downsize images to "appropriate for transcription" access copies.  We use jpeg compression to reduce image files to 1 MB in size. That said, we don't reduce more than 90%, so some files end up larger than 1 MB. We encourage you to upload high resolution images, but we do high compression, which is appropriate for manuscripts because what you most need is to tell the difference between text and background.

Primary Sidebar

What’s Trending on The FromThePage Blog

  • Archives as an Antidote for ChatGPT
  • An Interview with Michael Lapides of the New Bedford…
  • How Do I Read Old Handwriting?
  • An Interview with Dr. Camille Westmont of Sewanee:…
  • Learn to Decipher Old Handwriting with Online and…
  • Spreadsheet Transcription in FromThePage

Recent Client Interviews

An Interview with Michael Lapides of the New Bedford Whaling Museum

An Interview with NC State University Libraries

An Interview with Richard Gilreath of the Texas State Library and Archives Commission

An Interview with Julanne Neal of the Queensland State Archives

An Interview with Andrea Meyer of East Hampton Public Library

Read More

artificial intelligence crowdsourcing features fromthepage projects handwriting history iiif indexing Indianapolis Indianapolis Children's Museum interview Jennifer Noffze machine learning metadata ocr paleography podcast Ryan White spreadsheet transcription transcription transcription software
Privacy Policy | Terms & Conditions | About Us | Contact Us

Copyright © 2023 · FromThePage.com