415 Library, MC-522
1408 W. Gregory
Urbana, IL 61801
(217) 244-2062
Email: digicc [at] library.illinois.edu
This section describes best practices for digitization of print based original documents including photographs, manuscripts, maps, and text. These guidelines draw heavily on previously published standards and best practices developed by standards agencies and peer institutions, particularly those of the California Digital Library. Best practices for newspaper digitization are covered in Section 5. Best practices for intellectual property rights (IPR) issues, which should be investigated before scanning materials and making them publicly accessible, are covered in Section 15.
_________________________________________________________________
At a minimum, digitization of library materials should result in the creation of a master (archival) image and at least one access derivative for web display. Depending on the format of the material and its anticipated use and display, other files, such as thumbnails, PDFs, and OCR text files, may also be produced. Files should be named in accordance to the best practices for file naming in Section xx of this document. As they are produced, files should be saved to a workspace that resides on a Library server that is backed up nightly.
The source files for all other digital files and ensure the long-term usability of the digital information. A digital master file may serve as a surrogate for the original, may completely replace originals, or may be used as security against possible loss of originals due to disaster, theft and/or deterioration. Images are captured at a quality high enough to serve these potential uses via scanning or digital photography, depending on the attributes of the original. The digital master file should represent as accurately as possible the visual information in the original object. In general, decisions about image capture should err towards the highest quality. Files should use color rather than grayscale when color is an integral attribute of the original, and any compression applied to the file should be lossless. Accuracy and consistency in tone and color reproduction through appropriate use of scanner or camera controls is the goal;
Derived from master files and are used for presentation and transmission over networks. These images should be of good quality, but because their spatial resolution (measured in pixels per inch) is lower, the file size is smaller. Some minor post-scan adjustments to optimize image quality and to bring all images to a common rendition are acceptable. Such adjustments include the use of appropriate image processing tools to achieve final color balance and tone distribution and to sharpen scanned images to match appearance of the originals.
Very small files used in databases or web pages. Clicking on the thumbnail image will pull up the larger original image, which can be viewed and downloaded.
Are very small files used in databases or web pages. Clicking on the thumbnail image will pull up the larger original image, which can be viewed and downloaded.
The imaging quality requirements for master digital images are given below. These requirements should be viewed as the minimum necessary to create quality digital images and may be exceeded when warranted and when storage space permits. Associate technical metadata should be saved to the header file.
File format:
Pixel array:
Resolution and bit depth:
or
File format:
Pixel array:
Resolution and bit-depth:
File format:
Pixel array:
Resolution and bit depth:
File format:
TIFF or lossless JPEG2000
Pixel array:
4000 pixels across long dimension of image area, excluding mounts and borders
Resolution:
Adjust the scan resolution to meet pixel array specifications, based on the format of the original object - approximately 2800 PPI for 35mm originals and ranging down to approximately 800 PPI for originals approaching 4 x 5 in.
Bit depth:
8-bit grayscale mode for black-and-white, can be produced from a 16-bit grayscale file
or
24-bit RGB mode for color and monochrome (e.g., collodion wet-plate negative, pyro developed negatives, stained negatives, etc.), can be produced from a 48-bit RGB file.
File format:
TIFF or lossless JPEG2000
Pixel array:
6000 pixels across long dimension of image area, excluding mounts and borders.
Resolution:
Adjust the scan resolution to meet pixel array specifications, based on the format of the original object - approximately 1200 PPI for 4 x 5 in. originals and ranging down to approximately 600 PPI for 8 x 10 in. originals.
Bit depth:
8-bit grayscale mode for black-and-white, can be produced from a 16-bit grayscale file.
or
24-bit RGB mode for color and monochrome (e.g., collodion wet-plate negative, pyro developed negatives, stained negatives, etc.,), can be produced from a 48-bit RGB file.
File format:
TIFF or lossless JPEG2000
Pixel array:
8000 pixels across long dimension of image area, excluding mounts and borders.
Resolution:
Adjust the scan resolution to meet pixel array specifications, based on the format of the original object - approximately 800 PPI for 8 x 10 in. originals and ranging down to produce the desired size file from larger originals.
Bit depth:
8-bit grayscale mode for black-and-white, can be produced from a 16-bit grayscale file.
or
24-bit RGB mode for color and monochrome (e.g., collodion wet-plate negative, pyro developed negatives, stained negatives, etc.,), can be produced from a 48-bit RGB file.
File format:
TIFF or lossless JPEG2000
Pixel array:
4000 pixels across long dimension of image area, excluding mounts and borders
Resolution:
Adjust the scan resolution to meet pixel array specifications, based on the format of the original object - approximately 400 PPI for 8x10 in. originals and ranging up to the appropriate resolution to produce the desired size file from smaller originals, approximately 570 PPI for 5x7 in. and 800 PPI for 4 x 5in. or 3.5x5 in. originals.
Bit depth:
8-bit grayscale mode for black-and-white, can be produced from a 16-bit grayscale file.
or
24-bit RGB mode for color and monochrome (e.g., albumen prints or other historic print processes), can be produced from a 48-bit RGB file.
File format:
TIFF or lossless JPEG2000
Pixel array:
6000 pixels across long dimension of image area, excluding mounts and borders.
Resolution:
Adjust the scan resolution to meet pixel array specifications, based on the format of the original object - approximately 600 PPI for originals approximately 8x10 in. and ranging down to approximately 430 PPI for 11x14 in. originals.
Bit depth:
8-bit grayscale mode for black-and-white, can be produced from a 16-bit grayscale file.
or
24-bit RGB mode for color and monochrome (e.g., albumen prints or other historic print processes), can be produced from a 48-bit RGB file.
File format:
Pixel array:
Resolution:
Bit depth:
or
NOTE: If scans of aerial photography will be used for oversized reproduction, follow the scanning recommendations for the next largest format (e.g., if your original is 70 mm wide, follow the speicifications for 127 mm wide roll film to achieve 8,000 pixels across long dimensions).
File format:
TIFF or lossless JPEG2000
Pixel array:
6000 pixels across long dimension of image area, excluding mounts and borders.
Resolution:
Adjust the scan resolution to meet pixel array specifications, based on the format of the original object - approximately 2700 PPI for 70mm originals and ranging down to the appropriate resolution to produce the desired size file from larger originals.
Bit depth:
8-bit grayscale mode for black-and-white, can be produced from a 16-bit grayscale file
or
24-bit RGB mode for color and monochrome (stained negatives,) can be produced from a 48 bit RGB file.
File format:
TIFF or lossless JPEG2000
Pixel array:
8000 pixels across long dimension of image area, excluding mounts and borders.
Resolution:
Bit depth:
or
File format:
Pixel array:
Resolution:
Bit depth:
-or-
NOTE: If scans of aerial photography will be used for oversized reproduction, follow the scanning recommendations for the next largest format (e.g., if your original is 8x10 in., follow the specifications for formats larger than 8x10 in. to achieve 6000 pixels across long dimensions.
File format:
Pixel array:
Resolution:
Bit depth:
-or-
File format:
Pixel array:
Resolution:
Bit depth:
-or-
File format:
Pixel array:
Resolution:
Bit depth:
-or-
File format:
Pixel array:
Resolution and bit depth:
NOTE: In creating access images, scanned images should have Unsharp Mask applied to them in Photoshop. The following settings are recommended:
File format:
GIF (adaptive/perceptual palette, diffusion/noise dither).
Pixel array:
GIF images should fit within a boundary of 150-200 pixels across each dimension (200 pixels preferred).
Resolution and bit depth:
GIF images should be 4-bit grayscale, 8-bit color: 72 PPI.
4.0 Best Practices for Newspaper Digitization