This document is available in two formats: this web page (for browsing content) and PDF (comparable to original document formatting). To view the PDF you will need Acrobat Reader, which may be downloaded from the Adobe site.

U.S. Department of Justice, Antitrust Division

IMAGE DETAILS & LOAD FILE SPECIFICATIONS


Image Details
 
Image Files
Group IV Single-Page TIFFs
  Filenames cannot have embedded spaces
  Images for a document must be in one folder
  Number of image files should be limited to 5,000 per folder
  Files should be named <PageID>. TIF
 

Ex. DOJ-005.TIF



Summation Image Load file (.dii) Specifications
Bold indicates a constant. Italics indicate a variable.
   
@Fulltext DOC
Indicates that there is a Text file attached to the records
@T IMAGETAG
Required: Unique identifier for document
@D @I TIFF Path

Required: Directory location designation

Image Files
Required: listing or iteration of files
 
IMAGETAG
Identical to the Begdoc#
TIFFPath
Path to Image Files
Image Files
Individual or Iterated listing of Tiff filenames
 

Ex. as Iteration: DOJ-00{3-6}.TIF

 
Note: 8 character file name limitation for DII file
Note: The Fulltext line is written once at the top of the DII file.


Metadata Load File Delimeters
Field separator
Vertical Pipe (ASCII 124)
Field encapsulate
Carat (ASCII 094)
Return value in data

Tilde (ASCII 126)

Multi-value field
Semi-colon (ASCII 059)
Dates format
MM/DD/YYYY
Note:
Hard returns at the End of Record ONLY


Searchable Text File Specifications
A single Text file per document
The name of the Text file should equal the first page's Bates of the document, with a TXT extension
There must be a carriage return and line feed in the first 80 characters of text
Text files should include page breaks that correspond to the “pagination” of the image files.
All soft and hard returns in the native electronic document or image file should be replicated as a Carriage Return Line Feed in the text file

Two options for producing Searchable Text:

   1) Place text files under a "FULLTEXT" folder and provide an OCR Control List file

   2) Place text files with the corresponding images so as to load through a DII file