A dossier is an entry in the ECA Corpus; for example, a special report. It consists of a unique dossier number, additional metadata, miscellaneous files such as a CSS, and typically several full-text files in different languages and formats. Each dossier is stored in a dossier folder named after the dossier number, following the Portable Web Package specification. All dossiers are stored in the Repository. Illustration:
index.html # file - dossier index
en # directory - English
main # directory - main section; if only one file exists (e.g. "main.pdf"), a directory may not be necessary
doc1.pdf # file
doc2.pdf # file
doc1.html # file
doc2.html # file
reply # directory - replies section
doc3.pdf # file
doc3.html # file
fr # directory - French
... # other language directories
misc # directory - miscellaneous
metadata.txt # dossier metadata as key-value, example
ecac.css # CSS
The dossier root folder contains the file index.html
and several directories, mostly linguistic, such as en.
This structure may vary depending on the entry
type;
further work is required.
Directories with two characters in the dossier root directory are
language directories
(e.g. en, fr, de).
They may contain one or more additional levels of directories for organisation,
such as main and reply.
Lower levels may address aspects such as file formats.
At present the Repository is an empty skeleton directory structure
ready to receive full-text files in all languages and formats.