Dossier

A dossier is an entry in the ECA Corpus; for example, a special report. It consists of a unique dossier number, additional metadata, miscellaneous files such as a CSS, and typically several full-text files in different languages and formats. Each dossier is stored in a dossier folder named after the dossier number, following the Portable Web Package specification. All dossiers are stored in the Repository. Illustration:


index.html                      # file - dossier index 
en                              # directory - English
  main                          # directory - main section; if only one file exists (e.g. "main.pdf"), a directory may not be necessary
    doc1.pdf                    # file
    doc2.pdf                    # file
    doc1.html                   # file
    doc2.html                   # file
  reply                         # directory - replies section
    doc3.pdf                    # file
    doc3.html                   # file
fr                              # directory - French
...                             # other language directories
misc                            # directory - miscellaneous
  metadata.txt                  # dossier metadata as key-value, example
  ecac.css                      # CSS

The dossier root folder contains the file index.html and several directories, mostly linguistic, such as en. This structure may vary depending on the entry type; further work is required. Directories with two characters in the dossier root directory are language directories (e.g. en, fr, de). They may contain one or more additional levels of directories for organisation, such as main and reply. Lower levels may address aspects such as file formats. At present the Repository is an empty skeleton directory structure ready to receive full-text files in all languages and formats.