Skip to content

Terminology

Definition of terminology

There are two definition files to set up.

  • The terminology definition file (config_terms.txt) serving to describe all terminology used to define the metadata of a dataset.
  • The terminology documentation file (config_doc.txt) serving to documente all terminology definitions.

Each time there is a change in these two definition files, it is necessary to convert them so that they are taken into account by the application.

Terminology is the set of terms used to define the metadata of a dataset. A single file (web/conf/config_terms.txt) contains all the terminology. The input and search interfaces (e.g screenshot) are completely generated from this definition file, thus defining i) each of the fields, their input type (checkbox, dropbox, textbox, ...) and ii) the associated controlled vocabulary (ontology and thesaurus by autocompletion, drop-down list according to a list of fixed terms).

The metadata schema proposed by defaut is mainly established according to the DDI (Data Documentation Initiative) schema that also corresponds to that adopted by the Dataverse software.

Terminology is organised in several sections. By default 6 sections are proposed, but you can redefine them as you wish:

  • DEFINITION : Section for describing shortly the dataset.
  • STATUS : Section for defining the status of the dataset, associated rights, dates, etc.
  • MANAGEMENT : Section for assigning names of people or organizations who participated in the production of data and according to the type of participation.
  • DESCRIPTORS : Section for defining elements characterizing the data themselves and certain experimental conditions for obtaining them.
  • OTHER : Section for entering miscellious information (protocols, comments, issues, ...)
  • RESOURCES : Section for defining metadata about all the resources you want, i.e both external (links) and internal (data files on the storage space) resources. This section does not require any configuration a priori, it is added de facto.

For each section, fields are then defined. These fields can be defined according to the way they will be entered via the web interface. There are 6 different types of input: check boxes (checkbox), drop lists (dropbox), single-line text boxes (textbox), single-line text boxes with an additional box for multiple selection from a catalog of terms (multiselect), date picker (datebox) and multi-line text boxes (areabox).

For two types (checkbox and dropbox), it is possible to define the values to be selected (predefined terms).


Structure of the Terminology definition file (TSV)

The file must have 9 columns with headers defined as follows:

  • column 1 - Field : shortname of the fields
  • column 2 - Section : shortname ot the sections
  • column 3 - Required : indicates if the field is mandatory ('Y') or not ('N')
  • column 4 - Search : indicates if the field can be used as a criterion search ('Y') or not ('N')
  • column 5 - Shortview : indicates with ordered numbers if the field serves for the overview table after the search (empty by default)
  • column 6 - Type : indicates the way they will be entered via the web interface (possible values are: textbox, dropbox, checkbox, multiselect, datebox and areabox).
  • column 7 - Features : dependings on the Type value, one can specifiy some specific features. If several features, they must be separated by a comma.
    • open=0 or open=1 (checkbox) : indicates if the selection is opened or not. See Vocabulary.
    • autocomplete=entity (textbox, checkbox & multiselect) : The entity.js file must be present under web/cvlist/entity/ if the entity is a dictionary otherwise it must be present under web/js/autocomplete. See Vocabulary.
    • ws=web_service name (multiselect) : indicates the web service name, e.g VO, ols, agroportal or bioportal. Attribute available only if the API call for the vocabulary is based on the Twitter's Typeahead library. In this case, the autocomplete attribute must be equal to typeahead. See Vocabulary.
    • onto=ontology list/thesaurus name (multiselect) : indicates the ontology list, e.g. for a web service based on OntoPortal or the thesaurus name, e.g. for a web service based on SKOSMOS. In case of an ontology list, the separator (:) must a colon with no space (eg. OBI:EFO:PO:ENVO). Attribute available only if the API call for the vocabulary is based on the Twitter's Typeahead library. In this case, the autocomplete attribute must be equal to typeahead. See Vocabulary.
    • width=NNNpx (textbox, dropbox, datebox) : allows you to specify the width of the box. Usefull if you want put several fields in the same line. See note 1 below.
    • row=NN and cols=NN (areabox) : allows you to specify the row and column size of the textarea.
  • column 8 - Label : Labels corresponding to the fields that will appear in the web interface
  • column 9 - Predefined terms : for fields defined with a type equal to checkbox or dropbox, one can give a list of terms separated by a comma.

  • Notes

    • the fields will be displayed in the same order as in the file and by section. So if you want to specify several textboxes with particular sizes so that they are on the same line, they should belong to the same section and follow each other in the file in the same order.
    • the title and description fields are mandatory but not necessarily in the same section.

Below an example of Terminology definition file (TSV)


Example of Maggot JSON file generated based on the same definition file


Structure of the Terminology documentation file (TSV)

The documentation definition file is used to have online help for each field (small icon placed next to each label on the form). So it should only be modified when a field is added or deleted, or moved to another section. This file will be used then to generate the online metadata documentation according to the figure below (See Configuration to find out how to carry out this transformation).

The file must have 3 columns with headers defined as follows:

  • column 1 - Type : The type of the element, namely 'section', 'field' or 'option'. An 'option' type must correspond to each of the options for a field corresponding to a drop-down list.
  • column 2 - Name : Name of the element. The names of the sections, variables and drop-down options must be exactly the same as those specified in the terminology definition file.
  • column 3 - Description : The description corresponding to the element, serving as much as possible to give indications on the information to be selected or entered, in order to remove possible ambiguities.

Below an example of Terminology documentation file (TSV)


Same example as above converted to HTML format using Markdown format