Standards and Formats

Basic Principles

CLARIN adheres to the following principles:

  • Open standards are preferred over proprietary standards
  • Formats and protocols should be:
    • Well-documented
    • Verifiable
    • Proven (being used in practice)
  • Text-based formats are (where possible) preferred over binary formats
  • In the case of digitisation of an analogue signal, using no or lossless compression is recommended.

Learning More

Relevant Formats

Several CLARIN centres have published information on what formats they recommend for language research data depositions:

Relevant Standards

The CLARIN Standards Information System provides information on standards used in CLARIN and on formats accepted for data deposition at particular centres..