Basic Principles
CLARIN adheres to the following principles:
- Open standards are preferred over proprietary standards
- Formats and protocols should be:
- Well-documented
- Verifiable
- Proven (being used in practice)
- Text-based formats are (where possible) preferred over binary formats
- In the case of digitisation of an analogue signal, using no or lossless compression is recommended.
Learning More
- Ongoing work by the CLARIN Standards Committee
- FAQ about recommended formats and standards
- Document: Standards for LRT
Relevant Formats
Several CLARIN centres have published information on what formats they recommend for language research data depositions:
- ​ACDH/ARCHE
- BAS
- BBAW
- CLARIN-DK
- CLARIN:EL
- CLARIN.SI
- COCOON
- ​DANS
- HZSK​
- IDS
- ILC4CLARIN
- LAC
- LINDAT-CLARIAH/CZ
- ORTOLANG
- SAW
- TALAR (EKUT)
- TalkBank
- TROLLing
- TLA (MPI-PL)
- UdS
Relevant Standards
The CLARIN Standards Information System provides information on standards used in CLARIN and on formats accepted for data deposition at particular centres..