- Clarin
- Publications
- Clarin Groups
- Events
- Resources
- Help Desk
Media encoding
Documents related to media encoding
Media encoding and annotation
1. General distinctions / terminology
1.1. Spoken language vs. Speech vs. Phonetic vs. Multimodal corpora vs. Sign Language Corpora
1.2. Media encoding vs. Media annotation
1.3. File formats vs. Transcription systems/conventions
1.4. Transcription vs. Annotation vs. Metadata
2. Media encoding
2.1. Audio encoding
2.1.1. Uncompressed (Linear formats): PCM (WAV, AIF), AU, PhonDat1, PhonDat2, NIST Sphere
2.1.2. Compressed Formats: MP3, OGG (Audio), Flac, WMA
2.2. Video encoding
2.2.1. Container formats
2.2.2. Codecs
3. Media annotation
3.1. Tools and tool formats
ANVIL, CLAN/CHAT, ELAN, EXMARaLDA, FOLKER, Praat, (FLEX)
TASX Annotator, Transcriber, WinPitch, MacVissta, AG Toolkit, XTrans, EMU
Transana, F4, SACODEYL Transcriptor
Multitool, HIAT-DOS, syncWriter. MediaTagger
3.2. Other (“generic”) formats
TEI transcriptions of speech
Annotation Graphs / Atlas Interchange Format / Multimodal Exchange
BAS Partitur Format
3.3. Interoperability of tools and formats
3.4. Transcription conventions / Transcription systems
Phonetic: IPA / SAMPA, ToBi
Orthographic: HIAT, GAT, CHAT, ICOR, DT, CA
3.5. Commonly used combinations of formats and conventions
CLAN + CHAT
CLAN + CA
EXMARaLDA + HIAT
FOLKER + GAT
(ICOR + TEI?)
3.6. Other topics
Pronunciation lexica (should maybe go into “Lexica and Terminology Standards”)
SMIL
Annotation
Metadata
4. Summary / Recommendations
Media encoding and annotation
1. General distinctions / terminology
1.1. Spoken language vs. Speech vs. Phonetic vs. Multimodal corpora vs. Sign Language Corpora
1.2. Media encoding vs. Media annotation
1.3. File formats vs. Transcription systems/conventions
1.4. Transcription vs. Annotation vs. Metadata
2. Media encoding
2.1. Audio encoding
2.1.1. Uncompressed (Linear formats): PCM (WAV, AIF), AU, PhonDat1, PhonDat2, NIST Sphere
2.1.2. Compressed Formats: MP3, OGG (Audio), Flac, WMA
2.2. Video encoding
2.2.1. Container formats
2.2.2. Codecs
3. Media annotation
3.1. Tools and tool formats
ANVIL, CLAN/CHAT, ELAN, EXMARaLDA, FOLKER, Praat, (FLEX)
TASX Annotator, Transcriber, WinPitch, MacVissta, AG Toolkit, XTrans, EMU
Transana, F4, SACODEYL Transcriptor
Multitool, HIAT-DOS, syncWriter. MediaTagger
3.2. Other (“generic”) formats
TEI transcriptions of speech
Annotation Graphs / Atlas Interchange Format / Multimodal Exchange
BAS Partitur Format
3.3. Interoperability of tools and formats
3.4. Transcription conventions / Transcription systems
Phonetic: IPA / SAMPA, ToBi
Orthographic: HIAT, GAT, CHAT, ICOR, DT, CA
3.5. Commonly used combinations of formats and conventions
CLAN + CHAT
CLAN + CA
EXMARaLDA + HIAT
FOLKER + GAT
(ICOR + TEI?)
3.6. Other topics
Pronunciation lexica (should maybe go into “Lexica and Terminology Standards”)
SMIL
Annotation
Metadata
4. Summary / Recommendations
Accessibility
Public

