Media encoding

Documents related to media encoding


Media encoding and annotation

1. General distinctions / terminology

1.1. Spoken language vs. Speech vs. Phonetic vs. Multimodal corpora vs. Sign Language Corpora
1.2. Media encoding vs. Media annotation
1.3. File formats vs. Transcription systems/conventions
1.4. Transcription vs. Annotation vs. Metadata


2. Media encoding

2.1. Audio encoding
2.1.1. Uncompressed (Linear formats): PCM (WAV, AIF), AU, PhonDat1, PhonDat2, NIST Sphere
        2.1.2. Compressed Formats: MP3, OGG (Audio), Flac, WMA
   
2.2. Video encoding
        2.2.1. Container formats
        2.2.2. Codecs


3. Media annotation

3.1. Tools and tool formats
ANVIL, CLAN/CHAT, ELAN, EXMARaLDA, FOLKER, Praat, (FLEX)
        TASX Annotator, Transcriber, WinPitch, MacVissta, AG Toolkit, XTrans, EMU
        Transana, F4, SACODEYL Transcriptor
        Multitool, HIAT-DOS, syncWriter. MediaTagger

    3.2. Other (“generic”) formats
        TEI transcriptions of speech
Annotation Graphs / Atlas Interchange Format / Multimodal Exchange
BAS Partitur Format

    3.3. Interoperability of tools and formats

    3.4. Transcription conventions / Transcription systems
        Phonetic: IPA / SAMPA, ToBi
        Orthographic: HIAT, GAT, CHAT, ICOR, DT, CA

    3.5. Commonly used combinations of formats and conventions
        CLAN + CHAT
        CLAN + CA
        EXMARaLDA + HIAT
        FOLKER + GAT
        (ICOR + TEI?)
       
    3.6. Other topics
Pronunciation lexica (should maybe go into “Lexica and Terminology Standards”)
SMIL
Annotation
Metadata


4. Summary / Recommendations

Accessibility
Public