International Workshop: Speech corpora, text corpora: sharing practices



The constitution of language resources represents one of the major productions for researchers and laboratories in social sciences and humanities (SSH) in many years. In the field of language sciences, the exploitation and analysis of linguistic data have made a significant leap forward with the contribution of corpus linguistics and computer processes. Firstly, it proves vital in uniting established and emerging scientific practices, relatively divergent according to different fields (phonetics/phonology, lexicon, morphosyntax, semantics, text linguistics and discourse linguistics) and communities. Secondly, it is a matter of making the benefits and methods of tool-assisted linguistic analysis accessible to all SSH researchers, who base their works on the analysis of linguistic data, whether text, speech or multimodal productions. 

Goal of the workshop

The goal of this symposium is to create a dialogue between these different communities of researchers, whether they be producers or users of resources and/or tools. The methodological approaches that associate different modes of exploitation and analysis of speech and text corpora will be particularly welcomed. 

Invited speakers

- Bente Maegaard (CLARIN , Centre for Language Technologie at University of Copenhagen)
- Stefan Evert (FAU Erlangen-NĂĽrnberg)
- Damon Mayaffre (BCL UMR 7320, Université Nice Sophia Antipolis),
- Christophe Parisse (Modyco UMR 7114, Université Paris Ouest Nanterre).

Read more about the workshop and the Call for Papers



Université Paul-Valéry
Room 002 “Caryatides”
Saint Charles 2 Site,