TALEP : Traitement Automatique du Langage Ecrit et Parlé

Mots-clés

traitement automatique des langues, apprentissage automatique, annotations linguistiques, traitement de données multimodales, analyse syntaxique, réseaux de neurones, expressions polylexicales, corpus

Responsables
Alexis NASR / Carlos RAMISCH

Membres

ABROUGUI Rim  Doctorant
AKANI Adueni adjoba eunice  Doctorant
BECERRA BONACHE Leonor  Enseignant-Chercheur / Chercheur
BECHET Frederic  Enseignant-Chercheur / Chercheur
BLOUIN Baptiste  Doctorant
BROCHIER Robin  Post-Docs / ATER / Ingenieurs
DARY Franck  Doctorant
DEULOFEU Jose  Enseignant-Chercheur / Chercheur
FAVRE Benoit  Enseignant-Chercheur / Chercheur
FECHINO Marion  Doctorant
FOURTASSI Abdellah  Enseignant-Chercheur / Chercheur
MONTELLA Sebastien  Doctorant
NASR Alexis  Enseignant-Chercheur / Chercheur
NIKOLAUS Mitja  Doctorant
RAMISCH Carlos  Enseignant-Chercheur / Chercheur
ROLBERT Monique  Enseignant-Chercheur / Chercheur
SALIN Emmanuelle  Doctorant
SCHOLIVET Manon  Doctorant
STEFANINI Marie-Helene  Enseignant-Chercheur / Chercheur
ZOCK Michael  Enseignant-Chercheur / Chercheur

Site web

https://talep.lis-lab.fr

Objectifs scientifiques

Les travaux de l’équipe portent sur de nombreux aspects du Traitement Automatique des Langues (TAL). Plus précisément, l’équipe:

  • Développe des modèles numériques et symboliques pour le TAL
  • Implémente ces modèles dans des outils
  • Evalue ces outils à l’aide de benchmarks reconnus par la communauté ou de campagnes d’évaluations
  • Met en oeuvre ces outils dans des applications développées dans le cadre de projets divers
  • Développe des ressources spécifiques lorsque ces dernières sont inexistantes

Les activités de l’équipe TALEP visent à trouver un bon équilibre entre la linguistique et l’informatique en proposant des analyses linguistiques précises des phénomènes rencontrés et de développer des modèles de traitement efficaces. Une des particularités de l’équipe TALEP est de s’intéresser à des productions linguistiques variées. Cette variété concerne la langue (français, anglais, arabe …), le mode de production (oral ou écrit), le niveau (planifié, spontané, normé, déviant …) ou encore le contexte de production (monologue ou dialogue, monomodale ou multimodale). L’équipe crée des outils génériques de TAL, en particulier la suite d’outils multilingue MACAON qui permet de réaliser des traitements linguistique standards et le logiciel MWETOOLKIT, qui extrait automatiquement des séquences de tokens pouvant constituer des expressions polylexicales à partir de corpus. Tous ces logiciels sont distribués sous licence libre. L’équipe TALEP accorde une grande importance aux aspects méthodologiques de l’évaluation des outils de TAL.  Ces évaluations peuvent être menées dans des contextes « écologiques », auprès d’utilisateurs finaux ou dans le cadre de campagnes d’évaluation scientifiques, nationales ou internationales. TALEP est l’acronyme de Traitement Automatique du Langage Ecrit et Parlé.

Publications récentes de l’équipe



71 documents

Articles dans une revue

  • Simone Fuscone, Benoit Favre, Laurent Prevot. Reproducibility in speech rate convergence experiments. Language Resources and Evaluation, Springer Verlag, 2021, ⟨10.1007/s10579-021-09528-6⟩. ⟨hal-03126983⟩
  • Alexis Nasr, Franck Dary, Frédéric Bechet, Benoit Favre. Annotation syntaxique automatique de la partie orale du CÉFC. Langages, Armand Colin (Larousse jusqu'en 2003), 2020. ⟨hal-02973242⟩
  • Sebastien Delecraz, Leonor Becerra-Bonache, Benoit Favre, Alexis Nasr, Frédéric Bechet. Multimodal Machine Learning for Natural Language Processing: Disambiguating Prepositional Phrase Attachments with Images. Neural Processing Letters, Springer Verlag, 2020, ⟨10.1007/s11063-020-10314-8⟩. ⟨hal-02973244⟩
  • Marie Candito, Mathieu Constant, Carlos Ramisch, Agata Savary, Bruno Guillaume, et al.. A French corpus annotated for multiword expressions and named entities. Journal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, 2020, 8 (2), pp.415-479. ⟨10.15398/jlm.v8i2.265⟩. ⟨hal-03016721⟩
  • Michael Zock. AI at the Crossroads of NLP and Neurosciences. Journal of Cognitive Science, Institute for Cognitive Science, Seoul National University, 2020, 21 (1), pp.1-14. ⟨hal-03168883⟩
  • Michael Zock, Chris Biemann. Comparison of Different Lexical Resources With Respect to the Tip-of-the-Tongue Problem. Journal of Cognitive Science, Institute for Cognitive Science, Seoul National University, 2020, 21 (2), pp.193-252. ⟨10.17791/jcs.2020.21.2.193⟩. ⟨hal-03168850⟩
  • Beatriz Sanchez Cardenas, Carlos Ramisch. Eliciting specialized frames from corpora using argument-structure extraction techniques. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication , John Benjamins Publishing, 2019, 25 (1), pp.1-31. ⟨10.1075/term.00026.san⟩. ⟨hal-02318280⟩
  • Agata Savary, Silvio Cordeiro, Timm Lichte, Carlos Ramisch, Uxoa Iñurrieta, et al.. Literal Occurrences of Multiword Expressions: Rare Birds That Cause a Stir. The Prague Bulletin of Mathematical Linguistics, 2019, ⟨10.2478/pralin-2019-0001⟩. ⟨hal-02106263⟩
  • Silvio Cordeiro, Aline Villavicencio, Marco Idiart, Carlos Ramisch. Unsupervised Compositionality Prediction of Nominal Compounds. Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2019, 45 (1), pp.1-57. ⟨10.1162/coli_a_00341⟩. ⟨hal-02318196⟩
  • Mathieu Constant, Gülşen Eryiğit, Johanna Monti, Lonneke van der Plas, Carlos Ramisch, et al.. Multiword Expression Processing: A Survey. Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2017, 43 (4), pp.837-892. ⟨10.1162/COLI_a_00302⟩. ⟨halshs-01665254⟩

Communications dans un congrès

  • Léo Bouscarrat, Antoine Bonnefoy, Cécile Capponi, Carlos Ramisch. AMU-EURANOVA at CASE 2021 Task 1: Assessing the stability of multilingual BERT. Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), Aug 2021, Online, Unknown Region. ⟨hal-03255722⟩
  • Caroline Pasquer, Agata Savary, Carlos Ramisch, Jean-Yves Antoine. Verbal Multiword Expression Identification: Do We Need a Sledgehammer to Crack a Nut?. The 28th International Conference on Computational Linguistics (COLING-20), Dec 2020, Barcelona, Spain. ⟨hal-03013636⟩
  • Cindy Aloui, Carlos Ramisch, Alexis Nasr, Lucie Barque. SLICE: Supersense-based Lightweight Interpretable Contextual Embeddings. The 28th International Conference on Computational Linguistics (COLING 2020), Dec 2020, Barcelona (on line), Spain. ⟨hal-03017741⟩
  • Gabriel Marzinotto, Delphine Charlet, Géraldine Damnati, Frederic Bechet. Analyse automatique en cadres sémantiques pour l'apprentissage de modèles de compréhension de texte. 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles, Jun 2020, Nancy, France. pp.288-295. ⟨hal-02784778v3⟩
  • Léo Bouscarrat, Antoine Bonnefoy, Cécile Capponi, Carlos Ramisch. Multilingual enrichment of disease biomedical ontologies. Proceedings of the LREC 2020 Workshop on Multilingual Biomedical Text Processing (MultilingualBIO 2020), May 2020, Marseille, France. pp.21-28. ⟨hal-02531140⟩
  • Delphine Charlet, Géraldine Damnati, Frédéric Bechet, Gabriel Marzinotto, Johannes Heinecke. Cross-lingual and cross-domain evaluation of Machine Reading Comprehension with Squad and CALOR-Quest corpora. LREC 2020, May 2020, MARSEILLE, France. pp.5491-5497. ⟨hal-02973245⟩
  • Laurianne Sitbon, Benoit Favre, Margot Brereton, Stewart Koplick. Engaging the Abilities of Participants with Intellectual Disability in IIR Research. ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR), Mar 2020, Vancouver, Canada. ⟨hal-02470823⟩
  • Stewart Koplick, Laurianne Sitbon, Benoit Favre, Jinglan Zhang, Andrew Bayor, et al.. A Framework for Information Accessibility in Large Video Repositories. ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR), Mar 2020, Vancouver, Canada. ⟨10.1145/3343413.3378003⟩. ⟨hal-02470812⟩
  • Filip Bircanin, Laurianne Sitbon, Benoit Favre, Margot Brereton. Designing an IIR Research Apparatus with Users with Severe Intellectual Disability. ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR), Mar 2020, Vancouver, Canada. pp.412-416, ⟨10.1145/3343413.3378008⟩. ⟨hal-02470797⟩
  • Carlos Ramisch, Agata Savary, Bruno Guillaume, Jakub Waszczuk, Marie Candito, et al.. Edition 1.2 of the PARSEME Shared Task on Semi-supervised Identification of Verbal Multiword Expressions. Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 2020, Barcelona, Spain. ⟨hal-03014927⟩
  • Caroline Pasquer, Agata Savary, Carlos Ramisch, Jean-Yves Antoine. Seen2Unseen at PARSEME Shared Task 2020: All Roads do not Lead to Unseen Verb-Noun VMWEs. Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LZX 2020), 2020, Barcelona, Spain. ⟨hal-03014867⟩
  • Frédéric Béchet, Cindy Aloui, Delphine Charlet, Geraldine Damnati, Johannes Heinecke, et al.. CALOR-QUEST : generating a training corpus for Machine Reading Comprehension models from shallow semantic annotations. MRQA: Machine Reading for Question Answering - Workshop at EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing, Nov 2019, Hong Kong, China. ⟨hal-02317018⟩
  • Gabriel Marzinotto, Geraldine Damnati, Frédéric Béchet. Adapting a FrameNet Semantic Parser for Spoken Language Understanding Using Adversarial Learning. Interspeech 2019, Sep 2019, Graz, Austria. pp.799-803, ⟨10.21437/Interspeech.2019-2732⟩. ⟨hal-02298417⟩
  • Frédéric Béchet, Christian Raymond. Benchmarking benchmarks: introducing new automatic indicators for benchmarking Spoken Language Understanding corpora. InterSpeech, Sep 2019, Graz, Austria. ⟨hal-02270633⟩
  • Nicolas Zampieri, Carlos Ramisch, Geraldine Damnati. The Impact of Word Representations on Sequential Neural MWE Identification. Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019), Aug 2019, Florence, Italy. pp.169 - 175, ⟨10.18653/v1/W19-5121⟩. ⟨hal-02318287⟩
  • Agata Savary, Silvio Ricardo Cordeiro, Carlos Ramisch. Without lexicons, multiword expression identification will never fly: A position statement. Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019), Aug 2019, Florence, Italy. pp.79 - 91, ⟨10.18653/v1/W19-5110⟩. ⟨hal-02318241⟩
  • Manon Scholivet. Méthodes de représentation de la langue pour l'analyse syntaxique multilingue. TALN-RECITAL 2019- PFIA 2019, Jul 2019, Toulouse, France. pp.577-590. ⟨hal-02611213⟩
  • Frédéric Bechet, Cindy Aloui, Delphine Charlet, Geraldine Damnati, Johannes Heinecke, et al.. CALOR-QUEST : un corpus d'entraînement et d'évaluation pour la compréhension automatique de textes. TALN 2019, Jul 2019, Toulouse, France. pp.185-194. ⟨hal-02377119⟩
  • Sebastien Delecraz, Leonor Becerra-Bonache, Alexis Nasr, Frédéric Bechet, Benoit Favre. Visual Disambiguation of Preprositional Phrase Attachments : Multimodal Machine Learning for Syntactic Analysis Correction. IWANN: International Work-Conference on Artificial Neural Networks, Jun 2019, Gran Canaria, Spain. ⟨10.1007/978-3-030-20521-8_52⟩. ⟨hal-02465051⟩
  • Gabriel Marzinotto, Johannes Heinecke, Geraldine Damnati. MaskParse@Deskin at SemEval-2019 Task 1: Cross-lingual UCCA Semantic Parsing using Recursive Masked Sequence Tagging. Proceedings of the Thirteenth International Workshop on Semantic Evaluation, Jun 2019, Minneapolis, United States. ⟨hal-02298429⟩
  • Gabriel Marzinotto, Geraldine Damnati, Frédéric Béchet, Benoit Favre. Robust Semantic Parsing with Adversarial Learning for Domain Generalization. Proceedings of the 2019 Conference of the North, Jun 2019, Minneapolis - Minnesota, France. pp.166-173, ⟨10.18653/v1/N19-2021⟩. ⟨hal-02298402⟩
  • Manon Scholivet, Franck Dary, Alexis Nasr, Benoit Favre, Carlos Ramisch. Typological Features for Multilingual Delexicalised Dependency Parsing. 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2019, Minneapolis, United States. pp.3919-3930, ⟨10.18653/v1/N19-1393⟩. ⟨hal-02278897⟩
  • Jeremy Auguste, Delphine Charlet, Geraldine Damnati, Frédéric Béchet, Benoit Favre. CAN WE PREDICT SELF-REPORTED CUSTOMER SATISFACTION FROM INTERACTIONS?. 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2019, Brighton, United Kingdom. ⟨10.1109/ICASSP.2019.8683896⟩. ⟨hal-02439687⟩
  • Jeremy Auguste, Frédéric Béchet, Geraldine Damnati, Delphine Charlet. Skip Act Vectors: integrating dialogue context into sentence embeddings. Tenth International Workshop on Spoken Dialogue Systems Technology, Apr 2019, Syracuse, Italy. ⟨hal-02125259⟩
  • Johanna Monti, Silvio Cordeiro, Carlos Ramisch, Federico Sangati, Agata Savary, et al.. Advances in Multiword Expression Identification for the Italian language: The PARSEME shared task edition 1.1. Fifth Italian Conference on Computational Linguistics (CLiC-it 2018), Dec 2018, Torino, Italy. ⟨hal-02152557⟩
  • Frédéric Béchet, Christian Raymond. Is ATIS too shallow to go deeper for benchmarking Spoken Language Understanding models?. InterSpeech 2018, Sep 2018, Hyderabad, India. pp.1-5. ⟨hal-01835425⟩
  • Carlos Ramisch, Silvio Cordeiro, Agata Savary, Veronika Vincze, Verginica Mititelu, et al.. Edition 1.1 of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions. Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018), Aug 2018, Santa Fe, United States. pp.222 - 240. ⟨hal-01865575⟩
  • Caroline Pasquer, Agata Savary, Carlos Ramisch, Jean-Yves Antoine. If you've seen some, you've seen them all: Identifying variants of multiword expressions. COLING, Aug 2018, Santa Fe, United States. ⟨hal-01866345⟩
  • Caroline Pasquer, Carlos Ramisch, Agata Savary, Jean-Yves Antoine. VarIDE at PARSEME Shared Task 2018: Are Variants Really as Alike as Two Peas in a Pod?. COLING Workshop on Linguistic Annotation, Multiword Expressions and Constructions, Aug 2018, Santa Fe, United States. ⟨hal-01866364⟩
  • Caroline Pasquer, Agata Savary, Jean-Yves Antoine, Carlos Ramisch. Towards a Variability Measure for Multiword Expressions. Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2018) - Short papers, Jun 2018, New Orleans, United States. ⟨hal-01802238⟩
  • Gabriel Marzinotto, Frédéric Béchet, Géraldine Damnati, Alexis Nasr. Sources of Complexity in Semantic Frame Parsing for Information Extraction. International FrameNet Workshop 2018, May 2018, Miyazaki, Japan. ⟨hal-01731385v2⟩
  • Simone Fuscone, Benoit Favre, Laurent Prevot. Replicating Speech Rate Convergence Experiments on the Switchboard Corpus. Workshop on Replicability and Reproducibility of Research Results in Science and Technology of Language, May 2018, Miyazaki, Japan. ⟨hal-01807796⟩
  • Gabriel Marzinotto, Jeremy Auguste, Frederic Bechet, Géraldine Damnati, Alexis Nasr. Semantic Frame Parsing for Information Extraction : the CALOR corpus. LREC2018, May 2018, Miyazaki, Japan. ⟨hal-01959187⟩
  • Géraldine Damnati, Jeremy Auguste, Alexis Nasr, Delphine Charlet, Johannes Heinecke, et al.. Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text. Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018, Miyazaki, Japan. ⟨hal-01943391⟩
  • Jeremy Auguste, Delphine Charlet, Géraldine Damnati, Benoit Favre, Frédéric Bechet. Évaluation automatique de la satisfaction client à partir de conversations de type "chat" par réseaux de neurones récurrents avec mécanisme d’attention. 25e conférence sur le Traitement Automatique des Langues Naturelles (TALN), 2018, Rennes, France. ⟨hal-01943265⟩
  • Robin Perrotin, Alexis Nasr, Jeremy Auguste. Annotation en Actes de Dialogue pour les Conversations d’Assistance en Ligne. 25e conférence sur le Traitement Automatique des Langues Naturelles (TALN), 2018, Rennes, France. ⟨hal-01943345⟩
  • Sebastien Delecraz, Alexis Nasr, Frédéric Béchet, Benoit Favre. Correcting prepositional phrase attachments using multimodal corpora. The 15th International Conference on Parsing Technologies, Sep 2017, Pise, Italy. ⟨hal-01693292⟩
  • Frédéric Béchet, Géraldine Damnati, Johannes Heinecke, Gabriel Marzinotto, Alexis Nasr. CALOR-Frame : un corpus de textes encyclopédiques annoté en cadres sémantiques. ACor4French – Les corpus annotés du français - Atelier TALN, Jun 2017, Orléans, France. ⟨hal-01780348⟩
  • Agata Savary, Carlos Ramisch, Silvio Cordeiro, Federico Sangati, Veronika Vincze, et al.. The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions. MWE 2017 - Proceedings of the 13th Workshop on Multiword Expressions, Apr 2017, Valencia, Spain. pp.31 - 47. ⟨hal-01504624⟩
  • Sylvain Kahane, Henri-José Deulofeu, Kim Gerdes, Alexis Nasr, André Valli. Annotation micro- et macrosyntaxique manuelle et automatique de français parlé. Journée Floral, 2017, Orléans, France. ⟨halshs-01740461⟩
  • Gabriel Marzinotto, Géraldine Damnati, Frederic Bechet. Analyse automatique FrameNet : une étude sur un corpus français de textes encyclopédiques. TALN 2017, 2017, Orléans, France. ⟨hal-01959260⟩
  • Natalie Vargas, Carlos Ramisch, Helena Caseli. Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment. Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), 2017, Valencia, Spain. pp.91 - 96. ⟨hal-01795904⟩
  • Manon Scholivet, Carlos Ramisch. Identification of Ambiguous Multiword Expressions Using Sequence Models and Lexical Resources. Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), 2017, Valencia, Spain. pp.167 - 175. ⟨hal-01795903⟩
  • Rodrigo Wilkens, Leonardo Zilio, Silvio Cordeiro, Felipe Paula, Carlos Ramisch, et al.. LexSubNC: a Dataset of Lexical Substitution for Nominal Compounds. Proceedings of the 12th International Conference on Computational Semantics (IWCS 2017) - Short papers, 2017, Montpellier, France. ⟨hal-01795956⟩
  • Carlos Ramisch. Putting the Horses before the Cart: Identifying Multiword Expressions before Translation. Computational and Corpus-Based Phraseology - Second International Conference, Europhras 2017, London, UK, November 13-14, 2017, Proceedings, 2017, London, United Kingdom. pp.69 - 84, ⟨10.1007/978-3-319-69805-2_6⟩. ⟨hal-01795985⟩
  • Jeremy Auguste, Arnaud Rey, Benoit Favre. Evaluation of word embeddings against cognitive processes: primed reaction times in lexical decision and naming tasks. Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017, Copenhagen, Denmark. pp.21 - 26. ⟨hal-01773220⟩
  • Ahmed Hamdi, Alexis Nasr, Nizar Habash, Núria Gala. POS-tagging of Tunisian Dialect Using Standard Arabic Resources and Tools. Workshop on Arabic Natural Language Processing, Jul 2015, Beijing, China. pp.59 - 68, ⟨10.18653/v1/W15-3207⟩. ⟨hal-01464860⟩
  • Michael Zock, Ruslan Mitkov. How to ask a foreigner questions without knowing his language ? Proposal for a conceptual interface to communicate thought. Natural Language Processing Pacific RIM Symposium, 1991, Singapore, Singapore. ⟨hal-03175829⟩

Posters

  • Jeremy Auguste, Delphine Charlet, Geraldine Damnati, Frédéric Béchet, Benoit Favre. Can we predict self-reported customer satisfaction from interactions ?. International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. ⟨hal-02134252⟩

Chapitres d'ouvrage

  • Carlos Ramisch. Computational phraseology discovery in corpora with the MWETOOLKIT. Gloria Corpas Pastor; Jean-Pierre Colson. Computational Phraseology, 24, John Benjamins, pp.111-134, 2020, IVITRA Research in Linguistics and Literature, 9789027205353. ⟨10.1075/ivitra.24.06ram⟩. ⟨hal-02739265⟩
  • Mathieu Constant, Gülşen Eryiğit, Carlos Ramisch, Michael Rosner, Gerold Schneider. Statistical MWE-aware parsing. Yannick Parmentier; Jakub Waszczuk. Representation and parsing of multiword expressions: Current trends, 3, Language Science Press, pp.147-182, 2019, Phraseology and Multiword Expressons, ⟨10.5281/zenodo.2579043⟩. ⟨hal-02318231⟩
  • Reinhard Rapp, V. Xu, Michael Zock, S. Sharoff, R. Forsyth, et al.. New Areas of Application of Comparable Corpora. Using Comparable Corpora for Under-Resourced Areas of Machine Translation. Theory and Applications of Natural Language Processing, In press. ⟨hal-02079213⟩
  • Michael Zock, J. Bateman. Natural Language Generation. Mitkov, Ruslan. Handbook of Computational Linguistics (2nd edition), In press. ⟨hal-02079245⟩
  • Michael Zock. Eureka! A Simple Solution to the Complex ‘Tip-of-the-Tongue’-Problem. Bastardas-Boada, A.; Massip Bonet, A; Bel-Enguix, G. Complexity Applications in Language and Communication, pp.251-272, 2019, ⟨10.1007/978-3-030-04598-2_14⟩. ⟨hal-02079168⟩
  • Agata Savary, Marie Candito, Verginica Barbu Mititelu, Eduard Bejček, Fabienne Cap, et al.. PARSEME multilingual corpus of verbal multiword expressions. Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop, 2018. ⟨hal-01917174⟩

Directions d'ouvrage, Proceedings

  • Michael Zock, Alessandro Lenci, Emmanuele Chersoni, Enrico Santus. Cognitive Aspects of the Lexicon (COGALEX-VI): Proceedings of the Workshop, December 12, 2020, Barcelona, Spain (Online). 2020. ⟨hal-03168880⟩
  • Stella Markanotonatou, Carlos Ramisch, Agata Savary, Veronika Vincze. Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop. Language Science Press, 2018. ⟨hal-01917075⟩
  • Stella Markantonatou, Carlos Ramisch, Agata Savary, Veronika Vincze. Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017). France. 2017. ⟨hal-01624624⟩

Autres publications

  • Joakim Nivre, Mitchell Abrams, Željko Agić, Lars Ahrenberg, Lene Antonsen, et al.. Universal Dependencies 2.2. 2018. ⟨hal-01930733⟩

Pré-publications, Documents de travail

  • Caroline Pasquer, Agata Savary, Jean-Yves Antoine, Carlos Ramisch, Nicolas Labroche, et al.. To Be or Not To Be a Verbal Multiword Expression: A Quest for Discriminating Features. 2020. ⟨hal-02905874⟩

Habilitations à diriger des recherches

  • Benoit Favre. Contextual language understanding Thoughts on Machine Learning in Natural Language Processing. Computation and Language [cs.CL]. Aix-Marseille Universite, 2019. ⟨tel-02470185⟩