Publications

2025

  1. From perception to production: how acoustic invariance facilitates articulatory learning in a self-supervised vocal imitation model
    Marvin Lavechin, and Thomas Hueber
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing 2025
  2. Simulating Early Phonetic and Word Learning Without Linguistic Categories
    Marvin Lavechin, Maureen Seyssel, Hadrien Titeux, and 4 more authors
    Developmental Science 2025
  3. Simulating prenatal language exposure in computational models: An exploration study
    Marı́a Andrea Cruz Blandón, Nayeli Gonzalez-Gomez, Marvin Lavechin, and 1 more author
    Cognition 2025

2024

  1. Modeling the initial state of early phonetic learning in infants
    Maxime Poli, Thomas Schatz, Emmanuel Dupoux, and 1 more author
    Language Development Research 2024
  2. Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline
    Alejandrina Cristia, Lucas Gautheron, Zixing Zhang, and 8 more authors
    Behavior Research Methods 2024
  3. Decode, move and speak! Self-supervised learning of speech units, gestures, and sound relationships using vocal imitation
    Marc-Antoine Georges, Marvin Lavechin, Jean-Luc Schwartz, and 1 more author
    Computational Linguistics 2024
  4. Modeling early phonetic acquisition from child-centered audio data
    Marvin Lavechin, Maureen Seyssel, Marianne Métais, and 5 more authors
    Cognition 2024

2023

  1. Measuring language development from child-centered recordings
    Yaya Sy, William Havard, Marvin Lavechin, and 2 more authors
    In Interspeech 2023
  2. BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models
    Marvin Lavechin, Yaya Sy, Hadrien Titeux, and 5 more authors
    Interspeech 2023
  3. Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
    Marvin Lavechin, Marianne Métais, Hadrien Titeux, and 7 more authors
    ASRU 2023
  4. Realistic and broad-scope learning simulations: first results and challenges
    Maureen Seyssel, Marvin Lavechin, and Emmanuel Dupoux
    Journal of Child Language 2023
  5. ProsAudit, a prosodic benchmark for self-supervised speech models
    Maureen Seyssel, Marvin Lavechin, Hadrien Titeux, and 6 more authors
    Interspeech 2023

2022

  1. Probing phoneme, language and speaker information in unsupervised speech representations
    Maureen Seyssel, Marvin Lavechin, Yossi Adi, and 2 more authors
    Interspeech 2022
  2. Reverse engineering language acquisition with child-centered long-form recordings
    Marvin Lavechin, Maureen Seyssel, Lucas Gautheron, and 2 more authors
    Annual Review of Linguistics 2022

2021

  1. A thorough evaluation of the Language Environment Analysis (LENA) system
    Alejandrina Cristia, Marvin Lavechin, Camila Scaff, and 5 more authors
    Behavior Research Methods 2021
  2. ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings
    Okko Räsänen, Shreyas Seshadri, Marvin Lavechin, and 2 more authors
    Behavior Research Methods 2021
  3. ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track
    Afra Alishahi, Grzegorz Chrupała, Alejandrina Cristià, and 5 more authors
    arXiv preprint arXiv:2107.06546 2021

2020

  1. An open-source voice type classifier for child-centered daylong recordings
    Marvin Lavechin, Ruben Bousbib, Hervé Bredin, and 2 more authors
    Interspeech 2020
  2. Pyannote.audio: neural building blocks for speaker diarization
    Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, and 7 more authors
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
  3. End-to-end domain-adversarial voice activity detection
    Marvin Lavechin, Marie-Philippe Gill, Ruben Bousbib, and 2 more authors
    Interspeech 2020
  4. Speaker detection in the wild: Lessons learned from JSALT 2019
    Paola Garcı́a, Jesus Villalba, Hervé Bredin, and 8 more authors
    Odyssey 2020
  5. Longform recordings: Opportunities and challenges
    Lucas Gautheron, Marvin Lavechin, Rachid Riad, and 2 more authors
    In LIFT 2020-2èmes journées scientifiques du Groupement de Recherche" Linguistique informatique, formelle et de terrain" 2020