Publications

2025

  1. Simulating Early Phonetic and Word Learning Without Linguistic Categories
    Marvin Lavechin, Maureen Seyssel, Hadrien Titeux, and 4 more authors
    Developmental Science 2025
  2. Simulating prenatal language exposure in computational models: An exploration study
    Marı́a Andrea Cruz Blandón, Nayeli Gonzalez-Gomez, Marvin Lavechin, and 1 more author
    Cognition 2025

2024

  1. Modeling the initial state of early phonetic learning in infants
    Maxime Poli, Thomas Schatz, Emmanuel Dupoux, and 1 more author
    Language Development Research 2024
  2. Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline
    Alejandrina Cristia, Lucas Gautheron, Zixing Zhang, and 8 more authors
    Behavior Research Methods 2024
  3. Decode, move and speak! Self-supervised learning of speech units, gestures, and sound relationships using vocal imitation
    Marc-Antoine Georges, Marvin Lavechin, Jean-Luc Schwartz, and 1 more author
    Computational Linguistics 2024
  4. Modeling early phonetic acquisition from child-centered audio data
    Marvin Lavechin, Maureen Seyssel, Marianne Métais, and 5 more authors
    Cognition 2024

2023

  1. Measuring language development from child-centered recordings
    Yaya Sy, William Havard, Marvin Lavechin, and 2 more authors
    In Interspeech 2023
  2. BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models
    Marvin Lavechin, Yaya Sy, Hadrien Titeux, and 5 more authors
    Interspeech 2023
  3. Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
    Marvin Lavechin, Marianne Métais, Hadrien Titeux, and 7 more authors
    ASRU 2023
  4. Realistic and broad-scope learning simulations: first results and challenges
    Maureen Seyssel, Marvin Lavechin, and Emmanuel Dupoux
    Journal of Child Language 2023
  5. ProsAudit, a prosodic benchmark for self-supervised speech models
    Maureen Seyssel, Marvin Lavechin, Hadrien Titeux, and 6 more authors
    Interspeech 2023

2022

  1. Reverse engineering language acquisition with child-centered long-form recordings
    Marvin Lavechin, Maureen Seyssel, Lucas Gautheron, and 2 more authors
    Annual Review of Linguistics 2022
  2. Probing phoneme, language and speaker information in unsupervised speech representations
    Maureen Seyssel, Marvin Lavechin, Yossi Adi, and 2 more authors
    Interspeech 2022

2021

  1. A thorough evaluation of the Language Environment Analysis (LENA) system
    Alejandrina Cristia, Marvin Lavechin, Camila Scaff, and 5 more authors
    Behavior Research Methods 2021
  2. ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings
    Okko Räsänen, Shreyas Seshadri, Marvin Lavechin, and 2 more authors
    Behavior Research Methods 2021
  3. ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track
    Afra Alishahi, Grzegorz Chrupała, Alejandrina Cristià, and 5 more authors
    arXiv preprint arXiv:2107.06546 2021

2020

  1. An open-source voice type classifier for child-centered daylong recordings
    Marvin Lavechin, Ruben Bousbib, Hervé Bredin, and 2 more authors
    Interspeech 2020
  2. Pyannote.audio: neural building blocks for speaker diarization
    Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, and 7 more authors
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
  3. End-to-end domain-adversarial voice activity detection
    Marvin Lavechin, Marie-Philippe Gill, Ruben Bousbib, and 2 more authors
    Interspeech 2020
  4. Speaker detection in the wild: Lessons learned from JSALT 2019
    Paola Garcı́a, Jesus Villalba, Hervé Bredin, and 8 more authors
    Odyssey 2020
  5. Longform recordings: Opportunities and challenges
    Lucas Gautheron, Marvin Lavechin, Rachid Riad, and 2 more authors
    In LIFT 2020-2èmes journées scientifiques du Groupement de Recherche" Linguistique informatique, formelle et de terrain" 2020