Kalvin Chang 張郁騰 /ʈ͡ʂɑŋ˦ y˥˨ tʰəŋ˨˦/

Speech processing for language variation using historical linguistics.

prof_pic.jpg

Gates Hillman Complex

Language Technologies Institute

Carnegie Mellon University

I am a speech researcher who aims to build support for non-standard, low-resource language varieties. I have a track record of publication in top NLP and speech conferences, with a portfolio of 7 co-first authored publications across ASR, NLP, and computational linguistics. My work in both computational historical linguistics and low-resource speech recognition uniquely positions me to pursue my research agenda, which takes the unconventional approach of applying insights from historical linguistics to boost low-resource speech recognition.

I am currently a Visiting Scholar in Shinji Watanabe and David Mortensen’s labs at Carnegie Mellon, leading two teams working on speech in-context learning for low-resource dialects and on language-universal phone recognition. I graduated with a Master’s of Language Technologies (Rank 1) and a BS in Computer Science (with University Honors) from CMU.

news

Feb 13, 2025 Accepted to the Toyota Technical Institute at Chicago’s CS PhD program, the University of Cambridge’s Engineering and Computation, Cognition, and Language PhD programs, the University of Edinburgh’s PhD in Informatics program, the University of Waterloo’s CS PhD program, and UC Berkeley’s CS PhD program.
Jan 31, 2025 Awarded a Gates Cambridge Scholarship as one of 35 / 600 US applicants.
Dec 08, 2024 Selected to attend the inaugural SDAIA Winter School on multi-modal LLMs as a Researcher to work on ASR for code-switching.
Oct 17, 2024 Presented four posters at the SANE 2024 Workshop [1] [2] [3] [4] .
Sep 10, 2024 Won Honorable Mention at the Interspeech 2024 Responsible Speech Foundation Models Special Session for “Self-supervised Speech Representations Still Struggle with African American Vernacular English” (Chang* et al., 2024).
Aug 19, 2024 Returned to CMU LTI as a Visiting Scholar in WAVLab and ChangeLingLab, advised by Professors Shinji Watanabe and David Mortensen.

selected publications

  1. Self-supervised Speech Representations Still Struggle with African American Vernacular English
    Kalvin Chang*, Yi-Hui Chou*, Jiatong Shi, and 4 more authors
    Interspeech, 2024
  2. Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus
    Yi-Hui Chou*, Kalvin Chang*, Meng-Ju Wu, and 8 more authors
    In 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2023
  3. Transformed Protoform Reconstruction
    Young Min Kim*, Kalvin Chang*, Chenxuan Cui, and 1 more author
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jul 2023
  4. Automating Sound Change Prediction for Phylogenetic Inference: A Tukanoan Case Study
    Kalvin Chang*, Nathaniel Robinson*, Anna Cai*, and 3 more authors
    In Proceedings of the 4th Workshop on Computational Approaches to Historical Language Change, Dec 2023
  5. WikiHan: A New Comparative Dataset for Chinese Languages
    Kalvin Chang, Chenxuan Cui, Youngmin Kim, and 1 more author
    In Proceedings of the 29th International Conference on Computational Linguistics, Oct 2022
  6. Phonotactic Complexity across Dialects
    Ryan Soh-Eun Shim*, Kalvin Chang*, and David R. Mortensen
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  7. PWESuite: Phonetic Word Embeddings and Tasks They Facilitate
    Vilém Zouhar*, Kalvin Chang*, Chenxuan Cui, and 4 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024