Jaeyeon Kim

prof_pic.jpg

Hi everyone! I’m Jaeyeon Kim, a PhD student at the Language Technologies Institute, Carnegie Mellon University, co-advised by Professor Carlos Busso and Professor Shinji Watanabe.

My primary research interests lie in developing artificial intelligence that understands and interacts with the world in a human-like manner by integrating multiple modalities — particularly by bridging linguistic and visual knowledge with auditory information.

selected publications

  1. eba.png
    Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span
    Heeseung Yun, Joonil Na,  Jaeyeon Kim , Calvin Murdock, and Gunhee Kim
    In NeurIPS (Spotlight) , 2025
  2. wow_bench.png
    WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations
    Jaeyeon Kim , Heeseung Yun, Sang Hoon Woo, Chao-Han Huck Yang, and Gunhee Kim
    arXiv preprint arXiv:2508.20976, 2025
  3. visage.png
    ViSAGe: Video-to-Spatial Audio Generation
    Jaeyeon Kim , Heeseung Yun, and Gunhee Kim
    In ICLR , 2025
  4. learning_semantic.png
    Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations
    Jaeyeon Kim , Injune Hwang, and Kyogu Lee
    In ICASSP , 2024
  5. enclap.png
    EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning
    Jaeyeon Kim , Jaeyoon Jung, Jinjoo Lee, and Sang Hoon Woo
    In ICASSP , 2024