Jaeyeon Kim

Hi everyone! I’m Jaeyeon Kim, an incoming PhD student at the Language Technologies Institute, Carnegie Mellon University, starting in Fall 2025. I am currently a research intern at the Vision and Learning Lab at Seoul National University, advised by Professor Gunhee Kim, where I focus on audio-multimodal learning.
My primary research interests lie in developing artificial intelligence that understands and interacts with the world in a human-like manner by integrating multiple modalities — particularly by bridging linguistic and visual knowledge with auditory information.
selected publications
- EnCLAP++: Analyzing the EnCLAP Framework for Optimizing Automated Audio Captioning PerformanceIn DCASE2024 Workshop , 2024
- Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic RepresentationsIn ICASSP , 2024