Projects - Cheonkam Jeong

My research integrates computational linguistics and machine learning to build human-centered AI systems. These projects demonstrate the intersection of domain expertise in linguistics, cognitive science, and speech processing with modern ML/AI techniques.

Emotion-Aware Conversational AI for Dementia Care

Role: AI/ML Lead Project funded by UCOP Noyce Initiative

Overview

Leading the development of a clinically-validated AI pipeline for managing dementia-related agitation through therapeutic conversational interactions. This project creates the first AI system that combines real-time emotion recognition with expert-designed therapeutic response generation for the 55+ million people worldwide living with dementia.

Technical Contributions

Real-time Speech Emotion Recognition: Developed a wav2vec2-based emotion classifier achieving ~80% accuracy on detecting agitation, frustration, and engagement states from raw audio.
Therapeutic Response Generation: Engineered LLM prompting strategies (GPT-3.5/4) with clinician-validated therapeutic redirection techniques, including memory-based engagement and de-escalation protocols.
Voice Synthesis for Elderly Care: Implemented prosodic modifications using Azure Cognitive Services to generate age-appropriate, calming voice outputs with controlled pitch and speech rate.
Semantic Similarity Evaluation: Built conversation quality metrics using SentenceTransformers for comparing generated responses against clinical gold standards.

Linguistics & Cognitive Science Integration

Designed expert personas (Rachel, Jolene, Ralph) modeling different dementia severity levels based on clinical discourse patterns
Implemented episodic memory triggers leveraging autobiographical memory research
Applied therapeutic communication frameworks from dementia care literature

Technologies: Python, PyTorch, Transformers, OpenAI Whisper, wav2vec2, Azure Cognitive Services, Streamlit, HuggingFace, SentenceTransformers

GitHub Press Coverage Published Protocol (JMIR)

Understanding Emotion in Discourse: From Recognition to Generation-Informed Insights

Role: Principal Investigator TMLR Manuscript in Preparation

Overview

Systematic analysis of emotion recognition in conversation (ERC), moving beyond "black box" accuracy to understand how models work. Achieved state-of-the-art text-only performance on IEMOCAP using strictly causal context, surpassing prior methods that exploit future utterances.

Technical Contributions

State-of-the-Art Performance: Achieved 82.69% accuracy on IEMOCAP 4-way and 67.07% on 6-way emotion classification, surpassing prior text-only methods.
Context Dominance: Demonstrated that dialogue history alone yields 22% F1 improvement (p < .0001), establishing conversational context as the primary driver of ERC.
Structure-Context Trade-off: Hierarchical sentence representations improve utterance-level classification (+1.9%, p < .01), but this benefit vanishes with sufficient context—dialogue history subsumes intra-utterance structure.
Lexicon Redundancy: External affective knowledge (SenticNet) provides no gain over pre-trained embeddings, indicating modern encoders already capture affective semantics.

Linguistics & Cognitive Science Integration

Corpus analysis of 14,059 discourse marker occurrences revealing emotion-specific positioning patterns
Discovered hedging patterns at utterance-initial positions in sad speech
Preliminary insights for emotion-conditioned text generation

Technologies: Python, PyTorch, CUDA, Transformers, RoBERTa, LSTM, Scikit-learn, HuggingFace Accelerate, SenticNet

GitHub IEMOCAP Dataset (USC SAIL)

Additional Projects

LLM Alignment for Empathetic Responses

Investigating Direct Preference Optimization (DPO) for training language models to generate empathetic responses in healthcare contexts. Developing annotation frameworks for preference data collection.

Technologies: Python, Transformers, DPO, Preference Learning

Multimodal Team Communication Analysis

Contributed to MultiCAT, a comprehensive annotation framework for multimodal team communication, published at NAACL 2025. Developed annotation schemas for verbal and non-verbal communication patterns in collaborative settings.

Technologies: Multimodal annotation, Inter-rater reliability analysis

NAACL 2025 Paper

Big Data Phonetics: Korean Stop Hyperarticulation

Applied automated acoustic analysis to 100,000+ tokens from Korean broadcast speech, demonstrating how speakers hyperarticulate phonetic cues in lexically confusable contexts. This work bridges corpus linguistics with speech technology.

Technologies: Python, Praat scripting, Forced alignment, Statistical modeling

LabPhon 2024 Paper

Technical Skills

Programming Languages

Python, R, C++, Bash

ML/AI Frameworks

PyTorch, TensorFlow, HuggingFace Transformers, Scikit-learn

NLP & Speech

LLMs (GPT-4, Claude), Whisper, wav2vec2, BERT/RoBERTa, Praat

Computer Vision

MoveNet, OpenCV, Pose Estimation

Cloud & Hardware

NVIDIA GPUs (Academic Grant), Saturn Cloud, Azure Cognitive Services

Research Methods

Experimental design, Statistical analysis, IRR, Clinical validation

Research Projects

Emotion-Aware Conversational AI for Dementia Care

Overview

Technical Contributions

Linguistics & Cognitive Science Integration

Understanding Emotion in Discourse: From Recognition to Generation-Informed Insights

Overview

Technical Contributions

Linguistics & Cognitive Science Integration

Additional Projects

LLM Alignment for Empathetic Responses

Multimodal Team Communication Analysis

Big Data Phonetics: Korean Stop Hyperarticulation

Technical Skills

Programming Languages

ML/AI Frameworks

NLP & Speech

Computer Vision

Cloud & Hardware

Research Methods