Yunwoo Lee | Academic Homepage

About Me

I’m a Master’s student in Computational Linguistics at the University of Washington in Seattle. I’m broadly interested in computational linguistics and NLP, especially in how linguistic structure and meaning can be modeled and used to build better language technologies.

More recently, I’ve become particularly interested in trustworthy LLMs: how we can evaluate and improve models’ reliability in real-world settings, with a focus on hallucination, fairness, and socio-cultural bias.

I previously worked at Korea Electronics Technology Institute(KETI), NCSOFT, and SK Telecom. Those experiences made me care about how language models behave in the real world, not just how accurate they are, but how reliable and fair they can be.

I hold a B.A. in Linguistics & Cognitive Science and a double major in Language Science in Artificial Intelligence from Hankuk University of Foreign Studies, where I was advised by Prof. Jeesun Nam.

News

Sep 2025 Started M.S. in Computational Linguistics at University of Washington.

Education

University of Washington | 2025–Present

Master of Science in Computational Linguistics

Hankuk University of Foreign Studies | 2018–2024

Bachelor of Arts in Linguistics & Cognitive Science
Bachelor of Language Science in Artificial Intelligence (Double Major)
Leave of Absence - Military Service (Feb 2019 - Sep 2020)

Work Experience

Trustworthiness Benchmarks for Korean LLMs

Researcher @ KETI | Sep 2024 - Jun 2025

I worked on building evaluation resources for Korean LLMs with an emphasis on linguistically grounded reliability: how models maintain grounded claims, interpret indirect meaning, and respond appropriately under pragmatic pressure. Rather than scoring only “correct/incorrect,” I focused on identifying discourse-level failure patterns and the linguistic triggers behind them.

Discourse Evaluation Pragmatics Failure-Mode Analysis

AI Red Teaming & Safety Protocols

Language AI Researcher @ NCSOFT | Mar 2024 - Sep 2024

I supported safety-focused evaluation for conversational AI by building test cases that treat risk as a linguistic and interactional phenomenon. In addition to single-turn risk detection, I evaluated context-aware harmfulness in multi-turn settings where intent becomes clearer through discourse trajectories.

Semantic Risk Signals Context-Aware Safety Conversation Analysis

Persona-based Chatbot "Haru"

Linguistic Annotator @ SK Telecom | Feb 2023 - Jun 2023

I contributed to a persona-based chatbot by focusing on two linguistically central requirements for Korean dialogue: (1) robustness to ambiguity in colloquial, non-canonical user inputs, and (2) sociolinguistic coverage across honorific/register variation.

Ambiguity Taxonomy Text Normalization Sociolinguistic Variation

Research Experience

Corpus Study for Sentiment Analysis & Chatbot NLU

Undergraduate Research Intern @ DICORA Lab, HUFS | Jan 2022 - Sep 2022

At DICORA Lab, I worked on corpus-driven projects where the main goal was to turn raw text into usable linguistic resources. I was involved in data construction, guideline iteration, and quality control across domains, including finance and dialogue NLU.

Corpus Linguistics Annotation Design NLU Data Curation

Projects

Agentic Pipelines for English-to-Chinese Dialogue Summarization

Course Project, LING 573: NLP Systems and Applications, University of Washington | 2026

I worked on a team project that evaluated English-to-Chinese dialogue summarization on XSAMSum. We compared fine-tuned baselines with zero-shot agentic pipelines using local open-weight LLMs, and analyzed whether semantic intermediate representations could improve interpretability.

Cross-Lingual Summarization Agentic Pipelines Open-Weight LLMs LLM Evaluation Error Analysis

Linguistic Pattern Analysis for Financial Sentiment

HUFS Linguistics Graduation Thesis Project | Sep 2022 - Dec 2022

This thesis explored why sentiment in financial news cannot be captured by word lists alone. I modeled sentiment compositionally by focusing on how attribute nouns, such as interest rates and costs, interact with directional predicates, such as rise and fall, where polarity shifts depending on the semantics of the attribute.

Lexical Semantics Rule-Based NLP Sentiment Analysis

Relevant Coursework

Computational Linguistics & NLP

LING 473: Basics for Computational Linguistics
LING 566: Introduction to Syntax for Computational Linguistics
LING 570: Shallow Processing Techniques for NLP
LING 571: Deep Processing Techniques for NLP
LING 572: Advanced Statistical Methods in NLP
LING 573: NLP Systems and Applications
LING 575: Data Matters (Topics in NLP)
Machine Learning for Language Analysis
Big Data and Sentiment Analysis
Language Information Processing
Natural Language Data
Corpus Analysis and Dictionary
Intro to Linguistics and Language Technology
Computer and Linguistics

Linguistics & Cognitive Science

Syntactic Analysis
Phonetics
Pragmatics
Language Typology
Language and Logic
Introduction to Linguistics
Forensic Linguistics
Grammar in Korean as Foreign Language
Introduction to Cognitive Science
Cognitive Psychology
Neurolinguistics
Linguistics and Psychological Experiments
Language and Human Beings

Computer Science & Mathematics

CSE 373: Data Structures and Algorithms
Language and Database
Programming Languages and Laboratory
Introduction to Programming Languages
Essential Programming for Linguistics
Programming for Language Analysis
Probability and Statistics
Computer Mathematics
Statistics for Language Analysis