Sanghyuk Chun (He/Him)

Sanghyuk Chun

Ph.D. Student (2025-Present)
Princeton University
sanghyukc [at] princeton.edu
Google Scholar | Github | Twitter | CV

I am a Ph.D. student at Princeton University advised by Professor Olga Russakovsky. My current research interest lies on the domains of machine learning, multi-modal learning (e.g., vision-language, language-audio, and audio-visual), trustworthy ML, and computer vision.

Prior to Princeton, I served as a lead research scientist at ML Research team in NAVER AI Lab (from 2018 to 2025). I held a position as a research engineer at KAKAO Corp from 2016 to 2018, where my work focused on recommendation systems and machine learning applications.

News

_8/2025 : Serving as an area chair at ICLR 2026
_8/2025 : Serving as an area chair at AISTATS 2026
_8/2025 : 1 paper ^[VoxStudio] is accepted at ICCV 2025 Gen4AVC workshop.
_7/2025 : Starting a new chapter in life at Princeton University 👨‍🎓, with the great honor of being awarded the Upton Fellowship — a short blog post about my decision: [link]
_6/2025 : 1 paper ^[RTD] is accepted at ICCV 2025.
_6/2025 : Being nominated as CVPR 2025 outstanding reviewers (711/12,593 = 5.6%) [link].
_5/2025 : Giving a talk at Sogang University (topic: Multiplicity in Multimodal Learning) [slide]
_4/2025 : 1 paper ^{[ReadabilityEmergence]} is accepted at 2nd Workshop on Emergent Visual Abilities and Limits of Foundation Models at CVPR 2025.
_4/2025 : Giving a talk at Yonsei University (topic: Towards Reliable and Efficient Multimodal AI) [slide]
_3/2025 : 1 paper ^[LongProLIP] is accepted at ICLR 2025 Workshop on Quantify Uncertainty and Hallucination in Foundation Models: The Next Frontier in Reliable AI.
_2/2025 : Serving as an area chair at NeurIPS 2025
_1/2025 : 1 paper ^[ProLIP] is accepted at ICLR 2025.
_1/2025: Reaching a research milestone of 10,000 citations at Google Scholar!

See older news

12/2024 : Giving a talk at POSTECH AI Day (topic: Probabilistic Language-Image Pre-training) [slide]
12/2024 : Serving as an area chair at ICML 2025
12/2024 : 1 paper ^[ReWaS] is accepted at AAAI 2025.
11/2024 : 1 paper ^{[FairDRO extension]} is accepted at Neural Networks.
10/2024 : 1 paper ^[ReWaS] is accepted at NeurIPS 2024 Workshopon Video-Language Models.
10/2024 : Serving as an area chair at AISTATS 2025
_9/2024 : 1 paper ^[CKD] is accepted at NeurIPS 2024 D&B track.
_9/2024 : Giving a talk at SKKU (topic: "Realistic challenges and limitations of AI") [slide]
_8/2024 : RoCOCO^[RoCOCO] is accepted as ECCV 2024 Synthetic Data for Computer Vision Workshop and selected as Oral presentation!
_8/2024 : Giving a talk at HUST AI Summer School on "Generative AI" (topic: "CompoDiff") [slide]
_8/2024 : Serving as an area chair at ICLR 2025
_8/2024 : HYPE^[HYPE] is selected as Oral presentation at this ECCV!
_7/2024 : 1 paper ^[CompoDiff] is accepted at TMLR.
_7/2024 : 3 papers ^[HYPE]^[SAT]^[LUT] are accepted at ECCV 2024.
_4/2024 : 1 paper ^[CompoDiff] is accepted at CVPR 2024 SynData4CV Workshop.
_4/2024 : Serving as an area chair at NeurIPS 2024 Datasets and Benchmarks Track.
_3/2024 : Serving as an area chair at NeurIPS 2024.
_3/2024 : 1 paper ^[RegionVLP] is accepted at NAACL 2024 main track.
_3/2024 : Giving a talk at UNIST (topic: "Probabilistic Image-Text Representations") [slide]
_2/2024 : 1 paper ^[LinCIR] is accepted at CVPR 2024.
_2/2024 : Giving a talk at IEIE AI Signal Processing Society Winter School (topic: "Probabilistic Image-Text Representations") [slide]
_1/2024 : 2 papers ^[PCME++]^{[AD Correctness]} are accepted at ICLR 2024. One paper^[PCME++] is my sole authored paper 🤗, and one paper^{[AD Correctness]} is selected as spotlight! (Top-5% paper)
12/2023 : Giving a talk at Dankook University (topic: "Probabilistic Image-Text Representations") [slide]
12/2013: We finally released SynthTriplets18 dataset!
11/2013: Being nominated as NeurIPS 2023 top reviewers (10%) [link].
_9/2023 : Giving a talk at HUST AI Summer School on "Modern Machine Learning: Foundations and Applications" (topic: "Probabilistic Image-Text Representations") [slide]
_9/2023 : 1 paper ^{[PCME++ short]} is accepted at the non-archival track of ICCV 2023 Workshop on Closing The Loop Between Vision And Language (CLVL).
_8/2023 : Giving a talk at Yonsei University (topic: "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion") [slide]
_7/2023 : 1 paper ^[SeiT] is accepted at ICCV 2023.
_7/2023 : Serving as a TMLR Action Editor.
_6/2023 : Being nominated as a TMLR Expert Reviewer [link].
_6/2023 : Giving a talk at Sogang University (topic: "Probabilistic Image-Text Representations") [slide]
_5/2023 : Serving as an area chair at NeurIPS 2023 Datasets and Benchmarks Track.
_4/2023 : We released "Graphit: A Unified Framework for Diverse Image Editing Tasks" [GitHub] ^[Graphit], The technical report will be released soon!
_4/2023 : 1 paper ^{[3D-Pseudo-Gts]} is accepted at CVPR 2023 Workshop on Computer Vision for Mixed Reality (CV4MR).
_1/2023 : 1 paper ^[FairDRO] is accepted at ICLR 2023.
_9/2022 : Giving a talk at Sogang University (topic: "ECCV Caption") [slide]
_9/2022 : 1 paper ^{[MSDA theorem]} is accepted at NeurIPS 2022.
_8/2022 : Starting a new chapter in life with Song Park 🤵❤️👰.
_7/2022 : 1 paper ^{[LF-Font journal]} is accepted at IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI).
_7/2022 : 2 papers ^{[ECCV Caption]} ^[MIRO] are accepted at ECCV 2022.
_7/2022 : Giving a talk at UNIST AIGS (topic: "Towards Reliable Machine Learning: Challenges, Examples, Solutions") [slide]
_6/2022 : Giving a tutorial on "Shortcut learning in Machine Learning: Challenges, Analysis, Solutions" at FAccT 2022. [ tutorial homepage | slide | video ]
_5/2022 : Receiving an outstanding reviewer award at CVPR 2022 [link].
_5/2022 : 1 paper ^[DCC] is accepted at ICML 2022.
_4/2022 : 1 paper ^{[WSOL Eval journal]} is accepted at IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI).
_4/2022 : Organizing ICLR 2022 ML in Korea Social
_3/2022 : Giving guest lectures at KAIST and SNU (topic: "Towards Reliable Machine Learning") [slide]
_3/2022 : Co-organizing FAccT 2022 Translation/Dialogue Tutorial: "Shortcut learning in Machine Learning: Challenges, Analysis, Solutions" (slides, videos and web pages will be released soon)
_3/2022 : 1 paper ^[CGL] is accepted at CVPR 2022.
_2/2022 : Giving a talk at POSTECH AI Research (PAIR) ML Winter Seminar 2022 (topic: "Shortcut learning in Machine Learning: Challenges, Examples, Solutions") [slide]
_1/2022 : 2 papers ^[ViDT] ^[WCST-ML] are accepted at ICLR 2022.
12/2021 : Co-hosting NeurIPS'21 workshop on ImageNet: Past, Present, and Future with 400+ attendees!
12/2021 : Giving a talk at University of Seoul (topic: "Realistic challenges and limitations of AI") [slide]
11/2021 : Giving a talk at NAVER and NAVER Labs Europe (topic: Mitigating dataset biases in Real-world ML applications) [slide]
11/2021 : Giving a guest lecture at UNIST (topic: Limits and Challenges in Deep Learning Optimizers) [slide]
10/2021 : Releasing an unified few-shot font generation framework! [code]
_9/2021 : 2 papers ^[SWAD] ^[NHA] are accepted at NeurIPS 2021.
_8/2021: Reaching a research milestone of 1,000 citations at Google Scholar and Semantic Scholar!
_7/2021 : Co-organizing the NeurIPS Workshop on ImageNet: Past, Present, and Future! [webpage]
_7/2021 : 2 papers ^[MX-Font] ^[PiT] are accepted at ICCV 2021.
_7/2021 : Giving a talk at Computer Vision Centre (CVC), UAB (topic: PCME and AdamP) [info] [slide]
_6/2021 : Giving a talk at KSIAM 2021 (topic: AdamP). [slide]
_6/2021 : Giving a guest lecture at Seoul National University (topic: few-shot font generation) .[slide]
_5/2021 : Receiving an outstanding reviewer award at CVPR 2021 [link].
_4/2021 : 1 paper ^[LF-Font] is accepted at CVPR 2021 workshop (also appeared at AAAI).
_3/2021 : 2 papers ^[PCME] ^[ReLabel] are accepted at CVPR 2021.
_1/2021 : 1 paper ^[AdamP] is accepted at ICLR 2021.
12/2020 : 1 paper ^[LF-Font] is accepted at AAAI 2021.
_7/2020 : 1 paper ^[DM-Font] is accepted at ECCV 2020.
_6/2020 : Receiving the best paper runner-up award at AICCW CVPR 2020 ^{[DM-Font WS]}.
_6/2020 : Receiving an outstanding reviewer award at CVPR 2020 [link].
_6/2020 : Giving a talk at CVPR 2020 NAVER interative session.
_6/2020 : 1 paper ^[ReBias] is accepted at ICML 2020.
_4/2020 : 1 paper ^{[DM-Font short]} is accepted at CVPR 2020 workshop.
_2/2020 : 1 paper ^[wsoleval] is accepted at CVPR 2020.
_1/2020 : 1 paper ^[HCNN] is accepted at ICASSP 2020.
10/2019 : 1 paper ^{[HCNN short]} is accpeted at ISMIR late break demo.
10/2019 : Working at Naver Labs Europe as a visiting researcher (Oct - Dec 2019)
_7/2019 : 2 papers ^{[CutMix] [WCT2]} are accepted at ICCV 2019 (1 oral presentation).
_6/2019 : Giving a talk at ICML 2019 Expo workshop.
_5/2019 : 2 papers ^{[MTSA] [RegEval]} are accepted at ICML 2019 workshops (1 oral presentation).
_5/2019 : Giving a talk at ICLR 2019 Expo talk.
_3/2019 : 1 paper ^[PRM] is accepted at ICLR 2019 workshop.

Current Service Appointments

Area Chair - ICLR 2026 AISTATS 2026 NeurIPS 2025 ICML 2025 ICLR 2025 AISTATS 2025
Action Editor - TMLR
Reviewer - ICLR Blog track 2025 CVPR 2025 (outstanding reviewer) ICCV 2025

Research

I am interested in machine learning and its reliability. My research philosophy aims to (1) scalability and (2) theoretical or conceptual soundness. Recently, I have focused on multi-modal learning (e.g., vision-language and audio-visual), but I am open to exploring a broader range of topics. More details about my research can be found in my research statement (as of December 2023). Representative works that reflect my research approach and significant contributions are highlighted, with the most significant works emphasized in orange.

(C: peer-reviewed conference, W: peer-reviewed workshop, A: arxiv preprint, O: others)
(^❋authors contributed equally)
See also at my Google Scholar.

2025

Mitigating Cross-Image Information Leakage in LVLMs for Multi-Image Tasks.
- Yeji Park, Minyoung Lee, Sanghyuk Chun, Junsuk Choe
- preprint. paper | code | bibtex
Seeing What You Say: Expressive Image Generation from Speech.
- Jiyoung Lee, Song Park, Sanghyuk Chun, Soo-Whan Chung
- ICCV 2025 Workshop on Generative AI for Audio-Visual Content Creation (Gen4AVC). paper | bibtex
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval.
- Jaeseok Byun^❋, Seokhyeon Jeong^❋, Wonjae Kim, Sanghyuk Chun^†, Taesup Moon^†
- ICCV 2025. paper | bibtex
Multiplicity is an Inevitable and Inherent Challenge in Multimodal Learning.
- Sanghyuk Chun
- preprint. paper | bibtex
Emergence of Text Readability in Vision Language Models.
- Jaeyoo Park, Sanghyuk Chun, Wonjae Kim, Sangdoo Yun, Bohyung Han
- CVPR 2025 EVAL-FoMo Workshop. paper | bibtex
LongProLIP: A Probabilistic Vision-Language Model with Long Context Text.
- Sanghyuk Chun, Sangdoo Yun
- ICLR 2025 QUESTION Workshop. paper | code | pre-trained models 🤗 | slide | bibtex
DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias.
- Song Park^❋, Sanghyuk Chun^❋, Byeongho Heo, Dongyoon Han
- preprint. paper | bibtex
Probabilistic Language-Image Pre-Training.
- Sanghyuk Chun, Wonjae Kim, Song Park, Sangdoo Yun
- ICLR 2025. paper | code | pre-trained models 🤗 | slide | bibtex
Read, Watch and Scream! Sound Generation from Text and Video.
- Yujin Jeong, Yunji Kim, Sanghyuk Chun, Jiyoung Lee
- AAAI 2025 | NeurIPS 2024 Workshop on Video-Language Models. paper | code | project page | bibtex
FairDRO: Group Fairness Regularization via Classwise Robust Optimization.
- Taeeon Park, Sangwon Jung, Sanghyuk Chun, Taesup Moon
- Neural Networks. paper | code | bibtex

2024

Do Counterfactually Fair Image Classifiers Satisfy Group Fairness? -- A Theoretical and Empirical Study.
- Sangwon Jung^❋, Sumin Yu^❋, Sanghyuk Chun^†, Taesup Moon^†
- NeurIPS 2024 D&B. paper | code and dataset | bibtex
HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts. Oral presentation
- Wonjae Kim, Sanghyuk Chun, Taekyung Kim, Dongyoon Han, Sangdoo Yun
- ECCV 2024. paper | code | bibtex
Similarity of Neural Architectures using Adversarial Attack Transferability.
- Jaehui Hwang, Dongyoon Han, Byeongho Heo, Song Park, Sanghyuk Chun^❋, Jong-Seok Lee^❋
- ECCV 2024. paper | code | bibtex
Learning with Unmasked Tokens Drives Stronger Vision Learners.
- Taekyung Kim^❋, Sanghyuk Chun, Byeongho Heo, Dongyoon Han^❋
- ECCV 2024. paper | code | bibtex
RoCOCO: Robust Benchmark of MS-COCO to Stress-test Robustness of Image-Text Matching Models. Oral presentation
- Seulki Park, Daeho Um, Hajung Yoon, Sanghyuk Chun, Sangdoo Yun
- ECCV 2024 Synthetic Data for Computer Vision Workshop (SyntheticData4CV 2024). paper | code | bibtex
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion.
- Geonmo Gu^❋, Sanghyuk Chun^❋, Wonjae Kim, HeeJae Jun, Yoohoon Kang, Sangdoo Yun
- TMLR. CVPR 2024 SynData4CV Workshop. paper | code | SynthTriplets18M dataset | demo 🤗 | slide | bibtex
Toward Interactive Regional Understanding in Vision-Large Language Models
- Jungbeom Lee, Sanghyuk Chun^❋, Sangdoo Yun^❋
- NAACL 2024. paper | code | bibtex
Language-only Efficient Training of Zero-shot Composed Image Retrieval.
- Geonmo Gu^❋, Sanghyuk Chun^❋, Wonjae Kim, Yoohoon Kang, Sangdoo Yun
- CVPR 2024. paper | code | bibtex
What Does Automatic Differentiation Compute for Neural Networks? Spotlight presentation
- Sejun Park^❋, Sanghyuk Chun^❋, Wonyeol Lee
- ICLR 2024. paper | code | bibtex
Improved Probabilistic Image-Text Representations.
- Sanghyuk Chun
- ICLR 2024. ICCV CLVL 2023. paper | code | project page | slide | bibtex

2023

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage.
- Song Park^❋, Sanghyuk Chun^❋, Byeongho Heo, Wonjae Kim, Sangdoo Yun
- ICCV 2023. paper | code | slide | bibtex
Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild.
- Gyeongsik Moon, Hongsuk Choi, Sanghyuk Chun, Jiyoung Lee, Sangdoo Yun
- CVPR 2023 Workshop on Computer Vision for Mixed Reality (CV4MR). paper | code | bibtex
Re-weighting based Group Fairness Regularization via Classwise Robust Optimization.
- Sangwon Jung^❋, Taeeon Park^❋, Sanghyuk Chun, Taesup Moon
- ICLR 2023. paper | code | bibtex

2022

Group Generalized Mean Pooling for Vision Transformer.
- Byungsoo Ko, Han-Gyu Kim, Byeongho Heo, Sangdoo Yun, Sanghyuk Chun, Geonmo Gu, Wonjae Kim
- preprint. paper | bibtex
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective.
- Chanwoo Park^❋, Sangdoo Yun^❋, Sanghyuk Chun
- NeurIPS 2022. paper | code | bibtex
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO.
- Sanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh
- ECCV 2022. paper | code | pypi | slide (short talk) | slide (long talk) | bibtex
Domain Generalization by Mutual-Information Regularization with Pre-trained Models.
- Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun
- ECCV 2022. paper | code | bibtex
Few-shot Font Generation with Weakly Supervised Localized Representations.
- Song Park^❋, Sanghyuk Chun^❋, Junbum Cha, Bado Lee, Hyunjung Shim
- IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 2022. (IF:24.314)
- PAMI. paper | code (old) | code (new) | project page | bibtex
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets.
- Junsuk Choe^❋, Seong Joon Oh^❋, Sanghyuk Chun, Seungho Lee, Zeynep Akata, Hyunjung Shim
- IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 2022. (IF:24.314)
- PAMI. paper | code and dataset | bibtex
An Extendable, Efficient and Effective Transformer-based Object Detector.
- Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang
- preprint. paper | code | bibtex
Dataset Condensation with Contrastive Signals.
- Saehyung Lee, Sanghyuk Chun, Sangwon Jung, Sangdoo Yun, Sungroh Yoon
- ICML 2022. paper | bibtex
Learning Fair Classifiers with Partially Annotated Group Labels.
- Sangwon Jung, Sanghyuk Chun^❋, Taesup Moon^❋
- CVPR 2022. paper | code | bibtex
ViDT: An Efficient and Effective Fully Transformer-based Object Detector.
- Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang
- ICLR 2022. paper | code | bibtex
Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective.
- Luca Scimeca^❋, Seong Joon Oh^❋, Sanghyuk Chun, Michael Poli, Sangdoo Yun
- ICLR 2022. paper | bibtex

2021

SWAD: Domain Generalization by Seeking Flat Minima.
- Junbum Cha, Sanghyuk Chun^❋, Kyungjae Lee^❋, Han-Cheol Cho, Seunghyun Park, Yunsung Lee, Sungrae Park
- NeurIPS 2021. paper | code | bibtex
Neural Hybrid Automata: Learning Dynamics with Multiple Modes and Stochastic Transitions.
- Michael Poli^❋, Stefano Massaroli^❋, Luca Scimeca, Seong Joon Oh, Sanghyuk Chun, Atsushi Yamashita, Hajime Asama, Jinkyoo Park, Animesh Garg
- NeurIPS 2021. paper | bibtex
StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures.
- Sanghyuk Chun, Song Park
- preprint. paper | bibtex
Rethinking Spatial Dimensions of Vision Transformers.
- Byeongho Heo, Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Junsuk Choe, Seong Joon Oh
- ICCV 2021. paper | code | tweet | bibtex
Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts.
- Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim
- ICCV 2021. paper | code | bibtex
Probabilistic Embeddings for Cross-Modal Retrieval.
- Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus
- CVPR 2021. paper | code | video | slide (short talk) | slide (long talk) | bibtex
Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels.
- Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Junsuk Choe, Sanghyuk Chun
- CVPR 2021. paper | code | bibtex
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights.
- Byeongho Heo^❋, Sanghyuk Chun^❋, Seong Joon Oh, Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo Ha
- ICLR 2021. paper | code | project page | pypi | slide | bibtex
Few-shot Font Generation with Localized Style Representations and Factorization.
- Song Park^❋, Sanghyuk Chun^❋, Junbum Cha, Bado Lee, Hyunjung Shim
- AAAI 2021. CVPR Workshop 2021. paper | code | project page | bibtex

2020

Few-shot Compositional Font Generation with Dual Memory.
- Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee
- ECCV 2020. paper | code | video | bibtex
Learning De-biased Representations with Biased Representations.
- Hyojin Bahng, Sanghyuk Chun, Sangdoo Yun, Jaegul Choo, Seong Joon Oh
- ICML 2020. paper | code | tweet | video | slide | bibtex
Toward High-quality Few-shot Font Generation with Dual Memory. Oral presentation The best paper runner-up award
- Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee
- CVPR Workshop 2020. paper | bibtex
Evaluating Weakly Supervised Object Localization Methods Right.
- Junsuk Choe^❋, Seong Joon Oh^❋, Seongho Lee, Sanghyuk Chun, Zeynep Akata, Hyunjung Shim
- CVPR 2020. paper | code and dataset | tweet | slide | video on CVPR | video on ECCV tutorial | bibtex
Data-driven Harmonic Filters for Audio Representation Learning.
- Minz Won, Sanghyuk Chun, Oriol Nieto, Xavier Serra
- ICASSP 2020. paper | code and pretrained models | video | bibtex

2019

Neural Approximation of Auto-Regressive Process through Confidence Guided Sampling.
- YoungJoon Yoo, Sanghyuk Chun, Sangdoo Yun, Jung-Woo Ha, Jaejun Yoo
- preprint. paper | bibtex
Toward Interpretable Music Tagging with Self-attention.
- Minz Won, Sanghyuk Chun, Xavier Serra
- preprint. paper | code and pretrained models | bibtex
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features. Oral presentation
- Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, Youngjoon Yoo
- ICCV 2019. paper | code and pretrained models | blog | bibtex
Photorealistic Style Transfer via Wavelet Transforms.
- Jaejun Yoo^❋, Youngjung Uh^❋, Sanghyuk Chun^❋, Byungkyu Kang, Jung-Woo Ha
- ICCV 2019. paper | code and model weights | video | blog | bibtex
Automatic Music Tagging with Harmonic CNN.
- Minz Won, Sanghyuk Chun, Oriol Nieto, Xavier Serra
- ISMIR LBD 2019. paper | code | bibtex
An Empirical Evaluation on Robustness and Uncertainty of Regularization methods.
- Sanghyuk Chun, Seong Joon Oh, Sangdoo Yun, Dongyoon Han, Junsuk Choe, Youngjoon Yoo
- ICML Workshop 2019. paper | bibtex
Visualizing and Understanding Self-attention based Music Tagging. Oral presentation
- Minz Won, Sanghyuk Chun, Xavier Serra
- ICML Workshop 2019. paper | code | talk video | bibtex
Where To Be Adversarial Perturbations Added? Investigating and Manipulating Pixel Robustness Using Input Gradients.
- Jisung Hwang^❋, Younghoon Kim^❋, Sanghyuk Chun^❋, Jaejun Yoo, Ji-Hoon Kim, Dongyoon Han
- ICLR Workshop 2019. paper | bibtex

~ 2018

Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement.
- Jang-Hyun Kim^❋, Jaejun Yoo^❋, Sanghyuk Chun, Adrian Kim, Jung-Woo Ha
- preprint. paper | project page | bibtex
A Study on Intelligent Personalized Push Notification with User History.
- Hyunjong Lee, Youngin Jo, Sanghyuk Chun, Kwangseob Kim
- Big Data 2017. paper | bibtex
Scalable Iterative Algorithm for Robust Subspace Clustering: Convergence and Initialization.
- Master's Thesis, Korea Advanced Institute of Science and Technology, 2016 (advised by Jinwoo Shin) paper | code

Academic Activities

Professional Service

Journal Action Editor:
- Transactions on Machine Learning Research (TMLR)

Conference Area Chair:
- ICLR 2025-2026
- AISTATS 2025-2026
- NeurIPS 2024-2025
- NeurIPS Dataset and Benchmark (D&B) track 2023-2024
- ICML 2025

Tutorial / Workshop / Social Organizer:
- FAccT 2022 Translation/Dialogue Tutorial: "Shortcut learning in Machine Learning: Challenges, Analysis, Solutions"
  - Co-organized by Sanghyuk Chun, Kyungwoo Song, Yonghan Jung
  - homepage | slide | video
- NeurIPS 2021 Workshop on ImageNet: Past, Present, and Future
  - Co-organized by Zeynep Akata, Lucas Beyer, Sanghyuk Chun, Almut Sophia Koepke, Diane Larlus, Seong Joon Oh, Rafael Sampaio de Rezende, Sangdoo Yun, Xiaohua Zhai

ICLR 2021 Social: ML in Korea, technical chair and session chair
ICLR 2022 Social: ML in Korea, a main organizer

Reviewer:
- Conference: ICML 2021-2024, NeurIPS 2020-2023 (NeurIPS 2023 Top reviewer), ICLR 2021-2024, AAAI 2021, CVPR 2020-2025 (CVPR 2020, 2021, 2022, 2025 Outstanding reviewer), ICCV/ECCV 2021-2025, WACV 2021, ACCV 2020, CHI 2023
- Journal: Transactions on Machine Learning Research (TMLR) (TMLR 2023 Expert Reviewer), Transactions on Pattern Analysis and Machine Intelligence (TPAMI), International Journal of Computer Vision (IJCV), IEEE Transactions on Image Processing (TIP)

Honors and Awards

Talks

"Multiplicity in Multimodal Learning: From Ambiguity to Understanding", Sogang University (2025) [slide]
"Towards Reliable and Efficient Multimodal AI", Yonsei University (2025) [slide]
"Probabilistic Language-Image Pre-training", POSTECH AI Day (2024) [slide]
"Realistic challenges and limitations of AI", SKKU (2024). [slide]
"CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion", HUST AI Summer School on "Generative AI" (2024). [slide]
"Font Generation", Dankook University (2024).
"Probabilistic Image-Text Representations", IEIE AI Signal Processing Society Winter School and UNIST AIGS Seminar (2024). [slide]

See older talks

"Probabilistic Image-Text Representation", HUST AI Summer School on "Modern Machine Learning: Foundations and Applications" and Dankook University (2023). [slide]
"CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion", Yonsei University (2023). [slide]
"Probabilistic Image-Text Representations", Sogang University (2023). [slide]
"ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO", NAVER and Sogang University (2022). [slide]
"Towards Reliable Machine Learning: Challenges, Examples, Solutions", UNIST AIGS (2022). [slide]
"Tutorial on Shortcut learning in Machine Learning: Challenges, Analysis, Solutions" at FAccT 2022. [ tutorial homepage | slide | video ]
"Towards Reliable Machine Learning", KAIST and SNU (2022). [slide]
"Shortcut learning in Machine Learning: Challenges, Examples, Solutions", POSTECH AI Research (PAIR) ML Winter Seminar 2022. [slide]
"Realistic challenges and limitations of AI", University of Seoul (2021). [slide]
"Mitigating dataset biases in Real-world ML applications", NAVER and NAVER Labs Europe (2021). [slide]
"Limits and Challenges in Deep Learning Optimizers", UNIST (2021). [slide]
"Towards better cross-modal learning by Probabilistic embedding and AdamP optimizer", UAB CVC (2021). [info] [slide]
"AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights", KSIAM (2021). [slide]
"Towards Few-shot Font Generation", Seoul University and NAVER (2021). [slide]
"Probabilistic Embeddings for Cross-modal Retrieval", SNU AI Institute (AIIS) Retreat. [slide]
"Learning De-biased Representations with Biased Representations", NAVER (2020). [slide]
"Reliable Machine Learning in NAVER AI", Yonsei University (2020). [slide]
"Toward Reliable Machine Learning", omnious and nota (2020). [slide]
"Reliable Machine Learning", NAVER CVPR 2020 sponser event. [program] [slide] [video]
"Neural Architectures for Music Representation Learning", NAVER (2020). [slide]
"Learning generalizable representations with CutMix and ReBias", NAVER Labs Europe (2019).
"An empirical evaluation on the generalizability of regularization methods", ICML 2019 Expo Talk: Recent Work on Machine Learning at NAVER. [slide]
"Recent works on deep learning robustness in Clova AI", ICLR 2019 Expo Talk: Representation Learning to Rich AI Services in NAVER and LINE.
"Recommendation system in the real world", Deepest Summer School 2018. [slide]

Teaching

Princeton AI4All Instructor, 2025
KAIST Introduction to Big Data, TA, 2015
KAIST EE (Electrical Engineering) Laboratory, TA, 2014

Code and Data

Datasets

CUB v2 and OpenImages 30k [C5] [J2]: These datasets were designed and collected for the WSOL evaluation project. We newly collected bird images for CUB v2 (200 classes) and re-organize OpenImages V5 for OpenImages 30k (100 classes)
Biased MNIST, 9-Class ImageNet and ImageNet cluster labels [C6]: We proposed these datasets for measuring how ML models can be generalized to bias shift. Biased MNIST is a synthetic dataset based on MNIST, while each image has background colors which highly correlates with the labels (controllable). ImageNet-9 contains 9 super-classes (dog, cat, frog, turtle, bird, monkey, fish, crab, insect) and 57k training samples. We also proposes "unbiased accuracy" by using cluster labels (K=9), which empicially matches to Shell, Grass, Close-up, Eye, Human, Sand and Mammal.
CUB Caption [C11]: CUB was not majorly used for image-text matching (ITM) retrieval. However, while doing the PCME project, we proposed to use the CUB dataset as a ITM benchmark for measuring the impact of many-to-many correspondence; if an image and a description are in the same class, then we treat it as "positive" otherwise "negative". We carefully devide the training-validation (150 classes) and test splits (50 classes) following Xian et al. 2017.
ECCV Caption [C21]: Although CUB Caption can evaluate the impact of many-to-many correspondence in ITM, this dataset is still "synthetic". For a more practical usage, we proposed the ECCV Caption benchmark, by correcting the false negatives (FNs) in the MS-COCO Caption dataset. By machine and human annotators, we collected x8.47 positive images and x3.58 positive captions compared to the original COCO Caption.
RoCOCO [W9]: Is an ITM model robust to a malicious manipulation on captions or images? Here, we investigated the impact of altered captions (same concept, different concept, rand voca attack, and dangerous word) and mixed images. Even state-of-the-art ITM models showed substantial performance degradations on our benchmark.

Softwares

Few-shot Font Generation Benchmark.
- Song Park, Sanghyuk Chun
- open source. code
Graphit: A Unified Framework for Diverse Image Editing Tasks.
- Geonmo Gu, Sanghyuk Chun, Wonjae Kim, HeeJae Jun, Sangdoo Yun, Yoohoon Kang
- open source. code | demo

You can install my softwares [C9] [C21] via pip by following commands:

pip install adamp
pip install eccv_caption

Check the following HuggingFace hub links for the official weights [C25] [C34]:

PCME++ [C25]: https://huggingface.co/collections/SanghyukChun/pcme-6652e86eb6fc2144ec26bf09
ProLIP [C34]: https://huggingface.co/collections/SanghyukChun/prolip-6712595dfc87fd8597350291

Industry Experience

NAVER AI Lab (2018 ~ 2025)

Hangul Handwriting Font Generation
Distributed at 2019 Hangul's day (한글날), [Full font list]
- Hangul (Korean alphabet, 한글) originally consists of only 24 sub-letters (ㄱ, ㅋ, ㄴ, ㄷ, ㅌ, ㅁ, ㅂ, ㅍ, ㄹ, ㅅ, ㅈ, ㅊ, ㅇ, ㅎ, ㅡ, ㅣ, ㅗ, ㅏ, ㅜ, ㅓ, ㅛ, ㅑ, ㅠ, ㅕ), but by combining them, there exist 11,172 valid characters in Hangul. For example, "한" is a combination of ㅎ, ㅏ, and ㄴ, and "쐰" is a combination of ㅅ, ㅅ, ㅗ, ㅣ, and ㄴ. It makes generating a new Hangul font be very expensive and time-consuming. Meanwhile, since 2008, Naver has distributed Korean fonts for free (named Nanum fonts, 나눔 글꼴).
- In 2019, we developed a technology for fully-personalized Hangul generation only with 152 characters. We opened an event page where users can submit their own handwriting. The full generated font list can be found in [this link]. Details for the generation technique used for the service was presented in Deview 2019 [Link].
- This work was also extended to the few-shot generation based on the compositionality. See the papers in AI for Content Creation Workshop (AICCW) at CVPR 2020 (short paper) [Link], ECCV 2020 (full paper) [Link], AAAI 2021 [Link], ICCV 2021 [Link], and journal extension [Link].
- [BONUS] You can play with my handwriting here
Example emoji from LINE sticker shop.
Emoji Recommendation (LINE Timeline)
Deployed in Jan. 2019
- LINE is a major messenger player in east asia (Japan, Taiwan, Thailand, Indonesia, and Korea). In the application, users can buy and use numerous emoijs a.k.a. LINE Sticker.
- In this project, we recommended emojis to users based on their profile picture (cross-domain recommendation).
- I developed and researched the entire pipeline of the cross-domain recommendation system and operation tools.
Mentees / Short-term post-doctoral collaborators / Internship students
Topics: Reliable ML Vision-Language Modality-specific tasks Generative models Other topics
- _ Junwon Lee (KAIST, 2025) -- AVL representation learning
- _ Sehyun Kwon (Seoul National University, 2024) -- VL representation learning
- _ Jaeyoo Park (Seoul National University, 2024) -- VL representation learning
- _ Jungin Park (Visiting researcher, 2024) -- VL representation learning
- _ Yujin Jeong (Korea University, 2024) [C33/W10] -- AVL representation learning
- _ _ Heesun Bae (KAIST, 2023) -- VL representation learning under noisy environment
- _ Jungbeom Lee (Visiting researcher, 2023) [C28] -- VL representation learning
- _ Eunji Kim (Seoul National University, 2022) -- XAI + Probabilistic Machine (the internship project is published at ICML 2023 [paper])
- _ Jaehui Hwang (Yonsei University, 2022) [C30] -- Adversarial robustness and XAI
- _ Chanwoo Park (Seoul National University, 2021-2022) [C22] -- Deep learning theory
- _ Gyeongsik Moon (Visiting researcher, 2022) [W7] -- Semi-supervised learning for 3D Human Mesh Estimation
- _ Hongsuk Choi (Visiting researcher, 2022) [W7] -- Semi-supervised learning for 3D Human Mesh Estimation
- _ _ Seulki Park (Seoul National University, 2022) [W9] -- VL robustness benchmark
- _ Saehyung Lee (Seoul National University, 2021-2022) [C19] -- Data condensation
- _ Sangwon Jung (Seoul National University, 2021-2023) [C18] [C19] -- Fairness with not enough group labels, group fairness
- _ Luca Scimeca (A short-term post-doctoral collaborator, 2021) [C16] [C14] -- Understanding shortcut learning phenomenon in feature space
- _ Michael Poli (KAIST, 2021) [C14] [C16] -- Neural hybrid automata
- _ Hyemi Kim (KAIST, 2021) -- Test-time training for robust prediction
- _ Jun Seo (KAIST, 2021) -- Self-supervised learning
- _ Song Park (Yonsei University, 2020-2021) [C8/W6] [C12] [A4] [J2] -- Few-shot font generation
- _ Hyojin Bahng (Korea University, 2019) [C6] -- De-biasing
- _ Junsuk Choe (Yonsei University, 2019) [C5] [J1] -- Reliable evaluation for WSOL
- _ Naman Goyal (IIT RPR, 2019) -- Robust representation against shift
- _ Minz Won (Music Technology Group, Universitat Pompeu Fabra, 2018-2019) [W2] [W4] [A2] [C4] -- Audio representation learning
- _ Byungkyu Kang (Yonsei University, 2018) [C2] -- Image-to-image translation and style transfer
- _ Jang-Hyun Kim (Seoul National University, 2018) [A1] -- Audio representation learning
- _ Jisung Hwang (University of Chicago, 2018) [W1] -- Adversarial robustness
- _ Younghoon Kim (Seoul National University, 2018) [W1] -- Adversarial robustness

Kakao Advanced Recommendation Technology (ART) team (2016 ~ 2018)

Recommender Systems (Kakao services)
Feb. 2016 - Feb. 2018
- I developed and maintained a large-scale real-time recommender system (Toros [PyCon Talk] [AI Report]) for various services in Daum and Kakao. I mainly worked with content-based representation modeling (for textual, visual, and musical data), collaborative filtering modeling, user embedding, user clustering, and ranking system based on Multi-armed Bandit.
- Textual domain: Daum News similar article recommendation, Brunch (blog service) similar post recommendation, Daum Cafe (community service) hit item recommendation.
- Visual domain: Daum Webtoon and Kakao Page similar item recommendation, video recommendation for a news article (cross-domain recommendation).
- Audio domain: music recommendation for Kakao Mini (smart speaker), Melon and Kakao Music.
- Online to offline: Kakao Hairshop style recommendation.
System overview.
Personalized Push Notification with User History (Daum, Kakao Page)
Deployed in 2017
- The mobile push service (or alert system) is widely-used in mobile applications to attain a high user retention rate. However, a freqeunt push notification makes a user feel fatigue, resulting on the application removal. Usually, the push notification system is a rule-based system, and managed by human labor. In this project, we researched and developed a personalized push notification system based on user activity and interests. The system has been applied to Daum an Kakao Page mobile applications. More details are in our paper.
Large-Scale Item Categorization in e-Commerce (Daum Shopping)
Deployed in 2017
- An accurate categorization helps users to search desired items in e-Commerce based on the category, e.g., clothes / shoes / sneakers. However, the categorization is usually performed based on rule-based systems or human labor, which leads to low coverage of categorized items. Even the automatic item categorization is difficult due to its web-scale data size, the highly unbalanced annotation distribution, and noisy labels. I developed a large-scale item categorization system for Daum Shopping based on a deep network, from the operation tool to the categorization API.

Internship

Research internship (Naver Labs)
Aug. 2015 - Dec. 2015
- During the internship, I implemented batch normalization (BN) to AlexNet, Inception v2 and VGG on ImageNet using Caffe. I also researched batch normalization for sequential models, e.g., RNN using Lua Torch.
Software engineer (IUM-SOCIUS)
Jun. 2012 - Jan. 2013
- I worked as web developer at IUM-SOCIUS. During the internship, I developed and maintained internal batch services (JAVA spring batch), internal statistics service (Python Flask, MongoDB), internal admin tools (Python Django, MySQL), and main service systems (JAVA spring, Ruby on Rails, MariaDB).

Education and Career

Ph.D. (2025.09 - Now), Computer Science, Princeton University
M.S. (2014.03 - 2016.02), School of Electrical Engineering, KAIST
B.S. (2009.03 - 2014.02), School of Electrical Engineering and School of Management Science (double major), KAIST

Lead research scientist at NAVER AI Lab (Feb. 2018 - July. 2025)

Leader of ML Research @NAVER AI Lab (Feb. 2022 - July. 2025)
Tech leader at NAVER AI Lab (Oct. 2020 - Jan. 2022)
Visiting researcher at Naver Labs Europe (Oct. 2019 - Dec. 2019)
Research scientist at NAVER CLOVA (Feb. 2018 - Sep. 2020)

Research engineer at advanced recommendation team (ART) in Kakao (Feb. 2016 - Feb. 2018)
Research internship at Naver Labs (Aug. 2015 - Dec. 2015)
Master's degree in Electrical Engineering from KAIST (advisor: Jinwoo Shin) (Mar. 2014 - Feb. 2016)
Undergraduate researcher at Algorithmic Intelligence Lab in KAIST (Fall 2013)
Undergraduate researcher at Networked and Distributed Computing System Lab in KAIST (Summer 2013)

Participated as an undergraduate researcher for FloSIS: A Highly Scalable Network Flow Capture System for Fast Retrieval and Storage Efficiency (presented at USENIC ATC 2015) (I developed the index system described in Section 4.)

Software engineering internship at IUM-SOCIUS (Jun. 2012 - Jan. 2013)