Semantic-aware Wasserstein Policy Regularization for Large Language Model Alignment
- categorize
- Machine Learning
- Conference Name
- International Conference on Learning Representations (ICLR 2026)
- Presentation Date
- April 23-27
- City
- Rio de Janeiro
- Country
- Brazil
- File
- [ICLR26] Wasserstein_Policy_Regularization-camera_ready-ver-2.pdf (1.4M) 4회 다운로드 DATE : 2026-02-02 14:38:46
Byeonghu Na, Hyungho Na, Yeongmin Kim, Suhyeon Jo, HeeSun Bae, Mina Kang, and Il-Chul Moon, Semantic-aware Wasserstein Policy Regularization for Large Language Model Alignment, The Fourteenth International Conference on Learning Representations (ICLR 2026), Rio de Janeiro, Brazil, April 23-27, 2026.