COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences

Publication
The 14th International Conference on Learning Representations (ICLR)
Yang Cai
Yang Cai
Professor

Related