Eric Mitchell
Updated bibtex entry for NeurIPS.
Reference implementation for DPO (Direct Preference Optimization)
Python
main