Сортировать по
Язык: Все
Топик: rlhf
Unify Efficient Fine-tuning of 100+ LLMs
llmgptlanguage-modelagentbaichuanchatglmfine-tuninggenerative-aiinstruction-tuninglarge-language-modelsllamaloramistralmixture-of-expertspeftqloraquantizationqwenrlhftransformers- Python
01Обновлено 8 месяцев назад
Robust recipes to align language models with human and AI preferences
- Python
01Обновлено 8 месяцев назад
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
- Jupyter Notebook
01Обновлено 7 месяцев назад