[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
llamachatgptllama2chatbotgpt-4instruction-tuningvision-language-modelvisual-language-learningfoundation-modelsllama-2llavamulti-modalitymultimodal- Python
00Обновлено 7 месяцев назад
"Open Source Models with Hugging Face" course empowers you with the skills to leverage open-source models from the Hugging Face Hub for various tasks in NLP, audio, image, and multimodal domains.
nlpnlp-machine-learningimageaudio-processingmultimodalhugging-facegradioaudiocloud-deploymentcloud-deployment-modelgradio-python-llmhugging-face-apihugging-face-hubhugging-face-instructor-embeddingshugging-face-transformersimage-processingmultimodal-deep-learningmultimodal-learningopen-source-modelstransformers-library- Jupyter Notebook
00Обновлено 7 месяцев назад
Framework for processing and filtering datasets
- Python
00Обновлено 4 месяца назад
OmniFusion — a multimodal model to communicate using text and images
- Python
00Обновлено 2 месяца назад