(ICML 2024) TrustLLM: Trustworthiness in Large Language Models
llmainlplarge-language-modelsnatural-language-processingbenchmarkevaluationtoolkitpypi-packagedatasettrustworthy-aitrustworthy-machine-learning- Python
02Обновлено 5 месяцев назад
Efficient Retrieval Augmentation and Generation Framework
llmtransformersnlpquestion-answeringgenerative-aiinformation-retrievalsemantic-searchdiffusionsummarizationbenchmarkcolbertknowledge-graphmulti-modalsentence-transformers- Python
01Обновлено 7 месяцев назад
Rule-based token, sentence segmentation for Russian language
- Python
01Обновлено 7 месяцев назад
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
pythonhacktoberfestnlpdeep-learningquestion-answeringmilvusimage-recognitionimage-searchimage-classificationaudio-searchbenchmark-testingunstructured-data- Jupyter Notebook
01Обновлено 6 месяцев назад
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
- Jupyter Notebook
01Обновлено 6 месяцев назад
Superfast AI decision making and intelligent processing of multi-modal data.
- Python
01Обновлено 5 месяцев назад
Lightweight demos for finetuning of instruct LLMs. Powered by transformers/accelerate and open-source datasets.
- Jupyter Notebook
01Обновлено 7 месяцев назад
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
- Rust
01Обновлено 8 месяцев назад
12 Weeks, 24 Lessons, AI for All!
- Jupyter Notebook
01Обновлено 7 месяцев назад
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
pythonaimachine-learningtransformerslanguage-modelnlppytorchbertchatgptgenerative-aigpt-3large-language-modelsinformation-retrievalquestion-answeringsemantic-searchsquadsummarization- Python
01Обновлено 8 месяцев назад
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
llmmachine-learningnlpnatural-language-processingdeep-learninginformation-retrievallangchaindonutmlocrpdfpdf-to-jsonpdf-to-textpreprocessingdata-pipelinesdocument-image-analysisdocument-image-processingdocument-parserdocument-parsingdocx- HTML
01Обновлено 8 месяцев назад
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
pythonhacktoberfestmachine-learningpytorchlanguage-modelnatural-language-processingnlpnlp-librarypretrained-modelspytorch-transformersseq2seqspeech-recognitiontensorflowtransformerbertdeep-learningflaxjaxlanguage-modelsmodel-hub- Python
01Обновлено 8 месяцев назад
A curated list of resources dedicated to open source GitHub repositories related to ChatGPT
- Markdown
01Обновлено 6 месяцев назад
Language modeling and instruction tuning for Russian
- Jupyter Notebook
01Обновлено 7 месяцев назад
NLTK Source
- Python
01Обновлено 7 месяцев назад
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
hacktoberfestmachine-learningpytorchnatural-language-processingnlpcomputer-visiondatasetspandasdeep-learningtensorflownumpyspeech- Python
01Обновлено 8 месяцев назад
Must-read papers on prompt-based tuning for pre-trained language models.
01Обновлено 6 месяцев назад
An Open-Source Package for Textual Adversarial Attack.
- Python
01Обновлено 6 месяцев назад
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
- Python
00Обновлено 6 месяцев назад
Must-read Papers on Textual Adversarial Attack and Defense
- Python
00Обновлено 6 месяцев назад
On Transferability of Prompt Tuning for Natural Language Processing
nlppytorchpromptparameter-efficient-learningpretrained-modelspretrained-language-modelparameter-efficient-tuningpretrained-language-modelsprompt-tuningtransfer-learning- Python
00Обновлено 6 месяцев назад
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
- Python
00Обновлено 6 месяцев назад
An open-source online reverse dictionary.
- JavaScript
00Обновлено 6 месяцев назад
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
pythonllmmachine-learningnlplarge-language-modelstransformerslanguage-modelragembeddingsvector-databasesemantic-searchretrieval-augmented-generationinformation-retrievalvector-searchneural-searchsearch-enginesearchvector-search-enginesentence-embeddingstxtai- Python
00Обновлено 5 месяцев назад
Solves basic Russian NLP tasks, API for lower level Natasha projects
- Python
00Обновлено 4 месяца назад
Rule-based facts extraction for Russian language
- Python
00Обновлено 4 месяца назад
Deep Learning based NLP modeling for Russian language
- Python
00Обновлено 4 месяца назад
Rule-based token, sentence segmentation for Russian language
- Python
00Обновлено 4 месяца назад
Comparing quality and performance of NLP systems for Russian language
- Python
00Обновлено 4 месяца назад
Large silver standart Russian corpus with NER, morphology and syntax markup
- Python
00Обновлено 4 месяца назад