/

/

LLM-FineTuning-Large-Language-Models

Обзор Центр заботыВойти

LLM-FineTuning-Large-Language-Models

Ветки: 1 Коммиты: 144 Теги: 0

..

LLM-FineTuning-Large-Language-Models

/

Quantize_with_HF_transformers

9 месяцев назад

Quantize_with_HF_transformers.py

9 месяцев назад

9 месяцев назад

requirements.txt

7 месяцев назад

README.md

About this script

A Python script to quantize GPT models with HuggingFace 'transformers' library.

Usage

Install all dependencies

pip install -r requirements.txt

Run the script


python Quantize_with_HF_transformers.py --model_id 'mistralai/Mistral-7B-v0.1' --bits 4 --dataset 'wikitext2' --group_size 32 --device_map 'auto'

Features

Quantizing at various GPTQ precisions (8bit and 4bit).

Parameters:

model_id: The model path/id from huggingface repository or local directory.
bits: The number of bits to quantize to, supported numbers are (2, 3, 4, 8).
dataset: The dataset used for quantization. You can provide your own dataset in a list of string or just use the original datasets used in GPTQ paper [‘wikitext2’,‘c4’,‘c4-new’,‘ptb’,‘ptb-new’]
group_size: The group size to use for quantization. Recommended value is 128 and -1 uses per-column quantization.
device_map: Device mapping configuration for loading the model. Example: 'auto', 'cpu', 'cuda:0', etc. - Default "auto"

For all params of GPTQConfig check its official doc

https://huggingface.co/docs/transformers/main_classes/quantization#transformers.GPTQConfig

License

Apache 2

Использование cookies

Мы используем файлы cookie в соответствии с Политикой конфиденциальности и Политикой использования cookies.

Нажимая кнопку «Принимаю», Вы даете АО «СберТех» согласие на обработку Ваших персональных данных в целях совершенствования нашего веб-сайта и Сервиса GitVerse, а также повышения удобства их использования.

Запретить использование cookies Вы можете самостоятельно в настройках Вашего браузера.