Tensors and Dynamic neural networks in Python with strong GPU acceleration
- Python
01Обновлено 8 месяцев назад
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learningpytorchdeep-learninginferencemixture-of-expertsgputrillion-parameterszerobillion-parameterscompressiondata-parallelismmodel-parallelismpipeline-parallelism- Python
01Обновлено 7 месяцев назад
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.
- Python
01Обновлено 7 месяцев назад
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
- Python
01Обновлено 5 месяцев назад
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
machine-learningdeep-learningllm-servinggpudata-scienceml-platformdistributed-trainingcloud-computingcloud-managementmulticloudhyperparameter-tuningcost-managementcost-optimizationfinopsjob-queuejob-schedulerllm-trainingml-infrastructurespot-instancestpu- Python
00Обновлено 7 месяцев назад
Tensors and Dynamic neural networks in Python with strong GPU acceleration
- Python
00Обновлено 2 месяца назад