Stars
Turning PySpark Into a Universal DataFrame API
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
SGLang is a fast serving framework for large language models and vision language models.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
LLMPerf is a library for validating and benchmarking LLMs
A machine learning compiler for GPUs, CPUs, and ML accelerators
A comprehensive guide to building RAG-based LLM applications for production.
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
LightSeq: A High Performance Library for Sequence Processing and Generation
FauxPilot - an open-source alternative to GitHub Copilot server
Open Source version of SigOpt API, performing hyperparameter optimization and visualization
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Reference implementations of MLPerf™ training benchmarks
Fast & Simple repository for pre-training and fine-tuning T5-style models
An open-source ML pipeline development platform
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large scales
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
🦜🔗 Build context-aware reasoning applications
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment