Skip to content
View richardliaw's full-sized avatar

Highlights

  • Pro

Block or report richardliaw

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Turning PySpark Into a Universal DataFrame API

Python 281 8 Updated Sep 21, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,531 502 Updated Sep 20, 2024

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…

Python 267 24 Updated Jul 29, 2024

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,141 73 Updated Sep 21, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,209 369 Updated Sep 22, 2024

Benchmarking LLM Inference Speeds

Python 12 Updated Sep 1, 2024

Prompt engineering for developers

Python 668 23 Updated Feb 13, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,234 913 Updated Sep 18, 2024

LLMPerf is a library for validating and benchmarking LLMs

Python 580 93 Updated Aug 21, 2024

Kernel Tuner

Python 276 48 Updated Sep 19, 2024

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,585 403 Updated Sep 22, 2024

A comprehensive guide to building RAG-based LLM applications for production.

Jupyter Notebook 1,676 219 Updated Aug 2, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,524 93 Updated Feb 16, 2024
Jupyter Notebook 27 Updated Jul 26, 2023

Numbers every LLM developer should know

4,053 137 Updated Jan 16, 2024

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,171 328 Updated May 16, 2023

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,546 617 Updated Apr 9, 2024

Open Source version of SigOpt API, performing hyperparameter optimization and visualization

Python 37 5 Updated Sep 20, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,538 699 Updated Sep 20, 2024
Jupyter Notebook 56 2 Updated Mar 4, 2022

Reference implementations of MLPerf™ training benchmarks

Python 1,599 554 Updated Aug 14, 2024

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 957 70 Updated Aug 21, 2024

An open-source ML pipeline development platform

Python 969 58 Updated Sep 10, 2024

Bagua Speeds up PyTorch

Python 872 83 Updated Aug 1, 2024

Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large scales

C++ 61 3 Updated Mar 21, 2022

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,908 1,546 Updated Sep 20, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,490 1,208 Updated Aug 21, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 92,561 14,819 Updated Sep 22, 2024

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Python 776 61 Updated Apr 24, 2023
Next