richardliaw

Follow

Richard Liaw richardliaw

Follow

@anyscale - feel free to reach out

231 followers · 72 following

Achievements

Achievements

Highlights

Pro

Stars

eakmanrq / sqlframe

Turning PySpark Into a Universal DataFrame API

Python 281 8 Updated Sep 21, 2024

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,531 502 Updated Sep 20, 2024

tysam-code / hlb-gpt

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…

Python 267 24 Updated Jul 29, 2024

Lightning-AI / lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,141 73 Updated Sep 21, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 5,209 369 Updated Sep 22, 2024

cipher982 / llm-benchmarks

Benchmarking LLM Inference Speeds

Python 12 Updated Sep 1, 2024

ray-project / llmperf-leaderboard

418 13 Updated Jan 10, 2024

Tanuki / tanuki.py

Prompt engineering for developers

Python 668 23 Updated Feb 13, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,234 913 Updated Sep 18, 2024

ray-project / llmperf

LLMPerf is a library for validating and benchmarking LLMs

Python 580 93 Updated Aug 21, 2024

KernelTuner / kernel_tuner

Kernel Tuner

Python 276 48 Updated Sep 19, 2024

openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,585 403 Updated Sep 22, 2024

ray-project / llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

Jupyter Notebook 1,676 219 Updated Aug 2, 2024

ELS-RD / kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,524 93 Updated Feb 16, 2024

anyscale / Made-With-ML

Jupyter Notebook 27 Updated Jul 26, 2023

ray-project / llm-numbers

Numbers every LLM developer should know

4,053 137 Updated Jan 16, 2024

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,171 328 Updated May 16, 2023

fauxpilot / fauxpilot

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,546 617 Updated Apr 9, 2024

sigopt / sigopt-server

Open Source version of SigOpt API, performing hyperparameter optimization and visualization

Python 37 5 Updated Sep 20, 2024

Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,538 699 Updated Sep 20, 2024

sholtodouglas / scalingExperiments

Jupyter Notebook 56 2 Updated Mar 4, 2022

mlcommons / training

Reference implementations of MLPerf™ training benchmarks

Python 1,599 554 Updated Aug 14, 2024

PiotrNawrot / nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 957 70 Updated Aug 21, 2024

sematic-ai / sematic

An open-source ML pipeline development platform

Python 969 58 Updated Sep 10, 2024

BaguaSys / bagua

Bagua Speeds up PyTorch

Python 872 83 Updated Aug 1, 2024

facebookresearch / fairring

Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large scales

C++ 61 3 Updated Mar 21, 2022

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,908 1,546 Updated Sep 20, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,490 1,208 Updated Aug 21, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 92,561 14,819 Updated Sep 22, 2024

tunib-ai / parallelformers

Parallelformers: An Efficient Model Parallelization Toolkit for Deployment

Python 776 61 Updated Apr 24, 2023