Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,778 261 Updated May 17, 2025

srush / triton-autodiff

Experiment of using Tangent to autodiff triton

Python 80 2 Updated Jan 22, 2024

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,890 213 Updated Mar 8, 2024

meta-pytorch / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,162 566 Updated Aug 22, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,060 2,658 Updated Nov 3, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 15,881 978 Updated Nov 21, 2025

stas00 / the-art-of-debugging

The Art of Debugging

Python 1,153 57 Updated Nov 20, 2025

mstange / samply

Command-line sampling profiler for macOS, Linux, and Windows

Rust 3,590 80 Updated Nov 24, 2025

chengzeyi / stable-fast

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,291 87 Updated Mar 27, 2025

rvion / CushyStudio

🛋 The AI and Generative Art platform for everyone

TypeScript 792 57 Updated Jul 16, 2025

imoneoi / multipack

Multipack distributed sampler for fast padding-free training of LLMs

Python 202 16 Updated Aug 10, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,788 795 Updated Nov 26, 2025

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 20,967 1,125 Updated Nov 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benjamin Warner warner-benjamin

Block or report warner-benjamin

Stars

nil0x9 / flash-muon

Dao-AILab / quack

GoodStartLabs / AI_Diplomacy

github / awesome-copilot

lebrice / SimpleParsing

fastapi / typer

JonasGeiping / linear_cross_entropy_loss

flagos-ai / FlagAttention

mosaicml / composer

flagos-ai / FlagGems

BobMcDear / attorch

mgmalek / efficient_cross_entropy

LouisShark / chatgpt_system_prompt

meta-pytorch / torchtune

martinec / bash-per-directory-history

aristocratos / btop

criteo / autofaiss

AnswerDotAI / RAGatouille