Skip to content
View warner-benjamin's full-sized avatar

Block or report warner-benjamin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Flash-Muon: An Efficient Implementation of Muon Optimizer

Python 212 13 Updated Jun 15, 2025

A Quirky Assortment of CuTe Kernels

Python 677 61 Updated Nov 21, 2025

Frontier Models playing the board game Diplomacy.

Python 604 86 Updated Nov 18, 2025

Community-contributed instructions, prompts, and configurations to help you make the most of GitHub Copilot.

JavaScript 13,190 1,554 Updated Nov 30, 2025

Simple, Elegant, Typed Argument Parsing with argparse

Python 515 58 Updated Jun 3, 2025

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,389 813 Updated Nov 28, 2025

A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.

Python 73 6 Updated Aug 2, 2024

A collection of memory efficient attention operators implemented in the Triton language.

Python 283 19 Updated Jun 5, 2024

Supercharge Your Model Training

Python 5,442 458 Updated Nov 12, 2025

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 775 162 Updated Dec 1, 2025

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 584 31 Updated Aug 12, 2025

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 9,905 1,387 Updated Nov 27, 2025

PyTorch native post-training library

Python 5,604 683 Updated Nov 24, 2025

Per directory history for Bash

Shell 22 Updated Oct 4, 2025

A monitor of resources

C++ 28,771 864 Updated Nov 26, 2025

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 880 80 Updated Nov 4, 2025

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,778 261 Updated May 17, 2025

Experiment of using Tangent to autodiff triton

Python 80 2 Updated Jan 22, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,890 213 Updated Mar 8, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,162 566 Updated Aug 22, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,060 2,658 Updated Nov 3, 2025

Machine Learning Engineering Open Book

Python 15,881 978 Updated Nov 21, 2025

The Art of Debugging

Python 1,153 57 Updated Nov 20, 2025

Command-line sampling profiler for macOS, Linux, and Windows

Rust 3,590 80 Updated Nov 24, 2025

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,291 87 Updated Mar 27, 2025

🛋 The AI and Generative Art platform for everyone

TypeScript 792 57 Updated Jul 16, 2025

Multipack distributed sampler for fast padding-free training of LLMs

Python 202 16 Updated Aug 10, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 7,788 795 Updated Nov 26, 2025

A guidance language for controlling large language models.

Jupyter Notebook 20,967 1,125 Updated Nov 20, 2025
Next