Hello, I'm Sebastian Raschka, PhD
I am an LLM Research Engineer with over a decade of experience in artificial intelligence. My work bridges academia and industry, including roles as senior engineer at Lightning AI and as a statistics professor at the University of Wisconsin-Madison.
I am also the author of Build a Large Language Model (From Scratch).
My expertise lies in LLM research and the development of high-performance AI systems, with a deep focus on practical, code-driven implementations. (For my most up-to-date CV details, please visit my LinkedIn profile.)
Recent Notes and Blog Entries
Recommendations for Getting the Most Out of a Technical Book
Nov 12, 2025
This short article compiles a few notes I previously shared when readers ask how to get the most out of my building large language model from scratch books. I follow a similar approach when I read ...
Beyond Standard LLMs
Nov 4, 2025
After I shared my Big LLM Architecture Comparison a few months ago, which focused on the main transformer-based LLMs, I received a lot of questions with respect to what I think about alternative ap...
DGX Spark and Mac Mini for Local PyTorch Development
Oct 29, 2025
The DGX Spark for local LLM inferencing and fine-tuning was a pretty popular discussion topic recently. I got to play with one myself, primarily working with and on LLMs in PyTorch, and collected s...
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
Oct 5, 2025
Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples
Understanding and Implementing Qwen3 From Scratch
Sep 6, 2025
Previously, I compared the most notable open-weight architectures of 2025 in The Big LLM Architecture Comparison. Then, I zoomed in and discussed the various architecture components in From GPT-2 t...