Sebastian Raschka

Recommendations for Getting the Most Out of a Technical Book

Nov 12, 2025

This short article compiles a few notes I previously shared when readers ask how to get the most out of my building large language model from scratch books. I follow a similar approach when I read ...

Beyond Standard LLMs

Nov 4, 2025

After I shared my Big LLM Architecture Comparison a few months ago, which focused on the main transformer-based LLMs, I received a lot of questions with respect to what I think about alternative ap...

DGX Spark and Mac Mini for Local PyTorch Development

Oct 29, 2025

The DGX Spark for local LLM inferencing and fine-tuning was a pretty popular discussion topic recently. I got to play with one myself, primarily working with and on LLMs in PyTorch, and collected s...

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Oct 5, 2025

Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples

Understanding and Implementing Qwen3 From Scratch

Sep 6, 2025

Previously, I compared the most notable open-weight architectures of 2025 in The Big LLM Architecture Comparison. Then, I zoomed in and discussed the various architecture components in From GPT-2 t...

Hello, I'm Sebastian Raschka, PhD

Recent Notes and Blog Entries

Recommendations for Getting the Most Out of a Technical Book

Beyond Standard LLMs

DGX Spark and Mac Mini for Local PyTorch Development

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Understanding and Implementing Qwen3 From Scratch