I could train a 1B-A200m model on an iPhone 17 Pro at ~650 tokens/sec. It will take 360 days on 20B tokens of training data and use 156KW of electricity which cost $51.
The phone will fry of course, so I wrote algorithms to run inference on your phone rather. We named it after a plant that survives in resource-constrained environments, Cactus.
can run similar model on your Grandma’s Pixel 6a at 36 tokens/second
while only draining 10% battery per hour of continuous inference.
I had a verbal offer from Nvidia, my dream company, but passed on to build this. Cactus now has 3.8k GitHub stars and have completed 6m inference tasks in production.
We raised funding from YCombinator, FCVC, Oxford University, 62 tech CTOs/VP Eng, our fellow YC batchmates, a couple DeepMind engineers and Transpose (Garry Tan's brother).
- 2025-XX: Cactus (YC S25) - Founder & CTO (tiny inference engine for phones and wearables).
- 2024-25: Deep Render - AI Research Engineer (realtime video models that run on phone GPU/NPU).
- 2021-24: Wisdm - ML Software Engineer (distributed perception AI for Maxar Defence satelite views).
- 2019-21: MSc + Open-source activities (JAX/NanoDl, Torch/SuperLazyAutograd, CUDARepo, etc.).
- 2018-19: Google GADS Scholarship Programme with Andela (pre-MSc), around systems design.
- 2017-18: National Youth service, posted to software engineering after bootcamp, mostly ARM.
- 2012-16: Started uni at 15y, covered EECS, data structures, algorithms, maths, physics.
- Wrote Math & CS For ML (with codes).
- Gave this lecture to a small ML group in Nigeria, on optimising large-scale ML in JAX.
- Co-host this monthly dinner for AI researchers, engineers and founders in London.
- Kevin Murphy (DeepMind Principal), Thomas Wolf (HuggingFace Co-foubder), Daniel Holtz (Mid Journey Founder), Steve Messina (IBM CTO) followed back on X.
- After CUDARepo, Nvidia reached out, I did 7 technical rounds, got a verbal offer, back-and-forth over YOE/pay, then I got YC.
- Did MSc at QMUL, just to work with Prof Matt Purver (Ex-Stanford Researcher on CALO), did my project/thesis with his team.
- Did BEng under Prof Onyema Uzoamaka (Rumoured first Nigerian CS grad from MIT), he taught computing archs off-head!






