Andrej Karpathy Releases nanochat, a Minimal ChatGPT Clone – Analytics India Magazine


OpenAI co-founder and Eureka Labs founder, Andrej Karpathy, has released nanochat, an open-source project that provides a full-stack training and inference pipeline for a simple ChatGPT-style model. The repository follows his earlier project, nanoGPT, which focused only on pretraining.
Link to the GitHub repository.
In a post on X, Karpathy said, “You boot up a cloud GPU box, run a single script and in as little as 4 hours later you can talk to your own LLM in a ChatGPT-like web UI.”
The repo consists of about 8,000 lines of code and covers the entire pipeline. It includes tokeniser training in Rust and pretraining a Transformer LLM on FineWeb. The pipeline also handles mid-training on user-assistant conversations and multiple-choice questions, supervised fine-tuning (SFT), and optional reinforcement learning (RL) with GRPO. Finally, it supports efficient inference with KV caching.
Users can interact with the model through a command-line interface or a web UI, and the system generates a markdown report summarising performance.
Karpathy explained that the models can be trained at different scales depending on time and cost. A small ChatGPT clone can be trained for around $100 in roughly 4 hours on an 8×H100 GPU node, allowing basic interaction. 
Training for about 12 hours enables the model to surpass the GPT-2 CORE benchmark. Scaling up to approximately $1,000, or around 42 hours of training, produces a model that is more coherent and capable of solving simple math and coding problems, as well as answering multiple-choice questions.
“My goal is to get the full ‘strong baseline’ stack into one cohesive, minimal, readable, hackable, maximally forkable repo. nanochat will be the capstone project of LLM101n (which is still being developed),” Karpathy said. LLM101n is an undergraduate-level class at Eureka Labs that will guide students through the process of building their own AI model. Karpathy also added that the project could grow into a research harness or benchmark, similar to nanoGPT.

📣 Want to advertise in AIM? Book here
Supermemory has attracted investments from Google AI chief Jeff Dean, Cloudflare CTO Dane Knecht and DeepMind’s Logan Kilpatrick, among others.
AI is changing how we work and how we learn.
The data centres will also generate an increasingly high demand for the underlying technology services that power the infrastructure.
Synergy of development, experimentation, and deployment gives India a unique position, says Automation Anywhere product leader Dominic Pereira
Infrastructure alone cannot keep up with the growing urban population.
“Synthetic data is a key enabler for developing robust AI models when used responsibly.”
“The big deal here is that you can eventually update the model in real time, and always have up-to-date training
The heaviest users are established, product-heavy companies quietly embedding AI across operations.
MachineCon GCC Summit 2025 returns to Goa this December as India’s premier invite-only residential gathering of Global Capability Center leaders, driving the future of innovation and growth.
Email:
info@aim.media
Our Offices
AIM India
1st Floor, Sakti Statesman, Marathahalli – Sarjapur Outer Ring Rd, Green Glen Layout, Bellandur, Bengaluru, Karnataka 560103
AIM Americas
166 Geary St STE 1500 Suite #634, San Francisco, California 94108, United States
© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2025

source

Jesse
https://playwithchatgtp.com