Overview
Nano GPT is a simplified, educational implementation of the GPT (Generative Pre-trained Transformer) architecture, designed to help learners understand the core components and functioning of large language models.
Role in this knowledge base
Serves as a practical project within Andrew Ng’s ‘Zero to Hero’ series to teach users how to build language models.
Key facts
- Built using Shakespeare’s writing as a dataset in Andrew Ng’s series.