Overview

Nano GPT is a simplified, educational implementation of the GPT (Generative Pre-trained Transformer) architecture, designed to help learners understand the core components and functioning of large language models.

Role in this knowledge base

Serves as a practical project within Andrew Ng’s ‘Zero to Hero’ series to teach users how to build language models.

Key facts

  • Built using Shakespeare’s writing as a dataset in Andrew Ng’s series.

Sources