Skip to content

ibrahimmkhalid/llm-from-scratch

Repository files navigation

title emoji colorFrom colorTo sdk sdk_version app_file pinned
LLM From Scratch
🧠
green
red
streamlit
1.30.0
app.py
false

LLM From Scratch

Ibrahim Khalid

The hosted project is available on HuggingFace

The purpose of this project is to build a simple large language model from scratch.

This repo is following the guide from https://www.youtube.com/watch?v=UU1WVnMk4E8

In this repo:

  • ./shakespeare.txt - This is a sample text used for training a smaller scale model
  • ./bigram_testing.sync.ipynb - This notebook is where I test a basic BiGram model
  • ./gpt_shakespeare.sync.ipynb - Notebook implementing simple GPT model using entire works of shakespeare
  • ./gpt_openwebtext.sync.ipynb - Notebook implementing GPT model based on the OpenWebText Corpus

Prepare environment

pip install -r ./requirements-base.txt  
pip install -r ./requirements-pytorch.txt

On GitHub

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

About

Building an LLM from scratch using Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published