MLSys Class LLM Introduction
MLSys Class LLM Introduction
MLSys Class LLM Introduction
Language Models
Eve Fleisig & Kayo Yin
CS 294-162
August 28, 2023
Language Modeling
Unsupervised objective
Supervised objective
Prefixes & Prompting
Few- & Zero-Shot Learning
Few- & Zero-Shot Learning
Few- & Zero-Shot Learning
Few- & Zero-Shot Learning
Scaling
Data Compute
Scaling Data
Common Crawl dataset: introduced with T5; still in use
GPT-3 Training Data:
Scaling Data & Compute