We implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of machine learning (e.g. model training, learning rate tuning, hyperparameters, evaluation, train/dev/test splits, under/overfitting, etc.).
Links:
- makemore on github:
- jupyter notebook I built in this video:
- collab notebook (new)!!!:
- Bengio et al. 2003 MLP language model paper (pdf):
Useful links:
- PyTorch internals ref
Exercises:
- E01: Tune the hyperparameters of the training to beat my best validation loss of 2.2
- E02: I was not careful with the intialization of the network in this video. (1) What is the loss you’d get if the predicted probabilities at initialization were perfectly uniform? What loss do we achieve? (2) Can you tune the initialization to get a starting loss that is much more similar to (1)?
- E03: Read the Bengio et al 2003 paper (link above), implement and try any idea from the paper. Did it work?
Chapters:
00:00:00 intro
00:01:48 Bengio et al. 2003 (MLP language model) paper walkthrough
00:09:03 (re-)building our training dataset
00:12:19 implementing the embedding lookup table
00:18:35 implementing the hidden layer internals of : storage, views
00:29:15 implementing the output layer
00:29:53 implementing the negative log likelihood loss
00:32:17 summary of the full network
00:32:49 introducing and why
00:37:56 implementing the training loop, overfitting one batch
00:41:25 training on the full dataset, minibatches
00:45:40 finding a good initial learning rate
00:53:20 splitting up the dataset into train/val/test splits and why
01:00:49 experiment: larger hidden layer
01:05:27 visualizing the character embeddings
01:07:16 experiment: larger embedding size
01:11:46 summary of our final code, conclusion
01:13:24 sampling from the model
01:14:55 google collab (new!!) notebook advertisement
15 views
5598
1993
1 day ago 00:21:51 1
МЕДВЕЖИЙ ФАРШ • ЖОСКИЙ СМЕРТЕЛЬНЫЙ УРАГАННЫЙ 3D БИЛД ЯРОСТИ ОТЦА БОГА МАТЕРИ • THE WITCHER 3
2 weeks ago 00:16:00 1
Вихревой танец смерти или самый сильный билд на атаку в Ведьмак 3: Кровь и Вино в Новой Игре +
2 weeks ago 03:21:07 1
Doctor Kim Jong-Un Inspects! - Pirated TV from North Korea
2 weeks ago 00:07:17 1
Maserati Driver Shot After Police Pursuit and Gun Battle With LAPD Officers
3 weeks ago 00:08:24 1
Learn Chinese for Beginners | Nationalities in Chinese | Country Names in Chinese
3 weeks ago 00:13:37 1
Driver San Francisco - Oct. 2008 Prototype Comparison (City & Landmarks)
3 weeks ago 01:07:09 1
Grim Dawn ПАТЧ 1.2 - Билд Охотник на магов (Мистик + Инквизитор) Луч Альбрехта. Урон Хаосом. Кастер
3 weeks ago 00:19:31 1
Winter Survival Shelter - Sleeping Outside in -25° Weather
4 weeks ago 00:01:16 1
Andor Season 2 - Teaser Trailer | Star Wars & Disney+ | Diego Luna & Ben Mendelsohn (2025)
1 month ago 00:07:41 1
ChatGPT Ethereum Arbitrage Bot: Earn $1,000 Daily in Passive Income
1 month ago 00:34:18 1
Why Saudi Arabia is Building a $1 Trillion City in the Desert
1 month ago 00:01:02 1
Killing Floor 3 - Game Awards Trailer
1 month ago 00:11:30 1
Микец 2 день “Мидраш рассказывает“ По книге рава Моше Вейсмана. (р. Давид Мануйлов).
1 month ago 00:14:55 1
These 1 Exalt Boots BREAK The Game. Corpsewade Magic Find Infernalist.
1 month ago 01:01:42 1
ЭНЕРГИЯ БАЛИ || ЙОГА СРЕДНИЙ УРОВЕНЬ
1 month ago 00:06:03 72
The Witcher 4 - Official Reveal Trailer | The Game Awards 2024
2 months ago 00:05:35 1
Best AliExpress Alternatives for Dropshipping: The Game-Changing Tactics You Need! 🗣
2 months ago 00:07:15 1
30-06 at 3000+ Yards
2 months ago 00:01:29 1
Тентовые здания МСБИЛДИНГС
2 months ago 00:03:58 1
The Shira Choir x The Kiffness - Im Hashem Lo Yivneh Bayis (Psalm 127 DANCE REMIX)
3 months ago 00:06:07 1
PATRIARKH - WIERSZALIN IV (feat. Eliza Sacharczuk) (Official Video) | Napalm Records
3 months ago 00:00:54 1
Elon Musk takes son Lil X to meet Tayyip Erdogan
3 months ago 00:04:30 1
Two Storey House | Tiny Home ( 3X6 Meters )
3 months ago 00:11:44 1
Apple CEO’s High Stake Visit To China For Apology & Request To Market Share