Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)
#universalcomputation #pretrainedtransformers #finetuning
Large-scale pre-training and subsequent fine-tuning is a common recipe for success with transformer models in machine learning. However, most such transfer learning is done when a model is pre-trained on the same or a very similar modality to the final task to be solved. This paper demonstrates that transformers can be fine-tuned to completely different modalities, such as from language to vision. Moreover, they demonstrate that this can be done by freezing all attention layers, tuning less than .1% of all parameters. The paper further claims that language modeling is a superior pre-training task for such cross-domain transfer. The paper goes through various ablation studies to make its point.
OUTLINE:
0:00 - Intro & Overview
2:00 - Frozen Pretrained Transformers
4:50 - Evaluated Tasks
10:05 - The Importance of Training LayerNorm
17:10 - Modality Transfer
25:10 - Network Architecture Ablation
26:10 - Evaluation of the Attention Mask
27:20 - Are FPTs
17 views
13
5
3 months ago 02:42:27 1
Дмитрий Нестерук «Разработка с использованием искусственного интеллекта»
5 months ago 00:44:02 1
O mínimo que você precisa saber sobre IA pra sobreviver ao Hype
6 months ago 01:56:20 1
Let’s build GPT: from scratch, in code, spelled out.
7 months ago 00:34:34 1
chatGPT помогает писать код
7 months ago 00:04:16 1
Exploring the Wild West Through AI: 84 Epic Digital Artworks. A must-see!
9 months ago 00:21:58 2
“ПАСХА ИЛИ ПЕСАХ? КАКАЯ РОЛЬ МОИСЕЯ?
10 months ago 00:16:00 1
Заработок в интернете на ChatGPT и Canva в 2024
10 months ago 00:34:12 1
Модель ChatGPT. Как она делает то, что делает? Часть 1.
11 months ago 00:17:58 1
AI superpowered networks? (NVIDIA and Cisco join forces)
11 months ago 00:09:09 1
Нейросеть Sora которая ГЕНЕРИРУЕТ ВИДЕО от OpenAI
1 year ago 00:08:04 1
DOBB-E: 6D General AI Robot Breakthrough (109 TASKS, 5620 TRAJECTORIES, 1,500,000 FRAMES)
1 year ago 00:15:01 1
So How Does ChatGPT really work? Behind the screen!
1 year ago 00:03:11 1
Как зарегистрироваться в чате GPT за 5 минут в России. Самая простая регистрация. Chat GPT от Openai
1 year ago 00:17:21 1
Actual Objects Presents: Voice To Skull
1 year ago 00:05:07 2
How to use Stable Diffusion XL with low VRAM ComfyUI | Generate amazing images with Automatic 1111🌟
1 year ago 00:13:36 1
Что вы думаете о машинном/искусственном интеллекте?
1 year ago 00:02:33 1
Чат GPT-Как искусственный интеллект становится нашим личным помощником.
1 year ago 00:00:44 1
3DGPT - your 3D printing friend & collaborator!
1 year ago 00:12:36 1
Оптимизация для поисковых систем с помощью чата GPT🤖
1 year ago 05:43:41 47
Create a Large Language Model from Scratch with Python – Tutorial
1 year ago 00:21:05 2
ChatGPT. Секретное оружие писателей: Как GPT стал незаменимым инструментом для авторов
1 year ago 00:04:38 1
Расширение для ChatGPT «editGPT» – делайте Ваши тексты более эффективными!
1 year ago 00:03:33 1
7 Free Alternatives to Midjourney for AI Generated Images - Earn $5000 per Month
1 year ago 00:01:17 1
@AiNNGpT Искусственный Интеллект Artificial intelligence #ai Нейросеть #neuralnetworks NN #gpt