Researchers from the University of Washington, Columbia University, Open AI, the Allen Institute of Artificial Intelligence, and Toyota Research have teamed up to present a new method for fine-tuning these pre-trained models such as GPT-3, BERT, DALL-E, EfficientNet, or CLIP for application specific datasets. The key insight is that as you fine-tune these models, you gain in-distribution accuracy, but sacrifice the zero-shot flexibility, or out-of-distribution generalization, of these pre-trained “foundation” models. The authors present Weight-Space Ensembling, where you take a linear interpolation between the weights of the zero-shot and fine-tuned model to make new inference. This achieves a balance between in and out of distribution accuracy. The authors connect this to Linear Mode Connectivity to explain why it works compared to random weight-space ensembles, which do not work. This is another very interesting study on the Generalization capability of Deep Neural Networks. This includes solving problems o
1 view
6
2
3 years ago 00:31:00 8
Robust Fine-Tuning of Zero-Shot Models
2 years ago 00:02:57 1
Pnut & Jelly - Fit Or Fine
3 years ago 00:13:36 12
Out-of-Distribution Robustness in Deep Learning
1 year ago 00:00:55 1
TANK 300 Off Road Tuning #tank300
4 years ago 00:11:47 13
Mixer Fundamentals: Painting
10 months ago 00:29:18 1
Isometric Object : Hole Aluminium Bracket
1 year ago 00:02:04 1
Sena 50S Motorcycle Communication Bluetooth Headset
2 years ago 00:05:17 1
Sheku Kanneh-Mason Gets His 400-Year-Old Amati Cello Serviced By Legendary Luthier Florian Leonhard
8 years ago 00:06:22 1
Flowblade 1.4 video editor on Ubuntu linux
9 months ago 00:05:28 1
Everything Mark Wahlberg Eats In a Day | Eat Like | Men’s Health
3 months ago 00:03:32 12
Robot Motion Diffusion Model: Motion Generation for Robotic Characters
1 year ago 00:43:52 13
ComfyUI: Image to Line Art Workflow Tutorial
9 months ago 00:08:21 1
CQV-SWR-508 VSWR & Power Meter Review
9 months ago 00:20:01 3
Bronze Fittings for the “Serpent in The Blade“ Viking Sword
1 year ago 02:36:00 1
Spring Security, demystified by Daniel Garnier Moiroux
1 year ago 00:12:41 1
How To Install Code Llama Locally - 7B, 13B, & 34B Models! (LLAMA 2’s NEW Coding LLM)
9 months ago 00:03:00 1
Learning Vision-Based Bipedal Locomotion for Challenging Terrain
1 year ago 00:03:35 33
Scaler EQ | The World’s First Truly Musical EQ
3 years ago 00:13:59 1
The Mustang GT Fastback: The Ultimate Driving Machine