Visual question answering & reasoning over vision & language: Beyond limits of statistical learning?
Advances in deep learning keep producing impressive results at the junction of computer vision and natural language processing. The task of visual question answering (VQA), once considered incredibly ambitious, is now commonly used to benchmark multimodal models. Despite apparent progress, however, I will argue that some capabilities required for a general solution to VQA, such as strong out-of-distribution generalization, are beyond the reach of prevailing practices in machine learning. I will discuss how causal reasoning helps in formalizing the limits of classical, correlation-based learning. We will use a new layer of understanding of existing techniques to identify what information is missing from typical datasets, where else to find it, and how to test our models for the behaviors we really care about.
Speaker: Damien Teney, Idiap Research Institute in Switzerland
MSR Deep Learning team:
1 view
10
1
2 months ago 00:13:54 2
Ancient Temple Shows Cell Phone & Wrist Watch? Built with Psychic Powers?
3 months ago 00:50:41 1
Tchaikovsky - The Nutcracker (Vol. 1 - Classical Music For Christmas)
4 months ago 00:04:14 1
The Hunter - Bloodborne (4K UHD 2024)
4 months ago 02:44:47 1
Best of Debussy - Classical Music Gems
4 months ago 02:57:29 2
Classical Halloween - Essential Classical Music
4 months ago 01:47:09 1
Best of Richard Wagner - Classical Music Gems
4 months ago 00:02:47 1
MMV - KEAN DYSSO x Sinny - LTE
4 months ago 00:03:10 1
Kiwi!
4 months ago 10:10:10 1
T H A L A S S A \\ Hauntingly Beautiful Fantasy Music for Gaming | Relaxing | Studying | Sleeping
4 months ago 00:29:35 1
FunOS | A Balance of Features and Functionality With Low-resource Usage
4 months ago 00:05:06 1
NEW MR OLYMPIA 2024 - MONSTER WHO MADE HISTORY - SAMSON DAUDA MOTIVATION
4 months ago 00:02:03 1
Arcadia. Future Utopia.
4 months ago 09:49:52 1
4K Scenic Autumn Drive in New England | Connecticut Fall Foliage (Left Side, SloMo + Relaxing Music)
4 months ago 00:15:43 1
10 Years of DarkstepWarrior: Magnetude
4 months ago 00:04:21 1
The Monk’s Walk
4 months ago 01:09:33 1
Best of Felix Mendelssohn - A Classical Music Showcase
4 months ago 02:37:06 1
Best of Dvořák - Essential Classical Music
4 months ago 00:17:43 1
ASMR Role Play | Ear Exam with Breathy Whispers “GOOD“ “OKAY“ and “HMM“ Deep in your Ears
4 months ago 00:09:53 3
Pink Floyd - Sheep (2024 AI Visuals)
4 months ago 00:05:09 1
Pink Floyd - Have A Cigar (2024 AI Visuals)
4 months ago 00:04:50 1
Falling In Reverse - “Last Resort (Reimagined)“
4 months ago 00:11:41 1
Create Studio Pro Review ✅ Create Studio Pro Demo And 🎁 Create Studio Pro Bonus 🎁👇
4 months ago 00:25:16 1
Full 4k Video: Millions Of Wolves Are Exterminated By Farmers And Hunters, What Happens Next ?
5 months ago 00:01:57 1
“POUR 939” Some minds are already filled. Animation by Patrick Smith