SORA Video To Video Is Literally Mind Blowing - 12 HD Demos - Changes Industry Forever For Real
I have combined all 12 Video To Video #SORA demos released by #OpenAI into 1 video with their used prompts and a amazing background music. You won’t believe how this Video to Video will change entire movie, animation, social media industries forever. The results are just simply astonishing.
Our Discord Channel ⤵️
Our Patreon With Amazing AI Scripts & Tutorials ⤵️
Prompts Of Each Demo Video (Public Post) ⤵️
Official Site ⤵️
[AI video generation] Sora element technology explanation
Sora’s technical configuration
Although the paper has not been published, OpenAI has published an explanation page for the elemental technology, so I will refer to that page.
If you would like to see the original text, please click here
overall structure
Sora is said to consist of the following technical elements.
Turning visual data into patches
Video compression network
Spacetime latent patches
Scaling transformers for video generation
Variable durations, resolutions, aspect ratios
Sampling flexibility
Improved framing and composition
Language understanding
To summarize very simply, there are four main elements:
A technology that compresses video data into latent space and then converts it into a “spatiotemporal latent patch“ that Transformer can use as a token.
Transformer-based video diffusion model
Dataset creation using high-precision video captioning using DALLE3
Looking at it this way, it doesn’t seem like they’re using particularly new technology.
Raise your level and hit it physically. You can clearly understand the importance of level (money/calculation resources) rather than small techniques.
Turning visual data into patches
First, let’s look at how to create a “space-time potential patch.“
(Source: )
As a pre-process to create a spatiotemporal latent patch, the input video (video data) is compressed into a latent space.
If you think of it as equivalent to VAE in image generation, I think it’s mostly correct.
(In fact, since the paper on VAE is cited, I think it’s safe to assume that it’s just VAE.)
This greatly reduces the amount of calculation, and Sora trains with this compressed latent space. Masu.
In image generation, training begins immediately after conversion to VAE, but Sora includes another conversion process to create what is called a spatiotemporal latent patch.
This seems to correspond to a text token in LLM.
An image is worth 16x16 words: Transformers for image recognition at scale.
The patching method divides the image based on position (patching) and converts it into a one-dimensional vector (flatten/smoothing).
For those who want to know more ( )
(Source: )
Vivit: A video vision transformer.
There are two patching methods proposed here:
Similar to ViT, how to patch based on position and concatenate it in frame order (figure 2)
Capturing the input video three-dimensionally, extracting blocks (tubes) of t (number of frames) x h (patch height) x w (patch width) and compressing them into one dimension.
For those who want to know more ( )
(Source: )
Masked autoencoders are scalable vision learners.
Rather than a patching method, this paper is about efficiently learning patched images.
Effective as pre-learning for ViT
Input a masked part of a patched token and solve the task of restoring the masked part
For those who want to know more ( )
(Source: )
Patch n’Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution.
A paper that allows you to freely change the resolution and aspect ratio of input data
By taking advantage of the fact that ViT can change the length of the input sequence and packing the sequence, it is now possible to input any resolution or aspect ratio.
Using this technology, Sora can be trained on videos and images of varying resolutions, lengths, and aspect ratios, allowing you to control the size of the videos produced during inference.
(Source: )
Song: Unknown Brain - MATAFAKA (feat. Marvin Divine) [NCS Release]
Music provided by NoCopyrightSounds
Free Download/Stream:
Watch:
Song: Warriyo - Mortals (feat. Laura Brehm) [NCS Release]
Music provided by NoCopyrightSounds
Free Download/Stream:
Watch:
Song: Egzod, Maestro Chives, Neoni - Royalty [NCS Release]
Music provided by NoCopyrightSounds
Free Download/Stream:
Watch:
3 views
29
8
4 weeks ago 00:59:33 2
Белал БУДЕТ ТРЕНИРОВАТЬСЯ у Хабиба - Шавкат Рахмонов СТАНЕТ ПЕРВЫМ ЧЕМПИОНОМ UFC из Казахстана?
4 weeks ago 00:05:27 1
Hidden Gate of the Infinite Spiral | AI-Generated Scene Video 208
4 weeks ago 00:04:20 1
The Phoenix Prophecy: Renewal Begins | AI-Generated Scene Video 209
1 month ago 00:11:59 1
[4K] Посылка с наборами третьей волны 18-го (2-го) сезона ЛЕГО Ниндзяго 2024 + БОНУСНЫЕ ФИГУРКИ
1 month ago 01:32:31 1
Корабль призрак прибило к берегу. Что там нашли?
1 month ago 01:06:36 1
Видео с помощью Ai: технология, возможности, примеры
1 month ago 00:05:54 1
100 Best Ecc-hi Harem Anime
1 month ago 00:03:28 1
AI FILM - Darva - AI generated short video #12 | Midjourney, Runway Gen-2, Udio
1 month ago 00:04:52 1
Harmony in the Deep: A Sea Story | AI-Generated Scene Video 199
1 month ago 00:02:29 1
AI FILM - Chrysalis - AI generated short video #3 | Midjourney, Runway Gen-2
1 month ago 00:15:44 1
Бокс. ЛУЧШИЕ НОКАУТЫ! Октябрь 2024 года, часть третья
1 month ago 00:01:05 1
Unbelievable AI Runway: Runway Gen-3 Revolutionizes Magic Runway AI Fashion Show with Special FX!
1 month ago 00:07:07 1
БЕСПЛАТНАЯ НЕЙРОСЕТЬ для создания ВИДЕО / ТОП бесплатных нейросетей / Luma AI
1 month ago 00:04:22 2
Legends of the Eternal Kingdom’s Rise | AI-Generated Scene Video 204
1 month ago 00:14:45 1
Как установить Stable Diffusion 3.5 Large и Turbo на компьютер? Пошаговая инструкция для Windows.
1 month ago 00:06:06 1
HOTTEST KISSES IN ANIME #11 || САМЫЕ ГОРЯЧИЕ ПОЦЕЛУИ В АНИМЕ
1 month ago 00:04:05 1
離婚伝説 - まるで天使さ(Official Music Video)
1 month ago 00:03:15 1
ENHYPEN (엔하이픈) ’Future Perfect (Pass the MIC) [Japanese Ver.]’ Official MV
1 month ago 00:34:11 1
НОВАЯ модель ГЕНЕРАЦИИ видео - 🔥 Meta Movie Gen меняет правила игры в генерации видео. Sora прощай.
1 month ago 00:23:04 1
🚀 Adobe MAX 2024: Новая эра AI в видео и дизайне
1 month ago 00:02:40 1
Katia Ivan - Cine are frate ( Official Video)
1 month ago 00:16:10 1
Breathtaking views and exquisite onsen at unexplored region - Hotel Iyaonsen Iya Valley - Tokushima
1 month ago 00:13:28 1
[PREVIEW] PS2 - Aoi Sora no Neosphere: Nanoca Flanka Hatsumei Koubouki 2 (HD, 60FPS)
1 month ago 00:06:03 1
Sora Panic When Polka Didn’t Come in Time / Looking back on GTA【Tokino Sora/Omaru Polka/Hololive】