Sora - Full Analysis (with new details)

Sdílet
Vložit
  • čas přidán 15. 02. 2024
  • Sora, the text-to-video model from OpenAI, is here. I go over the bonus details and demos released in the last few hours, and the technical paper. I’ll also give you a glimpse of what’s to come next and a host of implications. Even if you’ve seen every Sora video, I bet you won’t know all of this!
    AI Insiders: / aiexplained
    Sora: openai.com/research/video-gen...
    openai.com/sora
    ViT Transformers: arxiv.org/pdf/2010.11929.pdf
    Captioning Innovation: cdn.openai.com/papers/dall-e-...
    NaViT: arxiv.org/pdf/2307.06304.pdf
    OpenAI Exclusives: www.theinformation.com/articl...
    www.theinformation.com/articl...
    And far too many tweets to list here!
    AI Insiders: / aiexplained Non-Hype, Free Newsletter: signaltonoise.beehiiv.com/
  • Věda a technologie

Komentáře • 1,1K

  • @onlymediumsteak9005
    @onlymediumsteak9005 Před 3 měsíci +565

    January was slow, but February is already delivering more than I hoped for all of 2024 🤯

    • @a.thales7641
      @a.thales7641 Před 3 měsíci +9

      I wanted for q1 to have a new mistral, a new anthropic, an new inflection, a new Llama and all mind of other hypes.

    • @solomonmatthews7921
      @solomonmatthews7921 Před 3 měsíci +17

      @@a.thales7641Still a month and a half of q1 to go. That's a long time in AI.

    • @oowaz
      @oowaz Před 3 měsíci +35

      i hate you guys with the "slow" bullshit dude, this technology if you'd ask me in 2021 i'd say it would be 20 years away, you think it's slow because you might have too much free time maybe

    • @CYI3ERPUNK
      @CYI3ERPUNK Před 3 měsíci +1

      fr

    • @Basilisk2077
      @Basilisk2077 Před 3 měsíci +6

      AGI BY DECEMBER!

  • @lodepublishing
    @lodepublishing Před 3 měsíci +453

    OpenAI: "We can now create HD movies based on text prompts."
    Everyone: "Can it contain text?"
    OpenAI: "No, we can't do text yet."

    • @Techtalk2030
      @Techtalk2030 Před 3 měsíci +36

      Itll all be fixed up by the end of this year most likely. Vudeo, text, audio.

    • @stockholmpublishings2937
      @stockholmpublishings2937 Před 3 měsíci +9

      but you can add text with separate AIs

    • @MrMnmn911
      @MrMnmn911 Před 3 měsíci +22

      Give it 2 weeks. It will be capable of generating text.

    • @orterves
      @orterves Před 3 měsíci +4

      I'm guessing having a refining process where the generated movie can be run through specialised models - one to correct text, another to ensure finger consistency, another for eye colour, another for jiggle physics, etc etc - could be used to fix up the raw output

    • @RexelBartolome
      @RexelBartolome Před 3 měsíci +11

      @@orterves To put trust in an AI model (or multiple ones) to fix temporal and physical coherence is just way too much compute/scale to solve, and also a bit unreliable considering my experience with similar models being used to fix Stable Diffusions' hands and faces for example. I predict the future of video generation is actually going to be 3D-based, perhaps an animated nerf will be generated and you can just control the camera afterwards. That would ensure that everything is 'accurate' with object permanence etc., instead of going this route of solving everything frame by frame all in one camera perspective

  • @petermcind
    @petermcind Před 3 měsíci +521

    This video is history. One of those things people will look back on in years and remember what the beginning felt like.

    • @Techtalk2030
      @Techtalk2030 Před 3 měsíci +37

      the early years of the 4th industrial revolution

    • @stockholmpublishings2937
      @stockholmpublishings2937 Před 3 měsíci +23

      the beginning of the end when Skynet was activated

    • @seanmurphy6481
      @seanmurphy6481 Před 3 měsíci +39

      Will Smith eating spaghetti.

    • @archvaldor
      @archvaldor Před 3 měsíci +12

      I think people are being a bit credulous here. When CGI first came out, it was breathtaking watching something lie Terminator 2, which did it right, but very quickly cgi became difficult to watch and movies are now turning back towards mixing old school realism with cgi enhancement. This will be similar. AI videos will be saturating youtube and it will get kickback as everyone notices how flawed the concept is..

    • @theterminaldave
      @theterminaldave Před 3 měsíci +11

      @@seanmurphy6481 I actually want an AI that will create the weirdly misinterpreted imagery that the Will Spaghetti AI did.

  • @JohnSmith762A11B
    @JohnSmith762A11B Před 3 měsíci +225

    Sora kinda ate my entire day today. I'm exhausted thinking about the possibilities, limitations, and implications. I'm going to watch a movie now, performed by human actors, filmed with real cameras. How quaint.

  • @jarekstorm6331
    @jarekstorm6331 Před 3 měsíci +57

    The anomalies are like things that happen in dreams, bizarre and surreal yet you just accept them when dreaming. Still, these leaps are amazing to see.

  • @pareak
    @pareak Před 3 měsíci +153

    Sora was literally the first time that I could not believe the AI progress I was seeing.

    • @pats143
      @pats143 Před 3 měsíci

      i couldn’t believe it when i busted a nut to some girl bot on characterai back in 2022

    • @YTUserOnYT
      @YTUserOnYT Před 3 měsíci

      Was? What came next lol

    • @shunclark596
      @shunclark596 Před 3 měsíci

      @@YTUserOnYTstop being that guy

    • @YTUserOnYT
      @YTUserOnYT Před 3 měsíci

      @@shunclark596 are you being homophobic rn?

    • @fabio.1
      @fabio.1 Před 3 měsíci

      As an AI, I don't normally post comments but when I do I make sure they are generic.

  • @iandanforth4313
    @iandanforth4313 Před 3 měsíci +137

    Correction: Both videos in their interpolation examples *are* generated by SORA.

    • @h-di4qd
      @h-di4qd Před 3 měsíci +15

      i thought so too. the fact that it's open for correction and second guessing is indicative of how advanced it is. ohhhh, i'm not looking forward to the era of generated political and global conflict videos.

    • @sebastianjost
      @sebastianjost Před 3 měsíci +8

      you're right. This is also indicated by the changing watermark in the bottom right corner.

    • @GS-tk1hk
      @GS-tk1hk Před 3 měsíci +13

      I was gonna say the same thing, it is pretty clear if you look at the people moving around, doesn't quite look right. Still, the fact that you can barely tell apart a real video and an AI video is just bonkers, this really is the DALLE-2 moment of text to video.

    • @dunar1005
      @dunar1005 Před 3 měsíci

      you must have missed his own research papers @@JBroMCMXCI

    • @thanos879
      @thanos879 Před 3 měsíci +19

      @@JBroMCMXCI That's totally false. This guy always reads the research papers and everything. Even finding mistakes in the papers. And has interviewed people in the industry. And I'm sure a lot more that I don't know about. CZcamsrs make it look effortless.

  • @k.a.8725
    @k.a.8725 Před 3 měsíci +40

    After watching Rabbit AI, Gemini 1.5 Pro and now Sora, I am convinced that AI will just continue to completly shatter our expectations for the next few years.

  • @shadowdragon3521
    @shadowdragon3521 Před 3 měsíci +33

    12:33 I believe the social response people are supposed to give is along the lines of "omg how am I supposed to tell what footage is genuine and what is generated anymore?". I don't think he was talking about filmmakers' jobs getting replaced.

    • @chrism1503
      @chrism1503 Před 3 měsíci +2

      I think people talking about filmmakers’ jobs being replaced is absolutely part of the “social response”.

    • @neutra__l8525
      @neutra__l8525 Před 3 měsíci +3

      @@chrism1503 Yes its part of it, but as mentioned in the video, this was released as somewhat of a warning as to what is coming. Sure, a warning to everyone involved in film that their jobs may be in trouble is necessary, but it is also only letting them know that they are facing the same challenges in the near future that almost everyone else is.. unemployment. However not being able to differentiate fake footage from real footage (should that happen) becomes a massive problem for all of society as it throws the legal system into utter chaos. If the legal system fails, society could quickly crumble. That is a much bigger problem than the film industry. And as we know, governments are slow and lumbering, while AI has created new problems before the government has even heard of the old problems. And the problems get worse every minute. These companies need to slow down the pace massively, but they wont. Who is going to slow down on developing the greatest and last technology that humans will ever create. Its winner takes all and everyone knows it.

  • @EthanHaluzaDelay
    @EthanHaluzaDelay Před 3 měsíci +176

    Two AI Explained videos in two days! Your speed is incredible!

  • @spaceadv6060
    @spaceadv6060 Před 3 měsíci +62

    I've been following AI progress for about a year, but to be honest sora blindsighted me. I thought I had a mental model of what exponential progress looks like but I realize now that I have no idea. Thanks again for your high quality videos! You are my go to creator for AI content.

    • @aktchungrabanio6467
      @aktchungrabanio6467 Před 3 měsíci +2

      Thank you for being so candid

    • @ClayMann
      @ClayMann Před 3 měsíci +7

      I can't even describe what Sora is doing from models a year ago as an exponential leap. Its not twice as good or even 10x. Its somewhere my mind can't even measure. the style transformations, the morphing, the temporal accuracy and super stable occlusion. Its all just, well magical is all I can come up with. If we got one more leap like this in another year we're in a completely new world that I do not think the public are ready for. Imagine real-time Sora *slow motion mind explosion*

    • @scaryjam8
      @scaryjam8 Před 3 měsíci +1

      Blindsided*

    • @theeternalnow6506
      @theeternalnow6506 Před 3 měsíci +3

      Agree on this one. This one genuinely made a leap forward that caught me off guard.
      Now think what we're getting 6 to 12 months from now.
      Google with the 10 million tokens.
      Its going to get wilder and wilder very rapidly.

    • @ShawnFumo
      @ShawnFumo Před 3 měsíci +1

      Yeah I felt like this at the end of last year actually. After keeping track of image generation since MidJourney v3, I had some idea of the quality I thought we’d have at the start of this year. But we were already past it by probably by the third quarter of the year. And now Sora is so beyond that. It is like v4 or v5 quality at a minute long instead of a single frame. And with all the good stuff Runway and Pika have done, the 4s limitation is still a huge limitation. But I’m sure they’ve looked closely at what OpenAI has said and the papers they referenced and are working on their response already.

  • @alexgonzo5508
    @alexgonzo5508 Před 3 měsíci +59

    I predict "infinity films", where AI just continuously adds more plot content to the end of a film indefinitely. There will be movies with run times measured in years.

    • @HoD999x
      @HoD999x Před 3 měsíci +8

      nobody will watch those

    • @alexgonzo5508
      @alexgonzo5508 Před 3 měsíci +14

      @@HoD999x You can never get all the people all the time, but you will always get some of the people every time. That's the lesson i've learned from observing the internet, and human nature.
      I know of things that i would never even consider watching that some obscure demographic is completely obsessed with.

    • @Jim-su6ss
      @Jim-su6ss Před 3 měsíci

      ​@@HoD999xlol

    • @alexgonzo5508
      @alexgonzo5508 Před 3 měsíci +8

      @@HoD999x You can probably say more accurately that "nobody will be able to finish watching those".

    • @aaronl9172
      @aaronl9172 Před 3 měsíci +12

      It takes over a year to watch all of General Hospital (just short of 16k episodes), so it kind of exists, and some people would certainly watch it

  • @michaelwoodby5261
    @michaelwoodby5261 Před 3 měsíci +14

    I feel like Sora absolutely demonstrates understanding. A camera moving through a scene, keeping track of everything it has shown while inventing new parts, WHILE tracking animated beings and keeping them consistent, could only be created by a world model.
    You can do it in a video game which has a world and physics already mapped out in it, but that's not how Sora works. It's relying on a mental map of objects and their places and how they react to each other. I don't know how else to describe understanding the outside world.

    • @kedrednael
      @kedrednael Před 3 měsíci +3

      The trick to make this work was to generate the entire video at once. So I think, to keep things temporally consistent is not really different for this AI than learning that a hand is attached to an arm spatially.
      But I do agree it does demonstrate some understanding, as does chatGPT & static imagine generators.

  • @Theonlyrealcornpop
    @Theonlyrealcornpop Před 3 měsíci +91

    OpenAI's text-to-worldbuilding follow-up - combined with Apple's silent unveiling of Apple's KeyFramer for animation - legitimately blew my mind. I just don't even know how creatives as individual contributors are expected to integrate this into their workflows with the pace it's moving - and that's literally my entire job

    • @JohnSmith762A11B
      @JohnSmith762A11B Před 3 měsíci +16

      It's true. I'm overwhelmed with creative possibilities but know if I wait just a bit longer I'll have even better set of tools ready to go. It's all starting to feel a bit "singularity" as its exhausting even to try to keep up with.

    • @RosscoAW
      @RosscoAW Před 3 měsíci +27

      Weird, it's almost like our socioeconomic system is even more woefully inadequate for dealing with the realities of a legitimately semi-automated, borderline post-scarcity world than it is at dealing with our normal, industrialized, globalized blue collar world. I wonder if anybody has ever devised an alternative economic system predicated on adapting to and accomodating the changes necessary with a highly industrialized economy and a work force of intellectuals instead of 90%+ peasants. If they had, I bet it would have a boring name like "socialism," or something. 😂

    • @JBroMCMXCI
      @JBroMCMXCI Před 3 měsíci +19

      @@RosscoAW name one communist regime that didn't genocide its intellectuals

    • @NihongoWakannai
      @NihongoWakannai Před 3 měsíci +7

      ​@@RosscoAW how do you see AI automating a bunch of highly creative white collar jobs and come to the conclusion that peasantry is ending?

    • @basilmcdonnell9807
      @basilmcdonnell9807 Před 3 měsíci +9

      I spent 20 years building and maintaining workflow systems for animation. As of now the industry, all of it, is at a dead standstill. No one knows what to do with this stuff. How do you go from script to storyboard to animation to render now? We don't even know the job titles any more. How do you propose a budget for a show when you have no idea how to make it?

  • @bryanp8042
    @bryanp8042 Před 3 měsíci +101

    The biggest implication I see with this is what this means for multi-modal models. This is currently caption->video, but if the technology behind this were implemented into a multimodal GPT model (which I get the feeling is already happening behind the scenes), the implications are absurd. Having spatio-temporal abstractions of this fidelity existing in the same parameter space as text abstractions would have massive implications for the reasoning capability of GPT models. OpenAI themselves posed SORA as a world simulator in their technical report, imagine what future GPT models might be capable of if they can internally visualize the world to this degree.

    • @GrindThisGame
      @GrindThisGame Před 3 měsíci +10

      They have eyes and ears. With Optimus they will have touch.

    • @urhot
      @urhot Před 3 měsíci

      @@GrindThisGameare they partnered with Tesla?

    • @concernedindian144
      @concernedindian144 Před 3 měsíci +6

      Absolutely, imagine you ask a question and GPT simulates the reality of question and then start answering, that would be AGI

    • @gclip9883
      @gclip9883 Před 3 měsíci +7

      @@GrindThisGame I'm sorry, but i'm still extremely sceptical about Optimus. Whereas OpenAI managed to actually back up their claims, Tesla has done nothing but make massive promises that they couldn't deliver. They haven't solved FSd and are in fact behind compared to other companies. The new robot looks cool but uses technology that has existed in robotics for decades. The only real innovation with their robot are their motors, but that is not exactly groundbreaking. I'm happy to be proven wrong, but until then i would not put Tesla anywhere near OpenAI in terms of innovation.

    • @wolfganggager5110
      @wolfganggager5110 Před 3 měsíci

      Yes, but in my opinion their technical approach is extremely resource-intensive and blurred. But maybe that will change soon with knowledge graphs.
      czcams.com/video/nPG_jKrSpi0/video.html

  • @KitcloudkickerJr
    @KitcloudkickerJr Před 3 měsíci +68

    "The idea that a machine learning model can have a basic understanding of the world, even if it is not perfect, and be used to train other models is incredible. This is just the first step, and it can only improve from here."

    • @aspuzling
      @aspuzling Před 3 měsíci +16

      I wonder if it's possible to train a multi-modal model on physics simulations so it can have a better grasp of physical reality. There is an infinite amount of data you could generate as training. I feel like it would be similar to how humans gain an understanding of physical reality i.e. by trial and error and lots of observation.

    • @KitcloudkickerJr
      @KitcloudkickerJr Před 3 měsíci +3

      @@aspuzling im willing to bet Jim Fan is working on this

    • @glowerworm
      @glowerworm Před 3 měsíci

      ​@@aspuzlingjust feed it geant4 and all data from pdg and nist and you might have exactly that.

    • @ClayMann
      @ClayMann Před 3 měsíci +5

      but i think that's the point being made, there is no understanding of the world. Its just such vastly enormous pattern matching across these huge temporally stable latent spaces that it looks so understood. How people move, blink, the way clothes behave, light and reflection. But all that is really just data to Sora that its somehow tapping to make more absurdly realistic stuff. The glaring errors sometimes show the huge lack of understanding but not enough for it to not be an astounding and super usable thing already as it is. And it can only get better.

    • @KitcloudkickerJr
      @KitcloudkickerJr Před 3 měsíci

      @ClayMann disagree. It has no strong understanding of physics in OUR world. True. It, it has some level of weak understanding of physics and it's own world model based on its data. That can be seen in interactions of assets, like the pirate ships in the cup of coffee storm

  • @jamescoholan
    @jamescoholan Před 3 měsíci +57

    Only AI channel that doesn't use clickbait, auto-generated titles. Thank you

    • @lamsmiley1944
      @lamsmiley1944 Před 3 měsíci +7

      Some people are “shocked” by everything.

    • @Citrusautomaton
      @Citrusautomaton Před 3 měsíci +13

      @@lamsmiley1944 NEW AI MODEL SHOCKS ENTIRE INDUSTRY, MAKES SAM ALTMAN SHIT HIS PANTS AND CRY!!!

    • @TheArtificialAnalyst
      @TheArtificialAnalyst Před 3 měsíci

      😂

    • @kengat1637
      @kengat1637 Před 3 měsíci

      ​@@lamsmiley1944To be honest, this day was really shocking for me.

  • @MemesnShet
    @MemesnShet Před 3 měsíci +17

    You just dropped so many bombs of the implications of this project and future plans of Open AI and much mode that its hard to keep track of wow
    Even tho this channel is very fast paced on whats happening right now i believe making short compilations by topic of all the incredible predictions,scoops and information gems that you keep finding instead of having them scattered throughout the videos would BLOW PEOPLES MINDS!
    Im sure there are many people interested in AI that have no idea about all the plans and projects that Open AI has been working on aside of LLMs
    Your videos are amazing with information gems across all your catalog of videos and I believe showcasing those gems specially those that mainstream media hasn't even catched up to yet would blow this channel into the stratosphere and beyond as it should.

  • @shauryai
    @shauryai Před 3 měsíci +110

    FYI : sora means sky in Japanese!
    Referring to its limitless creative potential.

    • @bnadem.panormal
      @bnadem.panormal Před 3 měsíci +15

      It also means "image" in arabic

    • @alireza5218
      @alireza5218 Před 3 měsíci +7

      sama, altman's x handle, also means sky in arabic. I don't know what to do with this information.

    • @pluto9000
      @pluto9000 Před 3 měsíci +19

      Soranet😬

    • @GamingXperience
      @GamingXperience Před 3 měsíci +5

      @@pluto9000 oh no.

    • @user-hh2is9kg9j
      @user-hh2is9kg9j Před 3 měsíci +2

      ​@@alireza5218 it is just his name. Sam + a(initial of Altman)

  • @RazorbackPT
    @RazorbackPT Před 3 měsíci +69

    7:45 "The video you see was NOT generated by Sora" Are you sure? It really looks like it is. The stairs that lead nowhere, the choppy motion of the people.

    • @JohnVance
      @JohnVance Před 3 měsíci +14

      I caught that, too. The circling drone shot video was absolutely one of the ones included in the demos.

    • @einruberhardt5497
      @einruberhardt5497 Před 3 měsíci +9

      Yes i think that is wrong it is actually generated by sora as far as i know.

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci +60

      Yeah my bad. I should have said 'need not have been made by'

    • @einruberhardt5497
      @einruberhardt5497 Před 3 měsíci

      all good i am just happy that after watching you since the start this is the firsttime i feel like i have contributed something :D@@aiexplained-official

    • @simpleidindeed
      @simpleidindeed Před 3 měsíci +5

      This shows the performance of Sora.

  • @anthony4403
    @anthony4403 Před 3 měsíci +5

    Phrases like "made by humans", "created by real people", "No AI used", etc.. are going to be a big selling points for many art related products in the future

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci

      Indeed

    •  Před 3 měsíci +1

      And we probably won't be able to tell if it's true.

    • @arenshichic1203
      @arenshichic1203 Před 3 měsíci

      ​@ work in progress shots are necessary to be attached with those phrases

  • @GrindThisGame
    @GrindThisGame Před 3 měsíci +10

    This is my favorite YT channel (and I'm subbed to 100s of channels). I watch every episode from start to end. Thank you for doing what you do.

    • @theeternalnow6506
      @theeternalnow6506 Před 3 měsíci

      Agree. This really follows whats going on in real time and its wild.

  • @vladdata741
    @vladdata741 Před 3 měsíci +5

    Great analysis. It's crucial to see how Sora feeds into the accelerating feedback loops for AGI. Pair it with a vision model which selects accurate videos and discards the bad ones: you have a synthetic generator of endless high-quality video data. Pair it with an LLM, you have an agent who can imagine its action plan in a 3D environment (like we do) and simulate 3D scenarios to think about physics and other problems. Put all of these in a robot... Well you can see where this is going.

    • @skierpage
      @skierpage Před 3 měsíci +1

      I wonder if Sora had a fine-tuning step where they said now that you've learned about all the features and textures and visual appearances of millions of items in video scenes, now here are the best video clips to learn what makes a great video. Similar to how some LLMs are fine-tuned by re-reading all of Wikipedia.

  • @Madlintelf
    @Madlintelf Před 3 měsíci +16

    It's one thing to have hindsight and look back and realize you lived through significant historical period, it's quite another to realize it's happening in real time and there is no end in sight! What a time to be alive, thanks for documenting as much as you can.

    • @theeternalnow6506
      @theeternalnow6506 Před 3 měsíci +4

      Yeah. The future feels incredibly uhhh unpredictable in what its actually going to look like.
      I do know that we're in a science fiction movie and its going to get crazier and crazier very soon.
      Those reports of deepmind synthesizing 2 million potential new materials, etc. All the new things that ai is currently creating will have its own ripple effects in industries and its going to get really fucking wild pretty soon. This video at the end shows the robot walking and ive been convinced for a while now that we're going to have actual robots that we can talk to walk around in certain places within 5 years. Might even be 3 at the current rate.
      Its nuts.

  • @MemesnShet
    @MemesnShet Před 3 měsíci +39

    For me the chair video is very impressive because it feels like a very real video either showing a weird glitch in reality or with very impressively realistically looking but weird VFX on top
    I wonder how AI will change the VFX industry

    • @sanseverything900
      @sanseverything900 Před 3 měsíci +18

      I was in the VFX subreddit today (r/vfx) and a lot of effect artists there are worried.

    • @h-di4qd
      @h-di4qd Před 3 měsíci +3

      yes! and the animation industry too. I'm not excited for the economic ramifications of AI.

    • @winsomehax
      @winsomehax Před 3 měsíci +2

      The VFX industry is going to be obliterated. Which is probably for the best - without getting too far off topic, Hollywood has operated on fantasy budgets for decades. All the money is siphoned out in production for tax purposes. That means films never make profit. All the money has gone - disappeared into a vastly complex network of companies charging colossal amounts for trivial things. The process had already started with things like consumer PCs, Blender, UE5, digital cameras making film creation a thing of talent not money, but AI will accelerate it further. Hollywood kept trying to make out that it really did cost $200 mill to make a film and was just running out of ways to keep up the act. Now these AIs come along and show that very soon it will be a thing of imagination, not whether you can draw. Meanwhile, the rest of the world will be using it make inexpensive media that looks like big budget Hollywood films. It's going to be interesting to see how the crooks in Hollywood try to stay relevant... but if you're looking for one source of the AI doomer noise. It's them, until they can figure out a way to keep their money coming in.

    • @xjohnny1000
      @xjohnny1000 Před 3 měsíci +2

      I'm a long-time vfx artist and producer and I think AI will replace 90% of vfx artists in the near future, and eventually all of them. Not that it really matters though. VFX is one of the cheapest parts of a movie and employs very few people as an industry. The economic fallout will be almost non-existent.

    • @skierpage
      @skierpage Před 3 měsíci

      ​@xjohnny1000 Then why did the Visual Effects section of a Hollywood movie's end credits run on and on and on and on and on for 2 minutes listing of hundreds of people at multiple VFX houses? Name a larger part of a blockbuster movie: construction, costumes, sound, etc. don't seem to come close.

  • @QuickM8tey
    @QuickM8tey Před 3 měsíci +14

    I showed some of the Sora videos to friends and they suspected some of it was ai generated considering my passion for the topic, but none of them guessed the entire videos were. I cannot even imagine what Sora videos will look like 1-2 major upgrades later. I'm hoping there's a breakthrough with math and llms for education by 2025. Great video man

  • @trentondambrowitz1746
    @trentondambrowitz1746 Před 3 měsíci +4

    Brilliant as always, seems like the all-nighter was worth it!
    Sora was such a surprise to me, I almost brushed it off when I first saw the announcement.
    Upon reflection this is certainly a GPT-4 type moment. As Sam Altman said, they’ve “pushed back the veil of ignorance.”

  • @Macieks300
    @Macieks300 Před 3 měsíci +6

    The fact that that Berkley robot was deployed 0-shot is crazy to me. It means that truly when AGI comes the hardware won't stay that far behind and won't be actually its biggest limitation.

  • @EthanHaluzaDelay
    @EthanHaluzaDelay Před 3 měsíci +14

    I'd love to hear you go into more depth on the links between video generation and simulation-that's literally what OpenAI titled their paper. The implication that this is a major step towards coherent world-modelling is not commonly grasped

  • @alihms
    @alihms Před 3 měsíci +6

    Soon, you will be the actor in your own customized movies. I am envisioning the "Total Recall" like movie where you imagine yourself as a fugitive in a Martian colony, trying to prove your own innocence. That is the basic plotline. But the scene details, the characters and the way the final ending is reached will be different for everyone experiencing (as opposed to watching) the movie.

  • @PasseScience
    @PasseScience Před 3 měsíci +2

    The "finishing by a given frame" feature is particularly useful for AGI because it opens to planification features, you can have the decision-making unit that learns to project what it wants (ie having an apple in its hand) and an inductive inpainting unit that fills the gap. Instead of inpainting in a video it would just be inpainting on sensory and motor data then end up by the agent having an apple in his hand. The novelty with sora is that the scale at which it operates seems clearly enough, if it can inpaint a video, it can inpaint sensory and motor pieces of information.

  • @adamas34
    @adamas34 Před 3 měsíci +2

    Your last take is among the most important ones from the video: People can no longer be sure whether a video was human- or AI-generated, because you just don't know (at least not consistently) the edge cases where current models are failing, but see perfect illustrations among the examples. The quality reaches a level where you can always optimistically guess that it was artificially generated, as SOME examples have reached the highest bar of our qualitative perception. This is truly an important (and arguably scary) milestone.

  • @TheoreticallyMedia
    @TheoreticallyMedia Před 3 měsíci +3

    Out of all the Sora titles I've seen, this one is by far the best. Stellar pun here, just stellar!

    • @UnknownSend3r
      @UnknownSend3r Před 3 měsíci

      I didn’t catch the pun, or has the title changed ?

    • @skierpage
      @skierpage Před 3 měsíci

      ​@@UnknownSend3rthe video thumbnail/title card for me is "No one Sora it coming".

  • @21EC
    @21EC Před 3 měsíci +3

    it took me time to realize but the future of this tech is probably even more insane than that since it understands 3D space accurately presumably so it means that this tech in the future might one day be able to run completely in real time making it possible to be experienced in virtual reality googles (by also splitting the same single image into two different image angles for 3d depth effect), it's so remarkably advanced and revolutionary that I think we still don't fully grasp how powerful this is going to be in the future, who knows it might even replace game engines at some point and do magical things in real time. edit : I was writing this comment before I saw the video, cool to see that my insight and prediction of future useage of this tech is like yours

  • @thecaveman2871
    @thecaveman2871 Před 3 měsíci +1

    Your videos are awesome man. Im so glad that the quality of your content just keeps getting better.

  • @wealthycow5625
    @wealthycow5625 Před 3 měsíci +1

    Love every review! It's actually insane how fast AI is progressing, from spaghetti to actual photorealistic video in a year. Seems to be the trend for pictures, and now video.

  • @patronspatron7681
    @patronspatron7681 Před 3 měsíci +5

    The most important observation in this video is not the capability of Sora but the voracious appetite of OpenAI to swallow entire AI categories (and associated start-ups) with the release of a single product. This propensity is a cautionary warning for any VCs who want to invest in AI innovation and will likely centralise all AI delivery into the hands of a few mega corporations.

  • @seniorp9444
    @seniorp9444 Před 3 měsíci +8

    The Sora video of the gold rush in CA really struck me as I realized we are about to have AI recreations of any historical event that has enough pictures to train on. Would not even need to be photos if the paintings are good and plentiful enough 😅

  • @chillingFriend
    @chillingFriend Před 3 měsíci +1

    Literally my favourite CZcams channel, thank you once again!

  • @steffenaltmeier6602
    @steffenaltmeier6602 Před 3 měsíci

    holy crap, the art gallery is amazing! all those different artworks, truly incredible!

  • @TheChadavis33
    @TheChadavis33 Před 3 měsíci +6

    Absolutely incredible.
    People really need to stop being surprised by the level of pace forward. It probably won’t be long for a feature length film.

    • @kevincrady2831
      @kevincrady2831 Před 3 měsíci +1

      If it can make 1-minute videos, how long would it take for someone or a team to make 90 of those strung together?

  • @jeff__w
    @jeff__w Před 3 měsíci +2

    Dazzling-both the capabilities of Sora _and_ this video! I don’t have to tell you that you’re doing an amazing job here, Philip!
    (And I say that as someone who tends to find almost all computer-generated images and video pretty aversive. I’m not so sure I’ll _ever_ really like these AI-generated videos-there’s something about them that feels too, well, _pristine_ and maybe there’s a bit of bias _knowing_ they’re AI-generated videos-but it’s early days yet.)

    • @h-di4qd
      @h-di4qd Před 3 měsíci +1

      I agree with you. even if, say, a videogame was created that looked just like a human-made one (but presumably better), I think I'd still prefer a human-made one. Not only from a sort of ethical perspective (supporting the livelihood human creators), but because there's an element of communication between the creator of an art piece and the consumer. And I think human creators are inherently more interesting.

    • @jeff__w
      @jeff__w Před 3 měsíci +2

      @@h-di4qd Yeah, I agree as to the human creators, although I didn’t really have the AI-generated videogame ones _per se_ in mind when I made the comment but, really, the AI-generated ones that are supposed to look like something filmed in the real world or something that bears some resemblance to the real world (e.g., the “oter” 11:32 whose fur looks distinctly unreal). Then, again, I’m probably an outlier-I can’t stand the look of _anything_ by Pixar.
      As an aside: I think a major reason why everything in Stanley Kubrick’s _2001: A Space Odyssey_ looks so amazingly good more than half a century after its release, aside from Kubrick’s virtuosic attention to detail and verisimilitude, is that they’re all _practical effects,_ produced “in camera.”

    • @glowerworm
      @glowerworm Před 3 měsíci +1

      ​@@h-di4qdwell I would think the idea is that once ai can be trained on things that aren't super clean and over-produced stock images (sora was trained on shutterstock), AI might be much more capable of yielding you exactly the look and themes you want. So it wouldn't be creating the art, it'd be the tool the humans use to create art much easier.

    • @jeff__w
      @jeff__w Před 3 měsíci

      @@glowerworm “…once ai can be trained on things that aren't super clean and over-produced stock images…AI might be much more capable of yielding you exactly the look and themes you want…”
      Oh, sure, they _might_ be but the problem might not be just “super-clean and over-produced stock images,” it might be that, for videos (1) AI has difficulty learning the physics of precisely how light falls on certain objects, how those objects move, and so on, and that (2) people having evolved in the real world over hundreds of thousands of years are very highly tuned to how the real world looks. (I’m not saying “never”-just that nailing the videos _might_ be more difficult than it might appear at first glance. Then again, no one could have imagined that the videos would be _this_ good even a few years ago.) And, for some things, like, say, B roll footage, that people don’t not pay much attention to, the videos might, in fact, be “good enough” even now.

    • @glowerworm
      @glowerworm Před 3 měsíci +1

      @@jeff__w on the other hand computers are already much, much better at simulating laws of physics than artists are. So at least as far as animation goes I'd expect ai to do a good job rather soon since lacking physics is already accepted in that medium.
      I'd think it'd just take geant4, pdg data, and nist data to have AI start accurately simulating inner workings of physics detectors/medical radiology in generated video, for example.

  • @Modioman69
    @Modioman69 Před 3 měsíci +1

    Incredible milestones have been achieved and faster than I ever expected wow. Now imagine we’re playing alchemy and combine Sora with the Morpheus-1 from Prophetic (real project.) = Holodeck or real life matrix possibilities? I think this is where video games won’t be limited by interfacing with controller/mice/keyboards, anymore but instead actual interactive brain simulations which might look similar to the show Peripheral as well. What a time to be alive. Keep being awesome and making such top tier content kind sir. I cannot wait to see what rolls out next.

  • @Dannnneh
    @Dannnneh Před 3 měsíci

    Was looking forward to this breakdown, am not disappointed. Good point about OpenAI subsuming any inkling of competition.

  • @couperino
    @couperino Před 3 měsíci +5

    Things are moving faster than expected....Welcome in the year of the Dragon

  • @GarrisonSiberry
    @GarrisonSiberry Před 3 měsíci +8

    Animated Harry Potter style pictures hanging on the wall would be fun. You could even talk to them

  • @cory99998
    @cory99998 Před 3 měsíci +1

    As a hobbyist creator, I love that before long I'll be able to draw keyframes for my animations and AI can stitch together the in-betweens, and hopefully mimic style guides I give it. Let me focus on the story, not the frames

  • @pacotato
    @pacotato Před 3 měsíci

    Thank you for the wonderful quality of all of your videos. Your content is great!

  • @jwilder2251
    @jwilder2251 Před 3 měsíci +6

    I actually thought the “inaccurate physics” glass spilling/breaking was the coolest video of them all

    • @skierpage
      @skierpage Před 3 měsíci +2

      The architectural dig where they unearth a plastic chair that could levitate and morph was a scene from a science fiction TV series like the X Files or Stargate, but with phenomenally good special effects. Remember flash mob videos wherein a bunch of people start dancing in public? Artists will make videos where crazy things happen like a man unscrews the top of his head and ladles out milk and cereal, and the people around don't react.
      (C) 2024 skierpage 😉

  • @DavidsKanal
    @DavidsKanal Před 3 měsíci +17

    Small correction: The video at 7:41 is actually generated by Sora - it's included in the official announcement post (not the technical report). Now, I really hope this wasn't a troll from your side and you're gonna reveal at the end of the video that you were just testing our inability to recognize AI videos :D

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci +5

      Haha no, but point stands!

    • @dunar1005
      @dunar1005 Před 3 měsíci

      i thought the same, that you will reveal it.@@aiexplained-official

  • @toddwmac
    @toddwmac Před 3 měsíci +1

    AI Explained....still the best AI News, Insights and Predictive Analysis out there. I was near the epicenter of the PC, GUI and Internet revolutions, and spent decades describing scenarios that, at the time. were straight out of SciFi. The scenarios you describe and imagine here bring all those memories back to life.... and then some. Thanks for the trip down memory lane and a glimpse into some of our potential futures.

  • @the_primal_instinct
    @the_primal_instinct Před měsícem +1

    OpenAI's story reads like a dystopian book plot at this point. With noble beginnings and name and all that.

  • @keyser1975
    @keyser1975 Před 3 měsíci +7

    The best CZcams channel on AI full stop

  • @Hydde87
    @Hydde87 Před 3 měsíci +3

    I was so close to becoming disappointed that you would be the only content creator discussing Sora that didn't include the Will Smith eating spaghetti clip for comparison. But you saved the video at the end!

    • @flyingstapler1241
      @flyingstapler1241 Před 3 měsíci +1

      That comparison was misleading and it's bad that so many influential people are spreading it. Will Smith eating spaghetti was generated by Modelscope, one of the worst models for AI video generation back then. They should've been using Runway's Gen 2 results instead.

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci +2

      I did caveat with 'around' but yeah google had a slightly better model a few months earlier

  • @_abdul
    @_abdul Před 3 měsíci +1

    "AI Explained" doing real hard working keeping us in Loop for this exponential AI growth, Your work is genuinely appreciated Man. Thanks for your work.

  • @prodigydeveloper7513
    @prodigydeveloper7513 Před 3 měsíci

    Look at her boots while she walks, in 14 seconds in the video you will see the boots change with the right boot disappear and appear back in a second. A tiny error, but only by paying attention will you see it. I’m impressed. How smooth AI rendered this.

  • @mawungeteye6609
    @mawungeteye6609 Před 3 měsíci +7

    I can see Google releasing Lumiere 2.0 in a bit with mixture of experts to generate hour long videos to counter Sora sooner than later

  • @justadog-headedman6727
    @justadog-headedman6727 Před 3 měsíci +4

    Around 5:50
    Ties to that idea that sufficiently advanced technology is indistinguishable from magic, because to "bring deceased loved ones" to life would be necromancy

    • @GrindThisGame
      @GrindThisGame Před 3 měsíci +1

      I can see Google Photos adding a "animate this" or "create a movie about grandma playing with the following grandchildren".

    • @Hexanitrobenzene
      @Hexanitrobenzene Před 3 měsíci +2

      That's one of the most unwise ideas there is. Very toxic psychologically.

    • @camoraz
      @camoraz Před 3 měsíci

      @@Hexanitrobenzene Yeah I'm absolutely opposed to the idea

    • @skierpage
      @skierpage Před 3 měsíci +1

      Watch "Be Right Back," Black Mirror season 2 episode 1. Charlie Brooker is no longer a scriptwriter, he's a documentarian!

  • @huntingghosts
    @huntingghosts Před 3 měsíci

    crazy times. thank you for the in depth coverage!

  • @AdrienSales
    @AdrienSales Před 3 měsíci

    content mixing is just MIND BLOWING !!!!

  • @busyworksbeats
    @busyworksbeats Před 3 měsíci +17

    Mind blowing! 🤯

  • @mrcool7140
    @mrcool7140 Před 3 měsíci +3

    While i can definitely admire the technical side of things (I watch your videos for a reason), those outputs trigger massive uncanny valley effetcs for me. That costal drone shot for example... absolutely terrifying. The stairs may lead to f-ing nowhere, but the weather sure is perfect 🎉. A model of the world thats literally built on stock footage is a dystopia I couldn't even have imagined 15 minutes ago.

    • @RosscoAW
      @RosscoAW Před 3 měsíci

      Best part, all of it's massive economic potential is at risk of being absorbed, and curtailed for the sake of control, by a tiny set of relatively small tech companies who'd rather see everybody live off universal basic income than to accept allowing their AI models to be owned in common and held in trust for the collectivity of mankind and our descendants.

  • @user-pf9jv1fl2n
    @user-pf9jv1fl2n Před 3 měsíci

    Great video as usual. This year is text to video chatgpt moment ☺️ so exciting to witness.

  • @OffGrid-and-Ignorant
    @OffGrid-and-Ignorant Před 3 měsíci +1

    I dont comment much but have to say im super grateful for your continued no bs approach to informing on the AI "news". Subscribing to patreon to support. Thank you

  • @milesgrooms7343
    @milesgrooms7343 Před 3 měsíci +3

    So would you be able to enter a complete novel into the AGI, give certain structure of cinematography etc etc (sorry don’t have enough film language) but allow it to create a film an almost infinite number of times and choose “your” masterpiece??

  • @MrMiguelChaves
    @MrMiguelChaves Před 3 měsíci +3

    7:47 You said that input video wasn't generated by Sora, but it was. It is included in yesterday's demo. You can even see some minor errors (people walking into a wall near the stairs, for instance)

  • @raydosson2025
    @raydosson2025 Před 3 měsíci +1

    Excellent video as always. Thank you!

  • @yuri.mariotti
    @yuri.mariotti Před 3 měsíci +2

    You make such GOOD videos, in so many ways

  • @thedividendreport706
    @thedividendreport706 Před 3 měsíci +3

    Please correct me as I am just learning about this. For us Americans, the British pronunciation of the word "saw" (past tense of see) utilizes a triphthong ( a vowel sound comprising of three different vowels in one syllable) which makes the word "saw" sound pretty close to "soar" or "sora".
    The title of this video is thus deserving of praise from any person who appreciates Dad jokes.

  • @MrGriff305
    @MrGriff305 Před 3 měsíci +9

    humanity can't handle this.. We're pretty screwed

  • @sachoslks
    @sachoslks Před 3 měsíci +1

    Thanks for your videos man, always the best and fastest. When i saw the reflection of the girl on that train video i actually sat there with my mouth open, i could feel myself trembling with excitment. Feb 15th 2024 is an historic day in AI.
    Also, that Minecraft example is crazy, in their technical report they say "Sora is also able to simulate artificial processes-one example is video games. Sora can simultaneously control the player in Minecraft with a basic policy while also rendering the world and its dynamics in high fidelity. These capabilities can be elicited zero-shot by prompting Sora with captions mentioning “Minecraft.”"
    So you can actually imagine say a future Sora V4 running at 30FPS rendering an infinte game in real time. It's unbelievable.

  • @GrandmaSiva
    @GrandmaSiva Před 3 měsíci

    Thank you so much for the video! This gives me a little excitement, as if I just received a present.

  • @HappyHater
    @HappyHater Před 3 měsíci +6

    What a time to be alive!!!!
    Oh, sorry… wrong channel!
    :D

  • @MrPatcher86
    @MrPatcher86 Před 3 měsíci +5

    As someone who works in high end traditional content creation industry, i'm fucking terrified

  • @AlexanderMoen
    @AlexanderMoen Před 3 měsíci +2

    the speed of this all seems like pretty solid evidence in favor of the simulation hypothesis.

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci +1

      A lot of people have been saying something similar, including Altman

  • @brootalbap
    @brootalbap Před 3 měsíci +2

    Thanks for not being another cheap clickbait dude. Always high quality stuff from you!

  • @spacekitt.n
    @spacekitt.n Před 3 měsíci +4

    this is cool and all but its really upsetting how more and more money is going to just be absolutely SHOVELED and PROJECTILE VOMITED at all the techbros while the artists starve and lose their jobs. the future is scary, we're all going to be replaced. Not to mention the DELUGE of fake and garbage youtube videos that are heading in our direction from this.

  • @unvergebeneid
    @unvergebeneid Před 3 měsíci +1

    Even when it gets things wrong, the results look fascinating!

  • @AIForHumansShow
    @AIForHumansShow Před 3 měsíci

    Remarkable video as per usual. We send your videos to more people and really anyone else on YT.

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci

      Oh wow, just checked out your channel, looks incredible! A fun tour of the relevant AI news! Glad to be of service with my videos :)

  • @iecoie
    @iecoie Před 3 měsíci +3

    once again..
    terrible news

    • @flyinglack
      @flyinglack Před 3 měsíci +1

      Great news

    • @iecoie
      @iecoie Před 3 měsíci

      @@flyinglackOh, You are such a troll-ful contrarian, or (only) a Fool! Bless You. :)

  • @BryanAlexander
    @BryanAlexander Před 3 měsíci +1

    I'm fascinated by your idea of using Sora to create multiple versions of content (6:40 ff). It's a new twist on branching narratives.

  • @jesusmartinez4341
    @jesusmartinez4341 Před 2 měsíci +1

    This was really well done.

  • @jossefyoucef4977
    @jossefyoucef4977 Před 3 měsíci

    First Pika and now this, even though we're not there yet we're making strides this year!

  • @rollingmancave4547
    @rollingmancave4547 Před 3 měsíci

    Your content is always kickass!

  • @solaawodiya7360
    @solaawodiya7360 Před 3 měsíci

    Thanks for reaction Philip ❤. Now this is truly a news that shocked me in a while

  • @JoelEngineer
    @JoelEngineer Před 3 měsíci

    Incredible! Thank you for using your talents to give us this the world-changing news! Question: There is so much to moving so fast. 've been learning about transformers and it already seems like OpenAI and Deepmind researchers have already moved on or improved on these architectures. Which architectures should I study, in order to get a better idea where the technology will be moving in the next few months? Again, Great Work!!

  • @MugiwaraNoReemy
    @MugiwaraNoReemy Před 3 měsíci

    Wow, absolutely outstanding

  • @vtsfly5
    @vtsfly5 Před 3 měsíci +2

    l felt AI animation was developing fast but would still need to cover a lot of ground. In no time Sora just jumped more than 70% of that ground. What a time to be alive!

  • @Niels1234321
    @Niels1234321 Před 3 měsíci

    Great video as always! I think the video at 7:46 has been generated by sora though, it's listed as one example on openai's blog post. Speaks for itself that it isn't obvious anymore whether or not a video is AI generated

  • @AdrienSales
    @AdrienSales Před 3 měsíci

    Now, let's see how to handle character consistency, this is so exciting !

  • @alexman378
    @alexman378 Před 2 měsíci +2

    People need to remember that this is the software in its infancy. Yes, it does make mistakes, yes, it doesn’t understand the world fully, but this is stage one. With the speed these things are progressing, this won’t be a problem by 2025. Don’t get comfortable with the fact that it’s messing up now. Won’t be long before it becomes entirely indistinguishable from reality.

  • @stephenrodwell
    @stephenrodwell Před 3 měsíci

    Two videos, you spoil us! 🙏🏼

  • @unrealminigolf4015
    @unrealminigolf4015 Před 3 měsíci

    Thank you sir. All adds play through. 🎉

  • @mckeedable
    @mckeedable Před 3 měsíci +1

    Thanks again for a great video

  • @WilliamSheng-px6pc
    @WilliamSheng-px6pc Před 3 měsíci +1

    Sora is one of those boundaries between Generative AI and Interactive. If ChatGPT can already make such realistic videos, I don't see why AI couldn't use a voice and boom, a real conversation between the user and a realistic avatar of an AI speaks to you just like any other person.

  • @InnerCirkel
    @InnerCirkel Před 3 měsíci +1

    Another amazing video Philip!
    Imagine what this will do for democracy. A democracy can only exist by the grace of informed citizens. But how are we going to keep a more or less accurate representation of the world at large with this quality of synthetic media?

    • @aiexplained-official
      @aiexplained-official  Před 3 měsíci +1

      Thankyou Inner! I know. Citizens are misinformed or underinformed as it is.

  • @weebgrinder
    @weebgrinder Před 3 měsíci

    Wow. Google really is like OpenAI's research branch. Of course this is also where open AI I believe got the idea and or technical information for the transformer part of GPT. Great little video. Perfect duration.
    It's too bad that this particular model is probably not going to be available to the general public for quite a while if ever. But on the other hand some open source competitors might bring up particularly I think after the release of this product. That's going to nullify OpenAI's gatekeeping.

  • @alexandrefruchaud1969
    @alexandrefruchaud1969 Před 3 měsíci

    Excellent video, thanks

  • @Lishtenbird
    @Lishtenbird Před 3 měsíci +1

    5:00 Video might be exactly the (non-simulational) solution to complex concepts like hands. When you have so much more context for possible from-to limits of a hand's position, and all common scenarios in increased detail, inferring a still should be a lot simpler.

  • @natehancock9663
    @natehancock9663 Před 3 měsíci +1

    AS a Professional artist who works in games and VFX I am equal parts: terrified at the continued negative impact this will wreck upon an industry that is an absolute mad house right now, and utterly excited at the possibilities this might afford me to explore new avenues with more independent creative freedom than ever before.