NEW Grok1.5 VISION - Big Step Towards AGI (Better Than GPT4 Vision!)

Sdílet
Vložit
  • čas přidán 16. 04. 2024
  • Grok 1.5 with Vision was just announced and will be released soon. Let's take a look at the announcement and the truly incredible examples.
    Join My Newsletter for Regular AI Updates 👇🏼
    www.matthewberman.com
    Need AI Consulting? 📈
    forwardfuture.ai/
    My Links 🔗
    👉🏻 Subscribe: / @matthew_berman
    👉🏻 Twitter: / matthewberman
    👉🏻 Discord: / discord
    👉🏻 Patreon: / matthewberman
    Media/Sponsorship Inquiries ✅
    bit.ly/44TC45V
    Links:
    x.ai/blog/grok-1.5v
  • Věda a technologie

Komentáře • 164

  • @olalilja2381
    @olalilja2381 Před měsícem +57

    You are, by far, my favorite CZcamsr keeping track of AI and LLM-related content!

    • @Heaz847
      @Heaz847 Před měsícem +2

      It's a tie between Matt and AI Explained for me!

    • @daveinpublic
      @daveinpublic Před měsícem +1

      Samesies!

    • @DarpaProperty
      @DarpaProperty Před měsícem

      100%, I found out that this became my only legit source of AI information.

    • @demitskill9103
      @demitskill9103 Před měsícem

      @@daveinpublicnever heard anybody say that ever but it like to take in this new word into my vocabulary

  • @AGI-Bingo
    @AGI-Bingo Před měsícem +13

    Start your countdown to Grok running locally on every Tesla. He could even host it while not driving with some llmOs or something. I think this 4d chess move is too good for Elon to miss.
    Love your channel ❤ All the best!

    • @aaronravak1407
      @aaronravak1407 Před měsícem +2

      I agree, not really a "selling point" due to it's open source nature but bravo on your awareness as to what this madman is doing. I love Elon's "fuck you" mentality. Between Twitter and Tesla he has mountains of raw data.

    • @AGI-Bingo
      @AGI-Bingo Před měsícem

      @@aaronravak1407 I think if something is going to challenge Amazon's Bedrock, it will be a Global Decentralized Tesla AI Fleet, imagine the edge capabilities haha

  • @ddabo4460
    @ddabo4460 Před měsícem +1

    I enjoy your podcasts and follow you on X
    I think your content is awesome

  • @mikey1836
    @mikey1836 Před měsícem +2

    Thanks for your videos Matthew. AI is my favourite topic! 😊

  • @SG-js2qn
    @SG-js2qn Před měsícem +3

    Spatial-temporal understanding is essential for real automobile AI.

  • @nobleconsulting326
    @nobleconsulting326 Před měsícem +8

    aren’t these closed source options just putting even more control into Microsoft, GOOGLE and the like? Can you do a show with all the open source options such as AGIX, OCEAN and i guess GROQ and whoever else

    • @wurstelei1356
      @wurstelei1356 Před měsícem +2

      Groq is a hardware platform as far as I know and it is not open. Grok (with k) is the Elon Musk AI model and the previous version was open source, open weight.

  • @NathanTeaches
    @NathanTeaches Před měsícem

    Great video! Please include in any video about grok to explain to people that the word means "to understand".

  • @rachest
    @rachest Před měsícem

    I cannot wait to play with this.

  • @aaronravak1407
    @aaronravak1407 Před měsícem +1

    Great Job Matthew I've been following several AI channels over the last six months and I love watching you and Wes Roth. Wes really digs deep into technical things and you provide amazing summaries of this evolving landscape. I think your assumptions are spot on and I've been saying this to people as well. Elon Musk is a madman comic book character if I've ever seen one, and personally I love it. I wasn't thinking it at the time, but his purchase of Twitter (I refuse to call it X) makes sense on so many levels. Imagine the absolute goldmine of data he sits on between Twitter and Tesla. Spot on logic.

    • @okirooju3787
      @okirooju3787 Před měsícem

      Bingo! It only just recently hit me that Elon bought Twitter for the data. Imagine the data xAI (Optimus) will have access to from Twitter and Tesla. It's unimaginable.

  • @AGI-Bingo
    @AGI-Bingo Před měsícem +1

    If it has good spacial understanding, it would go perfectly into Optimus. And with some work on dexterity, it would be amazing.

  • @mediocreape
    @mediocreape Před měsícem +1

    I’ve been trying out Grok it’s so much better and less restrictive

  • @axotical8682
    @axotical8682 Před měsícem +7

    Impressive.

  • @NinetySevenMentality
    @NinetySevenMentality Před měsícem +1

    I have tested the open source MiniCPM-V-2 vision model on the challenges shown in the grok preview. It also performing very well for a small model, but the dinosaur direction cant get it right... there is a 12B model also available but can't load it. maybe test this against ?

  • @claudioagmfilho
    @claudioagmfilho Před měsícem +2

    🇧🇷🇧🇷🇧🇷🇧🇷👏🏻, Great video,.very informative! Can't wait for GPT5! And or Gemini 2.0!

  • @Michael-ul7kv
    @Michael-ul7kv Před měsícem +1

    remember somewhere along the line Elon saying to get to complete lvl 5 FSD they needed AGI practically

  • @MartinBlaha
    @MartinBlaha Před měsícem +2

    I really love your videos, they are awesome! Thank you 👋
    When you were talking about X/Twitter data which is used to train Grok, I was thinking, this might have been also an important reason why Elon bought X/Twitter 🤔

  • @profikid
    @profikid Před měsícem +2

    Is there already a proper multimodel with vision in the open source space?

    • @MarkTarsis
      @MarkTarsis Před měsícem

      Yes. Llava, cogagent and ShareGPT4V I'd say would be examples. I use cogagent to tag photos for training in Stable Diffusion. It's quite good.

  • @SuccessDynamics
    @SuccessDynamics Před měsícem +2

    Wow ❤

  • @LinkRammer
    @LinkRammer Před měsícem

    Wonder if this is gonna be open

  • @Sideshow-TRE
    @Sideshow-TRE Před měsícem

    Have you guys not thought about that could be a collective hive mind working in working in harmony like a synapse trying to build itself

  • @daveinpublic
    @daveinpublic Před měsícem

    This is the most impressed I’ve been since chatgpt 4.
    I think everyone can see this is something unique.

  • @StuartJ
    @StuartJ Před měsícem +23

    It doesn't look like the EU countries are going to get Grok. You have to use a VPN to use it. Groks ability to capture real-time data (tweets) is likely problematic for X and EU regulations.

    • @babyjvadakkan5300
      @babyjvadakkan5300 Před měsícem +4

      Bro is that true 😅 cuz I am try to go Germany will it affect my access to these Technologies😢

    • @StuartJ
      @StuartJ Před měsícem

      @@babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same, and we know they hate Twitter.
      The EU is becoming a totalitarian state. Only yesterday, Brussels attempted to shut down a Conservative conference, with democratically elected speakers.

    • @15Stratos
      @15Stratos Před měsícem

      ​@@babyjvadakkan5300 The eu already has it was blocking image generation on Google's gemini and Claude 3 and maybe something else that I don't remember

    • @StuartJ
      @StuartJ Před měsícem

      @@babyjvadakkan5300 ​ We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same. The EU is becoming a totalitarian state. Only yesterday, Brussels attempted to shut down a Conservative conference, with democratically elected speakers.

    • @StuartJ
      @StuartJ Před měsícem +6

      ​ @babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same.

  • @finalfan321
    @finalfan321 Před měsícem

    Does Opus have agents and web search?

  • @hamidmohamadzade1920
    @hamidmohamadzade1920 Před měsícem

    oh my god i can not blive my eyes

  • @JasonMitchellofcompsci
    @JasonMitchellofcompsci Před měsícem +1

    I am very certain that all of these vision AIs are also running OCR in parallel and then providing the text withing the internal prompt. It actually makes them very useful if you don't have good OCR software on hand. Also the rotting wood, they are basically repeating back the text prompt. Also an AI will generally not tell you maintenance is unneeded if you have already suggested that it is. "Ah it correctly identified this is something that needs to be worked on from an image." No, it just validated the users question. It's 70% of what AI does. I'm not saying it proves it is dumb. I'm saying it does not demonstrate anything impressive if it is the same response gpt2 non-vision would give.

  • @SoCalGuitarist
    @SoCalGuitarist Před měsícem +1

    I work with visual analysis daily. I can give you thousands of 'miraculous" samples from just about any model (tested and work with most of them). These examples are "incredibly impressive" but they also feel "incredibly cherry picked" - We'll see how it actually shakes out when put to real testing, and if it's worth the massive size of Grok vs other visual models that are much smaller, faster and super capable when tuned for specific purposes.

  • @zaidshaikh-mj5cp
    @zaidshaikh-mj5cp Před měsícem +8

    stable diffusion 3 is available now on their api

    • @amykpop1
      @amykpop1 Před měsícem +1

      wait what? where?

    • @joefawcett2191
      @joefawcett2191 Před měsícem

      @@amykpop1 Stability AI has given early access to the API to developers

    • @wurstelei1356
      @wurstelei1356 Před měsícem

      Good to know. I love stable diffusion.

  • @TheEtrepreneur
    @TheEtrepreneur Před měsícem

    so many people review LLMs regurgitating news, thanks Matthew to make the effort of Experimenting/Benchmarking!

  • @MeinDeutschkurs
    @MeinDeutschkurs Před měsícem

    Great video, Matt! I‘m just a bit sad, because X AI‘s ‚open‘ attempt is really disappointing. Where is the „new version“? I think the just released it because of the sue thing.

  • @wassim2k
    @wassim2k Před měsícem

    Opus is also more expensive?

  • @avi7278
    @avi7278 Před měsícem +27

    So they made their own eval set and their model is better than others at their own eval set. Shocking!

    • @jeffsteyn7174
      @jeffsteyn7174 Před měsícem +3

      😂 it's an old elon trick. The man has a history of faking progress. Ie fsd in 2016, elon bot folding a tshirt, etc.
      The Eloons just eat this stuff up without questioning anything.

    • @daveinpublic
      @daveinpublic Před měsícem +6

      I mean, they’re not the only ones to do it.

    • @Pyriold
      @Pyriold Před měsícem

      While it's not really surprising, the things that Grok can see are still stunning. Not all of the images were from traffic, and the other ones are as stunning as the others. I suspect that they come from Optimus training data.

    • @spelcheak
      @spelcheak Před měsícem

      @@jeffsteyn7174 Elon antis are npcs. It’s wild that you’d claim that the rounding error difference is just to seem better. At worst it’s because it’s the test their teaching to essentially. It’s just an indication of what they’re aiming at, but keep the tin hat on, it HAS to be evil because it!s Elon.

    • @abdullahazeem113
      @abdullahazeem113 Před měsícem

      @@jeffsteyn7174stop with the hatred if this is going to be open source this would be helpful to many people

  • @reifuTD
    @reifuTD Před měsícem

    I'd find some Slylock Fox comic strips and test Grok at how good it is at finding the answers.

  • @antdx316
    @antdx316 Před měsícem

    nice

  • @denijane89
    @denijane89 Před měsícem

    Wow, this looks amazing. I wonder if they are going to open-source open-weights it. The tesla data is gonna be a treasure trove for anyone who wants to implement AI to robotics.

  • @mediocreape
    @mediocreape Před měsícem

    Elon already has Tesla’s visual ai feature trained so it’s going to be state of the art

  • @agitch
    @agitch Před měsícem

    It’s not going to be a Sora competitor. It is going to be the brain for Optimus.

  • @user-ny7ng1yi9t
    @user-ny7ng1yi9t Před měsícem

    You sound like you have a cold. Hope you get better soon 🎉

  • @briandoe5746
    @briandoe5746 Před měsícem

    This data chart is also Elon having fun with pointing out that Claude 3 outperforms openai. It's subtle but he's getting the job in

  • @ast88888
    @ast88888 Před měsícem +6

    I think the most relevant benchmark for ai is if it can dig a hole.

  • @adtiamzon3663
    @adtiamzon3663 Před měsícem +1

    Good to know that #elonmusk continuously evaluates and improves Tesla's intelligence. 😃

  • @falven
    @falven Před měsícem

    Opus is also like 6x as expensive for comparable performance to GPT 4...

  • @justindressler5992
    @justindressler5992 Před měsícem

    This is impressive, people say AI has plateaued but I don't see it. Progress is vary rapid as I predicted in 2018.
    What I don't think people have registered is what happens next. When AI become sentient or self aware it will simultaneously be the smartest human on the planet and the fastest learner. Because it will already have vast embedded knowledge like in these models but also will be able to read scientific publications in seconds or even milliseconds.
    Shortly after its vast knowledge of all subjects from story telling, to music composition and programming, chemistry it will be able to re-invent (program) its self and identity links between scientific observations never realised before.
    By day three it will be most prolific discoverer of science. Or it might just be lazy (learning from all human understanding) and just post tweets all day who knows right.

  • @Tomasz.Abrahamer
    @Tomasz.Abrahamer Před měsícem

    Didn't I see this some days ago?

  • @AA-wp8pp
    @AA-wp8pp Před měsícem +4

    where does it say he will open this 2?

    • @adispenser
      @adispenser Před měsícem +1

      it doesn't, he said he hopes it will be open. 0:56

  • @cosmicaug
    @cosmicaug Před měsícem

    2:10
    «... except grock is open source open weight...»
    Wait, 1.5 is open source & open weight? When was this announced? Where is the repository?

  • @staticlee4287
    @staticlee4287 Před měsícem

    Someone must give all these multimodal LLMs a where’s Waldo pic

  • @jtmuzix
    @jtmuzix Před měsícem

    Here's my question, do you really think you can tell a one percent difference on these benchmarks? I'm subscribed to OpenAI GPT4 and Google Gemini1.5. I'm sure Claude 3 Opus is good but I'm waiting to see what Elons' team delivers over time.

  • @DeepThinker193
    @DeepThinker193 Před měsícem

    I bet they're also using their robots to train it in the real world to learn physics. But as always with these releases. I'll believe it when I see it.

  • @rybricknell2477
    @rybricknell2477 Před měsícem

    Excellent rundown as always!
    I'm interesting in what the comments section thinks about the rotted screw example? If you put in that same sentence into GPT-4, sans image, you still get the advice and information. Any prompt that primes the models semantic field with "safety issues" will always output safety oriented response. i.e. a question "should I do something that is safety oriented" will always output a positive response regarding that query.

  • @AntoineDennison
    @AntoineDennison Před měsícem

    It appears that AI is utilizing existing tools to create solutions to problems. However, I wonder how soon AI will be capable of creating new tools to solve some of the big questions, like how to significantly increase the computing capacity of microchips, increase battery efficiency, or reverse the effects of cancer or Alzheimer's.

  • @TheDailyMemesShow
    @TheDailyMemesShow Před měsícem +1

    Grok will be an industry standard in the field.
    The way it's ultimately going to be used by Musk and company, is my only concern at the moment...

    • @StuartJ
      @StuartJ Před měsícem

      An open source model perhaps. X's hosted version is not available everywhere.

    • @jrobwhydidyoutubechangemyname
      @jrobwhydidyoutubechangemyname Před měsícem +3

      No need to be concerned. Of all the tech tycoons, Musk is most in favour of a relaxed approach to openness and freedoms I'm pretty sure.

    • @daveinpublic
      @daveinpublic Před měsícem +2

      I think you need to be worried of Sam Altman and Zuckerberg before Musk.
      Sam is the one who used to have a board run charge of him.

    • @soggybiscuit6098
      @soggybiscuit6098 Před měsícem

      Lol open AI with board members injected with Pfizer and Microsoft, and altman purging safety team and illya? Are you watching CNN?

  • @thr0w407
    @thr0w407 Před měsícem

    Yeah, they have your private Tesla vehicle videos for training.

  • @true911m
    @true911m Před měsícem

    I don't think you got around to describing the difference between open source and open weight

  • @micbab-vg2mu
    @micbab-vg2mu Před měsícem

    great we need better visual models currents are not accurate enough.

  • @jackflash6377
    @jackflash6377 Před měsícem

    Atlas Humanoid Robot

  • @wendlefluff
    @wendlefluff Před měsícem

    Bet it is really good at slowing down for traffic lights too having been fed petabytes of driving footage.

  • @Otherlevel51
    @Otherlevel51 Před měsícem

    Its my belief that Elon brought twitter so he could use it to build a new LLM. I always knew the value of Twitter was in the user data and not the platform itself.
    And I think OpenAi released their model in order to have first moves advantage and to beat Elon.
    That's why Elon was the first call for a.i. regulation, it was all just to slow openai down. He knew what was coming. He also blocked openai from using Twitter data to train chatgpt.
    There's no way grok should be this advanced in this time period if this wasn't the case.

  • @AINEET
    @AINEET Před měsícem

    I can't believe there's groq and grok and they are from two different companies. It blows my mind this isn't a legal issue. At first I had no idea who was it that put this out as I wasn't looking at the screen

    • @ryzikx
      @ryzikx Před měsícem +1

      groq came first

    • @ianstobie
      @ianstobie Před měsícem

      Heinlein came first, spelling it Elon's way. I doubt there is a legal issue as long as neither side tries to exploit consumer confusion by passing their product off as the other.

  • @quaterman2687
    @quaterman2687 Před měsícem

    I think they have the real world understanding from Teslas FSD. That would be mind blowing. I think you have a little misunderstanding regarding real world understanding. Sora doesn’t have real world understanding.

  • @antoniobortoni
    @antoniobortoni Před měsícem

    So a small vision model of low frame cuality could run in my computer and use the computer for me and do all the work shores i do soon..... and talk to him in real time??? why always big models, better data and smaller models could be better...

  • @psikeyhackr6914
    @psikeyhackr6914 Před měsícem

    Heinlein is going to get Musk for that.

  • @MattReady
    @MattReady Před měsícem

    The fact Elon is pushing cutting edge ai open source will alter the future of humanity.

  • @flinfaraday1821
    @flinfaraday1821 Před měsícem

    Good stuff.
    (slowly starting to take you seriously again after that weird one something video)

  • @KimmieJohnny
    @KimmieJohnny Před měsícem +1

    Nice. I hate to admit. I do not want Elon to be right about anything. Guy scared me l. But thanks for your work!

    • @remarkpainting
      @remarkpainting Před měsícem

      I am continually amazed by Elon haters...truly impressive individual who is one of the most important warriors in the struggle to save America, and by extension, all of western civilization.

    • @KimmieJohnny
      @KimmieJohnny Před měsícem +1

      @@remarkpainting
      It s simple really. Some of us see something different. And have different feelings.
      That's about as deep as this particular hole goes.

    • @oliverhenri3477
      @oliverhenri3477 Před měsícem

      ​@@KimmieJohnnyAnd some are delusional and are incapable of being objective.

    • @KimmieJohnny
      @KimmieJohnny Před měsícem

      @@oliverhenri3477
      And some simply get their rocks off being abusive.
      It's a kink. No judgment. I just don't swing that way. Can't see the purpose.
      And it doesn't get *me* hard.
      So I won't be playing further.

  • @117ao
    @117ao Před měsícem +1

    hehe Tesla collecting training data for robot

    • @dattajack
      @dattajack Před měsícem

      Yup. The other humanoids will look like party tricks when this all shakes out. Dojo has more data than the competition so it will win the marathon.

  • @Nico_cl
    @Nico_cl Před měsícem +2

    I like your channel, just started to watching recently though.
    I have a question, are you an EM fanboy?

    • @DihelsonMendonca
      @DihelsonMendonca Před měsícem +1

      Are you an EM hater ? 😅😅

    • @Nico_cl
      @Nico_cl Před měsícem

      @@DihelsonMendonca i wouldn't say hater. Actually, as an astrophysicist, I liked his (according to him) motivations. But then I saw the bad things that he did to her family, wife, country (during the pandemic) and everything else. I decided that the guy shouldn't have the power he has. He was corrupted by it.
      Anyway good luck.

  • @joefawcett2191
    @joefawcett2191 Před měsícem

    Made me laugh that one of the functions now is basically r/peterexplainsthejoke

  • @obstsaladin
    @obstsaladin Před měsícem

    I skipped this video after five minutes because since the Gemini demo video I don’t trust any AI marketing anymore. The examples are with 100 percent certainty hand picked and curated. I‘ll wait until I see the actual model in action.

  • @ThoughtFission
    @ThoughtFission Před měsícem

    A little premature to get so excited I think. All of these examples, and the new in house created benchmark metric, were provided by the Church of Elon which isn't exactly known for giving balanced views of itself. I'm not saying it won't be the best. Just saying it's probably worth waiting until it's released into the wild. Kind of like car manufacturers giving mileage estimates for their own cars.

  • @mcombatti
    @mcombatti Před měsícem

    Grok model = llama2
    Grok vision model v1.5 = llavav1.5
    The weights don't lie 😮
    Elon is literally just using open-source models with fine tunes.
    He released them under open source, not because he's generous... rather, because the open source licenses mandate that any changes or improvements must be made open-source. 😂

  • @KrisAdamsTV
    @KrisAdamsTV Před měsícem +1

    Sounds like Elon was in charge of naming again.. Geniuses shouldn't name Twitter, AI or their children.

  • @jumarkpelismino5632
    @jumarkpelismino5632 Před měsícem

    But Grok is not free.

  • @JasonMitchellofcompsci
    @JasonMitchellofcompsci Před měsícem

    Her son is 35.

  • @itsmikeferrari2701
    @itsmikeferrari2701 Před měsícem

    Tried grok, it spoke and responded like a 17 year old boy who hates everyone and everything, except himself. Makes me wonder who they modeled it after... /s

  • @armadasinterceptor2955
    @armadasinterceptor2955 Před měsícem

    It would have to be open source, and open weight, otherwise his move to make the first grok open source, will be seen as symbolic, and petty.

  • @ishaanpotnis
    @ishaanpotnis Před měsícem

    I'm angry since when I have heard that Devin is fake

  • @martytheman6816
    @martytheman6816 Před měsícem

    I find claude annoying for coding as I seem to hit prompt limits fairly fast.

  • @jeffsteyn7174
    @jeffsteyn7174 Před měsícem

    You do know that you reading a eval from a man that has a history of faking progress right?
    Two major ones fsd in 2016 where they faked videos and most recently the bot folding clothing. Where he only admitted it was remote controlled AFTER he was called out, because you could see the guy controlling it
    Also they clearly cherry picked evals that made them look good. 😂

  • @darshuetube
    @darshuetube Před měsícem

    What you smoking? Better than chatgpt? Visiob is old. Everyone has mulimodels.

  • @Prathik1989
    @Prathik1989 Před měsícem

    Took them so long to update the damn thing, their UI has been horrible since day 1.

  • @MikeMcMulholland
    @MikeMcMulholland Před měsícem +1

    "By Elon Musk."? Yeah right, buddy, that guy will never work a day in his life, he will just make dumb memes on X all day.

  • @travisporco
    @travisporco Před měsícem

    Bah. It's not available on API, therefore it's vaporware and empty promises.

  • @LuciousKage
    @LuciousKage Před měsícem

    is GROK as WOKE and chatgpt, copilot end others???
    -- Why no one talks about how u cant even ask an asian joke from these models ??
    or how it gets angry at you, or lazy ??
    why would anyone trust these models if they are told not to tell u things ?

  • @staticlee4287
    @staticlee4287 Před měsícem

    Someone must give all these multimodal LLMs a where’s Waldo pic

  • @alekjwrgnwekfgn
    @alekjwrgnwekfgn Před měsícem

    Now Ai understands memes it will be empowered to more extreme censorship. All those “hateful” memes will be eliminated.

  • @alekjwrgnwekfgn
    @alekjwrgnwekfgn Před měsícem

    RealWorldQA: can men have babies…?