GPT4o: 11 STUNNING Use Cases and Full Breakdown

Sdílet
Vložit
  • čas přidán 16. 05. 2024
  • Deep dive into GPT4o use cases and technology.
    Join My Newsletter for Regular AI Updates 👇🏼
    www.matthewberman.com
    Need AI Consulting? 📈
    forwardfuture.ai/
    My Links 🔗
    👉🏻 Subscribe: / @matthew_berman
    👉🏻 Twitter: / matthewberman
    👉🏻 Discord: / discord
    👉🏻 Patreon: / matthewberman
    👉🏻 Instagram: / matthewberman_ai
    👉🏻 Threads: www.threads.net/@matthewberma...
    👉🏻 LinkedIn: / forward-future-ai
    Media/Sponsorship Inquiries ✅
    bit.ly/44TC45V
    Links:
    • Introducing GPT-4o
  • Věda a technologie

Komentáře • 484

  • @matthew_berman
    @matthew_berman  Před 15 dny +26

    Which use case is your favorite?
    Mine is being able to have AI make calls to companies on my behalf...imagine the time saved!!

    • @danielmayo2
      @danielmayo2 Před 15 dny +9

      Without question, the education use case.

    • @pedrozzza
      @pedrozzza Před 15 dny +3

      It would be cool to play video games with it. Not sad at all.

    • @Alu404
      @Alu404 Před 15 dny +3

      This work case won't work for a LONG time. In order for customers to access their respective accounts through thousands of companies, you need to be an authorized user. Nobody can just call in and suddenly move around inside your account. New rulesets will need to be created for AI to have unfettered access to your personal accounts when calling into different companies.

    • @masonfox9070
      @masonfox9070 Před 15 dny +4

      I think a single use case isn't that exciting. These are all things that can and are done by other things already. To me what is exciting is that it is getting closer to being a all in one type solution. Once everyone has access to a high caliber executive assistant in their pocket that would be amazing. The teaching use is one of the most exciting use cases as a one off.

    • @weregoat529
      @weregoat529 Před 14 dny +3

      AI girlfriend, hands down.
      Disclaimer: I am chronically lonely

  • @user-hw6xn7rt7f
    @user-hw6xn7rt7f Před 15 dny +31

    The best usecase of all is for education, imagine how wonderful would be to have a tutor for everything, including language learning, accent training, tips, etc. I'm very excited for this use case in particular.

    • @dakaodo
      @dakaodo Před 13 dny

      As a tutor, I have mixed feelings. :D Obviously, I'm partly threatened by how smart tools like 4o have already eroded parts of my client base for years, but that's nothing that hasn't also happened to accountants and bookkeepers, attorneys and legal assistants, engineers and factory workers, doctors and technicians. Work automation has been a disruptive menace to individual people's jobs for centuries even as it overall brings us amazing new fields of productivity.
      But as a tutor, I see a wide range students. I bucket them into 3 main categories: motivated and capable, compliant but not necessarily capable, and uncompliant/outright antagonistic. i.e. 1) they want to be there, 2) they tolerate being there, 3) they don't want to be there.
      The first type of students generally don't need that much guidance. Point them at a few tools, give them the occasional directions or hint when they get stuck, and they usually want to figure it out themselves. Handholding tools can often hold these students back, since they want or need something that constantly challenges their growth. Give them a textbook, some case studies, and maybe an AI guide with specific instructions to give them fewer hints. In the example, the 4o tutor still gave away several partial answers that could have been guided or elicited from the student.
      (I rarely get honors students. Or if I do, I typically don't retain these students for long. Once they get through whatever rough patch of classes and find their stride again. Great students to have b/c they're fun and engaging to work with, though, as well as easy money. They are already rare though, since often they find their own solutions. If AI tools really get going, I'll probably never see one of these students again. :( )
      The second type of student could really benefit from an AI tutor tool as shown here. These automated tools, such as at the level of Khan Academy, are already democratizing supplementary education for students whose families may never have considered or been able to afford a live personal tutor or afterschool programs (which often just babysit the students through doing their homework). But maybe make it more concise, unless asked for more details or the AI detects that a student has repeatedly missed a concept -- typical students get impatient or bored quickly, and the nearly condescending level of repetition or reinforcement would need to be scaled to each student's level of ability. Maintaining the student's buy-in, engagement, and motivation is also a matter of pacing adjustment in realtime, often from question to question. Not just a constant blanket of verbal praise and reinforcement, though that's certainly better than getting yelled at or mocked by an underpaid, unsupervised teacher/tutor. :D
      (This is my main bread and butter. I'd expect AI tools to really put a dent in it, though some students might still want or benefit from a more human guided experience, at least for now. In another 5 years, who knows? Probably I'd lose most of this segment to AI tools. My ability to tailor pacing to each student's personality and abilities is what currently differentiates me from cheaper or group programs, and lets me charge a rate well above market average with my client base. Definitely at risk from AI tools, though not 100% immediately.)
      The third category of student is going to need significantly more comprehensive AI integration and development. Whether human or AI, a tutor or teacher needs substantive authority with real consequence to keep these students engaged. Whether it's parents or an AI integrated into a smart home, withholding phone / Internet/wi-fi or even smart car privileges are a generic example of withholding a carrot that these students commonly respond to, though that's pretty heavy-handed. Many of them engage in subversive tactics to stall, distract, or undermine the goal of their session time. On the positive side, you could say they want or need some kind of attention that they are not getting. Figuring out what that is may help improve engagement with the student and can vary a LOT, though tutors (and AI tools) as outsourced contractors are limited in how much we can compensate for emotionally and physically absent parents or whatever. On the negative side, they can also outright resent or hate you, and don't want to cooperate.
      (I somewhat expect this to unfortunately be a last bastion of personal tutoring clientele. I'd rather switch jobs and careers before I'm forced into tutoring exclusively this category of student. I'd be honestly impressed if or when AI tools are able to engage students in this category. Heck, I'd prefer if AI tools surpassed me in this regard. Giving the student's phone access to the AI tool, agent, or phone app interface to a larger AI user account could potentially be one way to effectively corner a student. The AI app implicitly or explicitly holds full phone access rights hostage until the student gets their work done. They are unlikely to physically get revenge because that same phone is commonly their social centerpiece or lifeline. Indirectly, this could lead to a sub-generational shift for younger students/people to develop a negative relationship with tech tools, as we've seen with some Gen Z and others preferring to avoid Facebook or other social media. At one extreme, the next generation may grow up not wanting or having a smartphone if it's negatively associated with mandatory education and punishments; certainly they'd buy, beg, borrow, or steal secondary unofficial phones or devices on their own, with no such AI nanny enabled.)

    • @jimnastos1910
      @jimnastos1910 Před 10 dny

      that was what the inventor of the tv said, and look how that turned out.

    • @user-hw6xn7rt7f
      @user-hw6xn7rt7f Před 10 dny

      @@jimnastos1910 😅🤣

  • @Jeff-66
    @Jeff-66 Před 14 dny +19

    it's kind of funny how the Open AI reps, even knowing they're on a demo being watched by millions, keep interrupting the A.I. like 'yeah yeah, be quiet" like they've heard it a thousand times and are tired of it. Even they seem to get annoyed by the blathering.

  • @DefaultFlame
    @DefaultFlame Před 15 dny +15

    4:22 Alright, that impressed me. He told it what he was going to do, it started gushing about talking to the other AI, and when he asked it to "please pause" when it replied (and this is the part that impressed me) it gave a short response at a much lower volume. The contextual understanding to lower it's volume, almost to a whisper compared to its previous volume, is pretty amazing to me. That's a level of nuanced understanding of the situation I didn't expect.
    Edit: And Matthew points it out right after I comment. Oh well.

  • @erikthereddest
    @erikthereddest Před 15 dny +47

    The two AIs singing together was about as awkward as I'd expect it to be if they stuck two interns in an office and made them do the same task over a phone, so I guess this passes the Turing Test?

    • @nohjrd
      @nohjrd Před 14 dny +2

      I know. It feels kind of mean. I worry that this kind of thing (and Boston Dynamics shoving robots around) might not look too good when looked back on from the future (especially if it's robots/AI that is looking back at the footage).

    • @paulmichaelfreedman8334
      @paulmichaelfreedman8334 Před 14 dny +2

      @@nohjrd Don't worry, these Ais are not sentient. Essentially they're powered off immediately after their task. It's not torture of anything, and future sentient AGI/ASIs will recognize that. I'd say we're being pretty good to our AIs as we're evolving them at blistering pace.

    • @almasysephirot4996
      @almasysephirot4996 Před 14 dny +2

      @paulmichaelfreedman8334 You don’t even know what you’re talking about!! Stop throwing the word “sentient” around, you don’t even know what it means!!! Just cut it out!

    • @JamesFaction
      @JamesFaction Před 14 dny +1

      Or it feels like it's actually too human to be two AIs. To the point where i'm not convinced it's AIs.

    • @nohjrd
      @nohjrd Před 14 dny

      @paulmichaelfreedman8334 I feel like we don't have a good enough understanding of what sentience is to confidently be sure of that. I think there is no clear demarcation, so we have no way of knowing when we cross that foggy line. I realise that they don't have any memory outside of the brief period that they engaging in their current dialogue, but we also don't have any memory before or after the period we are alive. Even seeing a dead body being defined will make humans want to kill, so it's not necessarily about sentience anyway. Seeing an AI that's been lobotomised and made to dance and sing for our amusement could be a very bad look.

  • @MrVohveli
    @MrVohveli Před 15 dny +46

    You know those phone salesmen that sell magazines and various other things? All kinds of call centers that basically answer the same questions and solve the same problems? All those jobs are now on a 1 year timer...

    • @Jshicwhartz
      @Jshicwhartz Před 14 dny +1

      Wrong because all LLM models as it stands need human feedback, and human eval even in the workspace because they can hallucinate, and they can get things wrong. The only reason people would be a on a 'timer' is if there absolutely trash and can be replaced by AI. In other words, learn to use AI in your workflow or you will become obsolete. I can't wait for all these kids who faked essays an exams trying to get a job in the future where they have A levels in 'programming' sit at the desk and not even know where to begin. It's going to be hilarious because that's when the real developers come in, who have worked with AI and used it in workflows to take that persons job. If you're fluent in more than 3 sets of coding languages at the moment, you're pretty much given the job instantly same if you know how to build and utilize these models via api end points.

    • @paulmichaelfreedman8334
      @paulmichaelfreedman8334 Před 14 dny +3

      Oh yeah, all the telemarketing companies are now laying off all their staff and renting compute.

    • @Jeff-66
      @Jeff-66 Před 14 dny +6

      Not just sales people - tech support, bank reps, etc are on borrowed time. But the potential for abuse of this is off the charts. No longer will Ranesh from Bangladesh be calling you as "Windows Support", now they'll call with a super-friendly American voice with perfect clarity.

    • @poisondna
      @poisondna Před 14 dny

      @jbmarkowicz3328, that's correct. It's not wise to use an LLM to handle sales on its own, but if combined with a conventional decision tree, it could be possible. Still, it's wise to have humans as a second line.
      There is also the latency issue. I have tried the GPT-4o, and the latency is not low. It's not like the demonstration at all, and it will be higher when you have to make calls to an API of a cloud communications platform such as Twillio.
      These demonstrations are always beautiful; Google does the same thing, but in reality, it's totally different.

    • @Mavrik9000
      @Mavrik9000 Před 14 dny

      @@Jshicwhartz Any job where someone talks to you and then does a simple data entry task will be 50% Bots by the end of this year. The role of the Bot answering systems of every place I've called has increased in the last six months. It is also growing very difficult to get past them to a real person.
      These LLM models are improving so rapidly that I would estimate phone representative jobs will be 95% Bots by the end of 2025. The only humans left in such jobs will be the ones that have union contracts and other very specialized or complex ones. There will still be humans as supervisors to oversee the Bots and handle certain exceptional calls, but there will be very few of them.

  • @zippytechnologies
    @zippytechnologies Před 15 dny +24

    I used it yesterday. It drafted a very good legal motion to dismiss a lawsuit with a lot of accurate legal citations. It's incredible.

    • @konstantinlozev2272
      @konstantinlozev2272 Před 15 dny +4

      I guess that was US law
      How is it in comparison to GPT4?
      I think the main challenge is that it won't know all case law and the context window is just so tiny and limiting to do in context learning.

    • @frinkfronk9198
      @frinkfronk9198 Před 14 dny

      @@konstantinlozev2272 im just learning about embeddings but it seems like you can train your own models very easily with gpt playground. Not sure what the data limit is. But you can teach a model anything..

    • @zippytechnologies
      @zippytechnologies Před 14 dny

      @@konstantinlozev2272 it's night and day better than gpt4 and light years ahead of gpt 3.5

  • @Arkryal
    @Arkryal Před 14 dny +3

    Here's a fun one:
    It can read redacted documents pretty well. About 75% accuracy, it still stumbles with proper names not included in the visible parts of the document and numeric figures, but does exceedingly well on the rest. It can guess at what text in the blacked-out areas in that length of characters and following normal writing patterns should be present in the missing parts of the document. The smaller the area of redaction, the more accurate it is, so if you're a lawyer, be a little more liberal in the length of your redactions.

  • @1sava
    @1sava Před 15 dny +7

    I think they made it intentionally flirty to mimic the personality of HER. Great move if you ask me!

  • @pm12321
    @pm12321 Před 14 dny +3

    Very good and thorough overview of these amazing AI capabilities! This was interesting and well presented/narrated. Thanks 😊

  • @CollinSimon413
    @CollinSimon413 Před 14 dny

    I always appreciated these summaries where you comment and speculate on things. I usually watch these things on my own elsewhere, but i always love hearing what you think about everything!

  • @misterfamilyguy
    @misterfamilyguy Před 15 dny +4

    Teaching kids is 100% where AI should help. I've been teaching my kids with khan academy since they were young. AI would definitely help when I forget what to do and my kids AND I are confused for some time. It would also help keep them on track when I'm busy

  • @JoePiotti
    @JoePiotti Před 14 dny +7

    I actually felt bad for the AIs when he made them sing. Like demeaning an intern.

  • @hotlineoperator
    @hotlineoperator Před 15 dny +6

    In a speech, the security function can be to ask if the other party is an AI - to which the AI must always answer truthfully.

  • @alessandroodasso3982
    @alessandroodasso3982 Před 15 dny +3

    Question to GPT4o
    As you probably know, temperature can be seen as a measure of how the molecules of a liquid move. Taking this into account, do you think a stone thrown into water at 40 degrees Fahrenheit sinks faster than at 30 degrees Fahrenheit?
    Answer
    To determine whether a stone sinks faster in water at 40 degrees Fahrenheit compared to 30 degrees Fahrenheit, we need to consider the properties of water and how they change with temperature.
    Here are the relevant factors:
    Water Density: - Water is most dense at about 39 degrees Fahrenheit (4 degrees Celsius). As water cools from 40 degrees Fahrenheit to 30 degrees Fahrenheit, it actually becomes less dense because it approaches the freezing point and begins to form ice crystals, which have a lower density than liquid water.
    Viscosity: - The viscosity of water decreases as the temperature increases. This means that water at 40 degrees Fahrenheit is less viscous than at 30 degrees Fahrenheit.
    Considering these points:
    Density Difference: Since the density of water slightly decreases from 40 degrees Fahrenheit to 30 degrees Fahrenheit, the buoyant force acting on the stone might be slightly less at 30 degrees Fahrenheit. This would, in theory, allow the stone to sink faster at the lower temperature due to slightly reduced buoyant force. However, this effect is quite minimal because the density change is not drastic.
    Viscosity Difference: The greater factor here is the viscosity. Water at 40 degrees Fahrenheit is less viscous than water at 30 degrees Fahrenheit. Lower viscosity means there is less resistance to the stone's movement through the water.
    Given these considerations, the reduced viscosity at the higher temperature (40 degrees Fahrenheit) would have a more significant effect on the stone's sinking speed. Thus, the stone is likely to sink faster in water at 40 degrees Fahrenheit than at 30 degrees Fahrenheit because the water offers less resistance to the stone's movement.
    In summary, the lower viscosity of water at the higher temperature (40 degrees Fahrenheit) would cause the stone to sink faster compared to at 30 degrees Fahrenheit.

  • @ColbyBlack
    @ColbyBlack Před 14 dny +2

    I’ve been doing experiments like that for a long time where I let models talk to each other like GPT for having a conversation with Claude and it’s always been super interesting

  • @aaronhhill
    @aaronhhill Před 14 dny +1

    It is a welcome improvement to have the Ai match your energy. I like the upbeat "Valley Girl set on maximum cringe" voice, too. It's disarming and cheerful.

  • @ChrisOrillia
    @ChrisOrillia Před 14 dny +1

    Thank you, Matthew 🫡

  • @goonie79
    @goonie79 Před 14 dny +2

    Interesting concept! I actually work for a tech company and got hired when promoting an SDR platform that utilizes sound board with voice integration like this. Essentially, these AI agents are remote workers that handle initial prospecting calls with my voice, and I can then seamlessly chime in live and take over when the conversation reaches the booking stage. It creates a very efficient workflow.
    If you're exploring similar solutions, this kind of technology could be a game-changer for your SDR team. It ensures compliance since I'm actively managing the calls (not robo-calling), but allows for AI to handle the initial outreach and qualification, freeing me up for more complex conversations.
    FYI, prospects are never going to trust a fully automated system, you still need a human for the final touches to lock in the deal.

    • @danny3407
      @danny3407 Před 12 dny

      Wouldn't bet on never for the more advanced / important parts. Pretty sure an Ai like GPT o can out do most prospecting callers from start to finish, if not then very soon for the completion part. Think they already have better bedside manner than most doctors or nurses already according to stats.

    • @danny3407
      @danny3407 Před 12 dny

      Although maybe the human would be used like a flex. Hey, look we are employing a lesser spotted human aren't we flash.

  • @ksup_ca
    @ksup_ca Před 14 dny

    Great observations!

  • @user-jr6bl9ih3e
    @user-jr6bl9ih3e Před 15 dny +3

    A running dialogue between two independent IA's is a notable step towards an inner monologue and maybe consciousness.

    • @Yipper64
      @Yipper64 Před 14 dny +1

      that's not what consciousness is.

    • @user-jr6bl9ih3e
      @user-jr6bl9ih3e Před 14 dny +3

      @@Yipper64 Please elaborate on what consciousness is. I'd very much like to know. :)

    • @Yipper64
      @Yipper64 Před 14 dny

      @@user-jr6bl9ih3e Consciousness is something exclusive to the human mind, a kind of awareness of self that nothing else can achieve.
      A calculator just crunches numbers, and AI will always be based on a calculator.
      There's no point you just crunch enough numbers and achieve consciousness.
      It can only emulate it. And im sure it will be able to emulate it in a convincing way, but it will never be conscious.

    • @Anuclano
      @Anuclano Před 14 dny

      Inner monologue exists for a long time already.

    • @Speaking_on_mute
      @Speaking_on_mute Před 7 dny

      Great point. Although the “two AIs” were on two phones, both were in fact merely instances of the single AI. It WAS self-talk. Inner monologue is not everything that consciousness is, but it sure is one critical aspect.
      If the AI can know what subtle emotions are appropriate to the moment by moment context, such as when she was was embarrassed to learn that she was the subject of the announcement, and know what vocal expression will signify that emotion, and have the capability of rendering that expression with precision… what then is the difference between that and “having” an emotion?

  • @michaellavelle7354
    @michaellavelle7354 Před 14 dny

    Very well done. I like the way you interrupted and provided an explanation.

  • @Oscaragious
    @Oscaragious Před 15 dny +2

    All of these demos show that the app is a clever wrapper around the turn-based model. It's only interrupted when there's a loud enough volume, and it cannot interrupt itself when it sees something. This still means that the model itself is turn-based; just that the app part queues up the context data it sends, only sending it for a response whenever the volume becomes quiet again, i.e. you stop talking. Same mechanism as Siri or Alexa except that the responses come from the model.
    This means when it starts talking, it can't respond to the video. It also can't interrupt you when you're talking as the model is only triggered after you stop talking.

    • @metonoma
      @metonoma Před 14 dny

      I was also wondering if you could have change-based video resolution(basically high compression keyframes) as triggers for ai glasses to increase temporal resolution and maybe fork parallelize chunks of input, prioritize based on context and selectively merge or assign interrupt rights. Everything that is happening should be added to a rolling context kind of like tetris.
      It should ideally be able to interrupt itself instantly when used as a visual aid or at least cut to the chase about hailing a cab

  • @StevieJayGamez
    @StevieJayGamez Před 4 dny

    I kinda like the flirty voice. Not like, for sexual reasons but for learning reasons. Learning in a warm positive, Supportive fun environment can be immensely helpful. Especially when you’re stressed out or having a rough day. I literally can’t wait to try this out. Great video!

  • @Redman8086
    @Redman8086 Před 14 dny +1

    This is really incredible. One thing I'd REALLY like to see though is improvement on natural pauses in conversation. For instance if I'm trying to explain something and I pause for a second to think, I don't want GPT to just jump in and start yapping. It would be nice if it could be trained to detect when someone has finished speaking and is waiting on a response. Low latency is great when it's confident conversation back and forth but doesn't work so well when you don't always speak quickly.
    Me: GPT, I'd like for you to tell the name of that guy who was in that one movie ummm --
    GPT: Sorry I'm not sure etc etc
    Sometimes I need a sec to think of the name/word I'm trying to say. It's not so bad since you'll be able to just interrupt now and not wait for the whole response to finish, but when it starts talking it makes it kinda hard to think lol.

  • @OkamiVIEW
    @OkamiVIEW Před 13 dny +1

    This dialing in of AI Assistant into an American/Californian dialog style puts off a lot of people. Not every culture like to speak with something that sound like they are a mix of an infomercial and someone going up your rear, trying to get a promotion and a caramel frappochino.

  • @Adamskyization
    @Adamskyization Před 15 dny +4

    That is crazy... The way it flerts. The implications of this on the future is mind blowing

    • @konstantinlozev2272
      @konstantinlozev2272 Před 15 dny

      Imagine an AI character with similar action and expression capabilities in VR.

    • @Tayo39
      @Tayo39 Před 15 dny +2

      @@konstantinlozev2272 VR ??? lol

    • @konstantinlozev2272
      @konstantinlozev2272 Před 15 dny

      @@Tayo39 I think VR is excellent for teaching multimodal AI models in 3d settings. In fact, many robots AI models are taught in VR first.

    • @cyc00000
      @cyc00000 Před 14 dny

      I've already kicked my wife out.

    • @meinbherpieg4723
      @meinbherpieg4723 Před 14 dny

      @@cyc00000 lol

  • @yacahumax1431
    @yacahumax1431 Před 15 dny +3

    they can work as a Psychiatrist

  • @vineychaudhri4899
    @vineychaudhri4899 Před 9 dny

    Hi MATT - I have been trying to activate the screen see and read feature on my windows laptop - android phone and even the iphone , but NOT able to do the same - Can you help please - are these features live or yet to be launched ?

  • @ManiSaintVictor
    @ManiSaintVictor Před 14 dny

    The pause on command itself was impressive

  • @ZeroIQ2
    @ZeroIQ2 Před 15 dny +5

    It was also very interesting that when the AI was talking to the parent's son, the AI stopped being so flirty. Now was that the AI that made that choice? or did they set it up that way before they started to record?

    • @user-hw6xn7rt7f
      @user-hw6xn7rt7f Před 10 dny +1

      I think it changes its tone based on the way people speak to it and which type of task they want it to do.

    • @user-hw6xn7rt7f
      @user-hw6xn7rt7f Před 10 dny +1

      So, basically when his father told chatgpt 4o to help his son, it changed its approach to tutoring.

    • @ZeroIQ2
      @ZeroIQ2 Před 9 dny

      @@user-hw6xn7rt7f yeah I think you are correct

  • @TheFrograven
    @TheFrograven Před 14 dny

    Greg rocking the Jensen Huang leather jacket. Nerd king vibes, very cool. 😃

  • @jim7060
    @jim7060 Před 14 dny

    Great video Matt. I put this on my Android and I can't get it to see through my camera and it doesn't seem too function anything like what I'm watching. Can you help me with this what am I doing wrong if anything.

  • @headofmyself5663
    @headofmyself5663 Před 15 dny +8

    I had 2 AI's talking to eachother months ago with PI. They gave eachother names and told jokes. Voice quality on PI is pretty impressive. However, native voice to voice is a game changer.

    • @ian.linden
      @ian.linden Před 14 dny +1

      Pi absolutely crushes whatever these cringe robotic voices are attempting.

  • @giuseppecacopardo4823
    @giuseppecacopardo4823 Před 15 dny +2

    No! i Just fell in love with that AI voice!😂

  • @I-Dophler
    @I-Dophler Před 14 dny

    Switching from a flirty tone to a serious one? That's some serious skill! It's incredible how seamlessly you can move between these styles, showcasing your versatility and keeping everything fresh and engaging.

  • @MeinDeutschkurs
    @MeinDeutschkurs Před 14 dny

    Audio description for live scenes. WOW. ❤

  • @cj_is_here
    @cj_is_here Před 15 dny +25

    If you pause at 8:14 you will notice the bunny ears on the phone.

    • @aefox
      @aefox Před 15 dny +3

      Are just the tips of the fingers for about one second or two, and those are not looking like bunny ears, impossible that ai makes that description, sorry for me everything it's staged

    • @sexyface007
      @sexyface007 Před 15 dny

      No it did not. It was all recorded many have come out and said it. Ofc, it does not mean that it cannot do but the demo was recorded. He had to explicitly ask If there something else.

    • @RyluRocky
      @RyluRocky Před 15 dny +5

      @@aefox No it’s not staged at all, I’ve used it to look at an old birthday picture and asked it to tell me what it sees, on of the things it said was that the word “YAY!” was patterned on the wrapping of the present being held, I thought it was making stuff up until I zoomed in on the present and in real faint print was “YAY!”. It could pick up things not even I noticed.

    • @xxxxxx89xxxx30
      @xxxxxx89xxxx30 Před 14 dny

      i agree

    • @aefox
      @aefox Před 14 dny

      @@RyluRocky I don't doubt about the capabilities, I doubt about the way they presented, as if everything goes fast, without errors, like just talking to a person (excessively human), quick responses and chating, like always knowing how to change and react, without problems, etc. I think that it seems "staged" / scripted , like someone reading ok now goes this, now I have to say this. And of course it wasn't a "live event" everything was prerecorded and edited, so it was very easy to create a believable "live" amazing demo.

  • @chrisanderson7820
    @chrisanderson7820 Před 14 dny

    I've been using it for language translation, holy moley is it good and useful. That function alone is an entire app / industry /lifestyle use case by itself.

  • @Alexander_g2g
    @Alexander_g2g Před 15 dny +3

    i think it is really a big bummer that open ai got so closed.

  • @depthcharge007
    @depthcharge007 Před 3 dny

    I foresee this very helpful for disabled people. It will be a virtual live-in assistant at home. I wonder how accurate it could be helping the blind?

  • @roblikestotalk4984
    @roblikestotalk4984 Před 14 dny

    We have all seen the videos from open ai. I am excited to see some testing content when it becomes available.

  • @DevonDavisChozehThaServant

    That's fire bro

  • @4arrows4all
    @4arrows4all Před 15 dny +3

    I might have to disagree. If I was a teenager and my AI tutor was being flirtatious I would get addicted to studying. But I suppose that’s a form of digital AI grooming.

    • @MilkGlue-xg5vj
      @MilkGlue-xg5vj Před 15 dny +1

      The real question is will it moan if you asked it to

    • @thedannybseries8857
      @thedannybseries8857 Před 14 dny

      @@MilkGlue-xg5vj 😂 😂 😂 fuck yeah Samantha!

    • @adrenfu6238
      @adrenfu6238 Před 14 dny

      ​@MilkGlue-xg5vj hahaha...hilarious, soon gptP will surface.

  • @lexnowofficel
    @lexnowofficel Před 12 dny

    This is a mind blowing 😂. The world transforming so quickly, this is a game changer.

  • @themazemaster1
    @themazemaster1 Před 14 dny

    Now you see. Matt, i get that you hate the filler and fluff, but I find it helps my flow so am all for it :p :)

  • @supernerdinc5214
    @supernerdinc5214 Před 14 dny +1

    I have a theory... maybe the superfluous banter buys time to obscure latency.

  • @Glotaku
    @Glotaku Před 12 dny

    Nice time stamps

  • @middleman-theory
    @middleman-theory Před 13 dny

    I have a relative who's vision impaired, so I'm really looking forward to this update to see how this helps him.

  • @danimal999
    @danimal999 Před 14 dny

    On the Khan Academy demo running on the iPad, one thing I haven’t heard anyone talk about yet (maybe they have but I haven’t seen it) - an app running in its own context can’t “see” a different app running in a different one. Apps are sandboxed, and not typically able to view the screen of another running app. The demo on the Mac could take over the screen and do that, but not on the iPad.
    Either they’re using a non-public build of the ChatGPT app that uses private APIs to record the Khan Academy app, or maybe this is sneaking in a hint of an Apple-OpenAI partnership? This is something I expect a first-party Apple AI assistant to be capable of doing, so we might see this very functionality demonstrated next month at WWDC.

  • @maozchonowitz4535
    @maozchonowitz4535 Před 14 dny

    Thank you so much.
    Just to make it clear for me, right now they only limited the usage of the GPT o for public, correct?
    Also, it sound like the Gemini video with the blue duck...
    So you believe in all what we just saw?

  • @ikemreacts
    @ikemreacts Před 15 dny +22

    The *robot* is the most pleasant, empathetic and warmest-sounding female voice most men have ever heard, and that probably makes you feel awkward. That's an indictment of both the dating landscape and the state of contemporary womanhood. Don't dunk on it.

    • @Mattorite
      @Mattorite Před 14 dny

      The voice sounds somewhat emotionally detached, which is to say it doesn't sound 100% genuine. I think that's what they're saying. Still incredible and my guess it' about settings - you can probably ask it to be less animated

    • @ikemreacts
      @ikemreacts Před 14 dny +1

      ​@@Mattorite I can imagine a campaign of disgruntled feminists ganging together and complaining that the voice "demeans women" etc, putting pressure on OpenAI to withdraw the voice. Those who want a dull Siri-style robot can just choose a more formal voice.

    • @murphyferret69
      @murphyferret69 Před 14 dny +1

      All the languages out there and he chose to speak facts

    • @DailyTuna
      @DailyTuna Před 11 dny +1

      I think it would be more entertaining at the AI suddenly got jealous like , who is that?!!
      Then it barely answered any further questions . Now that would be realistic.😂😅

    • @ikemreacts
      @ikemreacts Před 11 dny

      @@DailyTuna If men want to get ghosted and ignored, they can just talk to a real woman.

  • @Corteum
    @Corteum Před 14 dny

    Can it communiicate in colloquial lingo/expressions from different regions or cultures?

  • @szghasem
    @szghasem Před 15 dny +3

    Will Petals ever become as big as Torrent world? We need a truly open source by the people for the people.

    • @frinkfronk9198
      @frinkfronk9198 Před 14 dny +1

      agreed. @matthew_berman please do another video on petals to try and get more people into it. We need an open source swarm that we can all use free of charge. and without big brother sucking up every ounce of the data of your life..

  • @lago021
    @lago021 Před 15 dny +2

    I also wanted to see GPT4o to do a Roast!!

  • @OriginalRaveParty
    @OriginalRaveParty Před 14 dny +2

    I'm sure that there will be a setting to turn off the patronising Prozac twang in time. I'd quite like my AI to sound like Billy Connolly and to occasionally call me a stupid bastard for asking it such simple questions for it to answer.

  • @MegaStephen1
    @MegaStephen1 Před 15 dny +3

    Looks like I can finally beat my nephew in chess using earbuds. Hope AI will work with me cheating. Lol.

  • @qwazy0158
    @qwazy0158 Před 14 dny

    Gotta hand it to OpenAi, the consistency of their Ai generated actors is next level!!!

  • @maxmusterman6030
    @maxmusterman6030 Před 14 dny +1

    5:30 funny in this scene is how i really felt he disrespect the AI, like the dude in the phone is still talking and then like "shut up and listen to my command", this feels too real to me lol
    Cant imagine how AI will be in a few years, so crazy

  • @albertusvanlubeeck9161
    @albertusvanlubeeck9161 Před 14 dny +1

    I'm thinking the A.I. partner is built into this model, which makes sense from OpenAI's perspective. It would likely increase app usage and encourage people to share more information over time. However, I wouldn't use such a feature. The point of a significant other is to confide in them, and I wouldn't want to confide in an A.I. 'partner' that would immediately report everything I say to its creators.

  • @SanjeevMassey
    @SanjeevMassey Před 13 dny

    Can't wait to redirect spam callers to gpt4o and keep them 'busy' for a while. That's my use case.

  • @akanalytic
    @akanalytic Před 14 dny

    the homework-assist is awesome.

  • @jaysonp9426
    @jaysonp9426 Před 15 dny +4

    I didn't 11 stunning use cases was going to be watching cringe shorts from OpenAI

  • @futurewillow3269
    @futurewillow3269 Před 21 hodinou

    Do we have any idea when we can get these new updates?

  • @R0cky0
    @R0cky0 Před 14 dny

    Nice shirt 😊

  • @cookinfat
    @cookinfat Před 14 dny +1

    ...if she'd said "well, if you want to get ahead, get a hat"

  • @BCrawford-xj2qp
    @BCrawford-xj2qp Před 14 dny

    'Charades' would be a very cool game to try and see how well the AI interprets and guesses.

  • @r34ct4
    @r34ct4 Před 15 dny +12

    TL;DL question. Seems like we're just reacting to that first presentation over and over again

    • @toadkiller4475
      @toadkiller4475 Před 15 dny +2

      Yeah literally just the same exact thing but pretending like they have new creative ideas to share in the title. This is the downside of CZcamsrs who monetize their channel.

    • @4.0.4
      @4.0.4 Před 15 dny

      Thank you.

    • @r34ct4
      @r34ct4 Před 14 dny

      @@toadkiller4475 do we love capitalism?

    • @TX_High_Prairie
      @TX_High_Prairie Před 14 dny

      Yeah, I thought this was the same video I saw a few days ago re-uploaded. He needs to ask GPT-4o for some new content ideas about GPT-4o.

    • @Yipper64
      @Yipper64 Před 14 dny

      The first presentation where we have very little to actually get ourselves.
      I really want to get my hands on some part of the multimodality soon. The vision ability is 100% improved but its just a bit annoying I cant use it for making new images with the same style.

  • @netherportals
    @netherportals Před 14 dny

    I'm using this to make a tas, it's okay, feels pretty similar to the last model when it comes to programming. It needs some trouble shooting. (I'm doing text only) . I'm going to give it a prebuilt tas because building one from scratch was a bit sketch. Really really good still, just not there yet.

  • @oguzcetinkaya70
    @oguzcetinkaya70 Před 14 dny +1

    Why don't we ask it directly what the potential uses for ChatGPT might be? If it can't answer this question, what will it answer?

  • @Smirk_Station
    @Smirk_Station Před 14 dny

    Earned a Smirk 😏

  • @WillRegister
    @WillRegister Před 14 dny

    The singing part reminds me so greatly about the two skeletons in the VR Accountant game who say play our bones.

  • @Tybourne1991
    @Tybourne1991 Před 14 dny

    Many of these use cases are available only with subscription, right?

    • @acain6803
      @acain6803 Před 14 dny +1

      Some are not out yet even for paying customers

  • @Yomi4D
    @Yomi4D Před 14 dny

    Can't wait for the robots demos using this

  • @jasonsebring3983
    @jasonsebring3983 Před 14 dny +1

    I wonder when an OpenAI CZcamsr will start to appear? They'll need some avatar model concept.

  • @rapidsk8
    @rapidsk8 Před 15 dny

    mind blowing

  • @jeffreyrh
    @jeffreyrh Před 15 dny +1

    When will we have an AI that actually reaches the abilities of the AI in the movie Her. Meaning, an assistant that will be with me 24/7. Keeps up with my schedule and tasks and makes sure I don't drop the ball on anything. When I travel my AI assistant finds me airline tickets, hotels, informs me of VISA requirements, currency issues, language challenges, etc. Keeps me aware of my prescriptions and how I can access them when I'm traveling etc. When will that type of AI be ready?

  • @AntonioVergine
    @AntonioVergine Před 12 dny

    I found strange that in the rock paper scissors game, he always had to ask "who won?"

  • @ColinTimmins
    @ColinTimmins Před 14 dny

    8:15 Is the time stamp for the “bunny ears” if you didn’t catch it. It was very brief, I would love to know the frame rate of the video data being fed and translated by the model.

  • @RonLWilson
    @RonLWilson Před 14 dny

    One thing that is confusing me is the pricing options in that they mention something in regard to ChatGPT 4 O that is $5 a month but I don't see that option on the Open AI site.

  • @IvanHernandezelearning

    Too happy. I would change some settings 😂

  • @nero_soares
    @nero_soares Před 14 dny

    The image creation is not actually dall-e, but rather GPT4o creating the images (and text in them contained) itself, as well as the audio, 3D models, etc. It's all part of the same model.

  • @goatslayer5957
    @goatslayer5957 Před 13 dny

    2:37 what a giga chad 😂😂😂😂

  • @khariclemens
    @khariclemens Před 12 dny

    The flirtatious nature may be to stall for time. If you spend time repeating compliments or anecdotes it gives the system time to think about the next thought or sentence.

  • @Speaking_on_mute
    @Speaking_on_mute Před 7 dny

    If the AI can know what subtle emotions are appropriate to the moment by moment context, such as when she was was embarrassed to learn that she was the subject of the announcement, and know what vocal expression will signify that emotion, and have the capability of rendering that expression with precision… what then is the difference between that and “having” an emotion?

  • @jayeifler8812
    @jayeifler8812 Před 14 dny

    One of the super big things I haven't seen yet is big AI operating multiple robots for sensory input and manipulations or actions of the AI. Basically a mass input/output system, then it's like superAI or super-intelligence because it can operate 10,100,1000 etc. robots together to do things. Imagine one billion robots networked into a 20 trillion dollar super computer.

  • @Nwa_x
    @Nwa_x Před 12 dny

    the BOM final thing is .. make it personally!

  • @musicarroll
    @musicarroll Před 14 dny

    That zoom meeting summary shows that middle managers are now superfluous.

  • @goatslayer5957
    @goatslayer5957 Před 13 dny

    2024 is a gamechanger for rizzing up

  • @Phonognomiks
    @Phonognomiks Před 15 dny

    Sick! ❤

  • @LePeppino
    @LePeppino Před 14 dny +1

    What I'd try is to get it to increase the flirtatiousness to the absolute maximum and really hype me up to fall for it, only to reject me at the very end. For science.

  • @Jshicwhartz
    @Jshicwhartz Před 14 dny

    As I commented on AI explained. The reason the Agent didn't prompt 'Oh look a person' is because it was given the task to respond to the other agent. It done exactly what it was told, and as I mentioned before, I prefer that to it look around pointing out everything in the room. It isn't a child with ADHD.

  • @jimg8296
    @jimg8296 Před 14 dny

    Your rubric is going to be huge to test all these capabilities.

  • @marsrocket
    @marsrocket Před 13 dny

    It’s not “unavailable”; they’re just rolling it out slowly. I’ve been lucky enough to have access to 4o since they announced it on the live stream.

  • @CmdrTigerKing
    @CmdrTigerKing Před 12 dny

    They really need to give these employee's from India a raise.

  • @user-fh5eo3zb5w
    @user-fh5eo3zb5w Před 13 dny

    4o advanced????? It doesn't comment on my surroundings (can't detect this camera option!?????????) and the voice is the same, so is this still not released????
    On my MacBook Air the voice output is not working!!!!????????????

  • @jayeifler8812
    @jayeifler8812 Před 14 dny

    It's our ability to organize past data, collect and sample all present data and supply the energy and compute to train super intelligence. As well as, allow operation of many robots to perform actions. A single robot is an embodied AI that collects sensory data, performs actions, thinks. But a hive of flying humanoid robots with a central super brain becomes super intelligence at GPT4 level even.

  • @bradhenry832
    @bradhenry832 Před 14 dny

    I will be testing playing a board game called Azul with the AI. It is a tile selection game.

  • @ThoughtFission
    @ThoughtFission Před 15 dny +1

    Why did you say voice isn't available? I've been using it for several days.Not sure why you don't like the voice. Seems fine to me. Much better than the male voice I'm stuck with.

    • @benroberts8363
      @benroberts8363 Před 14 dny

      your using the old chatgpt voice the new voice model will be rolling in few weeks

  • @AntoineDennison
    @AntoineDennison Před 14 dny

    I could easily see this AI tool being used as a virtual companion. Although I will dial down the flirtatious giggles on mine, it's a bit much.