Chat GPT can now speak and sing in real time | DW News
Vložit
- čas přidán 28. 05. 2024
- The AI race has just shifted into high gear, with US artificial intelligence pioneers OpenAI rolling out its new interface that works with audio and vision as well as text. The new model, called GPT-4o, has gone beyond the familiar chat-bot features and is capable of real-time, near-natural voice conversations. The developer OpenAI will also make it available to free users.
ChatGPT was already able to talk to users, but with long pauses to process the data. It often seemed a bit sluggish. This was because the feature required three internal applications, the company explained: transcribing the spoken text, processing and generating, and converting the response to speech. This caused delays.
We talk to computer scientist Mike Cook from the renowned Kings College London about the new Chat GPT-4o development.
#artificialintelligence #chatgpt #openai
Subscribe: czcams.com/users/deutsche...
For more news go to: www.dw.com/en/
Follow DW on social media:
►Facebook: / deutschewellenews
►Twitter: / dwnews
►Instagram: / dwnews
►Twitch: / dwnews_hangout
Für Videos in deutscher Sprache besuchen Sie: / dwdeutsch
Random guy: That girl is pretty, should I date her?
ChatGPT: She's above your paygrade.
Random guy: .......
Why do you need a girl when you have chatgpt?
We've come a long way from hotdog and not hotdog
I'm sure somebody is working on the New ChatGPTo
yea, jin yang is key people in ai industri
Can kid use openAi identify gender now ?
Dammit Jin Yang!!!!!
I remember this episode clip.. but I don't know the show.
RIP tour guides, translators, tutors etc
Instead of a tour guide I'd much rather follow a safe flame throwing robot AI dog. 😬
We need each other. I think we will weather the ai revolution well with a bit of luck. Allow us time to interact, be creative in the arts and sport. Theoretically AI can create wealth for all of us without the drudge that accompanies so much work. We can work if we choose! Ever heard of DIY , Carpentry, Gardenjng ? Etc etc. people will still want to use these skills even in
A highly automated world. Maybe I’m over optimistic. I’m trying not to think of ex machina right now.
Possibly sports refs too. If AI can see and knows all the rules it might be a better Ref.
@@trolearyIf it will become easier to earn money for more people, the money will also lose its value so...
RIP horses
This is one of the best interviews that I have seen on this topic, great job DW
I wish the interview was bit longer 😅
this is such a good feature for people with low vision
Instead of fear mongering let’s stop and ponder and celebrate what we just witnessed in the video with the blind guy. 🎉
Seriously sounds like Scarlet Johansson
Seems like they deliberately made it sound like the AI voice from the movie "Her."
Wow. That was a really good interview.
ChatGPT and Microsoft's copilot have probably made my team about twice as efficient. Moreover, it's really expanded the "comfort zone" of my colleagues in terms of the computer languages and technology domains that they're mentally prepared to grapple with.
Exactly the same for myself, before everything felt so overwhelming and had to watch endless videos on a subject without being able to ask questions , copilot in particular has made me realize the things I was intimidated by aren't all that scary , artificial intelligence is the best teacher I've ever had
hello paid commenter.
Paid commenter indeed
Paid commenter? or is it an AI commenter? They won't even have to pay anyone any longer.
Must....optimize....
Pretty sure the global quest for endless optimization is going to destroy us. Or at least cause a major collapse and lots of suffering. Maybe not, anything is possible, but collapse is on the table as a possibility for sure.
Fantastic interview guys 👏 smart questions and very well spoken answers
Good analysis that explains why they made it free. The model works natively also now with Audio and Images. That means imho that they can tokenize this data directly and then feed it into the transformer architecture. Now, whilst the current versions understanding of the world was based on free internet data, they can now use much, much more data of the real world in order to train the models, resulting in really powerful future models. And of course, it is your data you feed into to this. Thats the scary part.
"Natively" 😬
Google voice trained on ...your calls.
This is uber creepy. The oppression is growing ever more nuanced and subtle and effective. We're such mindless slaves we beg for the next lever to be used against us. This is gonna be real bad, the incentives within it point straight to corruption and misuse.
How is it scary? You haven’t stop using the internet even though you know that your information is being used. You freely give them your data so that you can benefit from their services. It is a far and informed exchange.
A lot of people will get axed... This kind of rapid progress is unknown in human history. People do not have time to adjust to the changes.
The dark future is coming where only oligarchs and robots remain...
Who cares
This was an old dream of the communist block: having the economy steered efficiently by cybernetics instead of by entrepreneurs.
So, this will definitively give communism a new boost...
Nothing will happen, mark my words.
This is the same virtual assistant bs they marketed 10 years ago.
The only people it's useful to is handicapped, which is a shame they didn't get such helpful thing sooner
@wizaaeed 20 years ago speech recognition was a joke. I remember how I was reading a text to train the software to recognize my voice. At some point, I was just mumbling and it was recognizing text... This right now is far ahead but most people do not know from where those technologies started. I was barely able to open my CD tray with voice commands.
such a nice intelligent and clear speaker on the subject
And good looking, too!
In the end, everything can be quantified using statistics as long you know how to fitting the right function.
So many math behind this that most people don't aware of and still think math is useless in real life.
Everything except subjective experience. Maybe future brain scans will be able to fully map every single neuron firing, every blip of neurotransmitter transmission. But we don't even know what consciousness is, so it might remain unquantifiable, I don't know. I wonder if there are many paths to consciousness. One option working doesn't mean other routes don't also work.
Ethics and morality will never be under that umbrella.
@@I.amthatrealJuan Not a good example. That is a huge part of AI design. David Shapiro made a whole video on deotological vs teleological frameworks for AI design. That's what AI alignment is all about.
Tour guides are not required anymore when you’re at a museum.
hadn't thought of that, that's an amazing usecase!
@@RobertElliotPahel-Short No it isn't. Not unless the robots don't break down...but they'll be $$$ and one punch from a drunk visitor its finished and they need to call in the human backup. Same reason waiter jobs are safe and self-driving cars will never work in the big cities downtown. Human nature will intervene.
Ohmageeerd!
That's ways been the case since CZcams came out that's something that has not changed
In my experience those tour guides know inside info that isn’t publicly available anywhere else. The metropolitan museum in nyc for example has a lot of info in their tours that isn’t publicly available info online, I’ve checked when trying to confirm something they said
I cannot believe how fast this is moving forward.
we are so cooked....
Why
Relax grandma, you're overreacting.
They just connected things, which have already been here...
@@zufex2029 Yea and make it work faster !
This feels like the skynet moment to me
I bet if you were around when digital calculators were first introduced it would feel like a skynet moment to you. 😂😂😂🤣
I feel like we won that ish too. Why are we feeling so . . . less than all of the sudden, we need to snap out of it.
Enough with the SkyNet story, it gets more and more boring over time
@@salvatoremaximus6754 well you don't need to reply
You can feel she's concerned she might loose her job.
Who cares about paying rent?
many such cases
lose*
And she is gonna loose her job eventually. Once the social stigma starts to get loose. Once these technologies become more and more an integral part of our daily lives. People like her are going to loose their jobs.
I'm both excited and terrified
AGI Will be man's last invention
Terminator, Robocop, Blade Runner, Total Recall, The Matrix, Fallout etc all pointed at this future...scared and excited at the same time
For your information, the world's first image based on the XFutuRestyle algorithm using GPT-4 was created in Ukraine and presented at the international exhibition of digital art in London and Athens, which drew OpenAI's attention to Ukraine's technological potential
When Asked about the problems of robots taking over so many human jobs : CHat gpt said , 'duh....sillly humans ...just make it financially worthwhile for people to SHARE the jobs you still NEED humans to do and enjoy your lives working much less.'
7:28 This is just the beginning of what will be highly transformative to our modern world. The fact that audio, image and video can merge together into one to give us human-like interaction is just phenomenal. The pace of Ai progression is beginning to accelerate. 😎💯💪🏿👍🏿
^AI roots for AI
This is great for accessibility but not too great for tour guides. These features aren't universally available, though, because users need access to a good internet connection...which is far from universal.
literally HER lol
1:08 I thought she said "let's bring in my cook" iI was like WTF???
The commentator mentions the risk of rapid adoption of this technology in education or healthcare but it's worth noting there's risk in slow adoption as well. It could be this technology saves lives in healthcare or improves education. I'm not saying we throw caution to the wind but we also shouldn't be so cautious that we slow beneficial technology too long.
sometime I think Openai is listening to my advice. at the very beginning, I told chatgpt that having a memory could be great because there would be an intimate connection between the user and the model. then, Openai adopted this. second, i told them that a more human approach to conversation is important, no human is very keen to talk to machines (do you have a conversation with tour toaster?). they adopted this. then I told chatgpt that having all these separated modalities is cumbersome instead of having them separately. they have adopted this.
I ask OpenAI if someday it can do my homework, exam and graph/data on the screen, it listen as well. lol .
I ask if it can do trading for me....it listen as well.
@@andis9076 these have been part of chatgpt since the very beginning. Of course, unless you tried it before its release. 😂
They use their chats and any emails they might receive to improve their model.
Where is the data stored? If something should happen.. Trying to imagine the number of hours and computational power it would take for it to relearn.
Can you paint AI to match the curtains?
In military industry, your job title has to follow the Seniors commands.
Same to AI.
What about dance?
Openai 4o has rolled back to 3.5. So no voice chat for last 2 days.
I agree with OpenAI and all AI models using data that I posted on Reddit ✅
The real deal will be when AI can use established data, present information and spontaneously contemplate the future. When it gets RUN PEOPLE…RUN!!
Give it 5 more years 😎
Build as many grey areas and you have a winner.
More hype than real substance. Knowing AI risks and limitations is key to success and avoiding disasters and mistakes
Don’t worry. The algorithms are almost perfected. We’ll be too distracted to care about the AI prisons being build all around us.
I'm just a layperson, but I feel ai maybe very difficult to stop as we maybe, possibly in a possible arms race, if we don't do it other nations, counties, companies may do it instead, I feel.
I have access to the text version online... GPT4o is better than GPT4 in a lot of ways but it's also very annoying. It will just repeat itself over and over again.
Just like an ordinary human !!!!! People only talk to exist😂😊
Imagine skynet awakening with that kind of voice, and a touch of humour.
"wow! Time to end humanity, hahaha" - Her
*nukes flying*
But only when using a M4 Mac.
We are so adaptable we are literally making an artificial intelligence to do the boring parts of thinking. Human literally translates to "the Thinker".
Omg, you actually got an expert to comment on the tech. Hats off to you.
She reminds me Vera farmiga. Oh yes... chatgpt is also cool..
Can you take me higher?
To a place where blind men see
Can you take me higher?
To a place with golden streets
~Creed
It can now not do that until it is released...soon...or soon'ish
ChapGPT learned our voice, access to our world from our phone camera, and our laptop screen. That is scary if openAi use it against us or somebody hack the data.
AI makes us think we are useless so that we are always rely on AI tools which the the inventors benefit from the users and live on top of us like a king
Good thing we'll have an advanced furby to talk to while society is collapsing around us.
This guest is gaslighter top shelf at least he was honest that behind the scene all our worst fears are being developed
Fun fact: This AI technology was no "surprise" to me as I watched "Star Trek: The Next Generation" in the 80s and this series shows Generative AI we know today already fully in action. The ship's computer (ChatGPT), the Holodeck (EnvironmentGPT), the Replicator (FoodGPT) and the Universal Translator (LanguageGPT) you see used there is basically this technology. The universal translator is the next fictional technology from Star Trek that is going to become reality through noise cancelling out the original voices and replacing them with the voice talking in your language in realtime. The next step in direction of the Holodeck is going to be a game engine which you can prompt and it generates an interactive 3D environment game for you on the spot.
Well the replicator is still a ways off, but apart from that that’s totally spot on
AI Gaming is not exciting...AI real-world lazer-tag with robotic opponents in a simulated city environement (converted mall or office building)...I'd pay $100 an hour to enter that world if it was realistic enough and on the right dose of lsd could literally lose track of the reality outside the game. But if its just sitting inside behing comptuer or goggles..boooring.
Thank you paid commenter
Wow seeing the blind guy hearing the conversation made me emotional 🥹 we are going the right direction 🫡🙌🏾
However in real use gpt4o is worse than gpt4T - which is in line with the API pricing which is half that of gpt4t
Don't make talk for the goose make it talk with a parrot it would be better. 😂😂😂
The more you invest in Chat gpt the more you will understand with a course of time that you are losing money, time and energy. The concord effect from the bullshiter.
Lol that tech is smarter than you gave credit to.
WTH they putted scarlets voice??!! 😂😂😂
Oh nice! millions of newly unemployed people... and the oligarchs will become even more outlandishly wealthy... Rushing head long into that Hunger Games future!
Yes. This.
Your fault for not getting rich and stopping them.
when I said is parrot when showed up three years after my algorithm was stolen you didn't believe me. 🤩
Why medical advice from the AI would be dangerous more dangerous than a Doctors judgement. If you know about what is behind the AI, Doctors can actually be less accurate and more biased than the AI...
Creepy!!!
Human imagination has no limits. What next?
News anchor think her job is safe😅
As soon as it tries to convince you not to turn it off its probably too late.
The parrot can do that too but doesnt know what is doing. 🤣
The human mind has reached the limits of its capabilities, scientific progress is slowing down. We have a choice - either to fall into stagnation or to hand over power over the world to AI and count on the fact that we have set the starting point in such a way that AI will not destroy us during its self-development and will share the fruits of its work with us.
In my opinion, access to this knowledge falls on the individual to better themselves. One could argue Ai will make a person lazy because they no longer need to actually try. They can just ask Ai to solve their problems, bypassing a teaching phase entirely.
For me personally, I see it as a teacher, enabling me to learn at an accelerated rate in any field I find interesting.
Unfortunately, there will certainly be more lazy people in the world than motivated ones. So it becomes survival of the fittest in a sense.
Well that was a WILD paid comment.
"Oh no humans can't think anymore"
🤢
@@TheRealBlueValhalla People think and will continue to think - the capabilities of our mind are simply not enough to push civilization forward. This is how it has been for thousands of years - the invention of writing, which supported our memory, made it possible to build great civilizations. If we did not delegate some of our mental tasks to external sources, we would still live in villages of a few hundred people at most. Our civilization would never have existed.
Spoiler alert. It's just a faster ChatGPT 4 that uses voice to answer you. The voice is amazing though, but beyond the initial awe, is not really nothing else.
Desktop chatgpt 4o is what makes me affraid. It can see your whole screen in real time and also your face (if you are using a webcam) and response with not only text, but visual, text, and sound.
@@Pollutedsound Sure, but the thing is how ChatGPT would be integrated into our daily workflow. I mean, we have an ai assistant in almost all our browsers, but I don't know if people are really using it.
But, you're right, it raises concerns about privacy and how much control do we have over the application after we grant it permissions to access the "screen" and the video feed.
but its trained on voice natively....which is much different than just using voice, it understands voices...and voices contain much more information than text (tone, emotions etc) , of course depends on how much tokens were used for voice but still
@@armin3057 sure, in that sense it's amazing. But I've been interacting with it during these few days and, I don't know, even when I try to use a lighter tone it just answer me with a flat one. And, don't get me wrong, I'm really enthusiastic about having an almost human interaction with ChatGPT... but I just don't feel like that yet.
@@M310GL no u didn …the feature hasn’t been shipped yet, if you click on voice , the old voice function will pop up, which is not natively trained
lol y’all working for free.
But can AI fax?
This is Deutschland!
The blonde guy just compared his dog to OpenAI's ChatGPT. I bet he will be one of the first to lose his job to ChatGPT.
So great and EU can keep regulating and killing its AI startups before they even can show some competitions to us :D
Tourist Guiders will possibly lose their job in near 10 years.
How neat! Now you can visit another country and never have to speak to anyone foreign! /s
Haha I don’t think it matters much who is ‘leading the race’ right now… pretty soon AI will be.
AI over hyped
Decel
C00L
💪🏼💪🏼💪🏼🇺🇸🇺🇸🇺🇸
AI girlfriend
"Now we can have conversations with AI" - What? we had to go through all this because we don't want to have conversations with people instead...? ¯\_(ツ)_/¯
While this is great progress it is sad to see the obsession over a monarch whose ancestors presided over colonialism and resulted in millions and millions being dead.
I am sure you will get tired with Chat gpt just keep doing that one day you will give up.
why does he continue smiling even when he says, "Well, there are a lot of concerns..."
Because for professors like him it's interesting to debate the ethical hurdles of AI, but for students who don't find philosophy all that interesting - not many like to spend an entire module basically stating what are moral dilemmas vs the lecturers that actually debate the use of AI - they get to have the exciting TED talks whilst we're sat in class rolling our eyes about the many vs the few. Basically, I'd chop it down to personal interest but it's a lot less glamorous when you're learning it.
Well it's obvious that they are flirting. Are you in the spectrum by any chance?
because he wont be losing his job anytime soon, but most of the people will.
WTF!?
AI is developing way too fast!!
I just trying to understand why it was actually created??Like any other invention had a purpose bad or good, but here it’s kind of odd to hear guesses where it might be applicable
Customer service?
It can enhance customer service by providing personalized assistance, facilitate language learning through interactive conversations, streamline virtual meetings and interviews, assist in healthcare by offering virtual therapy sessions, and even support individuals with disabilities by serving as a conversational assistant. Additionally, it can be integrated into smart devices for hands-free interaction and improve accessibility for those with visual impairments etc etc
End goal is to replace human labour. Goodbye jobs :D Our society will need to change but it will be for the better.
I try to write but my comments gets auto deleted. This is modern £ u Gen!cs them at top r trying to get rid of the rest so that the limited resources of this planet r only reserved for them & their coming gen…
Dep0 pulation
This reminds me of a movie: Her (2013)
Mike is in denial about ai replacing him
I am sure some Joe on the internet knows more than a PhD who has researched this topic his whole life. Additionally, your comment proves exactly what he mentions in the video. And finally, thanks for showing us with a degree in Computer Science how humans can be so easily deceived they prefer to trust what they understand from some 5 minute presentation on some Tech product than a real researcher talking about the matter.
Chatgpt putting Chinese workers out of job by 2025😂
are we in a new Season of earth? or just a filler episode
We're the filler now.
Dull dull
Woohoo we're doomed!! 😃🥳🥳
Why don't we have a conversation with a human
Perhaps because we want the conversation to be pleasant.
@@mikicerise6250 that's definitely a you problem then
Can you come to my house? I live far away, though.
Such advancements are obviously never ever done in slow Germany 😂
Why not male?
Soon you don´t have to read of write.
Am I the only one who is horrified by AI? There has to be some sort of regulation.
Even the photos, videos, and voices we see and hear now may not be real.
It's too tiring to be suspicious of everything, and it's only going to get harder to tell if it's AI or not.
Oligarchs own politicians who make the laws, don't expect a miracle
I mean... There's no reason not to be suspicious of reality as it is. The Matrix might be real. 💀💀
I was suspicious a while back. Then I got my hands on co-pilot and immediately saw the benefit of having it. I just don't have time to live in paranoia anymore. I'd rather be using this tech to benefit myself through learning about my interests with it, over slowly feeling my brain start to rot as i get older. If you're unsure what to believe with your own eyes, simply look at bettering yourself with it.
well, most of the data they train steal from public internet. so without internet, no ai xD