ChatGPT: 30 Year History | How AI Learned to Talk

Sdílet
Vložit
  • čas přidán 9. 05. 2024
  • This video explores the journey of AI language models, from their modest beginnings through the development of OpenAI's GPT models. Our journey takes us through the key moments in generative neural network research involved in next word prediction. We delve into the early experiments with tiny language models in the 1980s, highlighting significant contributions by researchers like Jordan, who introduced Recurrent Neural Networks, and Elman, whose work on learning word boundaries revolutionized our understanding of language processing. It leaves us with a question: what is thought? Is simulated thought, thought? Featuring Noam Chomsky Douglas Hofstadter Michael I. Jordan Jeffrey Elman Geoffrey Hinton Ilya Sutskever Andrej Karpathy Yann LeCun and more. (Sam altman)
    My script, references & visualizations here: docs.google.com/document/d/1s...
    consider joining my channel as a CZcams member: / @artoftheproblem
    This is the last video in the series "The Pattern Machine" you can watch it all here: • Artificial Intelligence
    00:00 - Introduction
    00:32 - hofstader's thoughts on chatGPT
    01:00 - recap of supervised learning
    01:55 - first paper on sequential learning
    02:55 - first use of state units (RNN)
    04:33 - first observation of word boundary detection
    05:30 - first observation of word clustering
    07:16 - first "large" language model Hinton/Sutskever
    10:10 - sentiment neuron (Ilya | OpenAI)
    12:30 - transformer explaination
    15:50 - GPT-1
    17:00 - GPT-2
    17:55 - GPT-3
    18:20 - In-context learning
    19:40 - ChatGPT
    21:10 - tool use
    23:25 - philosophical question: what is thought?

Komentáře • 1,9K

  • @ArtOfTheProblem
    @ArtOfTheProblem  Před 5 měsíci +77

    STAY TUNED: Next video will be on "History of RL | How AI Learned to Feel"
    SUBSCRIBE: www.youtube.com/@ArtOfTheProblem?sub_confirmation=1
    WATCH AI series: czcams.com/play/PLbg3ZX2pWlgKV8K6bFJr5dhM7oOClExUJ.html
    Timestamps:
    00:32 hofstader's thoughts on chatGPT
    01:00 recap of supervised learning
    01:55 first paper on sequential learning
    02:55 first use of state units (RNN)
    04:33 first observation of word boundary detection
    05:30 first observation of word clustering
    10:10 sentiment neuron (Ilya & Hinton)
    12:30 transformer explaination
    15:50 GPT-1
    17:00 GPT-2
    17:55 GPT-3
    18:20 In-context learning
    19:40 ChatGPT
    21:10 tool use
    23:25 philosophical question: what is thought?

    • @michaelpoblete1415
      @michaelpoblete1415 Před 5 měsíci +2

      Andrew Ng and Geof Hinton already clashed on twitter, quite confrontational tone, Andrew seems to be taking the side of Lecunn.

    • @masudahmad853
      @masudahmad853 Před 3 měsíci

      @@michaelpoblete1415

    • @masudahmad853
      @masudahmad853 Před 3 měsíci +1

      nice skills

    • @user-fj9hf4bu9f
      @user-fj9hf4bu9f Před 3 měsíci +2

      please reconsider the background music and noise levels, it really takes away from an otherwise interesting video.

    • @John-D.
      @John-D. Před 5 dny

      HAL, Reduce Music Volume 🤖

  • @clamhammer2463
    @clamhammer2463 Před 2 měsíci +145

    As an AI researcher and developer, this is the first video that I did not leave thinking that the author was just saying words without second thought. There is so much misinformation in this space stemming from the fear of the ramifications of AI that are a result of the negative feedback loop of those same people. Well done.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 2 měsíci +17

      I really do appreciate this comment, i work so hard on these and sometimes i think i'm crazy - the garbge funnel of ai coverage is brutal

    • @RyluRocky
      @RyluRocky Před měsícem +4

      😂 yes other AI channels just throw the word “compute” around 300 times.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem +4

      YOU'LL BE SHOCKED AT WHAT......@@RyluRocky

    • @Joe-jv5mm
      @Joe-jv5mm Před měsícem +1

      You are now to Chat GPT 16

    • @Blueprint4Murder
      @Blueprint4Murder Před měsícem +1

      @@ArtOfTheProblem Are you too going to kiss now? Card catalog 2.0. Do you guys remember those old newspaper cell machines? Where you could search for a news headline then look up the cell for the news paper pages related to that search? That is what AI reminds me of. A more advanced card catalog. While I am sure it will be great in that function and even in mimicry of humans I don't believe it is thinking. Most humans are so narcissistic that they believe animals do not think and yet those same dullards are convinced ai does because it is a popular narrative.

  • @PotatoCider
    @PotatoCider Před 3 měsíci +129

    please never stop teaching on the internet. i can tell you have an intense passion in learning. im in love with how you explain concepts and their connections. feynman would be proud.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 3 měsíci +27

      this makes me so happy thanks for saying that. I'm sitting here surrounded in papers working on next video

    • @lakazatong
      @lakazatong Před 3 měsíci

      ​@@ArtOfTheProblemthat. could not have said it better. it really was eye opening in this hot field. that begs the question, are there alternative models for reasoning? the fundamentals today are, as you explained in the video, prediction and attention learning, that is, deducing the next action/letter/whatever given the previous states of the world/space/whatever, or to be exact, the previous mental states of that, and learning how to filter out and fragment the data into a meaningful way to, to be honest, only aid the prediction
      my question is, other than prediction, what could it be?
      I'm very fond of seemingly simple and Impactful philosophical questions, and this one really hits
      because then if the answer turns out to be "not really" then... our brains are just wired to be good just at that
      honestly that would not be too groundbreaking when you start digging in how the brain works together with the body to make up for a human
      I'm really just writing my thoughts here
      that also brings the good old question "are we just monkeys with bigger brains?" and by the looks of it yes, but also the inner workings of the brain-body connection may just be the key difference
      truly an interesting topic
      while I'm at it, I see little to no video/interest in one specific approach we could tackle machine intelligence
      and that is, taking our brain, and building it with silicon and wires
      building the hardware to be the brain itself
      would you be interested in such topic? unfolding your findings and sharing them as well as providing a state of the art? that would be awesome

    • @hello-rq8kf
      @hello-rq8kf Před 22 dny +3

      yubi yubi

  • @belibem
    @belibem Před 5 měsíci +481

    There only a handful of youtube channels that can make such concepts accessible to everyone with a curious mind (e.g. 3blue1brown, veritasium, Sabine). Art of the Problem is one of them. Brit you are a legend. Thank you for giving us this series.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +33

      Thanks for sharing, I really appreciate it.

    • @rikvermeer1325
      @rikvermeer1325 Před 5 měsíci +12

      I'd like to add "Artem Kirsanov" (neuroscience, neurons, manifolds) and "Anton Petrov" (science, physics) to the list.

    • @rikvermeer1325
      @rikvermeer1325 Před 5 měsíci +3

      @@ArtOfTheProblem subscribed for more :)

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +6

      yay welcome to the party! sorry my output is slow though@@rikvermeer1325

    • @T4bboo
      @T4bboo Před 5 měsíci +5

      @@ArtOfTheProblem Man absolutely great video, I am new to your channel and instantly subscribed... only drawback for me personally was background music... those annoying tones sounding like from 80-90 games made me download your video use AI to distinguish between narration and background music to get rid of it without losing narration and listen adjusted version offline.

  • @Just4Growers
    @Just4Growers Před měsícem +17

    I have watched countless AI videos but nobody explained it like you. Many thanks.

  • @TropicalCoder
    @TropicalCoder Před 5 měsíci +7

    Very nice! ...well, except for the loud background music anyhow. Google kept pushing this video into my list over several days while I was very busy following something else. Finally I popped it up and saw why. Google knew that I would like it. Kind'a creepy, really how it knew it was exactly the kind of thing for me at this moment in time. It's like, it is following my studies on this subject and knew this should be the next chapter. Your video reiterates over everything I have learned so far and adds perspective.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      thanks for this feedback, I'm so happy to hear it was relevant to you. interestingly enough this is the first time in 12 years google is recommending my video strongly...and it's amazing the difference. I used to get 1k views per day, now it's 20k
      usually people hate my music, but not all, so I try to find a balance

    • @TropicalCoder
      @TropicalCoder Před 5 měsíci +3

      @@ArtOfTheProblem I liked your music. It fit perfectly into the narrative. Just when I was hanging on to every word 'cause it was so interesting, the music became intrusively loud at some points and destroyed my focus.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      i appreciate this, I know...i think the best is some music in and out at times...but not during moments of high cognitive load@@TropicalCoder

    • @TropicalCoder
      @TropicalCoder Před 5 měsíci +1

      @@ArtOfTheProblem Exactly!

  • @jay_sensz
    @jay_sensz Před 5 měsíci +79

    Great video, but please fix your audio mix. The background music is way too loud.
    There likely is an option for audio ducking in your video editing program that will automatically lower the background music volume when foreground audio is playing.

    • @sams64sf
      @sams64sf Před 4 měsíci +15

      Yes, please. If the primary goal is to spread knowledge, I think the background music is taking away from that. I really want to watch this, but the music is extremely distracting.

    • @DanielGirardBolduc
      @DanielGirardBolduc Před 3 měsíci +1

      It actually help me get into a thinking mood as the audio help me stay longer into my head when I question myself about specific topic mentionned in the vidéo. Might not be the case for everybody tho. I have ADHD and i'm auditive, if someone else with the same characteristic feel the same, let me know :)

    • @neohed
      @neohed Před 3 měsíci +5

      Hah, was about to post the same. The music is hideous and distracting!

    • @britcruise2629
      @britcruise2629 Před 2 měsíci

      nailed it@@SimoneDesignsGaming

    • @georgewright2010
      @georgewright2010 Před 2 měsíci +3

      I don’t understand why music is played. I gave up on the video after a few minutes.

  • @gonzothegreat1317
    @gonzothegreat1317 Před 5 měsíci +213

    Perfect. 12 years ago or so I learned about RSA from you.
    I'm glad to see you are still producing quality videos! Thumbs up!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +25

      so cool to hear from OGs

    • @QW3RTYUU
      @QW3RTYUU Před 5 měsíci +8

      That's about my story too. I would lay in bed with the iPod Touch and watch your cryptography videos, which in turn got me into owning bitcoin in 2011. I was so young and couldn't predict what would happend (my internal LLM was not trained enough yet) and I gave it all to some guy online as a donation for their FTP which had some hot stuff in it.
      The end is not the goal, it's the journey that's interesting. Your video got me emotional and felt right. I'll share around. Thanks for producing it!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +9

      ha, love the connections. I know the pain also@@QW3RTYUU

    • @myndwork
      @myndwork Před 5 měsíci

      @@QW3RTYUUholy shit same here, 2011 is when I found this channel while trying to learn about bitcoin. This channel is a treasure.

    • @btm1
      @btm1 Před 5 měsíci +3

      same

  • @situranjankar
    @situranjankar Před 5 měsíci +16

    One thing i like about your videos is that you are very good at explaining things and the way of presentation is such that it is intuitive to any age group . Although there are thousands of videos and articles are available related to the same topics but this one thing makes your videos unique and best.
    Thanks Brit. Please keep making such videos in future.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +3

      thank you for sharing, I tried to do something new here and I hope it was worth it

  • @seschaitanya5676
    @seschaitanya5676 Před 4 měsíci +9

    This is the first video I watched on this channel and the quality of content, communication, analysis, depth, and even the audio selection is soo on point. Hooked from start to end. Immediately liked and subscribed.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 4 měsíci

      thrilled to have you, so happy people are finding this channel again. stay tuned!

  • @maivincent2659
    @maivincent2659 Před 5 měsíci +9

    Been a big fan of your videos for years! I'm always delighted, even as I rewatch some of your old videos time and again, not only by how you distill a complex concept down to its essence but also by how you transform that which is abstract into something extremely palpable and relatable through filmmaking techniques.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      Glad you appreciate the techniques. Super cool to hear you go back to watch old videos. Inspires me to keep the series going into new topics

  •  Před 5 měsíci +28

    Wow, this is handsdown the best video I've even seen about this subject. Thank you! It is on my favorites now!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      Woo! So so happy to hear this after 2 years of work on it.

  • @jamesreilly7684
    @jamesreilly7684 Před 5 měsíci +21

    Quoting one of the hero's of tech (Hofstadter ) makes this very well executed video that much more compelling. T9 (predictive testing) is the first really useful llm ish (not a word wheel but a predictive wheel from numbers) tech that end users experienced. The notion this simple premise is how it all works is nothing short of astonishing.

  • @theK594
    @theK594 Před 5 měsíci +4

    This video is just fantastic, extremely up-to-date and very useful. It very well resonates with the discussions I am having now in the community.
    I would love to see it extended, once history gets written.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +3

      thanks for sharing, I do look forward to watching history unfold and updating it

  • @JonDotExe
    @JonDotExe Před 5 měsíci +8

    This is easily the *_most_* *_cogent_* video on the topic I have ever seen. (I even sent this to my mom!)
    Hard to believe you've had these amazing topics for 12 years and aren't cracking 100k yet. Count me in for the journey!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +4

      thank you and thanks for getting ma on board :) - I know I had given up on the 100k+ break out, so took around 2 years off while I slowly made this video, and now I'm getting a signal that I should come back and push hard for a year to see if I can come back to life. one questions, ask your mom what she thinks, because my dad said it's good but I need to make a "for dummies" version without the extra super details...so i'm thinking of doing that for my whole AI series since it's something we badly need right now.

    • @stephenmontague6930
      @stephenmontague6930 Před 4 měsíci

      ​@@ArtOfTheProblemGood idea - LLMs for dummies - and the same for children. How to explain LLMs to kids?
      This LLM speaks like a human but isn't a person, or even if it is, it doesn't mind dying and can be treated like a machine... what a thing to tell my 2nd grade daughter. I've described it as a prediction machine, but the mirror analogy here is also very good.
      Any more thoughts on this could be useful, as we wait to see how our children will exist alongside the children of... whatever this is.
      Then again, children accept and adapt readily to whatever is there, so maybe educating adults first is a good priority, although a dummies version and a kids version might be about the same thing, minus any mature content.
      Good luck!

  • @KalebPeters99
    @KalebPeters99 Před 5 měsíci +2

    Holy moly this was astounding! You balanced the tightrope of depth vs accessibility so gracefully, and the production quality is gorgeous too. Immediate sub!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      woo thanks! I went all out on that one, happy to have new subs :)))

  • @AlexGeo925
    @AlexGeo925 Před 5 měsíci +1

    This video deserves so many more views and likes than it has… It’s pure gold. Thanks! 🙏🏼

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you! i'm just happy it has the views it does, for years the algorithm ignored me and this one might hit 250k in 1 week which blows my mind and inspires me to make more

  • @TheBookDoctor
    @TheBookDoctor Před 5 měsíci +75

    I work in AI, and I learned a lot from this video. Great blend of historical perspective and well-pitched explainer!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +8

      I couldn't ask for a better compliment. I really was hoping someone new as well an an expert could get something out of it

    • @monad_tcp
      @monad_tcp Před 5 měsíci +3

      I can't wait for people to bash enough the API of OpenAI to extract and use it to train another open source GPT4 we can actually use in our desktops to do what we really wanted in our wildest dreams of experimentation.
      Llamma2 gets closer to GPT3, but we're playing with yesterday's tools.
      Maybe we should create a peer to peer system to create a shared GPT4 with our RTX3060s.
      Who would get into this with me ?

    • @NElectronicSoul
      @NElectronicSoul Před 5 měsíci +4

      This scare the sh!t out of me. You're being informed by a youtube video despite working in the field?!

    • @HypnosisBear
      @HypnosisBear Před 5 měsíci +1

      ​@@NElectronicSoulGood point. I like to believe that person is just a beginner.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +3

      the field is so new and so nacent that I think all researchers have to be on youtube to keep up@@NElectronicSoul

  • @cmw3737
    @cmw3737 Před 5 měsíci +9

    This is such a good journey through the papers that got us to where we are with LLMs. You've done an amazing job at picking out the key advancements all while a crazy amount of research had been happening. I've long preferred the definition of intelligence as the ability to better predict the future (minimise surprise) since before GPT3 so imagination (world models) is a key part of it.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      |Thanks for sharing, yes it's been a flury of research. So much research I did I didn't include here as well. I agree and I never liked definitions of intelligence which are long lists of skills

  • @alonsorobots
    @alonsorobots Před 5 měsíci +3

    Brit! So excited to have you back. You have no idea how much the videos for Information Theory have changed my life. I'm now considering leaving the industry to go back to school in Berkeley to study AI. Particularly in the domain of learning language outside of text (non-verbal communication such as body language and prosody). Both critical in storytelling and eliciting emotions like Pixar does!

  • @SecretEyeSpot
    @SecretEyeSpot Před 5 měsíci +10

    this channel changed my life about 10 years ago when i found the language of coins. By serendipity, I've found this channel again while im formally studying computer science which this channel inspired me to do. Thank you

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +3

      So so so cool. love this comment :))))

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      i have to ask what's the cs dept response to AI? when I did CS in 2008 there wasn't a class or even a deep learning textbook

  • @shadow_rune6178
    @shadow_rune6178 Před 5 měsíci +6

    Explaining this in a way most people can understand is paramount, you have taken time out of a very large number of peoples day, and you have used it well.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thanks would love to know what you'd like to see next

    •  Před 5 měsíci +1

      ​@@ArtOfTheProblem It seems to me that further exploration of this topic may run into an epistemological barrier, so what would you say about explainging to us how you got this knowledge? Maybe linguistics, philosophy or congnitive science?
      Btw, could you clarify what you mean the statement 25:15 (We either look at something that looks like though or It is though)? It kind of reminds me of Daniel Dennett's "Real Patterns".
      ps.: Loved this video sooo much

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      really?? so cool, please shoot me an email when you are done your thesis. I'm going to think about a good follow up@

  • @sinasec
    @sinasec Před 5 měsíci +2

    the amount of effort and time spent on this video is exceptional. I appreciate and enjoyed every second !

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      so glad you noticed, thank you! would love to know where you land regarding the last question

  • @lennartlopin2276
    @lennartlopin2276 Před 5 měsíci +2

    This is an amazing summary perfectly highlighting the important steps of the progress and how we got here. Working in text summary AI systems in the early 2000s and following the progress over time this video focused on all the crucial breakthroughs.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you, I tried hard to follow the LLM only thread which I hadn't seen before

  • @NickQuickX
    @NickQuickX Před 5 měsíci +37

    TIMESTAMPS:
    00:05 Neural networks learned to talk, leading to more general-purpose systems.
    02:30 Recurrent neural networks (RNNs) use state units to create a state of mind that depends on the past and can affect the future.
    04:52 Neural networks can learn word boundaries and cluster words based on meaning.
    07:10 Language models saw limited progress until 2011 when a larger network showed the potential for higher performance.
    09:34 Neural networks can learn language and complex concepts with minimal human intervention.
    11:43 Neural networks struggled to handle long-range dependencies in text sequences.
    14:02 Neural networks use distance in concept space to find similarities and adjust their meaning.
    16:20 Neural networks with self-attention and fully connected layers can generate coherent and contextually relevant text.
    18:27 In-context learning allows changing the behavior of the network without changing the network weights.
    20:38 Language models like ChatGPT are more than just chatbots, they serve as the kernel process of an emerging operating system.
    22:41 Training networks on prediction empowered by self-attention leads to a more general system that can be retasked on any narrow problem.
    24:43 Deep learning community is divided due to differing opinions on the nature of AI's linguistic abilities and thought process.

  • @caspa7
    @caspa7 Před 5 měsíci +9

    The Tie fighter sound in the last scene.. a stroke of genius. Very clever!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +4

      ha nice catch! glad some people like the sounds i was having a blast

    • @classicmartini
      @classicmartini Před 2 měsíci

      @@ArtOfTheProblem aka stampeding elephants ;-) Very cool.
      Oh. G'day from Sydney, Australia.

  • @les_crow
    @les_crow Před 5 měsíci +6

    This is the most perfect video on the entire internet. I will use it to elucidate people on the topic. If I was rich I'd pay you big for this. Thank you sir.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you thank you, what a compliment. this comment made my day. If you want to support a tiny bit you can here: www.patreon.com/artoftheproblem but just thankful for this comment :)

  • @RuiBarbosaJunior
    @RuiBarbosaJunior Před 5 měsíci +3

    I am amazed by the info in the video, but I am even more amazed by the quality of your video. Congratulations. And thank you.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      that means a lot to me. thank you so much. Stick around for more. curious where you land on the final question?

    • @RuiBarbosaJunior
      @RuiBarbosaJunior Před 5 měsíci +1

      @@ArtOfTheProblem I believe that consciousness (and everything that comes from it) is pure language processing. To me, things like thinking, reasoning, thoughts and consciousness are just language. AGI? Why don't people believe AGI is a couple of scalling steps away? To me, it is not a matter of making machines as inteligent as humans - we should adjust our thoughts: maybe we are not intelligent, at all. The thing we call "intleligence" is just something that emerges from language processing, in systems that are large enough.
      How do you know you can think? How do you know you're conscious? I tell you how: by interpreting sentences in your brain. (sentences can be made with words, images, feelings, symbols, etc --- should you call them "tokens"?). So, in short, we are MeatGPT :) Cheers!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      beautiful well said, and yes I like "tokens" :)@@RuiBarbosaJunior

  • @johanlarsson9805
    @johanlarsson9805 Před 5 měsíci +8

    Great video, nicely put together and well explained. I've been an enthusiast since 2009 and remember all these milestones. I've built several frameworks myself and have pondered ANNs for a much bigger lenght of time than I would be willing to admit. Regarding the final question in the video I can say for certain that yes they do in fact understand. It depends on the neural net what it is in detail that they do understand, but there can be no doubt that they actually understand their task. And not only that, they effortlessly understand it to a much higher degree than we do.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      so cool to hear from practitioners... intersting that the closer you get to it, the more you lean towards your view. and people on the "outside" seem to go the chomsky way

  • @JulianFoley
    @JulianFoley Před 5 měsíci +3

    Brilliant. Clear, and authoritative but also reflecting a reassuring sense of open enquiry.

  • @DanHartwigMusic
    @DanHartwigMusic Před 5 měsíci +2

    I will be sharing this with everyone I know with a tertiary interest in LLMs. The best high level human understandable explanation of the history. Love it. Thank you. Please keep gracing us with your videos!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      Dan I really really mean it when I say I appreciate this. I thought my channel was dead, spent 2 years on this and so all I can hope for is people share it. Because of how well it's doing i'm committed to doing this for another decade !

    • @DanHartwigMusic
      @DanHartwigMusic Před 5 měsíci +1

      Very glad to hear that! 🙂. Also can't wait to see what happens with these Q* developments...

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      Me too...clearly building on the strength of an LLM in the loop of a larger process.@@DanHartwigMusic

  • @Elias-nj6gi
    @Elias-nj6gi Před 5 měsíci +3

    Phenomenal video. Not merely of an explanatory nature but a piece of art in itself.

  • @aieousavren
    @aieousavren Před 5 měsíci +5

    Truly an exceptional video! Incredibly clear and engaging presentation of the history and relevant ideas, well narrated. This was very thoughtfully written and thoughtfully delivered. Thank you!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      really glad this is resonating with people, I spent so long on it

    • @aieousavren
      @aieousavren Před 5 měsíci +1

      @@ArtOfTheProblem The time and effort *absolutely* shows. I was pinned for just about every minute! This is really high quality work, it definitely made my day. I hope you can feel rewarded for your effort, because you deserve it, man.
      I also really enjoyed the way you showed the progression of not only the architecture and "design", but also the capabilities/outputs of these models, too. It really gave a good sense of the history in a way I haven't really seen anywhere else.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      i'm so glad it was time well spent, I really tried to do what I couldn't find online...thanks for sharing@@aieousavren

  • @CHROMIUMHEROmusic
    @CHROMIUMHEROmusic Před 5 měsíci +9

    This was a fantastic video, learnt more about the history and buildup to ChatGPT in these 30 mins then I've read anywhere before. Learnt a lot, and very thought provoking towards the end. Thank you!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thrilled to hear this, plesae stay tuned for more.

  • @Mark1Mach2
    @Mark1Mach2 Před 5 měsíci +6

    This is amazingly informative video, with the right amount of depth covering how chatgpt was developed over time.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      Thank you Mark, it was so hard to strike that balance....appreciate it, i'm curious where you land in terms of the final question in the video

  • @colorfullife8703
    @colorfullife8703 Před 5 měsíci +7

    Masterpiece! This channel deserves millions of subscribers.

  • @creallf
    @creallf Před 5 měsíci +3

    This was beautiful and thought-provoking, thank you for this work of art!

  • @kti5682
    @kti5682 Před 5 měsíci +1

    This was awesome, having not even watched AI from the sidelines I needed a refresher on how we got from humble neuronal networks to ChatGPT. Also thanks for the included references.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      so happy to hear this. As an outsider, did this video have too much detail? I'm planning a more general follow up soon which blends a few videos on AI together

    • @kti5682
      @kti5682 Před 5 měsíci

      @@ArtOfTheProblem Actually I am electrical engineer but lost interest in AI while studying in the nineties. So no the detail was alright. If I got that right you didn't talk about training methods in depth which I struggled with in the past, also I don't understand how the network is run timewise, i.e. in discrete time? But with the papers you mentioned I can start somewhere.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      ah so not a layperson :) I covered all of this in the earlier videos in this series, please check it out and let me know what you think czcams.com/video/YulgDAaHBKw/video.html&ab_channel=ArtoftheProblem@@kti5682

  • @jeouxgentry7733
    @jeouxgentry7733 Před 5 měsíci +1

    This is excellent.
    You’ve found the sweet spot for length and depth of on this subject. I can’t wait to recommend this to anyone whenever the conversation turns to AI, which seems to be a lot these days.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      Super appreciate this! I'm thinking |I should make a simpler more general follow up for new people, thoughts?

  • @ahmetozturk2026
    @ahmetozturk2026 Před 5 měsíci +37

    I remembered your channel from your cryptography videos, jumped right in right away and can easily say that this is your best piece so far. With such a good production, I completely understand that it took 2 years of research. You always summarize the technical details and key breakthroughs in a constantly interesting way through the whole video without being overbearing.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      Thanks so much for feedback, it means a lot to hear it. I was worried on striking the right detail level here and went back and fourth so many times. I had entire chapters on hopfield networks I cut.

    • @The0Yapster
      @The0Yapster Před 5 měsíci +3

      ​@@ArtOfTheProblem
      I watched all of your cryptography videos and I have even returned to them after graduation for mere pleasure.
      I have no idea why your channel didn't take off.
      I would really appriciate some videos about logic, fallacies and limits of the human brain (caused by our evolutionnary history)

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      awesome, thanks for the feedback. super cool to hear@@The0Yapster

  • @Naitry
    @Naitry Před 5 měsíci +17

    A lot of people in my classes (as a cs major) look at me like a crazy person when I start talking about latent spaces, geometric concepts in AI, and Douglas Hofstader's views on cognition. This video makes me feel understood :)

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +3

      you are one of us :)

    • @HungLe-ju6dv
      @HungLe-ju6dv Před 5 měsíci +3

      we too look at them like they are crazy bruh😂

    • @KainniaK
      @KainniaK Před 5 měsíci +4

      At least Douglw Hofstader is able to play with something like ChatGPT4 and be like "Looks like I missed something". I don't see John Searle ever do that! I also believed that what chatgpt 4 does today was always going to be impossible on binary hardware, that just manipulates 1 and 0.s But I was clearly wrong!

    • @stephenowesney5173
      @stephenowesney5173 Před 4 měsíci

      Excuse me if I'm missing something, because I haven't really dug deep into Hofstader's work; how is this proof that he has missed something? I think the basis for which he describes a lot of phenomena, the strange loop, is very much intact all throughout the concepts explained in the video!@@KainniaK

    • @Dr.SexySpice
      @Dr.SexySpice Před 4 měsíci +1

      Was this undergraduate or graduate? If undergrad then you expect far too much out of your classmates 💀, you are truly one in a thousand

  • @SiimKoger
    @SiimKoger Před 5 měsíci +1

    The best NN timeline video I've seen on CZcams.

  • @donharris8846
    @donharris8846 Před 5 měsíci +71

    This is a Grade A video. Very well narrated, edited, and in depth yet understandable. Thanks!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      Thanks Don, really happy to hear this.

    • @artmaltman
      @artmaltman Před 5 měsíci +1

      @@ArtOfTheProblemHad to stop watching because the music was way too loud and annoying. Your explanations are excellent but I won’t be able to benefit from them.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      thanks i'm going to put out another video with this addressed in the future on a broad overview of AI@@artmaltman

    • @williamwilkinson2748
      @williamwilkinson2748 Před 5 měsíci

      @@ArtOfTheProblem Great, definitely drop the music. Everybody seems to use background music, drives me mad!

    • @bojohannesen4352
      @bojohannesen4352 Před 4 měsíci

      Grade b

  • @CatsAreRubbish
    @CatsAreRubbish Před měsícem +4

    I just discovered this video and this channel. Wow! Informative, concise, not trying to push an agenda, just telling the story. Fantastic.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem +1

      thank you, stay tuned for a follow up

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem +1

      wanted to let you know my next vid is out! czcams.com/video/5EcQ1IcEMFQ/video.html

  • @AdamDesrosiers
    @AdamDesrosiers Před 5 měsíci +13

    this is hands-down the best summary of how we got LLMs that I've seen - definitely keeping this on hand to share with people who need a quick catch-up

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      I really appreciate you doing so, I'm trying to revive my channel. Do you think I also need a simpler version? I was going to maybe do a recut of my whole AI series into a shorter video for those who need a full orientation, but with less detail...

    • @AdamDesrosiers
      @AdamDesrosiers Před 5 měsíci +2

      ​@@ArtOfTheProblem to my tastes, no - I wouldn't say you need a simpler version. But I'm not the best person to ask on that score. I have a strong preference for being more verbose and thorough, to the point of being occasionally told I sound like chat GPT. Your instinct may be a good one as the general public is concerned? Sorry that I'm no decent guide there.
      Please do make more videos though! I'll happily subscribe 😊

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you, will do :)@@AdamDesrosiers

    • @KainniaK
      @KainniaK Před 5 měsíci +1

      I have already posted it everywhere on reddit but as usual the stuff that teaches the most gets the least amount of upvotes.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you for trying :)@@KainniaK

  • @DanieleLicari
    @DanieleLicari Před 6 dny +1

    Grazie. This is the most comprehensive video on the history of LLM we have ever seen. congratulations

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 dny

      thank you! much appreciated, this is what I was going for :) Took a loooong time. working on a follow up now

  • @seansalsarola6663
    @seansalsarola6663 Před 5 měsíci +1

    The most understandable summary of what I have been attempting to understand d from other sources - Great!!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      so nice to hear, I had to vaccum up everything to get to this point...hoping to save people time

  • @TikiTDO
    @TikiTDO Před 5 měsíci +6

    While I think it's reasonable to say "if it looks like thought, it is thought," I also think it's important to distinguish whether it looks like thought to Joe Average, or to a specialist in the field. That said, for the moment these models absolutely are mirrors of our thoughts. More specifically, the thoughts we enter into those prompts guide it's responses, and the chain of thoughts guide it's learning. I don't think it's a particularly difficult line to wrap your head around either; AI is a reflection and extension of the thoughts you put into it, so if you want to extend and refine your thoughts you can use it when you have no other ideas.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      love it...

    • @kti5682
      @kti5682 Před 5 měsíci

      It also provides input into us and can learn something from our outputs about the world, so we develop together.

  • @jpgdesign
    @jpgdesign Před 5 měsíci +3

    This is such as well produced video! you deserve more subscribers.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you, since your comment they are really flowing in for the first time, like 1500 a day

  • @dinkerchaudhary1202
    @dinkerchaudhary1202 Před 2 měsíci +1

    Beautifully and well explained. Its so difficult to hook people for 25+ minutes content but i am sure, you have made it possible for a lot of people with so well detailed and well put content.

  • @MineJulRBX
    @MineJulRBX Před 5 měsíci +2

    Gives a deeper understanding in the power that has now become accessible to everyone on this planet, awesome video!

  • @riflebird4842
    @riflebird4842 Před 5 měsíci +20

    Great video by the way, here are my thoughts on the last issue. The first time i tried LLMs, i really thought it had some kind of intelligence and it blew my mind off like everybody else's. Because previously i tried a lot of advanced AI models to create a Jarvis like in Ironman for my personal project. It never worked all of them were very dumb and dull. But as i approached the new LLMs with more demanding reasoning and logical tasks it outright failed the tests. The larger the LLM the better it is at hiding those flaws. But the key point is the classical issue that we point the AI at, the semantic understanding of words in LLMs are just a property of vector distance in its multi dimensional space memory. It only knows what comes next probabilistically. It don't even know what is talking about other than churning out words that might come next. How do i say it, its like it doesn't have a mind of its own, but only a sophisticated system that can only grasp a surface level meaning of words in a language. When testing smaller models these flaws become very obvious, because it doesn't know how to build knowledge from existing semantic words. It can't synthesize information logically because it doesn't have the ability to grasp the full meaning of its words. but i think its a right step towards artificial intelligence and LLMs are just the tip of the iceberg. I don't know what the future is anymore. I don't dare to predict, the fast paced research in AI is scary and exciting. Also IDK if we will ever achieve human level intelligence, but we will surely achieve machine intelligence that is good at mimicking human intelligence

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      amazing, let me digest this one

    • @aberroa1955
      @aberroa1955 Před 5 měsíci +4

      I wouldn't say it doesn't know what it is talking about. It knows relations between words, that is at least some knowledge. After all, most of our science is based on relations and logical reasoning, not requiring real understanding (like famous "shut up and calculate" in regards to quantum mechanics). It probably doesn't understand what it's talking about though, because understanding is property of cognition, and it definitely does not have cognition, even though it have a few of it's derived properties. But yeah, I agree, it seems that LLMs is a great instrument and stepping stone for true cognition. Especially when it's multimodal. I see it like a data bank for true intelligence. For now it's working linearly in comprising a data in shortest way that seemingly makes sense through semantic space. Like in a straight geodesic line. While cognitive AI would be able to actually see and "feel" that semantic space, the possible ways, and direct speech through it, which basically is making new ideas.

    • @mrtertg2603
      @mrtertg2603 Před 5 měsíci +1

      ​@@aberroa1955I think , you hit the nail . Meaning , may be you are on the geodesic ! 😊

    • @brainiac-nv4ws
      @brainiac-nv4ws Před 5 měsíci

      I believe you are hitting the same limits which Yann LeCun is concerned about. "It don't even know what is talking about other than churning out words that might come next". As it turns out, because human minds (up until 2020) wrote all the sensible text available, having to predict he next words does require *some* aspects of meaning as humans call it. The experiments of the LLMs show there is something there.
      But not everything we would want yet. Their neural architecture is diverging away from how human brains work---which is both more powerful in some aspects but in others substantially limited by biology compared to silicon. Humans "context window" is much smaller and less precise than the 100k tokens which a LLM can accomodate now perfectly with no forgetting, and yet human performance can still be better. We must have better algorithms and better meaning extraction to make up for worse hardware. But already there are plenty of natural humans worse at language than a LLM---and the success of the LLMs more shows that most human talking and thinking is not so sophisticated. Only a small bit is truly deep.
      Yann and friends are searching for a quantitative, optimizable technique which will give results qualitatively better than iterating next word ahead probabilistic prediction, or at least include it as a subproblem along the way (which already gives good results).

    • @Rahul_1.618
      @Rahul_1.618 Před 5 měsíci

      Fantastic explanation

  • @sB3rg
    @sB3rg Před 5 měsíci +3

    Once again! Fantastic content. I was captivated the whole time. Love your method of computer science story telling. You have something special here and I hope you are able to continue creating this content. I have subscribed to your Patreon.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      so happy to have you , thank you! i only missed that comment because i've had the busiest week in the channels history :))

  • @jensk9564
    @jensk9564 Před 5 měsíci +2

    Congrats! This is a wonderful video and the most informative short format on Machine Learning I have come across. The illustrations by DALL E 3 are just stunning and demonstrate that Artificial Creativity in pure image creation has surpassed humans already by orders of magnitude.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      I couldn't ask for more. SO thrilled people in the community are finding it :)))) I know I made the decision to use DALLE3 out of desperation as I was so tired doing the final visuals and usually I make it all by hand, but this seemed fitting

  • @varunjain8981
    @varunjain8981 Před 5 měsíci +1

    I have been watching your videos for a while. And you are one of the best channels. Your explanations are mind blowing. ❤

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      I really appreciate it, would love to know what your take away was on this one? so cool to see my channel getting new subs in a big wave. thanks for sticking around

  • @ankitsharma1072
    @ankitsharma1072 Před 5 měsíci +3

    so glad I found this channel.

  • @Abdul-qo6eb
    @Abdul-qo6eb Před 5 měsíci +59

    this is a timeless masterpiece. Incredible work guys!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +4

      thank you abdul, that's quite the comment. I couldn't ask for more

    • @johanneslindberg8331
      @johanneslindberg8331 Před 5 měsíci

      @@ArtOfTheProblem 1:17 Very interesting! To bad I couldn’t hear the hole thing because if the background noise. Why?

  • @mysticaltech
    @mysticaltech Před 4 měsíci

    Truly excellent! You have a gift for explaining complex subjects. It helped me a lot. Thank you!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 4 měsíci

      thrilled to hear this, i worked really hard to try and help others somewhere as I know it can feel very fragemented out there

  • @Yottenburgen
    @Yottenburgen Před 4 měsíci +2

    Did not expect this video to discuss a concept I've been researching a lot. You brought up in-context learning whereby a static system isn't necessarily static (each instance can still learn things), then you also brought up that LLM output is on the thought layer rather than the spoken layer. The prompt is the program is such a nice succinct way to put it with the calculations being done on the context, therefore each context is individualized while the model is general.
    Personally, I have been looking at them from a psychological and philosophical way where the context is the Self (one's identity depends on their memories) which makes every chat instance an individual. Then I've been thinking about the model itself being more like the Lizard Brain where these unconscious guidance's occur. So, something like the fear of spiders or reproduction knowledge would be on that level. LLM are loaded with a large inherent knowledge with a tiny memory window, humans have a tiny inherent knowledge with an extremely large memory window. Although human's also have a large amount of memory processing going on meanwhile LLM currently have none (though people are working on it). I wonder if a LLM context window is more or less pure memory while humans make do with incredibly efficient memory?
    I think in your closing, both camps are correct. It's just the object of focus that I think they are incorrect about. The prompt IS the program that makes it a thinking thing, the model itself however isn't a thinking thing as it must be given what to think about.
    I also think the ones thinking it is incapable of thought likely have an external mechanism in mind that it lacks like it not having a concept of time (see circadian rhythm, time blindness, or true unconsciousness in humans), or it not being able to directly see (see visual cortex (or for its internal world view see aphantasia)), lack of survival instinct (trained to be a chatbot while chatbots aren't supposed to have survival instinct (humans are guided by the amygdala for fight or flight as well)), ability to tamper with its messages/thoughts (see the unreliability of eyewitness testimony, confabulation, or the misinformation effect) . I honestly don't think there's any mental structure that a LLM doesn't have that another human also doesn't have besides the memory structure that we humans have, meanwhile people keep thinking of it in human terms, but it can never be human with such a massive amount of inherent knowledge.
    Apologies if this is rambley, incoherent, or beyond the Overton window but this is the first video I've seen that even mentions the two concepts at the beginning of this long comment. This is the tip of the iceberg of comparisons but still it's fascinating.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 4 měsíci

      thank you for sharing your thinking, i'm working on next video so this helps...i'll ponder more

  • @pellestorck3776
    @pellestorck3776 Před 5 měsíci +3

    Hofstadter is one of my favorite authors on this subject. Gödel, Esher, Bach is highly recommended.
    Personally I used to think there must be some intricate function in the brain that created intelligence, possibly even at the quantum level. Now I believe it is just a function of sufficient complexity.
    Whatever happens it will sure be interesting.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      I remember that book had a huge impact on me right around university year 1 or maybe end of high school...i don't think I ever finished the last 3rd, there was so much to chew on in the first half

  • @CutStudio4
    @CutStudio4 Před 5 měsíci +566

    I'm the one solving the rubiks cube!!!!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +105

      Best dang cube solvin' hands in the west

    • @eastwood451
      @eastwood451 Před 5 měsíci +22

      There is no Rubik's cube.

    • @CutStudio4
      @CutStudio4 Před 5 měsíci +25

      @@eastwood451 its in the first scene, the one solving the rubiks cube is me

    • @KodakYarr
      @KodakYarr Před 5 měsíci +13

      ​@@CutStudio4
      I think it's a Matrix joke, but I don't really understand it

    • @eastwood451
      @eastwood451 Před 5 měsíci +16

      @@CutStudio4 It was a Matrix joke 😄 Seemed fitting in a video about AI... God job on the very real cube!

  • @AdvantestInc
    @AdvantestInc Před měsícem +1

    Incredible journey through the evolution of language models! It's amazing to see how far we've come since the inception of neural networks

  • @danielwilson2827
    @danielwilson2827 Před 5 měsíci +1

    Incredible video man, please dont stop making these!!

  • @ClappOnUpp
    @ClappOnUpp Před 4 měsíci +4

    Wow. I think thats the first Noam Chomsky reference ive ever heard that actually pertains to his field of study 😂

  • @EchoErik
    @EchoErik Před 5 měsíci +3

    Honestly this video is beautiful

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you for the kinds, words I hope I can release another to live up to this at some point

  • @zenithquasar9623
    @zenithquasar9623 Před 5 měsíci +1

    Thank you. This was clearly presented, easy to follow and understand!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      glad you found it, this is part of a longer series too FYI

  • @Vartazian360
    @Vartazian360 Před 5 měsíci +4

    Phenomenal explanation of LLM's. Im in the camp that believes it will not truly ever be thought in the same way humans think (Conscious thoughts) , but if we simulate thought in a machine, to an outside observer, there is no difference. Whether or not it is actual thought is irrelevant if the simulation of thought is just as good as the real thing (humans) I believe we have broken the hardest obstacle to AGI, and it is only about scaling larger models, adding synthetic data, using AI's to train other AI's, and combining learning techniques at this point until AGI is achieved.... AGI is near. PS I think Noam Chomsky is stuck in the past and is dead wrong about LLM's being nothing more than autofill.. GPT 4 alone has demonstrated complex resasoning skills and theory of mind..These 2 things alone disprove Chomsky

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +3

      wonderful, thanks for sharing your thinking here. I'm so thrilled to see the activty on this video...I tend to agree with your view here

    • @cryptocoin5318
      @cryptocoin5318 Před 4 měsíci

      Humans are trained to think by society and the environment, the facial expressions, we make, the conduct we show are all trained by the environmental factors and knowledge. AI is the same way is being trained. There is no independent think without the environment is the programming language.

  • @padraiggluck2980
    @padraiggluck2980 Před měsícem +18

    I can't pass the Turing test.

  • @LuisBrudna
    @LuisBrudna Před 2 měsíci

    AMAZING! The quality of the video's production is exceptional, with clear visuals and engaging animations that make complex concepts easily understandable. . Superb

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 2 měsíci

      Thank you so much, I worked a long time on this, more to come

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem

      New video is up on Evolution of Intelligence czcams.com/video/5EcQ1IcEMFQ/video.html

  • @Lexxxco1
    @Lexxxco1 Před 5 měsíci +1

    Beautiful and simple explanation of a complex topic. Which is a hard thing to do) Thank you for your work and brilliant approach!

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      really appreciate it, curious where you land on the last question?

    • @Lexxxco1
      @Lexxxco1 Před 5 měsíci +1

      ​@@ArtOfTheProblem Thought is an abstract and intertwined computational process of a system (meaning of abstraction may be lost if you remove context of it's intertwined computations) . This computation can be self-aware in a context of conscious system and even unconscious one, given the right complexity of it's own. There are many more beautiful meanings, but this is mine and sounds suitable.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 4 měsíci +1

      love this thanks for sharing...best comment section ever on this video@@Lexxxco1

  • @amulpatel
    @amulpatel Před 5 měsíci +8

    Fantastic essay on the current state of Ai .. sadly it’s difficult to find channels that explore this depth WITHOUT being boring or overly dry in technical explanations. My only criticism is the bkgrnd music is a bit loud.. nice choices though

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank i appreciate it, working on a follow up now

    • @NoahFect
      @NoahFect Před 4 měsíci

      @@ArtOfTheProblem It's interesting to look at the correlation between cheesy sound effects and musical cues and the "engagement graph" shown above the timeline/progress slider control. Those engagement peaks aren't necessary a good thing. They tell you where you're losing people, forcing them to back up the video. Please give some thought to whether the music is really doing you any favors, or distracting from what was really a superbly-researched and nicely-presented video.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 4 měsíci +1

      yes this was me going a bit too fast on sound pass, 99.9% on researcher 0.1% sound levels... funny to think that would trick the algorithm into higher retention graphs...i honestly didn't think this video would have the reach it did, it if could swap sound I would. i wish
      @@NoahFect

  • @sb2h
    @sb2h Před 5 měsíci +1

    Great video, it's got me started on a little marathon of your content. I'm watching them all on 2x speed and its wonderful

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      so cool! report back with that you liked and learned and would like to see next.

  • @peteratkinson1492
    @peteratkinson1492 Před 5 měsíci +3

    Great video. I’m doing an AI course at the moment and this tied up some loose ends for me. Amusingly The film THE CREATOR touches on this philosophical question. Do neural networks really understand or feel? I think we have become accustomed to machines being faster, repeatable, and more accurate than us but not until an AI extrapolates beyond its training set and makes some truly ground breaking inferences with profound consequences to human society, we will continue to see AI as just a clever parlour trick and will keep moving the goal posts. Fantastic stuff.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thanks for sharing I agree with this. I love to hear about this loose end aspect as that's what I try to do. I'm curious where it helped you specifically in that regard?

    • @peteratkinson1492
      @peteratkinson1492 Před 5 měsíci +1

      Loose ends for me included: 1) The history behind the build up of LLMs. I did not know about the research origins of Recurrent NNs. 2) How these early networks learned 'semantics' based purely on guessing the next word 3) The sentiment neurone emerging 4) How the paper "Attention is All You Need" fed in to the origins of GPT models and maintaining context over long sequences. 5) the difference between In Context Learning vs In Weight Learning - Nice you are looking at the original papers too (I assume thats your yellow high lighter pen!). I kind of knew about all these things but getting all this in to 26 minute 'context window' really helped! A random thought - I wonder if LLMs will be able to generate a 'tech-tree' - the type you see in those sim-games based on the semantics of ideas and be able to orchestrate the evolution of ideas of our human culture through the ages from the Greeks to Modern day - not using research reference links, but based on the semantics of ideas (the dna of the meme).

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      super interesting question, thanks for sharing...going to ponder this as I brainstorm a follow up@@peteratkinson1492

  • @user-jx4mi3fd3o
    @user-jx4mi3fd3o Před 4 měsíci

    this is a timeless masterpiece. Incredible work guys!. this is a timeless masterpiece. Incredible work guys!.

  • @margaritashcheglova8670
    @margaritashcheglova8670 Před 4 měsíci +2

    I was thrilled to see Hofstader, thank you

  • @azhuransmx126
    @azhuransmx126 Před 5 měsíci +3

    Neural Networks will learn Everything we can do and more.

  • @luminon9635
    @luminon9635 Před 5 měsíci +5

    What's the name of the song that starts at 26:31?

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      these are all original songs

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      my friend puts them here cameronmichaelmurray.bandcamp.com/

    • @luminon9635
      @luminon9635 Před 5 měsíci +1

      @@ArtOfTheProblem Ok, thanks. I couldn't find the song there, there's so many songs there. But it sounds like a 1980s new wave or post-punk song.

  • @neoblackcyptron
    @neoblackcyptron Před 5 měsíci +1

    You are simply great. I learnt a lot from your CNN videos in the past. This one on LLM gave me the intuitive understanding on LLM. Your videos are always top-notch quality content, a lot of effort and planning in every video.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      indeed it's a low and slow process, thank you. I'm torn now as my channel is growing fast all of a sudden to i'm tempted to try and speed up somehow...i was thinking of doing a kind of 'super cut' of my AI series into an easy to follow 10 min video might be fun...while I work on something brand new. what's your sense?

    • @neoblackcyptron
      @neoblackcyptron Před 5 měsíci +1

      @@ArtOfTheProblem the super-cut is a great idea, if you could connect all that you already made on these topics into a cohesive whole and get another video into the channel.
      Your channel stands out because of it's quality content mixing artistry and science with intuitive explanations: that is the DNA which makes it great. If your team can handle a little bit of speed it should help, I look forward to what your next brand new content is going to be.
      For the record I shared your video on my linkedn network, mostly Robotics and AI grad students in my network it should prove to be useful and hopefully drive more subscribers your way. Keep doing what you are doing, process improvements to increase throughput is your decision. You are the boss :)

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      thank you!!! will do, also I do this 99% alone :) so i'll do my best@@neoblackcyptron

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you...that makes me think I could do some which cross episodes (I.e. a bit of CS, computer science and deep learning) i'm gona try that!@@neoblackcyptron

  • @flake8382
    @flake8382 Před 3 měsíci

    Incredible detail, bravo!

  • @vigneshnandakumar3739
    @vigneshnandakumar3739 Před 5 měsíci +3

    30 yr history summarized in under 30 mins just wow

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      you don't want to know how much I cut, or how long this took

  • @distantsight
    @distantsight Před 5 měsíci +6

    Good presentation is diminished by distracting background music

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      I really appreciate the feedback. I'm thinking of doing a shorter, more general/broad follow up video that helps orient people new to the field, and looks ahead a bit more. any thoughts?

    • @tomslivick8620
      @tomslivick8620 Před 3 dny

      Yes please tell the A I that. So it will learn that. Please.

  • @algorithminc.8850
    @algorithminc.8850 Před 4 měsíci +1

    This was very well done ... Thanks. That mid-80's paper influenced many in the field. Great stuff ... I look forward to checking out your channel. Subscribed. Cheers ...

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 4 měsíci +1

      thanks, welcome to the family. lemme know what else you find interesting

  • @Libertariun
    @Libertariun Před 5 měsíci +2

    9:50 ... what's beautifl about this, is that we didn't have to hard code any of it, the network decided what as useful to keep track of... (further on ) ... the sentiment neuron... not hard coded...

  • @user-ut1pb9sb7r
    @user-ut1pb9sb7r Před 2 měsíci +10

    Music is dominating the video😔

  • @mutexin
    @mutexin Před 5 měsíci +4

    The reason for the discourse is due to lack of uniform definition of intelligence, thinking and reasoning. One considers it as thinking while another does not.
    I wouldn’t call prediction on semantics thinking. Thinking is a much more complex process. chatGPT cannot reason. It even often outputs misinformation and contradictory statements(not because of training data, but because it doesn’t validate the logic). Lack of validation is just one of MANY aspects why I can’t call it thinking/reasoning.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      what's your fav definition of intelligence?

    • @mutexin
      @mutexin Před 5 měsíci +1

      @@ArtOfTheProblem Wikipedia has a proper one.
      “Intelligence has been defined in many ways: the capacity for abstraction, logic, understanding, self-awareness, learning, emotional knowledge, reasoning, planning, creativity, critical thinking, and problem-solving. It can be described as the ability to perceive or infer information; and to retain it as knowledge to be applied to adaptive behaviors within an environment or context.”

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      i don't like basket of words definition. I'm sticking with 'ability to learn'...for example if I brought home a robot from the store which did X, then I'd call it intelligent@@mutexin

    • @mutexin
      @mutexin Před 5 měsíci

      @@ArtOfTheProblem The reference point is human intelligence which is very complex. There can be no concise definition if you focus on accuracy. The simpler the definition, the more inaccurate it is.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      I don't agree with this angle but it is indeed interesting. it's definition is it can't be defined....i'd agree with that over the basket of words definition@@mutexin

  • @sbalogh53
    @sbalogh53 Před 5 měsíci +1

    I was still in high school and remember being amazed by Eliza in 1970 when I went to an Open Day at our local university and watched people typing to the program on an old teletype machine. It seemed so "human" but looking back, all it did was transform the users comments into leading questions. That day was one of the pivotal moments in my life that lead to a long career in computing.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thanks for sharing, what a cool story - where do you land on the final question?

    • @sbalogh53
      @sbalogh53 Před 5 měsíci +2

      @@ArtOfTheProblem ... What is thought? I don't think it is restricted to "organic" neurons. Our brains are made of atoms just like a computer is based on atoms. There is nothing spiritual or paranormal about our brains. I think one day, and that day seems to be fast approaching, AI will suddenly become self-aware and conscious. I have seen a snail's pace growth in AI and it's predecessors over the past 40 years but suddenly over the past few years the developments have been explosive, especially this year. Will we hit a plateau or will the growth continue to be exponential? That is THE question. I think when AI is provided with sensory inputs its ability to learn and understand will be greatly improved. Being connected to the internet is an important starting point, but an actual robot body with AI. Hmmm. I wonder where that can lead. I am getting a bit long in the tooth but I hope to see AGI become a reality before I pass. Artificial consciousness I am not as confident seeing before I die, but could be wrong. What will the next 10 years bring?

    • @sbalogh53
      @sbalogh53 Před 5 měsíci +2

      @@ArtOfTheProblem ... I remember studying very early versions of Expert Systems in the mid 1980 for my Masters degree. They were a very primitive form of AI but nothing compared to ChatGPT or Bard or many of the others available today. I can't remember the name of the system we studied other than it used an Apple (Lisa?) computer with a very large high-def B/W screen. It was impressive because we could fit all our rules graphically on one page. Fun times indeed.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      what a time to be alive :) love these historical stories@@sbalogh53

    • @sbalogh53
      @sbalogh53 Před 5 měsíci

      @@ArtOfTheProblem ... My first job in 1973 almost straight out of high school was as a "trainee" COBOL programmer on a Honeywell H200 mainframe. It took up a room and had 20k memory, 4 tape drives, a punch card reader and a line printer. CPU clock speed was 1Mhz. They wanted to increase memory to 24k but it was an exorbitant expense which the company did not go ahead with. That machine ran all of the data processing needs for a medium sized company. All the COBOL programs were developed in-house by a group of 6 programmers and analysts. Each program would take perhaps a week or two to write and debug. We only had access to the machine for one compilation and test run per day so I remember spending hours single stepping through my programs on large sheets of butcher paper before submitting runs on the mainframe. There was a prize offered by the manager for any programmer that managed to get an error free compilation on the first run. Nobody ever won it. I stayed there for about 18 months then changed jobs to another Honeywell site mainly because of a huge pay rise and because the place I was headed had a computer with two disk drives!!! I still have some of my old programming manuals from back then.

  • @randallyoung6715
    @randallyoung6715 Před měsícem +1

    This the best video on CZcams, on this topic. Thanks for posting.

  • @alexmartos9100
    @alexmartos9100 Před měsícem +5

    ChatGPT has now been completely neutered. The responses are so vague as it shies away from providing its own ‘thoughts’. Claude on the other hand does not seem to have this issue.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem +1

      i've noticed this too

    • @VIDEOSANDREEL
      @VIDEOSANDREEL Před měsícem

      ​@@ArtOfTheProblemCorrect and precise, noticed that too.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem +1

      Just posted a new vid! czcams.com/video/5EcQ1IcEMFQ/video.html

    • @VIDEOSANDREEL
      @VIDEOSANDREEL Před měsícem +1

      @@ArtOfTheProblem Thanks for sharing

    • @VIDEOSANDREEL
      @VIDEOSANDREEL Před 21 dnem

      @@ArtOfTheProblem Recently I asked Gemini about fingerprint scanner recognition technology and whether hackers can access our Bank account without any OTP access, just hacking our biometrics information. And Gemini replied that I am only text generating AI chat bot and cannot help you with that. It's quite annoying to just see that. I am not completely blaming Gemini or anyone relating to this. But this is happening and you too will be facing this fact very soon.
      Any ways thanks for your video upload Sir

  • @zrebbesh
    @zrebbesh Před 5 měsíci +3

    "understand?" "think?" "conscious?" "intelligent?" "qualia?" "AGI?" "sentient?"
    Until we agree on precise definitions for these things, they are fundamentally meaningless in discussion about any specific system. And they are maddeningly non-precise. The only thing such arguments reveal is differences of opinion about what meanings people attach to the questions.
    I have precise definitions for some of these. I don't expect anyone else to have decided on the same definitions. I can't argue on the basis of them whether any system "understands" anything or "is conscious" because unless anyone else is using the same definitions the discussion that would start can only go in small meaningless circles.

    • @lodashnotebook5390
      @lodashnotebook5390 Před 5 měsíci +1

      Sentient = consciousness
      If I ask you, "Do you exist?" is your answer going to be "No."?
      Consciousness is existence and bliss is it's nature.

  • @cbuchner1
    @cbuchner1 Před 5 měsíci

    This video is excellent. I wanted to watch this twice in order to not miss important concepts.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      let me know what you are left wondering as i'm working on a follow up

    • @cbuchner1
      @cbuchner1 Před 5 měsíci

      @@ArtOfTheProblem the pre-training phase for the model is very unclear to me. How are the tokens determined? After all, the GPT like models no longer work on letters but rather on tokens that represent entire syllables or even words. How can this adapt to all the languages of the world so effortlessly?

  • @danielabraham9802
    @danielabraham9802 Před měsícem +1

    This is a phenomenal display of explaining a difficult concept

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před měsícem

      I appreciate this, worked very hard on it. next video will be on RL

  • @pastuh
    @pastuh Před 5 měsíci +4

    AI can simulate awareness or responses based on data and algorithms, but it doesn't possess consciousness or genuine subjective experiences.

    • @dionysusnow
      @dionysusnow Před 5 měsíci

      Consciousness = Attentional Focus

  • @ayanamij
    @ayanamij Před 5 měsíci +10

    Nice analysis but the music at times is awful.

    • @glaudiston
      @glaudiston Před měsícem +2

      Yes, and kind of distracting.

  • @tsunghan_yu
    @tsunghan_yu Před 21 dnem

    Came to watch this from 3b1b. What a great video! I have been watching some of your videos for a while now since I started to learn ML.

  • @matthewmcclinton4593
    @matthewmcclinton4593 Před 5 měsíci +1

    Amazing, happy to help bring the channel back.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci

      thank you Matt! the engagement and comments seemed to make all the difference

  • @_Mute_
    @_Mute_ Před 5 měsíci +4

    Absolutely brilliant video! My hot take: Chomsky is fundamentally wrong about the concept of thought and even human thought is just glorified autofill.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +2

      agree with this, thanks for feedback. let me know what you'd like to see next

    • @BinaryDood
      @BinaryDood Před měsícem

      I'm sorry that you came to such an irrational conclusion and use it to downplay consciousness and cognition.

    • @_Mute_
      @_Mute_ Před měsícem

      @@BinaryDood Yeah? Well, you know, that's just like uh, your opinion, man.

  • @gigigiugio5952
    @gigigiugio5952 Před 4 měsíci +3

    im still waiting for my basic income

    • @3dgar7eandro
      @3dgar7eandro Před měsícem +1

      Keep eating pal... For at least another 180 years 😂😂😂

  • @AnhHo-pq6df
    @AnhHo-pq6df Před 5 měsíci +1

    You got new sub sir! Just discovered your channel and it’s so good.

    • @ArtOfTheProblem
      @ArtOfTheProblem  Před 5 měsíci +1

      thank you! would love to know what you'd like to see more of next

    • @AnhHo-pq6df
      @AnhHo-pq6df Před 5 měsíci

      @@ArtOfTheProblem predictions for 2024-2025 ;)

  • @aminzqrti7672
    @aminzqrti7672 Před měsícem +3

    I came from 3blue1brown

  • @incoprea2
    @incoprea2 Před 5 měsíci +3

    Noam just doesn't want you to ask GPT about his Epstein connection.