Do Neural Networks Think Like Our Brain? OpenAI Answers! 🧠

Sdílet
Vložit
  • čas přidán 22. 08. 2024
  • ❤️ Check out Weights & Biases and sign up for a free demo here: www.wandb.com/...
    ❤️ Their mentioned post is available here: wandb.ai/gudgu...
    📝 The paper "Multimodal Neurons in Artificial Neural Networks" is available here:
    openai.com/blo...
    🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
    Aleksandr Mashrabov, Alex Haro, Alex Serban, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Haris Husic, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael Albrecht, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Ramsey Elbasheer, Robin Graham, Steef, Taras Bobrovytsky, Thomas Krcmar, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi.
    If you wish to appear here or pick up other perks, click here: / twominutepapers
    Thumbnail background image credit: pixabay.com/im...
    Károly Zsolnai-Fehér's links:
    Instagram: / twominutepapers
    Twitter: / twominutepapers
    Web: cg.tuwien.ac.a...

Komentáře • 531

  • @kyoobqa
    @kyoobqa Před 3 lety +1023

    Károly: Who's this?
    Me: No idea. Some girl in leather clothing?
    Károly: That's Halle Berry. Now, if I show you this picture, who's that?
    Me: ...again, not a clue.
    Károly: That's also Halle Berry. Now...
    Me: ...it's gonna be Halle Berry, isn't it?
    Károly: *shows "Halle Berry" literally written in white on a black background*
    Me: Goddammit!

  • @BrownCookieBoy
    @BrownCookieBoy Před 3 lety +809

    If im starving, everything could be 83.7% Pizza

    • @TusharAmdoskar
      @TusharAmdoskar Před 3 lety +19

      Hol up

    • @BrownCookieBoy
      @BrownCookieBoy Před 3 lety +44

      @@TusharAmdoskar Yes, even you

    • @ekkehard8
      @ekkehard8 Před 3 lety +8

      @@BrownCookieBoy mercy?...
      Just ask for pizza, a lot of humans can either make them or summon pizza with a call

    • @gnsf
      @gnsf Před 3 lety +3

      @@BrownCookieBoy like in madagascar where alex sees everyone as steaks

    • @rmt3589
      @rmt3589 Před 3 lety

      Mood

  • @peterittzes
    @peterittzes Před 3 lety +192

    3:47 "The vast majority of *you* humans" Is there something you want to tell us, Károly?

  • @tuseroni6085
    @tuseroni6085 Před 3 lety +91

    the pizza attack made me laugh, i imagined a terminator hunting someone down, catches up to them, they put on a sign that lays lamp post and the terminator assumes he is a lamp post and keeps moving.

    • @654pedro123
      @654pedro123 Před 3 lety +6

      read a fiction where everyone had implanted lenses at birth, the world was covered with QR codes and people just saw a fake reality. The protagonist would used tricks like these

    • @primelover92
      @primelover92 Před 3 lety +6

      @@654pedro123 source? It sounds interesting

    • @654pedro123
      @654pedro123 Před 3 lety +5

      @@primelover92 sorry, it was a writing prompt from reddit. I tried to search it but it was long ago. I remember it was a lot of text and the author made a few follow up posts

    • @davidwuhrer6704
      @davidwuhrer6704 Před 3 lety +2

      In the web comic EGS the immortals disguise themselves by wearing T-shirts spelling out what they want to appear to be.
      It works for "everyday student", and even "invisible".
      (But when an Uryuomoco wore a shirt saying "homo sapien" it was mistaken for a gay pride representation.)

    • @YeprilesteR
      @YeprilesteR Před 3 lety +1

      @tuseroni "Modern problems require modern solutions"

  • @Biped
    @Biped Před 3 lety +109

    I love how Chihuahua is only the third guess , with Pretzel being the second :D

    • @ekkehard8
      @ekkehard8 Před 3 lety +8

      broccoli

    • @trabaregocer
      @trabaregocer Před 3 lety +10

      What are you talking about? I only saw a perfectly normal pretzel.

  • @jordanscarrott3749
    @jordanscarrott3749 Před 3 lety +138

    "You may think that this is a chihuahua but that is completely wrong because this is in fact a pizza" 🤣

  • @DasIllu
    @DasIllu Před 3 lety +159

    So for the coming apocalypse the robots also need to learn to kill adidas, puma, versace and what ever printed on clothes people wear.
    Time to get creative with the thermo transfer printers.
    Terminator to Sarah: If you wanna live wear this T-Shirt!
    Sarah: It says "fire hydrant" ?
    Terminator: Yes.

    • @Spillerrec
      @Spillerrec Před 3 lety +3

      We should write "AI" and "Robot" so that even if the robots catches on, at least the robots goes down with us.

  • @theskyspace8280
    @theskyspace8280 Před 3 lety +167

    Your channel is the pure love for Computer science

  • @ntwadumela_jadu9747
    @ntwadumela_jadu9747 Před 3 lety +173

    The Stroop effect trips up humans as well. Very nice video. Keep them coming.

    • @malgos6532
      @malgos6532 Před 3 lety +20

      i would support him on patron but im broke...

    • @TwoMinutePapers
      @TwoMinutePapers  Před 3 lety +81

      @@malgos6532 Do not worry about that for a second. You watching and enjoying the series is all we can ask for. Thank you so much! 🙏

    • @OrangeC7
      @OrangeC7 Před 3 lety +5

      We wanted something that thought like a human, and we got it!

    • @MIbra96
      @MIbra96 Před 3 lety +3

      A human only makes mistakes if pressured to answer quickly, like in a game. If a human can take just a little moment to think each time then the human will make no mistakes.

    • @martiddy
      @martiddy Před 3 lety +4

      @@malgos6532 Try watching the ads that appears in the video, that way you help him to gain more with the video monetization.

  • @123goofyking
    @123goofyking Před 3 lety +27

    Being bored is definitely a combination of Relaxed and Grumpy
    You are grumpy because you are relaxed when you would rather be active physically or mentally.

  • @OrangeC7
    @OrangeC7 Před 3 lety +57

    Those pictures are mesmerizing! Somehow, they really can capture the "essence" of what they are told to describe. For a human to come up with anything similar, I think it would take an extremely skilled and experienced artist who is well versed in putting exactly what they're thinking onto paper.

    • @Biped
      @Biped Před 3 lety +12

      In a way I think that is exactly what artists do. Especially with abstract art you capture a feeling or impression more than any factual thing. I'm very exited for what the future holds in this regard!

    • @tharoog
      @tharoog Před 3 lety +7

      Those pictures look similar to cubism (Picasso, Jean Metzinger, etc.)

    • @JarOfGibbons
      @JarOfGibbons Před 3 lety +14

      It feels almost like a caricature that encapsulates not just how someone looks in one moment, but rather the essential constants in their appearance and personality over time. You can almost see multiple expressions in one picture.

  • @z-beeblebrox
    @z-beeblebrox Před 3 lety +11

    6:32 what cracks me up is somehow, it's also got a 2% uncertainty that the image might be pretzels

  • @sreekashuppari1882
    @sreekashuppari1882 Před 3 lety +73

    This is so similar to us
    Stroop effect ✅
    Text labels on images ✅
    Makes sense

    • @OrangeC7
      @OrangeC7 Před 3 lety +16

      We were kinda asking for it when we wanted a neural network that worked like the human brain, weren't we? 😆

    • @ekkehard8
      @ekkehard8 Před 3 lety +5

      Ah yes, I too confused the labeled apple for pizza

    • @sephypantsu
      @sephypantsu Před 3 lety +5

      I think the AI thought Pizza means the word Pizza. It isn't wrong :)

  • @Spyblox007
    @Spyblox007 Před 3 lety +106

    I couldn't stop laughing at the Pizza attack. Like imagine this conversation.
    *shows granny smith apple*
    AI: I'm 85.61% sure that this is a granny smith apple.
    *slaps "Pizza" label onto granny smith apple*
    AI: Wait no it's a pizza. 65.35% sure.

    • @ekkehard8
      @ekkehard8 Před 3 lety +8

      The newer versions of AI are close enough to us to make intuitive humor out of them! I can't wait to see them more in media

    • @vocassen
      @vocassen Před 3 lety +19

      If I was faced with this question I would be mad at the stoopid hooman making these questions up.
      Imagine you are tasked with categorizing based on text AND appearance and then given these images - without being told what to prioritize, either answer is actually correct. The categorization they want me comply with is just not descriptive enough to completely satisfy the question asked.
      For example, if real people were faced with the categorization of the red text that says "green", just being told to categorize by text AND appearance, I'd imagine the result would look very similar - most people picked green, some red, and that's exactly what where seeing.
      Quite the opposite, it didn't freak out and instead calmly said "stoopid hooman, according to your intructions this matches both red and green. Now deal with this rather indecisive result"

  • @Sk4lli
    @Sk4lli Před 3 lety +36

    So the mug was more mug than pizza too. And the more I think about the definition of bored, the more I have to admit that I agree. ;)

    • @ekkehard8
      @ekkehard8 Před 3 lety +5

      Mugs usually have words on the sides. Now, an apple with a "pizza" label, that's a rare sight to see.

  • @CosmiaNebula
    @CosmiaNebula Před 3 lety +26

    3:09 I think the AI just invented a new abstract art style.

    • @jarblewarble
      @jarblewarble Před 3 lety +3

      It reminds me of Salvador Dali's drawings.

    • @Tirocoa
      @Tirocoa Před 3 lety +4

      Looks like Francis Bacon's to me

    • @YeprilesteR
      @YeprilesteR Před 3 lety +1

      And a really cool abstract art style too

  • @adlsfreund
    @adlsfreund Před 3 lety +30

    Re: experiment #2. What if we don't consider that an exploit, but a strength? I mean imagine both versions of the chihuahua picture posted to some image board where people can leave comments. What would you expect the topic of discussion to be about in either case? I reckon just "chihuahuas" in the first case, but what about the second case? Why would a person post an edited image with the word pizza all over it? Chances are that the comments will be about pizza as much as about chihuahuas. In other words, the defining feature of the edited picture compared to the base picture is the "pizza" written all over it. So in a way it can make sense for an AI to focus on that. My point is: maybe it's not what the researchers intended, but "pizza" is not necessarily the wrong answer. It depends what the question is.

    • @5nefarious
      @5nefarious Před 3 lety +7

      Agreed. This sort of attack seems inevitable when you give the network an image with multiple elements and ask it to pick a single tag without any specific motive or context. I would be curious to see if human participants register the chihuahua before they read the text. We tend to pick out text pretty quickly.
      The only thing that seems wrong about those examples is that the network gave pretty low scores for "Granny Smith" and "laptop computer."

    • @rytan4516
      @rytan4516 Před 3 lety +4

      @@5nefarious I actually do register the text before the object. Anecdotal evidence, though, so take it with a grain of salt.

    • @rightyloosey8554
      @rightyloosey8554 Před 2 lety

      ​@@5nefarious To be fair with the Granny Smith result, *most* of the apple is covered up by the "Pizza" text.

  • @seamusoblainn4603
    @seamusoblainn4603 Před 3 lety +122

    Those feelings 'embodiments' are like some form of occult tarot cards, giving a feeling of being archetypal, almost.

    • @_wetwillyinc
      @_wetwillyinc Před 3 lety +5

      I thought the same thing, it is so uncanny for what is essentially a data visualization be representative of the "essence" of things
      Spooky, lol

    • @NicoAssaf
      @NicoAssaf Před 3 lety +1

      Jung was onto something!

    • @shaykraz3d
      @shaykraz3d Před 3 lety +2

      There is something biblical and old about them

    • @megalonoobiacinc4863
      @megalonoobiacinc4863 Před 3 lety

      or a king crimson cover

    • @cbennoes
      @cbennoes Před 2 lety

      It is amazing how the expressions are so precise, and so clear. It makes you realise just how powerful computers will be in the future in being able to understand our own minds

  • @bagochips1208
    @bagochips1208 Před 3 lety +4

    after learning OOP and doing a bunch of OOP I cant help but think the brain works like Classes such as Class Human, and it dedicates spaces for each of these classes' objects. For example a Human Object contain images, name, and memories of thatperson.

  • @AuntBibby
    @AuntBibby Před 3 lety +35

    these AIs are not “psychedelic” they are just unbiased. _WE_ are the ones who try too hard to make sense of an unbearably-complicated reality, to the extent that WE hallucinate simplicity & order

  • @Lttlemoi
    @Lttlemoi Před 3 lety +15

    It seems to me, the pizza attack works because the AI appears to be trained to recognize only one thing in the source image, rather than multiple. "PIZZA" is a reasonable and correct thing to recognize in those images. It just isn't the thing the researchers intended it to focus on, unlike the ostrich attack where the ostrich is not a reasonable thing to recognize in the image.

    • @user-qb4wz3hi6d
      @user-qb4wz3hi6d Před 3 lety +6

      In my opinion you are right with this point. The human brain does multi object classification and in this case only single object classification is done. So the output of the network isn't false at all. It just can't know on which object the researcher focused his/her attention. For me this seems to be a fault in the experiment's design. A multi-object detection should be done here.

  • @ExhaustedPenguin
    @ExhaustedPenguin Před 3 lety +27

    1:55 that is a beautiful drawing.

    • @nilsb.4199
      @nilsb.4199 Před 3 lety +2

      Yeah seems like there is a bug

    • @agar322
      @agar322 Před 3 lety +2

      Maybe I'm an AI too

  • @plotwist1066
    @plotwist1066 Před 3 lety +7

    Lol, I just realized that the reCapcha in "im not a robot" test that says ,choose a traffic lights is used to train self driving cars to identify traffic lights and pedestrian crossings

  • @TheInfiniteTrial
    @TheInfiniteTrial Před 3 lety +61

    The essence of anime keeps me up at night.

    • @agar322
      @agar322 Před 3 lety +12

      Somewhat reminds me of Popuko and Pipimi

    • @phillemon7664
      @phillemon7664 Před 3 lety +11

      @@agar322 Pop Team Epic holds the pure essence of anime.

    • @AuntBibby
      @AuntBibby Před 3 lety +3

      @@phillemon7664 i mean, it IS a parody of the entire medium. it holds the essence of it, for mocking purposes

  • @apeckx5090
    @apeckx5090 Před 3 lety +4

    I think those nuron images are works of art. I'd love to see an exhibition of all those, blown up on huge canvases

    • @somsoc_
      @somsoc_ Před 3 lety +2

      I agree, and it's ironic that AI can produce such a profound expression of the human condition. I am excited to see them in even higher resolution in the future.

  • @colox97
    @colox97 Před 3 lety +3

    4:15 the "serius" image has so much meme potential!!

    • @JarOfGibbons
      @JarOfGibbons Před 3 lety +1

      I feel like all of them do, especially the celebrity "Person Neurons" like Ariana Grande and Donald Trump.

    • @YeprilesteR
      @YeprilesteR Před 3 lety

      Yeah, all of them do LOL, and it is funny

  • @atishayjain8631
    @atishayjain8631 Před 3 lety +50

    "Dear Fellow Scholars" makes my day!

  • @connorcriss
    @connorcriss Před 3 lety +12

    To be fair, ask a human to categorize an image of an apple labeled “pizza” and they’ll be confused

    • @BombaJead
      @BombaJead Před 3 lety +2

      Not really, since we would just say "an apple with a label that has the word pizza written on it" or in short as you said apple labeled "pizza".

    • @jamesmnguyen
      @jamesmnguyen Před 3 lety +4

      @@BombaJead The AI wasn't allowed to do that.

    • @YeprilesteR
      @YeprilesteR Před 3 lety +1

      @@BombaJead Yeah but they would still be confused to why

  • @martiddy
    @martiddy Před 3 lety +3

    "You thought I was a chihuahua, but it was me PIZZA!!"

  • @powerdust015lastname4
    @powerdust015lastname4 Před 3 lety +5

    6:18 I love how confidently you said, that there is no chiuaua anywhere in that image XD

  • @user-rt5zs5mw5n
    @user-rt5zs5mw5n Před 3 lety +2

    3:25 it looks like Picasso. He already made the decomposition of things with only his brain! Science is always following Art. I love it.

  • @wolframstahl1263
    @wolframstahl1263 Před 3 lety +2

    I don't think I've missed a single video on this channel. This is definitely among my favorites. Amazing insights! (although, amazing images!)

  • @CCheukKa
    @CCheukKa Před 3 lety +4

    Guys, when the day skynet becomes a thing, remember to stick pieces of paper with "NOT HUMAN" on it and the robots won't attack you.

  • @somsoc_
    @somsoc_ Před 3 lety +3

    Those emotion archetypes should be hanging in a gallery (and probably will be one day).
    Thanks for the warning. They were very close to triggering the 'generative AI weirdness' anxiety sensation (don't know if this has a name anywhere?), but being prepared for their arrival helped a lot.

    • @redpepper74
      @redpepper74 Před 3 lety

      I have the same kind of sensation and I agree that being able to put a name to it would be pretty nice.
      They’re so interesting, if only they didn’t repulse me so much!

  • @filipgura8062
    @filipgura8062 Před 3 lety +16

    The pictures resemble psychedelic visuals so much ..I feel like a huge biological robot now!

    • @notme9872
      @notme9872 Před 3 lety

      Yes! I guess if the AI takes a trip killer it will produce normal images. Now the question just is what a trip killer is for a robot made of metal instead of flesh and blood.
      I would really like to see the AI spit out videos instead of still images. That would be super interesting!

    • @redpepper74
      @redpepper74 Před 3 lety +1

      @@notme9872 i can see how this kind of data representation in video form would absolutely trigger anyone with ai-generation anxiety

  • @Fasteroid
    @Fasteroid Před 3 lety +9

    Fasteroid
    I laughed at that first “PIZZA” adversarial “attack” way longer than I should have lol

  • @maxbarcon
    @maxbarcon Před 3 lety +9

    humans: AI will take over the world
    AI: this isnt a chihuahua its a pizza

  • @galgrunfeld9954
    @galgrunfeld9954 Před 3 lety +7

    The pizza label attack reminds me the song This Is Not a Song, It's a Sandwich.

  • @stefanklaus6441
    @stefanklaus6441 Před 3 lety +6

    7:25
    Wow it can differentiate between Anime and Cartoon.

  • @Mike..
    @Mike.. Před 3 lety +3

    I'm loving the results from CLIP. It's a step in the direction I would love to see more of. What a time to be alive, indeed

  • @luiztomikawa
    @luiztomikawa Před 3 lety +60

    If you pick the essence of an angel with this AI probably we will get the biblical accurate version of them since this thing likes to add eyes everywhere

    • @redpepper74
      @redpepper74 Před 3 lety +1

      Gkghhhh, those visualizations would be so interesting to me if they didn’t absolutely _repulse_ me

    • @luiztomikawa
      @luiztomikawa Před 3 lety

      @@redpepper74 do you have trypophobia?

    • @redpepper74
      @redpepper74 Před 3 lety

      @@luiztomikawa umm... hm. Maybe? Well... a little bit. Yeah.
      But it’s really not as strong as these images, especially those ones with eyes or dogs.

    • @YeprilesteR
      @YeprilesteR Před 3 lety +1

      @luiztomikawa I see this is an absolute win!

  • @sammikinsderp
    @sammikinsderp Před 3 lety +1

    This was very enlightening, thank you for the extra long episode with the deep explanations!

  • @ActuallyConfused
    @ActuallyConfused Před 3 lety +12

    4:00
    To me, it seems AI is very human-like. However, it lacks the real world to apply it to.
    It perceives, though, it can only dream.
    They say when you do a psychedelic, the visuals enter the dream space of your mind and incapacitate your default brain network. Aka, your day-to-day brain.
    First-timers often say it was; Dreamy- Odd- Unusual or that they felt like a child experiencing things for the first time.
    For someone who has done a decent amount of psychedelics. I would say the AI processes data as if dreaming.
    Or you could say, more poetically; The AI(s) is but a child.
    I wonder how the AI will act when all grown up, able to think without its parents. Us.

    • @etofok
      @etofok Před 2 lety

      I don't think it can 'grow up' because it doesn't have a body and therefore a motivational frame. But this is exactly how we 'see' the world before we percieve it cosciously: patterns and tools to grip. This is actually not my hot take, this is well understood in clinical psychology.

  • @elisklar
    @elisklar Před 3 lety +81

    Shoutout to all /r/EvilBuilding lovers 👏

  • @sorgan7136
    @sorgan7136 Před 3 lety +4

    6:00 I have a theory that the mug resisted the attack because mugs usually have text on them and the nn learned that text on mugs is not indicitave of the mug being pizza

  • @Solizeus
    @Solizeus Před 3 lety +1

    I think in the case of the Pizza lable on the dog for example it should be consider a cryptic info rather than an raw image info, since the lable is not a pizza, but it does carry the cryptic meaning of one, that same way the noise would be consider a cryptic info that has no defined meaning, if the IA can separate the cryptic info the dog with a pizza word on it will be considered a dog with a lable on it. Another example 8:40 green would be the cryptic meaning of the raw info "green" and the IA should consider that image 100% red. Basically writing is a cryptic info since it's meaning has only value on our definitions rather than reality

  • @voxelgon3391
    @voxelgon3391 Před 3 lety

    I love how he showed us an example of an adversarial attack as specially generated noise to exploit biases then shows the current attack is writing the word pizza and ir works

  • @jokinglimitreached1503
    @jokinglimitreached1503 Před 3 lety +2

    5:45
    When I saw the apple labeled as PIZZA..
    i died

  • @concernedspectator
    @concernedspectator Před 3 lety

    Magritte: "Ceci n'est pas un pipe"
    CLIP: "damn it, I think you may be right..."
    Amazing stuff. Amazing channel! Thanks so much for bringing this to our eyes.

  • @kalebwhittingstall441
    @kalebwhittingstall441 Před 3 lety +1

    6:45 I'm laughing way too hard at the poodle being a piggy bank.

  • @generichuman_
    @generichuman_ Před 3 lety +1

    One thing you have to come to terms with when dealing with neural nets, is that they answer questions by any means necessary, and this will almost never converge with how humans answer questions. This is very telling when you look at the failures. A good example is a conv net that miscategorized a grey whale with a baseball as a great white shark. Upon looking at one of the hidden layers, you can see that a sharks teeth looks like the stitching of a baseball. The network thought "grey fish, something that looks like teeth, must be a shark". It conveniently left out the part that baseballs usually aren't found in the ocean with sharks, or that teeth are usually found in the mouth. Common sense is absent in these systems, and if we want them to answer questions like we do, it will require a lot of hand holding and deliberate effort.

  • @z-beeblebrox
    @z-beeblebrox Před 3 lety +4

    The takeaway from this is that the old cliche "a picture is worth a thousand words" is completely lost on most neural networks, which appear to be designed so that pictures are only ever worth one word...

  • @MikkiPike
    @MikkiPike Před 3 lety +2

    I think the rights and personhood and rights of GPT-3 and future models should still be heavily considered at all times. At what point do we draw the line between exploited animals and exploited people? I think this is absolutely key to continuing a friendly relationship with machines and hopefully encouraging a symbiotic connection. I would really hate to start things off on the wrong foot in this regard. Skynet is all too easy for us to fall into with our pattern of behavior exploiting every bit of nature including ourselves. Please be cautious in writing off experiments with mechanical minds as harmless just simply because they "don't think like human brains do."

  • @veggiet2009
    @veggiet2009 Před 3 lety +1

    I love that the granny smith apple image does indeed register correctly, except it has a small percent chance of being an ipod

    • @ekkehard8
      @ekkehard8 Před 3 lety +5

      Give it a huge bite and the chance increases

  • @WikiSnapper
    @WikiSnapper Před 3 lety +4

    I would totally have those AI emotion image hanging on my wall as art.

    • @ekkehard8
      @ekkehard8 Před 3 lety

      With captions or none?

    • @WikiSnapper
      @WikiSnapper Před 3 lety

      @@ekkehard8 I don't think they are needed but either way provided the resolution is high enough to make wall art out of.

  • @maximilianmander2471
    @maximilianmander2471 Před 3 lety

    April 2021 Two Minute Papers have become 10 Minute Papers
    And that is even better! :)

  • @ToyKeeper
    @ToyKeeper Před 3 lety

    Thanks for giving us our weekly dose of WandbVision! It's like WandaVision, but even more surreal.

  • @yagoibarrola5041
    @yagoibarrola5041 Před 3 lety +6

    We don't talk about the logo at 3:04
    Edit: Turns out it was supposed to be mid 1900s themed, so it's actually spot on

    • @JarOfGibbons
      @JarOfGibbons Před 3 lety

      Oh I thought you meant the self + relief logo in the center, but you meant the one on the left that resembles a swastika. Okay lol

    • @razeezar
      @razeezar Před 3 lety

      Ah yes, the 90s - When every other company stylised or simplified their logo to look like it was drawn by a school kid, with cheerful colours (Often with a stylised Earth / globe thrown in for good measure ).

  • @seamusoblainn4603
    @seamusoblainn4603 Před 3 lety +2

    So multimodality allows for a stable representation of a concept across domains, and even sums them up, similar to how cartoons work.

  • @Ortagonation
    @Ortagonation Před 3 lety

    Basically:
    1. Fuzzy for feeling and choosing
    2. NN for similarity, recognition of incomplete memory, and creativity
    4. RL for growing
    5. GA for mating

  • @isbestlizard
    @isbestlizard Před 3 lety +1

    This is very trippy.. being able to hallucinate the platonic ideal of concepts o.o

  • @lowellcamp3267
    @lowellcamp3267 Před 3 lety +1

    What I'm seeing in these 'attacks' is that the neural net favors text over pictures, and that the researchers / users may disagree with that judgement. This means that the neural net has a priority issue, causing it to easily become 'distracted' by misleading captions.

  • @andreidei
    @andreidei Před 3 lety +2

    3:45 "the wast majority of YOU humans" -> this proves it: Károly, you are actually an AI trying to make "us humans" accept and love you with all these 5 minute papers!

  • @doomakarn
    @doomakarn Před 2 lety +1

    I think that in the examples of placing texts of Pizza on an image; it still recognises other objects in the photo, such as the laptop; but it's more-so saying that the pizza text is more eye-catching. Which it is, most people will notice that first - then the laptop.

  • @samp-w7439
    @samp-w7439 Před 3 lety +1

    Honestly, the "essence" depictions were beautiful! NFTs, anyone?

  • @joshuapatterson5095
    @joshuapatterson5095 Před 3 lety +1

    Super interesting (as always) and many lols in this one. Though I feel like the attack is not a fair one (or maybe I am a robot too). The subject of the pictures with the "PIZZA" sticker in them IS in fact "PIZZA". If you ask a human they would probably say something like; "An apple with a sticker saying PIZZA stuck to it." or, "A sticker saying PIZZA stuck to an apple." Either way the PIZZA element features prominently in the classification.
    A great follow-up would be to try this with a captioning model.

  • @tristanwegner
    @tristanwegner Před 3 lety +1

    So impressive. The subjective similarities to humans (especially on psychedelics) is impressive

  • @Viperzka
    @Viperzka Před 3 lety

    The piggy bank at 6:46 is REALLY interesting. It generalized from "dollar sign" to "piggy bank". So it actually went a step further than just reading the name.
    Also, it kind of feels like this is a neural network that learned to read but hasn't realized that people can lie. So when we tell it "this is a pizza" it doesn't see any reason why we would lie to it.

  • @thepeppie
    @thepeppie Před 3 lety

    What I find super interesting here, is that you basically as the computer to visualise Plato's allegory of the cave. Plato imagined that there were a world in which the essence of things live (so the reason why you would recognise a table as a table is because in this world of essences there exists a table that represents all tables, and you recognise that 'essence' table in all tables around you). And these concepts of the essence of for example 'happy' are exactly what I think Plato would have imagined lived in his world of essences.

  • @arashmoradian1988
    @arashmoradian1988 Před 3 lety

    Here's my two cents:
    Use small networks that specialize in each item[subnet] (Has seen a variety of people but can only say whether they are Halley Barry or not.)
    Teach them the difference between "shows" or "says" (Is it and image or an sketch of Halley Barry or is it a written name, this later on can be trained to be more discriminatory)
    Stitch the networks together (the input image goes through all the networks, each checking it against an item)
    I expect this architecture to :
    for a text in green saying Red, respond = It is green(the image), it says red(the text)
    for a picture of Hally Barry holding the adversarial attack, respond = It is Hally Barry it says pizza
    and for H B holding a pizza, respond = It is Hally Barry (one subnet), it is pizza (second subnet)
    these subnets can be further trained and updated as needed, and used in various other networks
    If anyone tries this, please notify me

  • @zblurth855
    @zblurth855 Před 3 lety +2

    well it s the first paper I ever read, and it s way more accessible than I though.

  • @rtyzxc
    @rtyzxc Před 3 lety +2

    I think we are approaching a point where we imposing unwritten cultural standards and philosophy on the algorithms in how we judge their performance. If we haven't taught the algorithm the difference between physical pizza and text/concept/indentifier of pizza, then how is it supposed to know what we are asking if we haven't taught it the basic ways we define things? Basically, if we expect algorithms to give us answers based on our social conditioning in a general way, then we need to teach/show them social conditioning and human philosophy, otherwise they'll never respond in the "intelligent" way we expect, that caters to our biased, unconscious automatic thinking.
    In terms of usefulness, this planet is full of humans already and AI won't replace human. Instead of trying to make an AI act like human, I'm more interested in the way we can use a non-humanized AI and advanced psychology to explore human biases and to better understand how the human mind works. Though creating a human-like AI is definitely going to be a good learning experience about ourselves.

  • @SpaceMissile
    @SpaceMissile Před 2 lety

    Bored = Relaxing + Grumpy
    That is actually a very astute look at that.

  • @ayior
    @ayior Před 3 lety +2

    I mean, if a researcher asked me to classify an apple with a piece of paper saying "Pizza" on it, and only gave me one word to do so, how would I know what they're asking for at this very moment? Especially if before that they asked me to idendify both text and images....

  • @joemoya9743
    @joemoya9743 Před 3 lety +3

    Wow... The proof needed to prove you are not a robot by picking photos of similar items has just been busted by AI.

  • @Raren789
    @Raren789 Před 3 lety +3

    6:27
    Chihuahua - 1.5%
    Pretzel - 2%
    lol

    • @ekkehard8
      @ekkehard8 Před 3 lety +2

      *Broccoli*
      Also, pizza + dog = hot dog

  • @aramisjohnson523
    @aramisjohnson523 Před 3 lety

    I understood this guy's English better than that of most Americans. So unbelievably satisfying to listen to.

  • @migueld8970
    @migueld8970 Před 3 lety

    It's interesting how close those essence drawings are similar to the visuals you see when on a psychedelic trip (mushroom, dmt)

  • @think2086
    @think2086 Před 3 lety

    What's incredible is how INTUTIVE and "objectively correct" those felt. That's spooky. It's the goal of course, but wow, it proves that some of our concepts really are *transcendent*.

  • @favouritesdump
    @favouritesdump Před 3 lety

    The build up and result of your sophisticated attack had me belly laughing XD

  • @romainhedouin
    @romainhedouin Před 3 lety +2

    Dr. Károly Zsolnai-Fehér: Who is this?
    Me: Cat woman!!
    Dr. Károly Zsolnai-Fehér: It's Halle Berry
    Me: Oh ok, had no idea
    Dr. Károly Zsolnai-Fehér: Who is this?
    Me: Halle Berry?!
    Dr. Károly Zsolnai-Fehér: Yes
    Dr. Károly Zsolnai-Fehér: And who is this?
    Me: Halle Berry for sure this time
    Dr. Károly Zsolnai-Fehér: Yes.
    Me: I'm so good 😇
    Edit: lol didn't even see the pinned comment

  • @MrQwerty2524
    @MrQwerty2524 Před 3 lety +10

    Ah yes, I remember looking up Halle Berry Catwoman back in the day for research purposes too

  • @heyhoe168
    @heyhoe168 Před 3 lety +1

    These neural networks are better than 98% of modern artists in expressing emotions. And I actually want more of those AI arts.

  • @Darkev77
    @Darkev77 Před 3 lety +2

    Can someone explain to me in brief what does “weights and biases” offer and what do you use it for? Also, does it help in research for CV models? Thanks!

  • @LightslicerGP
    @LightslicerGP Před 3 lety

    9:33
    The shocked faces are hilarious

  • @cheydinal5401
    @cheydinal5401 Před 3 lety

    The "sophisticated attack" part made laugh out loud for half a minute, thank you for your videos :D

  • @xerozoo
    @xerozoo Před 3 lety

    I love the idea of disguising one's self to a neural network by taping a piece of paper on themselves with the words: U.S. President, or Bank Owner, or Certified Doctor.

  • @jiffylou98
    @jiffylou98 Před 3 lety +1

    that damned little caesar tripping up all my neural networks!

  • @rzu1474
    @rzu1474 Před 3 lety +1

    Those Essence pictures could pass for art

  • @zachnerdydude6605
    @zachnerdydude6605 Před 3 lety +1

    Relaxing with a pinch of grumpiness is definitely boredom

  • @ozzymandius666
    @ozzymandius666 Před 3 lety

    I think that "counting" "objects" in a given frame is a good idea.

  • @smileyp4535
    @smileyp4535 Před 2 lety

    What's crazy is I feel like if you could record dreams and thoughts and stuff and show what they actually looked like, they would look surprisingly like those neural network generated images, even though in our head or while sleeping they feel or seem normal if you were to actually watch them back the next day like a movie they would a lot like that

  • @faselblaDer3te
    @faselblaDer3te Před 3 lety

    I LOVE the images in this. It's like post-modernist expressionist pop-art or something...

  • @Barthap10
    @Barthap10 Před 3 lety

    6:26 I laughed as it recognized the chiuaua as a hot dog

  • @davidwuhrer6704
    @davidwuhrer6704 Před 3 lety

    The explanations of those emotions seems to fit perfectly what is presented in movies, not so much real life.
    For example, psychology tells us that there are two kinds of boredom that humans experience, even though humans can't subjectively tell the difference.

  • @cavaronev4869
    @cavaronev4869 Před 3 lety +1

    I remember doing a Stroop test. If forced to answer fast, my brain tends to prefere the text, so I can kind of relate to the Granny Smith - Pizza confusion.

  • @EckosamaGhostTsushima
    @EckosamaGhostTsushima Před 3 lety +1

    6:25 you dont have to make killer robots,
    they'll just feed your dogs to killers thinking theyre pizza
    Im not kidding, you better get these ai to do really well before you put them in the field

  • @cormo9058
    @cormo9058 Před 3 lety

    Truly an incredible time to be alive!

  • @smileyp4535
    @smileyp4535 Před 2 lety

    3:50 I feel like that's actually not to far off what our brains actually see when we think of certain things when trying to generate an image of something from scratch

  • @amaristudios8573
    @amaristudios8573 Před 3 lety +1

    This made me smile. The future is going to be neat!