BERT Neural Network - EXPLAINED!

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

IKON A SÉGRA HODNOTÍ SVŮJ ROK ?! 😲 #shorts

Attack a Terezka jdou na rande… FESTIVAL

HOW DID HE WIN? 😱

Transformer models and BERT model: Overview

Google Cloud Tech

zhlédnutí 85 977

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 4. 06. 2023
Watch this video to learn about the Transformer architecture and the Bidirectional Encoder Representations from Transformers (BERT) model. You learn about the main components of the Transformer architecture, and the different tasks that BERT can be used for, such as text classification, question answering, and natural language inference.
Enroll on Google Cloud Skills Boost to view the lab walkthrough and participate in a hands-on lab!
Enroll on Google Cloud Skills Boost → goo.gle/3Wk3jnC
View the Generative AI Learning path playlist → goo.gle/LearnGenAI
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech
Věda a technologie

Komentáře • 27

@googlecloudtech Před rokem ⁺³
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech
@maleehahaider8596 Před 2 dny
The best explanation of this topic I have seen on the web till date….BRAVO
@ollieanntan4478 Před rokem ⁺¹⁶
Important info. I'd love to see a similar video but with real life examples to illustrate the points.
@Ops_pops Před rokem ⁺²
explained it very well and the use of relavent examples is awesome.thank you so much.your work is very much appreciated.
@venkateshpattu1620 Před 2 měsíci ⁺¹
Fantastic video in a such a short time can explain this much details is incredible. Thank you
@googlecloudtech Před 2 měsíci
Thank you for the kind words! 🤗 We're glad you found this video helpful!
@notmimul Před rokem ⁺⁷
8:15 What does it mean? 12 and 24 layers of transformers in BERT and then there is Transformer with 6.. 6 layer of encoder/decoder right? not same layers as BERT. layers in BERT are transformers but for transformer itself has 6 encoder/decoders.
@meltem9078 Před rokem
Since they're referencing the original transformer architecture, the 6 layers refer to the encoder part of the transformer (encoder has six layers in the original paper: 2x self-attention, 2x feed-forward and 2x normalization layer after attention and feed-forward). BERT is an encoder-only transformer model
@vivekc2303 Před 10 měsíci ⁺³
At 2:45 she says the OG research paper on transformer had 6 stacks of encoders stacked on top of each other... At 8:12 by saying `6 layers in the original transformer` I think she means 6 encoders on top of each other. They should not have used layers there because they already used that word differently to describe the two layers within each encoder (which are self-attention and feedforward layers).
Also transformer doesn't denote a single encoder or a single layer of encoder-decoder pair. It rather represents the whole model with all its encoders and decoders.... So BERT is an advanced type of transformer model which has more encoder layers than the original transformer. So each layer within BERT can't be called transformer.
[UPDATE] the layers within each encoder such as self-attention and feedforward are actually called sublayers... So it makes sense `layers` in a transformer refers to the stack of encoders and each layer within an encoder is called a `sublayer`.
@kartikpodugu Před 8 měsíci ⁺⁸
need more deep dive on how BERT works. Can you guide regarding more references. ?
@Philippe_Rougier Před 11 měsíci ⁺³
Is that a practical exercise in which we have to predict the missing word?:@2:43
“ the encoding component is a stack of encoder of the same number” of what?I assume she meant “ of same structure ( entirely identical layers) “ ? It kind of confirmed later in the video…
I there some reviewing these videos before they get out? the previous video on attention mechanism was absolutely confusing partially to the fact that the notations used were not matching the words of the presenter !
@sapnagupta6215 Před 10 měsíci ⁺⁶
well explained, but I'd love to see a real life examples to illustrate the points.
@s0meb0dy78 Před měsícem ⁺¹
A Lifesaver...
@KiranMundy Před 7 měsíci ⁺¹
Since I don't have any background on transformers, I get completely lost at the point where you're explaining the query, key and value vectors and how the weights for these are determined at training time. I had to resort to questioning bard about this in more detail, but am still lost, although questioning bard helped in getting some understanding of what these 3 vectors are.
Can you more clearly explain how the adjustment to the weights of the query, key and value matrices differs during backpropagation?
@wohola Před 9 měsíci ⁺³
I would love to see a real-time example to illustrate the step
@swarnodipnag Před 11 měsíci ⁺¹
Well explained ❤
@kartikpodugu Před 8 měsíci ⁺¹
How "Next Sentence Prediction" task is named as NPS, can you elaborate ? 9:29
@jeromeeusebius Před 9 měsíci ⁺¹
Thanks for the video explaining Transformer models and BERT. Good summary and highlevel description. Small nitpick: @9:03, abbreviation for "next sentence prediction" should be NSP, but slide has NPS.
@user-kd6wf6pf3g Před 4 měsíci
1:08 Isn't LSTM proposed at 1997?
@ferdousihaque9633 Před 11 měsíci ⁺²
joss
@nanelikahya9949 Před 10 měsíci ⁺²
Confusing
@1857kyle Před 6 dny
6:44
@OkiemTaty Před 7 měsíci ⁺²
Why are all videos explaining transformers so frocking boring and uneducative!!!
@keenoain6885 Před 7 měsíci ⁺²
Is it a AI generative model made this video?
Probably, all the information are correct, but:
1. It is very much NOT clear how they connect to each other
2. There is great emphasis on non relevant highly technical flows while keeping out description of what is the idea behind this structure, what is the motivation and advantages of this structure. Explain HOW it solves the problems.
This video is providing quite a lot of useless information.
For those who are familiar with the subject, it is way to basic; and to those that are not, it ‘gives’ nothing. No understanding what so ever.
If you at Google use AI to generate your videos, at least review them before publishing.
Moreover, her voice is like a voice of a typing machine.
@Linguisticsfreak Před 3 měsíci
and the stress in some words is so off like "percentage", "component", "develop", and some others are said with the wrong stress.
@realGynaExpress Před 3 měsíci ⁺¹
It's such a bad explanation... No examples and just reading the script. I totally get lost listening to it

Další v pořadí

Automatické přehrávání

BERT Neural Network - EXPLAINED!

BERT Neural Network - EXPLAINED!

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

IKON A SÉGRA HODNOTÍ SVŮJ ROK ?! 😲 #shorts

IKON A SÉGRA HODNOTÍ SVŮJ ROK ?! 😲 #shorts

Attack a Terezka jdou na rande… FESTIVAL

Attack a Terezka jdou na rande… FESTIVAL

HOW DID HE WIN? 😱

HOW DID HE WIN? 😱

Transformer Models and BERT Model: Overview

Transformer Models and BERT Model: Overview

Gail Weiss: Thinking Like Transformers

Gail Weiss: Thinking Like Transformers

Transformers, explained: Understand the model behind GPT, BERT, and T5

Transformers, explained: Understand the model behind GPT, BERT, and T5

The math behind Attention: Keys, Queries, and Values matrices

The math behind Attention: Keys, Queries, and Values matrices

What are Transformer Models and how do they work?

What are Transformer Models and how do they work?

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

Decoder-Only Transformers, ChatGPTs specific Transformer, Clearly Explained!!!

AI Language Models & Transformers - Computerphile

AI Language Models & Transformers - Computerphile

The Numitron: An obvious idea that wasn't very bright

The Numitron: An obvious idea that wasn't very bright

Deep Cleaning and Fixing The DIRTIEST IPad 🤢🤮 #shorts #apple #ipad

Deep Cleaning and Fixing The DIRTIEST IPad 🤢🤮 #shorts #apple #ipad

Noctua NH-D15 G2 Review & Benchmarks, HBC & LBC Comparison, & Best CPU Coolers

Noctua NH-D15 G2 Review & Benchmarks, HBC & LBC Comparison, & Best CPU Coolers

I've got a problem... - Sony ULT TOWER 10

I've got a problem... - Sony ULT TOWER 10

The 2024 viral phone case🙌 #smartphone #mobileaccessories #phonecase #casecollection #tech

The 2024 viral phone case🙌 #smartphone #mobileaccessories #phonecase #casecollection #tech

Apple Watch with a CAMERA?! 😳

Apple Watch with a CAMERA?! 😳

High voltage Ground Fault testing.

High voltage Ground Fault testing.

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts