155 - How many hidden layers and neurons do you need in your artificial neural network?

Sdílet
Vložit
  • čas přidán 19. 07. 2024
  • No one can give a definite answer to the question about number of neurons and hidden layers. This is because the answer depends on the data itself. This video goes through the thought process of determining the number of hidden layers and neurons using simple code as example.
    #Dataset link:
    cdn.scribbr.com/wp-content/up...
    Code generated in the video can be downloaded from here:
    github.com/bnsreenu/python_fo...
  • Věda a technologie

Komentáře • 86

  • @dinhh4248
    @dinhh4248 Před 3 lety +32

    16:36 is the answer for how many layers do you need

  • @JoaoPedro-px5sx
    @JoaoPedro-px5sx Před 3 lety +24

    Those were some of the best explanations about hidden layers and numbers of neurons I could find, also making it very easy to see in your python plots. Keep up the good work!

  • @nate4511
    @nate4511 Před 3 lety +13

    Finally, someone who speaks math. Thank you sir

  • @abhilashchaturvedi1479

    I really loved your approach. You are explaining the technicalities and discussing the various possibilites while staying on the subject. It's thorough. With other youtubers, I felt like they were too basic and missing the crucial implementation part. Thankyou!

  • @AdobadoFantastico
    @AdobadoFantastico Před 2 lety +1

    This is a fantastic explanation. I really appreciate how you involved the math as you walked through your implementation. A lot of people hand wave the math.

  • @wuzark
    @wuzark Před rokem

    This is such an intuitive and helpful video. I can see that there is a lot of hard work behind this video. Great job!

  • @andrewkicha1628
    @andrewkicha1628 Před rokem +1

    Such a wonderful explanation to the really fundamental question. I wonder why there is so little accessible information for beginners on this topic.
    Thanks a lot for the video.

  • @poffer31
    @poffer31 Před 3 lety +3

    The approach to explaining it trough linear regression was very useful for me, thank you!

  • @PatrickBateman12420
    @PatrickBateman12420 Před rokem

    Finally, an explanation that goes straight into code. Awesome!

  • @devishaarunadevitiwari3988

    Thanks for giving clarity on such an important notion. worth it.

  • @WahranRai
    @WahranRai Před 2 lety +3

    22:55 Rule 3 contradiction with rule 2 : input = 50, output = 50
    . rule 2 gives hidden

  • @aqibfayyaz1619
    @aqibfayyaz1619 Před 3 lety

    Awesome video that is what i was looking for.

  • @troupebase2292
    @troupebase2292 Před 2 lety

    Thank you for this decent explanation

  • @vaibhavs2522
    @vaibhavs2522 Před 3 lety +1

    I don't know why there are too few likes on such an awesome video..you are really great sir.

  • @tahseen4790
    @tahseen4790 Před 3 lety

    Thanks a lot. I regularly watch your Videos.

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety +3

      Thanks for watching my videos, I donate all money from CZcams advertisements to charity so please thanks for your contribution by watching part of the advertisements.

  • @madhumitagiribabu
    @madhumitagiribabu Před 3 lety

    thanks much sir found right content after long search

  • @cyruskavwele5304
    @cyruskavwele5304 Před rokem

    Fantastic explanation.

  • @RajeshSharma-bd5zo
    @RajeshSharma-bd5zo Před 3 lety +1

    Great video and well explained!!

  • @rfreeman057
    @rfreeman057 Před rokem

    Excellent video!

  • @3DComputing
    @3DComputing Před 3 lety

    Strange isnt it, so many people asking THIS question, and so few people can answer it, THANK YOU

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety

      Well, I try to answer it but in reality it is difficult to definitively answer this question as so much depends on the nature of input data.

    • @HK-jw2et
      @HK-jw2et Před 2 lety

      @@DigitalSreeni czcams.com/video/pDXdlXlaCco/video.html
      Hey. Can you pls help me in understanding how many nodes he used in this project. It's a project based on recognising sign language

  • @ssonicmoumed
    @ssonicmoumed Před 3 lety

    Thank you, great explanation.

  • @vanithas4336
    @vanithas4336 Před rokem

    Amazing..helps me lot for my research work. Thanks

  • @neerav302
    @neerav302 Před 3 lety

    You are awesome .. you taught me this topic like pro

  • @mojoway9379
    @mojoway9379 Před 3 lety

    Thanks for the video very helpful

  • @momchi2
    @momchi2 Před 2 lety

    many good hints and insights here

  • @shubhamsongire6712
    @shubhamsongire6712 Před 2 lety

    Powerful explination

  • @danielniels22
    @danielniels22 Před 2 lety

    thanks sir, this is such an enlightenment 😂 ive been using 4 or even 6 layers by thinking that the model could learn very deep, like some unrecognized patterns 🤣🤣 but turns out just use 1 to 2 😭😭 thanks sir, im new to your channel this week btw 🙏

  • @pramishprakash
    @pramishprakash Před rokem

    Great video sir

  • @tharindukanchana2077
    @tharindukanchana2077 Před 2 lety

    Thank you sir, very nice explanation

  • @surajshah4317
    @surajshah4317 Před 3 lety +3

    wow, thank you so much for the great video. Sir can you make videos on segmentation using GAN and UNet ??

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety

      They are already on my channel. Please explore videos on my channel.

  • @ahmedmoayadalhasani
    @ahmedmoayadalhasani Před rokem +1

    Hi Sreeni, I had a significant mistake and training and test data differences. This, in my opinion, is due to the huge values of the output response numbers, which have increased from 64 to over a thousand. Please, how can I resolve this issue? Can I divide them by their maximum value to fix the issue? What do you prefer, please?

  • @tesfayeabera4978
    @tesfayeabera4978 Před 3 lety

    please I need an explanation of how to increase the layer of deep belief network from three-layer to more than 6 and its advantages and disadvantage .

  • @yacineyacine2951
    @yacineyacine2951 Před 2 lety

    this video is like finding gold ... thannnk youuu

    • @HK-jw2et
      @HK-jw2et Před 2 lety

      czcams.com/video/pDXdlXlaCco/video.html
      Hey. Can you pls help me in understanding how many nodes he used in this project. It's a project based on recognising sign language

  • @hahavv7058
    @hahavv7058 Před 3 lety

    excellent vedio that give me great help!think you sir~

  • @error220.5v5
    @error220.5v5 Před 6 měsíci

    great video. inlike ecuations.

  • @abderrahmaneherbadji5478
    @abderrahmaneherbadji5478 Před 3 lety +1

    Many thanks

  • @SomenathChakraborty
    @SomenathChakraborty Před 2 lety

    Could you provide the data source details but it is very small dataset with very limited parameter. But I appreciate your video for clear clarification of the concept.

  • @lakeguy65616
    @lakeguy65616 Před 2 lety

    adding hidden layers without activation functions is essentially linear regression. If the problem is linearly separable, you can find a solution. complex problems with non-linear solutions require hidden layers with activation functions. A more complex solution requires a higher number of hidden layers and activation functions. The "magic" is in the activation function.

  • @kakshah
    @kakshah Před 3 lety +1

    Hi, thank you, may I know which tool is used to make this video?

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety

      Not sure what you are asking... can you be a bit specific?

  • @tso8761
    @tso8761 Před 3 lety +1

    17:53 we can use dropout technique to reduce overfitting btw

  • @grdev3066
    @grdev3066 Před 3 lety +2

    "If your problem can be solved with linear fitting..."
    Me: trying to survive 2020 ...

  • @somewhereinparallelunivers4226

    I have 7lakhs data.so can you suggest me how many neuron can i use for my neural network.. i am using curve fitting neural network.

  • @shivamsingh-fn8vz
    @shivamsingh-fn8vz Před 2 lety +1

    ok so my doubt is i read on stack exchange and also ur 3 rd point in node section that neuron size should be 2/3 of input size so here the input size is equals to number of unique features or length of features input (len of dataset) and also 2/3 neuron = all the neurons in all the layers or only in single layer

  • @shipengxu
    @shipengxu Před 2 lety

    Thanks!

    • @DigitalSreeni
      @DigitalSreeni  Před 2 lety

      Thank you for your kind contribution. Keep watching.

  • @domenicobezuidenhout1587

    For my thesis I am using weather data to predict future values using the CNN but for my loss and Val loss I get nan values? Do you know of a way I could fix this sir?

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety

      There are many reason why you’d get a NaN for loss and the most probably reason is high learning rate. If your learning rate is 0.01 try changing it to 0.001 and see if that helps.

  • @tehpson
    @tehpson Před 2 lety

    OMG thank you, I finallyu understand

  • @pallavi_4488
    @pallavi_4488 Před 2 lety

    you are worth listening

  • @heliyahasani6859
    @heliyahasani6859 Před rokem

    Least Squares Optimizer is same as Analytical Solution.(Wrote this comment to avoid confusion :) )

  • @aomo5293
    @aomo5293 Před rokem

    error again when calculating mean squared error !
    line 71
    you should use: np.mean((y_test--pred)**2) not np.mean(y_test-y_pred)**2 !!
    Thank you for good content

  • @dikshitlenka
    @dikshitlenka Před 3 lety

    Can't we use Keras Tunner to find the exact number of layers and neurons required in the network?

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety +1

      Keras is for hyperparameter tuning and I don't think it is for defining models. I may be wrong as I haven't explored Keras tuner much. If your goal is to find the best model for you problem I recommend AutoKeras.

  • @lerneninverschiedenenforme7513

    I didn't understand what's happenning, when the number of hidden nodes increase. Does that also lead to overfitting?

    • @DigitalSreeni
      @DigitalSreeni  Před 3 lety

      Yes, increasing the number of nodes will also lead to overfitting. Anything that increases the nonlinearity in the model and makes it easy for the model to map training data will lead to overfitting.

    • @lerneninverschiedenenforme7513
      @lerneninverschiedenenforme7513 Před 3 lety

      @@DigitalSreeni Thank you!

  • @ad.donielson
    @ad.donielson Před rokem

    How about neural network without hidden layer for classification?

  • @pfever
    @pfever Před 2 lety

    10:43 I think that if the learning rate is too small it could get 'stuck' in a local minima, isn't it?

  • @TheConsoleMania
    @TheConsoleMania Před 2 lety

    the number of neurons in both hidden layer, should be the same?

    • @DigitalSreeni
      @DigitalSreeni  Před 2 lety +1

      No, they can be anything.

    • @TheConsoleMania
      @TheConsoleMania Před 2 lety

      @@DigitalSreeni yeah, but if I follow the rules in the video, i obtain about 12 neurones. This number should be the same on both hidden layer ? Or maybe the second one should be smaller ?

    • @TheConsoleMania
      @TheConsoleMania Před 2 lety

      @@DigitalSreeni I have 12 input and 1 output

  • @NoamRathaus
    @NoamRathaus Před 2 lety

    TLDR; 1 or 2 hidden layers - or just guess because he doesnt know

    • @DigitalSreeni
      @DigitalSreeni  Před 2 lety

      This is an educational video intended to train the viewer on the implications of number of neurons and hidden layers. In fact, I try to design my content such a way that the viewer gains incremental knowledge on a specific topic. I am sorry if the title set a different expectation to you.

  • @user-mb6mv2do6g
    @user-mb6mv2do6g Před rokem

    سلام.Hi