Build a Custom ASR Model in TensorFlow: A Step-by-Step Tutorial

Sdílet
Vložit
  • čas přidán 26. 02. 2023
  • Learn the basics of speech recognition with TensorFlow and build practical applications with this tutorial. Discover the history of speech recognition and the challenges that come with dealing with human speech variability, similar-sounding words, and low-quality audio. Explore the various techniques used in speech recognition, such as machine learning algorithms like deep learning, Hidden Markov Models (HMM), Dynamic Time Warping (DTW), and phonetic-based approaches. Discover how transformers have transformed the field of speech recognition and how they can be used to recognize different languages, understand natural language, and distinguish between similar words. Follow along with the tutorial to build a basic speech recognition model using TensorFlow, combining a 2D convolutional neural network (CNN), recurrent neural network (RNN), and Connectionist Temporal Classification (CTC), and apply this knowledge to develop practical applications.
    Text Version Tutorial: pylessons.com/speech-recognition
    GitHub: github.com/pythonlessons/mltu...
    pypi: pypi.org/project/mltu/
    #machinelearning #python #tensorflow #opencv #ASR

Komentáře • 33

  • @vkrts9176
    @vkrts9176 Před rokem +2

    A good presentation. Thank you for providing this information.

  • @omochi887
    @omochi887 Před rokem +1

    This is so nice. Thank you very much for sharing your knowledge.

  • @space_x5627
    @space_x5627 Před 28 dny

    thank you for efforts, after train and save model how i use to transcript other audio not the one i trained and exist on csv file ? please tell me ? another thing how i know train is good with curves.

  • @ishanpanta4293
    @ishanpanta4293 Před 10 měsíci

    I am looking for some resources to learn ASR but I couldnot find good resources so could you please share me some ASR resources. Thank You!

  • @kishanbangsi7888
    @kishanbangsi7888 Před rokem

    Will there be a PyTorch version of this tutorial??? It would be great. Thanks for such helpful video.

    • @PyLessons
      @PyLessons  Před rokem +1

      I am not sure if its necessary, I already made a pytorch tutorial for handwritten words recognition, its pretty easy to combine both of them to get same results

  • @tringuyen-iv8yf
    @tringuyen-iv8yf Před 10 měsíci

    That's great, thanks for your sharing.
    After creating the model, can we use this model with openai whisper ?

    • @PyLessons
      @PyLessons  Před 9 měsíci

      Hey, I don't know, never used openai whisper

    • @tringuyen-iv8yf
      @tringuyen-iv8yf Před 9 měsíci

      @@PyLessons thanks a lot, have a nice day 😁

  • @GelanaAbdisa
    @GelanaAbdisa Před 3 měsíci

    thank you for the nice tutorial I think you did it with CTC mode which is sequence to sequence. I want to do the same project by using my dataset by using Listen attend and spell model and there is no any tutorial done on that area can you help me on how to implement it??

    • @PyLessons
      @PyLessons  Před 3 měsíci

      There is plenty tutorial online, I don't have time to try this. Hope you understand me

  • @mustafaa4a
    @mustafaa4a Před rokem

    Thanks.. Fantastic work.. Please can I run it in my own CPU computer??

    • @PyLessons
      @PyLessons  Před rokem

      I think you can't train it on cpu, but if only using it, then yes, you can

  • @mariamjbani-amer6684
    @mariamjbani-amer6684 Před měsícem +1

    when i try your code , on the output folder model I did not get model.onnx file
    and when i test .h model i get error message said "model,onnx not found"
    can you help me ?

    • @PyLessons
      @PyLessons  Před měsícem

      This means there is something wrong with onnx package on your side. Check in terminal, there should be an exception what is wrong

  • @mariamjbani-amer6684
    @mariamjbani-amer6684 Před 2 měsíci +1

    why you select 1000 as epochs number ?

    • @PyLessons
      @PyLessons  Před měsícem

      I select 1000, not to make any limitations. I am using callbacks that will break the training process when model stops improving

  • @navyaanzaheen3233
    @navyaanzaheen3233 Před rokem

    nice explaination but please can you add a method in which user can recognize his own voice by repeating dataset sentences

    • @PyLessons
      @PyLessons  Před rokem

      You will need to do this by your self, I am giving a principle how it works

  • @yashkewlani2878
    @yashkewlani2878 Před rokem

    Could you please make video on project converting text to speech ?

    • @PyLessons
      @PyLessons  Před rokem

      Maybe in the future, its way more difficult to convert text to speech, you should use transformers for this task

    • @eminagamirzeyev3144
      @eminagamirzeyev3144 Před 4 měsíci

      @@PyLessons What kind of transformers?

  • @pesworld2405
    @pesworld2405 Před 2 měsíci +1

    can i use this for making a model for arabic language ?

    • @PyLessons
      @PyLessons  Před 2 měsíci +1

      You can try, I am not sure how it will result

  • @melapobia
    @melapobia Před 11 měsíci

    why dont you put microphone on your model? i just wonder

  • @user-xu6yw9fn2l
    @user-xu6yw9fn2l Před 3 měsíci

    can you provide your pretrained model for use as we cannot train on cpu

    • @PyLessons
      @PyLessons  Před 2 měsíci

      Why you need it then if you can't train it, because I demonstrated it simply as example, not production or something ready model

  • @mariamjbani-amer6684
    @mariamjbani-amer6684 Před 2 měsíci

    can you please answer my questions ?