Real-time Speech Recognition in 15 minutes with AssemblyAI

Sdílet
Vložit
  • čas přidán 28. 08. 2024
  • Get your free speech-to-text API token 👇
    www.assemblyai...
    Transcribing in real-time is a super skill only court reporters can brag about. But luckily, we don’t need to learn how to type fast to get transcriptions of audio quickly. Thanks to Assembly AI’s Streaming Speech-to-Text model (previously real-time speech recognition), it is very simple to set up a python script that can listen for audio and turn it to text.
    In this video, we will see how to create this script on Python with the help of pyaudio, web sockets and asynchronous functions. The app will have the power to listen to audio input through a microphone and display the transcription in real-time. We will integrate this code into a simple Streamlit application to showcase the real-time speech recognition with a touch of interactivity.
    If you’d like to follow along, don’t forget to get your own AssemblyAI API token for free at assemblyai.com
    You can find the code from this tutorial in this GitHub repository: github.com/mis...
    Find the written form of this tutorial here: www.assemblyai...
    AssemblyAI Streaming STT docs: www.assemblyai...

Komentáře • 74

  • @saifullahkhan9837
    @saifullahkhan9837 Před 2 lety +12

    The accuracy and formatting is quite interesting here.

    • @AssemblyAI
      @AssemblyAI  Před 2 lety +2

      Thank you! - Mısra

    • @debojitmandal8670
      @debojitmandal8670 Před 9 měsíci

      ​@@AssemblyAIhi what if I want the input to be not from microphone and i want it from my speaker or laptop speaker how do I do it then.

  • @pjayo
    @pjayo Před 2 lety +6

    Is there a JavaScript version of this video please? Both service side and front end…

  • @otomakannioc8213
    @otomakannioc8213 Před rokem +1

    Very sympathic and engaging presentation. Maybe the most beautiful side of Artificial Intelligence 😊

  • @lfmtube
    @lfmtube Před 2 lety +2

    Most instructional and useful video. Thank you.

  • @MrThought2012
    @MrThought2012 Před rokem +1

    Very nice and easy setup! Took me ages to achieve the same with whisper. However, are you planning to support other languages, german, french or even a multilinugal model?

  • @ashiqashervegar7973
    @ashiqashervegar7973 Před rokem +3

    How can I use this for transcribing particular chrome tabs for online meetings? Can you help me with that?

  •  Před 4 měsíci +1

    Thanks for everything :)

  • @Asparuh.Emilov
    @Asparuh.Emilov Před 2 lety +3

    This is really awesome! I would prefer though to see the final result as a short highlights at the beginning of your videos before you go into the details of how to. But thanks anyway for the effort and the time! Hugs!

    • @AssemblyAI
      @AssemblyAI  Před 2 lety +2

      Thanks for the feedback! It's definitely a good idea to give an impression of the app that is being built. With the newer videos we do a preview at the beginning of the videos indeed. - Mısra

    • @Asparuh.Emilov
      @Asparuh.Emilov Před 2 lety

      @@AssemblyAI 🤗🤗♥️♥️

  • @claudiotassis
    @claudiotassis Před rokem +4

    Incredible video. Would I be able to use chatGPT, as an intermediate, to correct the sentences based on vocabulary and grammar, and after that, get the response from that chatGPT "reviewed" sentences?

    • @mohamedshagie3342
      @mohamedshagie3342 Před rokem +1

      Yup i tried to make it but it worked only text cant use speak 😅

  • @mehdismaeili3743
    @mehdismaeili3743 Před 14 dny

    Excellent .

  • @alexander5429
    @alexander5429 Před 15 dny +1

    @assemblyai : When will you finally support Streaming in German?

  • @fahnub
    @fahnub Před rokem +2

    Does it also offer diarization in real time?

  • @slimyelow
    @slimyelow Před 11 měsíci

    Very kewl it works. However for the live service a $8 minimum is required. - but totally worth it

  • @lookersky6145
    @lookersky6145 Před rokem +1

    I've this installed and worked on windows. My question is that Real-time Speech Recognition only recognize english ? Does it support other languages ? Thank you.

  • @1992kshitizyadav
    @1992kshitizyadav Před 3 měsíci

    As of now, only the English language is supported in the live transcription feature. when can we expect more language support ?

  • @ckames22
    @ckames22 Před 2 lety +3

    Awesome 👍

  • @KashyapJadav
    @KashyapJadav Před rokem +1

    Live transcript is paid version?

  • @usus8420
    @usus8420 Před 4 měsíci

    hi great works but what about smartphone ?

  • @adhikesavan9377
    @adhikesavan9377 Před 2 lety +1

    when i tried to install pyaudio terminal displays this error: "Cannot open include file: 'Python.h': No such file or directory "

  • @Pinkijhabnp
    @Pinkijhabnp Před 9 měsíci

    Thank you for this nice tutorial

  • @borr2749
    @borr2749 Před rokem +1

    Assembly ai real time transcription doesn't have a free trial ?

  • @PoojaVerma-sl6mg
    @PoojaVerma-sl6mg Před 11 měsíci

    Could you please instruct me on how I can include this in my Angular project?

  • @spinal_cord
    @spinal_cord Před rokem +1

    I know this is a little old, but I get a 4002 error, what might cause that?

  • @weebiesoftware6296
    @weebiesoftware6296 Před 4 měsíci

    I want to implement a realtime app using voice recognition on python 3 / android 11 on my samsung s22. It's my understanding portaudio is NOT supported on Android 11. Is portaudio your only way to get to the mic?

  • @onintsoavola5698
    @onintsoavola5698 Před 8 měsíci

    Is it possible to make it faster ? The transcription takes a little time

  • @Miguel-hq1lx
    @Miguel-hq1lx Před 4 měsíci

    is it possible to transcribe in real-time in other languages, such as spanish?

  • @moncefarajdal4582
    @moncefarajdal4582 Před 2 lety +1

    Can you please let me know how can I integrate this in my JAVA Maven project?

    • @AssemblyAI
      @AssemblyAI  Před 2 lety

      Hey Moncef, unfortunately I also don't have experience on that. -Mısra

  • @GiulianoGolfieri
    @GiulianoGolfieri Před rokem +1

    Is it possible to use this service in other languages apart from English?

    • @tiagofyhnesteves74
      @tiagofyhnesteves74 Před rokem

      im also trying to find an answer to this question

    • @GiulianoGolfieri
      @GiulianoGolfieri Před rokem +2

      @@tiagofyhnesteves74 they answered to me privately. It's not possible yet. I switched to Azure cognitive services, which is multi-language.

    • @frizzfrizz3550
      @frizzfrizz3550 Před 10 měsíci +1

      @@GiulianoGolfieri I had taken it for granted that it was a multilingual service, a fucking morning's work wasted. Grazie della info, Giuliano

  • @bakhshizade
    @bakhshizade Před 10 měsíci

    I am here for Freddie.

  • @HomelessRafi
    @HomelessRafi Před rokem

    How can I introduce um, ahs, and other filler words in to the Realtime transcription? I see it is an option for uploading an audio file

  • @loubino18
    @loubino18 Před 5 měsíci

    Should have mentioned cost to go to pro version.... why hide it?

  • @eagold
    @eagold Před 2 lety +2

    buut.. if i have no money to buy the pro key?😕

  • @amineelarif7001
    @amineelarif7001 Před 2 lety

    that is sick! goodjob

  • @user-mx5lv5qp5y
    @user-mx5lv5qp5y Před rokem

    can you pls let me know how to save that text

  • @parameswaranesnsce-cse9491
    @parameswaranesnsce-cse9491 Před 7 měsíci

    can we speak any indic languages , will this endpoint will transcribe or not ?

    • @AssemblyAI
      @AssemblyAI  Před 7 měsíci

      Yes AssemblyAI's API supports Hindi Transcription, check out this tutorial: czcams.com/video/3WlNOCeyyjQ/video.html

  • @MDMUHTADEEFAIAZKHANSOUMIK

    Can we setup Bangla language for this system?

  • @angelfernando8954
    @angelfernando8954 Před 2 lety +1

    Hi. how can i change the lenguage to transcript in spanish?

    • @AssemblyAI
      @AssemblyAI  Před 2 lety +1

      Hey Angel, here is the documentation on transcribing in languages other than English. docs.assemblyai.com/walkthroughs#specifying-a-language

    • @dirtydevil81
      @dirtydevil81 Před 2 lety

      @@AssemblyAI But do different languages work with realtime transcription on this specific endpoint? The documentation, regarding changing the language, is not clear about this.

    • @giovanniied
      @giovanniied Před rokem

      @@dirtydevil81 do you find a solution?

  • @REALVIBESTV
    @REALVIBESTV Před rokem

    Can this work in Unreal Engine 5

  • @IntricateMoon
    @IntricateMoon Před rokem

    I'm on windows, When I try to run it it does nothing, just creates a new line on the terminal. when I cloned the github repo, it was working, hmmm

    • @AssemblyAI
      @AssemblyAI  Před rokem

      Have you tried speaking while the code is running? It might be that you don't have a microphone connected to the computer.

  • @rubibeats
    @rubibeats Před rokem

    how to add custom ui?

  • @benyusu8045
    @benyusu8045 Před 9 měsíci

    received 4001 (private use) Not authorized; then sent 4001 (private use) Not authorized

  • @marlontuquerres6072
    @marlontuquerres6072 Před 2 lety

    THIS IS ONLY AVAILABLE ON MAC/LINUX, RIGHT?

    • @AssemblyAI
      @AssemblyAI  Před 2 lety

      No, it is available independent of the operating system.

  • @siamkamelia87
    @siamkamelia87 Před 2 lety

    does this work for song transcription ? in real time ?

    • @AssemblyAI
      @AssemblyAI  Před 2 lety +1

      Hey Siam, depending on the amount of background music and clarity of pronunciation you'd get varying levels of success with transcribing songs.

  • @ibrahimimohssine8131
    @ibrahimimohssine8131 Před 2 lety

    is assemblyAI support arabic language with vowelization?

    • @AssemblyAI
      @AssemblyAI  Před 2 lety +2

      We are launching support for Arabic in late January!

  • @Homurdan
    @Homurdan Před 2 lety +1

    Aha Türk !

  • @barankaya3333
    @barankaya3333 Před rokem

    Türk müsün?

  • @valerozanoni952
    @valerozanoni952 Před rokem

    When i added this line
    if json.loads(result_str)['message_type'] == 'FinalTranscirpt':
    it wouldnt transcript anything anymore