Live Discussion - How To Read Text From Images Using Pytesseract

Sdílet
Vložit
  • čas přidán 28. 06. 2020
  • Tesseract link: github.com/UB-Mannheim/tesser...
    Please donate if you want to support the channel
    Gpay: krishnaik06@okicici
    Please join as a member in my channel to get additional benefits like materials in Data Science, live streaming for Members and many more
    / @krishnaik06
    Please do subscribe my other channel too
    / @krishnaikhindi
    Connect with me here:
    Twitter: / krishnaik06
    Facebook: / krishnaik06
    instagram: / krishnaik06

Komentáře • 101

  • @pushpendudas3372
    @pushpendudas3372 Před 4 lety +2

    It was nice. Please keep doing session so that our learning curve doesn't stop.

  • @amitjajoo9510
    @amitjajoo9510 Před 4 lety +3

    I like this type of session sir thank you for such a great session

  • @vcjayan8206
    @vcjayan8206 Před 2 lety

    Thank you Krish for the video. Really interesting and useful..!!

  • @harsharora3535
    @harsharora3535 Před 4 lety +3

    Great sir 👍 , before this video i can't imagine that python do this type of extraction also.

  • @shivaprasad9787
    @shivaprasad9787 Před 2 lety

    You have actually played a safe game in the video without resolving the extraction issues

  • @venkateshnallapaneni8932

    Really you are helping me alot
    Thank you very much

  • @suryav3959
    @suryav3959 Před 4 lety +2

    This type of classes are really nice.
    Please do upload advanced topic in tesseracr in future videos.
    Thank you so much for this, one of my most waited video

  • @pattuchiitu8978
    @pattuchiitu8978 Před 2 lety

    Thank you So much!, its really helpful

  • @tarunkumar9528
    @tarunkumar9528 Před 3 lety +5

    Hi sir,
    I am currently working on a project Text Extraction from CPG(Consumer packaged goods) Product Images. Can we use Pytesseract to do the same?

  • @vaibhavghadage8127
    @vaibhavghadage8127 Před 2 lety +4

    This is very helpful session for me ... Can you please make a video on how to convert Image to CSV ... If possible.
    Thank you so much 👍🤝

  • @islamic-videos-together

    which filteration do you suggest to make this more accurate

  • @anoopyadav4081
    @anoopyadav4081 Před 3 lety +1

    I'm doing the same steps still it shows test for simple images, but does not show for other complex images like invoices, traffic signal sign etc. What may be the reason, pls guide.

  • @SuheylBey
    @SuheylBey Před 2 lety

    Hello thank you for the video. Is there a way to get the image preprocessed by the tesseract algorithm? When running tesseract in cmd I can get it by setting tessedit_write_images = 1, but in python I couldn't find a way to get preprocessed image.

  • @ankitdhanore1355
    @ankitdhanore1355 Před rokem

    thank you so much sir...

  • @pinakimishra9057
    @pinakimishra9057 Před 4 lety

    Nice topic , krish

  • @Dailythingsx
    @Dailythingsx Před 2 lety

    you are awesome .. Nice video.

  • @sajithakk4790
    @sajithakk4790 Před 4 lety +1

    Sir please take a class about how to save the model created using cnn for future use using hdf5

  • @Rahul_Singh_Rajput_04
    @Rahul_Singh_Rajput_04 Před 2 lety

    Thankyou so much sir

  • @akshatpant9758
    @akshatpant9758 Před 3 lety

    hello krish,
    i try to upload the same images like you are uploading i.e traffic image and invoice..i choose the exact same image from google but on running, image is displayed but no text is getting printed and for the case i take screen shot of Wikipedia text its is working absolutely fine..what could be the problem??

  • @tahsinasif2612
    @tahsinasif2612 Před 4 lety

    Hi Krish thanks a lot for your videos..I also want to know create container in aws

  • @SICK527
    @SICK527 Před 3 lety

    sir ,how can we do it on multiple images and the extracted text should be created as .txt file as like in notepad

  • @punks0325
    @punks0325 Před 3 lety

    Sir how can we train or retrain the model for new symbol ....
    So that it can detect the symbol ....

  • @aryashailesh12
    @aryashailesh12 Před 3 lety

    Live or recorded Both ways are good, sir

  • @shashankshashi6361
    @shashankshashi6361 Před 4 lety +6

    Sir, how extract data from PDF and separate the names and phone numbers and save it in Excel file

  • @ChetanParmar
    @ChetanParmar Před 4 lety +1

    What is the name of your writing pad

  • @ideygmail
    @ideygmail Před 4 lety +2

    Sir, I have a linux box. What are the steps for me? I have installed tesseract-ocr and pytesseract both the packages

    • @manishankus573
      @manishankus573 Před 3 lety

      Hey, follow this czcams.com/video/-fIlUcp69xo/video.html.

  • @Fast_4ward
    @Fast_4ward Před 6 měsíci

    Hi @Krish, I want to extract text from the yolov8 predicted results which are scanned documents and predicted result images also have bounding boxes with their classes defined as, header, footer, subheading and paragraph. I want to extract text with respect to the class name and the confidence score.

  • @harshavardhanachyuta2055

    Please make a video on post processing of the text that is extracted from ocr ?? It is very important because of the design changes hardcording like contain string does not work. So we need to use nlp I guess.

  • @rohit_mondal__
    @rohit_mondal__ Před 3 lety

    sir i tried this pyteserract on number plate detection...and its not showing great results...can you please make one video on number plate detection also ?

  • @meghnajain7889
    @meghnajain7889 Před 2 lety

    How can I generate character level confidence score using tesseract??

  • @swatibohidar1678
    @swatibohidar1678 Před 3 lety

    I am getting error:
    ImportError: cannot import name 'image_to_string' from 'pytesseract' (c:\python37\lib\site-packages\pytesseract\__init__.py)
    Just after importing tesseract and giving the path.
    Please help!!

  • @panwar2001
    @panwar2001 Před 8 měsíci

    @krish Naik sir could you please tell some way to extract address from a large text corpus? How can tesseract help to extract address from docs?

  • @adityasoni1639
    @adityasoni1639 Před 5 měsíci

    How to know, what's the accuracy of my ocr model ?

  • @heenagirdher6443
    @heenagirdher6443 Před 2 lety

    Helo sir. Could you please make a video on segmentation of handwritten text image to characters. 🙏

  • @adityanjsg99
    @adityanjsg99 Před rokem

    Sir, please make video on custom training and fine tuning! Please!

  • @karishmagermanwala764
    @karishmagermanwala764 Před 3 lety +2

    Do we have any library which can extract text from structured documents like passport, adhar card ,pancard ?

  • @shubhamrajput3148
    @shubhamrajput3148 Před 3 lety

    Sir can you have lecture on OCR USING DEEP LEARNING

  • @krishnirwan4493
    @krishnirwan4493 Před rokem

    I am unable to join ur membership can u guide to join the membership

  • @dswithreddy7384
    @dswithreddy7384 Před 4 lety

    Krish can you make this on Real time video

  • @krishanansamy3479
    @krishanansamy3479 Před 3 lety

    You make one environment to install all installation. Or make every time create new environment and install.plz clear me.

  • @koustavdutta5317
    @koustavdutta5317 Před 4 lety

    Sir Debit Card is not working for getting membership ( Rs. 59 ) of your channel. Please help sir.

  • @AnuRadha-fg8ry
    @AnuRadha-fg8ry Před 3 lety

    sir..can u put a new video for text extraction in azure for arabc and eng ID cards

  • @qzwwzt
    @qzwwzt Před 6 měsíci

    Hi Sir, My challenge is reading texts inside images with wavy lines. The Image was created with a cell phone and just inserted as an image in a PDF file. Any special lib to do this? Pytesseract did work very well. It didn't capture well the wavy lines

  • @payalray8854
    @payalray8854 Před 3 lety

    When I execute import pytesseract....

  • @MaxVerstappen-cj4jw
    @MaxVerstappen-cj4jw Před 4 měsíci

    hey im trying to build a pdf chat bot but i want to install ocr in it so that it recognizes image text too , can someone guide me plz

  • @payalray8854
    @payalray8854 Před 3 lety

    It show me module not found sir

  • @tejasarondekar375
    @tejasarondekar375 Před 3 lety

    what if the language is hindi or sanscrit will it work

  • @xyz-wl9uj
    @xyz-wl9uj Před rokem

    sir plz tell how to implement for multiple images

  • @mashaelabdullah6758
    @mashaelabdullah6758 Před 2 lety

    Hi, I am looking for medical prescriptions dataset where I read the handwritten text using OCR, anyone can share with me this dataset?

  • @lokaranjan4353
    @lokaranjan4353 Před 2 lety

    can we use pytesseract to read kannada text

  • @kushaggrkashyap9831
    @kushaggrkashyap9831 Před 2 lety

    If we draw a circle over a text and take a snap of it then How will we extract that only content which is inside the circle.?

    • @awerqga
      @awerqga Před rokem

      Did you find answer for this?

  • @akshaymitra
    @akshaymitra Před 2 lety

    This is not working in tabular data in scanned images

  • @pinakimishra9057
    @pinakimishra9057 Před 4 lety

    Yes

  • @atharvachouhan474
    @atharvachouhan474 Před 2 lety

    Can it read Doctor's Handwriting?

  • @AshokYadav-is7wx
    @AshokYadav-is7wx Před rokem

    How to extract hindi text in tessract.

  • @anilsharma32g
    @anilsharma32g Před 8 měsíci

    Dear Sir, I am your Subscriber
    I want to create a tool that finds text errors in the image.
    For Example:
    I forgot to write CONTACT US, BUY NOW, CONTACT NUMBER, SPELLING MISTAKE, etc... in my social media post.
    that the tool finds error and suggests what are missing or what is incorrect in social media post.
    🙏 Please guide me and suggest what course I need to buy or what I need to learn to create this tool
    Thank you

  • @AnuRadha-fg8ry
    @AnuRadha-fg8ry Před 3 lety

    Sir..can we extract arabic and english text in pytesseract?if so,can you discuss in tomorrows session or put a video reg the same sir..

  • @lakshikaseksaria845
    @lakshikaseksaria845 Před rokem

    Hey! can u create a model for extracting pan number from pan card

  • @shubairabbas5480
    @shubairabbas5480 Před 4 lety +2

    Sir i am working with teseract and opencv for making an ML based Application on Invoice system. The project is basically on the systwm where you automate the data and convert into excel by training 10 bill of invoice.
    Please help sir

    • @manjubadiger2902
      @manjubadiger2902 Před 3 lety

      Hi you can contact me regarding OCR on invoice projects

    • @shubairabbas5480
      @shubairabbas5480 Před 3 lety

      @@manjubadiger2902 Thanks. But mera project band kar diya gaya ha. Maine bahat help mangi krish sir se, aur bhi loggo se LinkedIn par. But no one help. For this reason i lost my job. Tabse maine inn sab youtuber ki video dekhna band kar diya. Thanks for your favour.

    • @shivangiexclusive123
      @shivangiexclusive123 Před 3 lety

      @@shubairabbas5480 Could you clarify abt the project..and why was it closed?

    • @rubs63
      @rubs63 Před 3 lety

      @@manjubadiger2902 hey buddy.. I need some help, how to extract tables along with other datas from any scanned document??

  • @MuhammadZainE-
    @MuhammadZainE- Před 3 lety

    Great

  • @anannayaaaaa
    @anannayaaaaa Před rokem

    What about other languages

  • @KasimKhan-lk4sw
    @KasimKhan-lk4sw Před rokem

    what about ubuntu path

  • @Gamezone-kq5sx
    @Gamezone-kq5sx Před 2 lety

    You saved me

  • @milindwaykole8187
    @milindwaykole8187 Před 4 lety +1

    Please let me know how we can install it in Linux

    • @manishankus573
      @manishankus573 Před 3 lety

      Hey, follow this czcams.com/video/-fIlUcp69xo/video.html.

  • @nsnilesh604
    @nsnilesh604 Před 3 lety

    please help us with captcha images reading

  • @ragha2846vs
    @ragha2846vs Před 4 lety

    This is amazing. Thanks. Can we extract tabular info from image as tables? how?

    • @ppakpahan
      @ppakpahan Před 2 lety

      I want to know how to do this as well....

  • @telugugamers6402
    @telugugamers6402 Před rokem

    Bro, can u try
    Image_to_boxes

  • @surajmota6415
    @surajmota6415 Před 4 lety

    Sir please build handwritten Oct recognise using CNN...

    • @shubhamrajput3148
      @shubhamrajput3148 Před 3 lety

      Sir have you found any solution for your queary ,as I also need OCR using deep learning tutorial

  • @ChetanParmar
    @ChetanParmar Před 4 lety +1

    Can this also read invoices or bank statements? I think should be able to help my wife who is a CA

    • @krishnaik06
      @krishnaik06  Před 4 lety +2

      Yes I have shown the example

    • @ChetanParmar
      @ChetanParmar Před 4 lety +2

      Oh sorry did I miss it I am was getting my food.

    • @ChetanParmar
      @ChetanParmar Před 4 lety +1

      How to send this data to excel files?

    • @ChetanParmar
      @ChetanParmar Před 4 lety +1

      What is the name of the writing pad

  • @geshusinha6413
    @geshusinha6413 Před rokem

    I want to just read particular part from images after classification

    • @geshusinha6413
      @geshusinha6413 Před rokem

      like only read names from all aadhar cards photos

  • @lokap9662
    @lokap9662 Před 4 lety +1

    Yes yes

  • @MaheshSharma-ud1cj
    @MaheshSharma-ud1cj Před 4 lety +2

    Getting error Exec format error tesseract-ocr-w64-v5.exe

    • @MaheshSharma-ud1cj
      @MaheshSharma-ud1cj Před 4 lety +2

      Running code in colab

    • @venkateshnallapaneni8932
      @venkateshnallapaneni8932 Před 4 lety

      I think this doesn't work on colab because we need to install tessarct exe file on our local system to use it. So use this on your local desktop jupyter notebook.

  • @harjeetsangwan7707
    @harjeetsangwan7707 Před rokem

    sir if video would be recorded then it would have be more helpfull rather than livestreaming

  • @Joel95ify
    @Joel95ify Před 4 lety +2

    Tesseract only works when the image background and texts are clear. I tried to use tesseract on lcd panels and it gave bad results.

    • @adis6867
      @adis6867 Před 3 lety

      There is one otherway to make it, first change the original image to binary image which will basically separate the text and non- text part. And then further feed it into tesseract. It will get improved

    • @aditinagar6688
      @aditinagar6688 Před 3 lety

      @@adis6867 Can you elaborate the steps for it? It would be quite helpful.

  • @ccreddy87009
    @ccreddy87009 Před rokem

    hi krish

  • @lokap9662
    @lokap9662 Před 4 lety +1

    Hi

  • @RAJATSINGH-hk6xi
    @RAJATSINGH-hk6xi Před 3 lety

    Thanks a lot sir ..