Step-by-Step Handwritten Sentence Recognition with TensorFlow and CTC loss

Python Lessons

zhlédnutí 24 725

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 29. 01. 2023
Unlock the power of handwritten sentence recognition with TensorFlow and CTC loss. From digitizing notes to transcribing historical documents and automating exam grading.
This tutorial will teach you how to use TensorFlow and CTC loss to master Handwritten Sentence Recognition. This challenging task involves interpreting text written in handwriting and has various applications, such as converting handwritten notes into digital text, transcribing historical documents, and automating the grading of exams. One of the critical challenges in Handwritten Sentence Recognition is handwriting variability, which makes it difficult for a machine-learning model to recognize handwritten text accurately. With this tutorial, you'll be able to address this challenge and use your model to recognize handwritten text with high accuracy. You'll learn how to use CTC loss to handle sequence data, such as text, and how to train your model to recognize handwritten text even with different input and output sequence lengths. Don't miss out on this opportunity to become an expert in Handwritten Sentence Recognition!
Text Version Tutorial: pylessons.com/handwritten-sen...
GitHub: github.com/pythonlessons/mltu...
pypi: pypi.org/project/mltu/
#machinelearning #python #tensorflow #opencv #ocr

Komentáře • 148

@vkrts9176 Před rokem ⁺²
Awesome informational video.
@raghavsharma-ut7oy Před 9 měsíci ⁺²
Thanks for sharing such a valuable content..
@PyLessons Před 9 měsíci
My pleasure!
@jenellpatchen1309 Před rokem
hello, I have a question, how to continue the training of the model? I don't want to restart training the machine from scratch all over again,
@vkrts9176 Před rokem ⁺²
Can you suggest to some steps to extract information from a "BANK CHECK" like Payee name, Amount in words, Amount in digits, date, MICR Code etc .
@user-mk1fm1rx2l Před 11 měsíci ⁺¹
Hello Sir, thanks for sharing this video and knowledge request your suggestion on the software that identifies the user handwriting style like (Cursive handwriting, Lucida etc) and suggest the improvements in Handwriting if there any mistakes in the handwriting styles. for example user writes word "Apple" in Cursive handwriting but if letter "e" is more squeezed, the software should identify the letter "e" and suggest "e" should be properly written... please suggest if you have any software or code... Thanks a lot once again.
@user-os5rk4xc1w Před 4 měsíci ⁺³
Will this work on JUPITER NOTEBOOK PYTHON...PLS HELP..THANKS....
@nnprasannakumar5618 Před 4 dny ⁺¹
we are not able to download the dataset
@vaishalishiv5529 Před 3 měsíci
Also how to do labelling for my custom data?
@hemantchauhan6437 Před 3 měsíci
NEED HELP! I am making a website where user can upload a pdf but I want that pdf to upload only if that pdf has images of only HANDWRITTEN text. Thank you for reading.
@ayilanaveensai1987 Před rokem
How to give external images for this model sir, can you please say me
@ibraheem1224 Před rokem ⁺¹
Can we get the tensor board graphs that you show in the end, in google colab?
@PyLessons Před rokem
I'll check if I still have them
@MonishaBM-rf6qk Před 11 dny
In which folder we want to store train model pls tell me it is showing error
@VENKADARAMANANP Před rokem ⁺¹
Awesome sir, could you please make another video on how to improve the accuracy rate of the same model?
@PyLessons Před rokem
Hey, way more data and try to improove model, thats all, i am not sure if its worth creating another video where everything would be the same
@ayilanaveensai1987 Před rokem
Did you found the way bro ??
@mobeentariq6652 Před rokem ⁺²
Awesome tutorial.
I have one question, can i use it on image which contains more than one sentence? Like 2 or 3 sentences.
@PyLessons Před rokem
Thanks! It would be way harder to train model on couple of sentences, but it would work. Haven't tried
@juanmoctezuma9225 Před rokem ⁺¹
great tutorial! - but i have a question , if i download the giant IAM dataset -> train my model -> load my model / script into a streamlit web app , can I delete the IAM database after training my model locally or do I need to keep it so that my model continues to run? Im working on an OCR app that copies text from pdfs and converts it into string - thanks!
@PyLessons Před rokem ⁺¹
Its not that giant comparably. But answering your question, as long as you complete model training you don't need this dataset. it doesn't require dataset to make predictions ;)
@juanmoctezuma9225 Před rokem
@@PyLessons thank you! cheers 🍻
@tejassimha4796 Před rokem ⁺¹
A neural network is not a database, it does not refer back to anything. As long as you're training the model you will need the dataset. Once done the information is stored in the neural network in the form of weights.
@VENKADARAMANANP Před rokem
I have a question. After training, how can I use my own data input to predict/recognize. Where should I change, in which format & how to give the that input for prediction?
@PyLessons Před rokem
Hey, example is given here:
github.com/pythonlessons/mltu/blob/main/Tutorials/04_sentence_recognition/inferenceModel.py
@-HarshalMali Před rokem
Can we convert it into tflite for android implementation
@PyLessons Před rokem
I haven't tried, but yes, I don't know why it shouldn't be possible
@ayilanaveensai1987 Před rokem
After training, how can I use my own data input to predict/recognize. Where should I change, in which format & how to give the that input for prediction? How to give external images for this model sir, can you please say me!
@PyLessons Před 11 měsíci
There is inference script in tutorial, analyse it
@computer9921 Před 25 dny
Hi there, how i make propject where i insert handwritten pdf and get a pdf with text converted .
@PyLessons Před 20 dny
It is way harder than you think, and I suppose if you ask, you haven't done any research yet
@vaishalishiv5529 Před 3 měsíci ⁺¹
Not able to download dataset from the link you mentioned, can you help me ?
@PyLessons Před měsícem
You may be not able to access dataset website from your location, try to use vpn to access dataset
@sondosaabed Před 4 měsíci
Did start with the loss inf? and then after the 20th epoch started to learn?
@PyLessons Před 4 měsíci
Yes, this is because of small dataset or weak neural network architecture. But it works, and people can move further with this :)
@Sihaam_Sheik Před rokem ⁺¹
How to create handwritten image dataset for regional language
@PyLessons Před rokem ⁺²
Get or create annotated dataset for that language :)
@karanrathi7073 Před 9 měsíci
How to print val _accuracy for each epoch if i add in the metrics it is giving error
@PyLessons Před 9 měsíci
How you try to do it? if you try to add metric as metrics=['accuracy'] it may not work because of CTC loss, you can try metrics=['sparse_categorical_accuracy'], I haven't tried but it should work
@nikenyulianaazen8488 Před 9 měsíci ⁺¹
Awesome tutorials, hi sir, could you mind to explain how to calculate the accuracy of the model we have built🙏 and if the real text written on the picture is "i buy clothes" and the result of predicted text is "i buy clothed", how is the accuracy? Will it be totally not accurate or it will get 90% or more but not 100% accuracy?🙏🙏
@PyLessons Před 9 měsíci
Hey, that's why we use CER (Character Error Rate) and WER (Word Error Rate) metrics to get these results, but yes, more familiar sentence is lower the scores will be.
@edenrobin154 Před 8 měsíci
@@PyLessons HEllo sir can you provide the dataset .IT seems their website is defunct so i cannot register a new email hence cant access the data.
@valorantgopro6649 Před 8 měsíci
i can't access the website to download datasets, can you give me a folder on gg drivers or smth so i can download from that. Thanks all guys.
@krishnasharma657 Před 5 měsíci
Hey u got the folder??
@aiminnur9247 Před měsícem ⁺¹
can you share your datasets sir? because not able to download dataset from the link you mentioned
@PyLessons Před měsícem
You may be not able to access dataset website from your location, try to use vpn to access dataset
@imnotossy6436 Před měsícem
@@PyLessons It does not send a verification link to our email
@thangaraj0478 Před 2 měsíci ⁺¹
Hi,
This is Awesome,
can I use this for other languages
@PyLessons Před 2 měsíci
Hey, Thanks! Yes you can use this for other languages, this tutorial is just an example
@swaralijoshi3476 Před 8 měsíci
Hi
I am facing the issue regarding that IAM dataset . How much time it take to get the verification email
@PyLessons Před 7 měsíci
No idea, for me it works fine
@vkrts9176 Před rokem ⁺¹
I'm able to get the prediction. How do I improve the prediction results?
@-HarshalMali Před rokem ⁺¹
Bro can you guide us
@-HarshalMali Před rokem
Could i get ur phone no or insta id i need ur help
@vkrts9176 Před rokem
@@-HarshalMali Yes
@tommathew5148 Před 8 měsíci
Hi, I'm implementing ctc on an attention encoder, and I want to jointly decode it with an attention decoder, after the encoder I added an output layer(train that with ctc) same thing with decoder(but train with cross entropy). but obviously the output shapes are diff, I want to do a linear combination of the outputs of the two, ctc gives me (batch,maxframelength, vocab size) and decoder gives me (batch, transcriptlength, vocab size), is there some step I'm missing? I can't figure it out. Great video btw😊
@PyLessons Před 8 měsíci ⁺¹
Hey, you should check my tutorial about transformers. I haven't tried to do what you do but everything sound logically. I think your decoder output is wrong (but not sure) - hard to say without looking at the code. Usually you will run forward pass on encoder one time, and on decoder side you'll need to iterate untill the end of sentence
@swaralijoshi3476 Před 8 měsíci
Hi
Actually I am facing the issue regarding the IAM dataset how much time it takes to get the verification email can you plz tell
@TheAIJokes Před rokem
Hi sir The tutorial was awsome but If the image is a one page handwritten text then what shoud we do...Please let me know......
@PyLessons Před rokem
With computer vision techniques, seperate each line and do recognition on these lines
@adhirs1745 Před měsícem
How to run this code?
@PyLessons Před měsícem
I gave all step by step details about this
@AnthonyGrau Před 2 měsíci
Hello,
Thank you for your awesome work !
I'm a French developer and I actually transcribe some family papers dated between XIIIth and XIXth centuries (French and Latin).
Could you give me some clues for contructing datasets suited to work with your code ?
I'm beginning my investigations and I'm sure I'll eventually find a way to achieve the task on my own but your help could save me some time as I'm new to TensorFlow and PyTorch.
You are also mentioning that you your model is training in 1h30. Could you share the technical specs of your harware so I can compare with mine ?
Bravo once again. I hope to hear from you.
@PyLessons Před 2 měsíci ⁺¹
Yes, I tried to write code so it would work out of the box. As I remember, I trained it with GTX 1080TI gpu
@AnthonyGrau Před 2 měsíci
@@PyLessons Thank you for your answer. Next gen GPU will allow better training time :)
@hearttoheart3879 Před rokem
can we convert these recognize text in to voidable? using raspberry?
@PyLessons Před rokem
can you give more details? voidable?
@hearttoheart3879 Před rokem
@@PyLessons I want to make this for blind then I need voice output
@avinashpraveen1758 Před rokem
sir please tell me whether i can use this code to convert a whole page into textual format
@PyLessons Před rokem ⁺¹
You can't use it to convert whole page
@shravansalunkhe6626 Před 8 měsíci
@@PyLessons Can you make tutorial to do for whole page Please coz I'm doing this for my major project and not getting code anywhere
@user-zd4nn3uj4e Před rokem
instale mltu ,pero no me reconoce mltu.losses ni callbacks y tampoco metrics alguna respuesta o les paso lo mismo estoy con python 3.11
@PyLessons Před rokem
?
@nahushs Před rokem
Hi bro I have a question there is no such library as mltu.tensorflow. How can i resolve this ??
@PyLessons Před rokem
install it
@nahushs Před rokem
@@PyLessonsI'm really sorry but I have installed the mltu library and tensorflow 2.10, all functions are there except mltu.tensorflow and mltu.utils and mltu.annotations can u tell me where to get those files?
@abhinavkotagiri6320 Před 7 měsíci
I registered in the site(IAM Dataset)but i am not able to access the database.There are login issues
@PyLessons Před 7 měsíci
not sure, for me it works fine
@manasupadhyaya4346 Před 6 měsíci
@@PyLessons could you make a drive or something from where we can download. because even i cant access
@ELECTRONICS_TIMES Před 11 měsíci
Sir, i am getting< No module named 'onnx' >error. What i have to do?
@PyLessons Před 11 měsíci
Pip install onnx
@ELECTRONICS_TIMES Před 11 měsíci
Sir, I want to make correction system so please tell me how to do.
As of my knowledge I think these are steps involved in it, so plz tell me and suggest me and help me.
1. Capturing the real-time paper or scanning.
2. Sentence recognisation
3. Using NLP to get keywords from recognised sentences
4. Creating a dataset that includes keywords
5. Tokenizing and comparing the recognized data with dataset.
6. Allocating marks
If these are wrong please can you tell me?
And I had downloaded ascii.tgz and sentences.tgz files from that website but I can't getting extracting files how sir?
@akhiljohn8892 Před rokem
I finished training, but where foes the output get showed. Pls help .
@PyLessons Před rokem
you mean where model is saved?
@chaimaafaouzi2514 Před 10 měsíci
please how much time training take??
@presisprem Před rokem
error showing Exception: Model path (Models/04_sentence_recognition\202303182217\model.onnx) does not exist
@PyLessons Před rokem ⁺¹
You need to download a model from the link in my text version tutorial or train your own model
@presisprem Před rokem
@@PyLessons Thank you sir,i got the output😀
@harshinikaredla5758 Před rokem
I installed the mltu correctly but getting a "No module named 'mltu.utils'". Rest of the mltu modules were imported without any issues. Could you please help me how to resolve this?
@PyLessons Před rokem
What version of mltu?
@harshinikaredla5758 Před rokem
@@PyLessons version 0.1.5
@PyLessons Před rokem ⁺¹
Thanks, there was a bug, and no one mentioned it... I released 0.1.7 version, try it now
@harshinikaredla5758 Před rokem ⁺¹
@@PyLessons Yes this version is working. Thanks a lot!!☺
@-HarshalMali Před rokem
Could u tell comand how to save the modal
@vkrts9176 Před rokem
Yes
@PyLessons Před rokem
Hey, its like a standard way, right now model is saves by callbacks
@ishitamahajan2571 Před rokem
Hello Sir,
I am getting the error FileNotFoundError: [Errno 2] No such file or directory: 'Models/04_sentence_recognition/202301131202/configs.yaml'
How Do I resolve this?
Can you please help. Thankyou
@PyLessons Před rokem
download the model file together with configs?
@meershajafar7494 Před 8 měsíci
@@PyLessons actually its not with the downloads thats the same thing that im facing too
@omarzain3292 Před rokem
Awesome sir, i got this error when i try to train the model ...... TypeError: Input 'y' of 'Less' Op has type int64 that does not match type int32 of argument 'x'.
@PyLessons Před rokem
Install mltu the right version, it may be the issue
@omarzain3292 Před rokem
@@PyLessons same error, i used mltu 0.1.5 version
@PyLessons Před rokem
@@omarzain3292 thanks, I will check, you may try 0.1.6 version
@kirthanr4496 Před 2 měsíci ⁺¹
Hi I am not able to access the website, does it require a vpn
@kirthanr4496 Před 2 měsíci ⁺¹
NVM, just found out you need a VPN to access from India
@PyLessons Před měsícem
Great! Because I can access it from my location I didn't knew that you can't access it from other locations
@kirthanr4496 Před měsícem
Hey I could not download it, could you please send it to me??
@kirthanr4496 Před měsícem
drive.google.com/drive/folders/13-U__hphtd1Wc5F7UjmIauT4U4nDZ_yY
@kirthanr4496 Před měsícem
You could upload in the drive link
@fekirsouhaib3542 Před rokem
hi i have a problem in mltu
"
ERROR: Cannot install mltu==0.1.3, mltu==0.1.4, mltu==0.1.5, mltu==0.1.6, mltu==0.1.7, mltu==1.0.0, mltu==1.0.1, mltu==1.0.2, mltu==1.0.3, mltu==1.0.4, mltu==1.0.5, mltu==1.0.6, mltu==1.0.7 and mltu==1.0.8 because these package versions have conflicting dependencies.
"
What's the solution
@PyLessons Před rokem
Thats strange, what OS you use and what python version?
@fekirsouhaib3542 Před rokem
@@PyLessons I fixed it thanks ,
but it doesn't work it needs datasets
@nahushs Před rokem
Hi sir I'm really sorry but I have installed the mltu library and tensorflow 2.10, all functions are there except mltu.tensorflow and mltu.utils and mltu.annotations can u tell me where to get those files?
@PyLessons Před rokem
what mltu version you installed?
@nahushs Před rokem
@@PyLessons mltu 0.1.6
@PyLessons Před rokem
install newest version and follow tutorial code that is on github
@nahushs Před rokem
Thanks sir it worked but how can I lower the training speed as it is taking 2 hours to train 1 epoch if I want to do 1000 then it will take more can u help me out?
@PyLessons Před rokem
@@nahushs make sure to train on GPU, and early stopping will work, it wont train for 1000 epochs
@kalanabimsara1213 Před rokem
1000 epoches are essntial for traing? and what amout of time do i wait to train the mode sir?
@PyLessons Před rokem
Hey Sir, I think you haven't watched complete video or haven't read my text version tutorial. For this you use validation dataset, and you stop training when your model achieves best point within this validation dataset
@sohambhole4288 Před rokem
how to solve error : artefact not found
@PyLessons Před rokem
What OS you use?
@sohambhole4288 Před rokem
@@PyLessons Windows 11 Home Single Language
@PyLessons Před rokem
@@sohambhole4288 stow package doesnt work with win11, I'll make a fix in a week, but you can replace stow package code with os.path package if you cant wait
@sohambhole4288 Před rokem
@@PyLessons ohk thanks
@yashkewlani2878 Před rokem
@@PyLessons Sir please make a detailed video for this error as lots of folks getting the same error
@user-mq9oz1tl5f Před 11 měsíci
Hi, I am trying to train my model on my database according to the tutorial and sometimes the training takes quite a long time so I wanted to load the model saved by callback with load_model("{path_to_model}/model.h5") and continue training where I left off. Unfortunately, I get an Unknown loss function error: CTCloss, which I tried to solve using the custom_objects parameter, but it caused another error that I couldn't solve: CTCloss.__init__() got an unexpected keyword argument 'reduction'. Then I tried to do it by saving the file in .tf format and it caused an error related to the metrics and also after using custom_objects and passing these metrics the error looped and it was related to the metrics arguments (which I entered). So Is it possible to somehow load a saved model while training is interrupted and continue training it so that it stays in accordance with the tutorial? (For example, I have epoch 53 /1000 and I see that the best value yet was saved to the model.h5 file at 52 epoch so I stop learning and then I want to load the saved model at epoch 52 and continue from there)
@PyLessons Před 11 měsíci
Open issue on github, will be easier to solve this
@sohambhole4288 Před rokem
How to solve Value error: not enough value to unpack
@PyLessons Před rokem
need more details
@sohambhole4288 Před rokem
@@PyLessons File "C:\Users\Soham\AppData\Local\Programs\Python\Python310\lib\site-packages\mltu\dataProvider.py", line 215, in __getitem__
batch_data, batch_annotations = zip(*[augmentor(data, annotation) for data, annotation in zip(batch_data, batch_annotations)])
ValueError: not enough values to unpack (expected 2, got 0)
@PyLessons Před rokem
You changed something in code? Because you are not giving any data to dataProvider "(expected 2, got 0)"
@sohambhole4288 Před rokem
@@PyLessons No I have just changed stow to os.path package
@PyLessons Před rokem
ok, I see its not enough to do this change, tomorrow I'll look at it and post you a link to fixed code
@alihamama7036 Před rokem
when i try to train the model i face this error
stow.exceptioncould s.ArtefactNotFound: Couldn't locate artefact /Users/aliha/AppData/Local/Temp/tmpg50_k80k
could u please help me with it
@PyLessons Před rokem
what is your OS?
@alihamama7036 Před rokem
@@PyLessons
thanks for replying
it's win11
@PyLessons Před rokem
win11 has problems with stow package, in future versions I'll try to fix these issues
@yashkewlani2878 Před rokem
@@PyLessons please fix this issue as soon as possible sir we have deadlines

Další v pořadí

Automatické přehrávání

Build a Custom ASR Model in TensorFlow: A Step-by-Step Tutorial