Coding OCR with machine learning from scratch in Python - no libraries or imports! (From Scratch #2)

Sdílet
Vložit
  • čas přidán 6. 07. 2020
  • In this video, we implement OCR/image recognition using simple machine learning in Python with no imports! This was streamed live on / clumsycomputer on 2020/07/06.
    Our program uses the k-nearest neighbors algorithm to classify images from the mnist and fashion-mnist datasets, meaning it takes digits (1, 2, 3, …) and garments (shoe, t-shirt, …) and guesses what they are (classifies them). It can get up to 93% accuracy on mnist. I wanted this to be a simple introduction to machine learning for those not very familiar with it.
    The full code including more details is available at git.sr.ht/~vladh/clumsycomput...
    Because this is a recording of a live stream, you'll hear me sometimes talking to the chat.
    If you'd like to see more, please follow the clumsy computer Twitch channel at / clumsycomputer .
    If you have any questions, write a comment or contact me on / clumsycomputer .
    What would you like to see me code from scratch next? Let me know in the comments! :)
    Wishing you a swell day,
    Vlad
  • Věda a technologie

Komentáře • 59

  • @flashbao1922
    @flashbao1922 Před 3 lety +40

    He explains everything from scratch. This is the scratch I actually needed!

  • @ArabianKnight63
    @ArabianKnight63 Před 7 dny

    followed, that the from scratch thing everyone was looking for.

  • @haniajarlaczynska4518
    @haniajarlaczynska4518 Před 4 lety +26

    I've just started learning machine learning and I really like this video - you explain everything very clearly. It's nice to see how to do this from scratch instead of using some huge libraries.

    • @clumsycomputer
      @clumsycomputer  Před 4 lety +3

      Thank you for the feedback! It makes me really happy to hear that.

    • @ANSELAbitsxb
      @ANSELAbitsxb Před rokem

      @@clumsycomputer Dude you make really good videos. If you get time again in the future please make more.

  • @Muphet
    @Muphet Před 3 lety +5

    most underrated software programmer and evangelist on youtube. i find this explanation/tutorial better than any other youtube video.

  • @zodiaminc1474
    @zodiaminc1474 Před 3 lety +2

    Please don't get dis-heartened or demotivated, this video is a total gem (sorry too late for encouragement after 10 months) but this really helped, thanks bruv, keep up the good work !!!!!

  • @youssef-attai
    @youssef-attai Před 3 lety +8

    I swear to god you are a legend, no one could have explained it better. Thank you so much.

  • @frindle18
    @frindle18 Před 28 dny

    Brilliant tutorial, everything is so clearly explained, and it actually is from scratch.

  • @adhamsalama4336
    @adhamsalama4336 Před 2 lety +1

    This is amazing! Thanks for this awesome video! Subscribed!

  • @myquoraismine2953
    @myquoraismine2953 Před 3 lety +6

    I was looking around for this stuff, it's amazing, You nailed it
    clumsy computer, can you please implement hyper parameters in it ?

  • @nelhdez3126
    @nelhdez3126 Před 3 lety +3

    This was so refreshing, you made it way better than some full-of-pride professors at some universities. Thanks so much mate. Could you do at some point the most basic “non-lazy-training” algorithm from scratch? Whichever you deem is fine

  • @ruoyuguo3945
    @ruoyuguo3945 Před rokem +4

    Thanks for sharing, PaddleOCR seems friendly for training the OCR models. Could you please shoot a video about how to use PaddleOCR?

  • @benoitd94
    @benoitd94 Před 10 měsíci

    Hi Clumsy VERY GOOD JOB ! Just one question, is it possible to decode Water Counter with that method ? do i need a special MNIST DATASET to download ? thx

  • @SarthakRauts
    @SarthakRauts Před 3 lety

    Amazing video

  • @ElectroProgrammingFreakJuanGCJ

    Very useful.

  • @munteanionut3993
    @munteanionut3993 Před 2 lety

    Does this tutorial only build a digit ocr? Or does it also work for characters as well?

  • @prakharpathak2358
    @prakharpathak2358 Před 2 lety

    How to preprocess a image of PNG file format downloaded from the web to recognise the digit and what changes to do in read images function in the above code to test that image

  • @abdullahshoukat7848
    @abdullahshoukat7848 Před 2 lety

    603. you deserve more, I hope you'll get soon

  • @arvindshelke8889
    @arvindshelke8889 Před 2 lety

    It really helped👌👌

  • @jaycobclavel7098
    @jaycobclavel7098 Před 6 měsíci

    How do you make your own custom datasets.? Provide your own dataset froms cratch. Just want to detect numbers. sometimes keras ocr detects 3 as 5.

  • @SourabhChavan-1997
    @SourabhChavan-1997 Před rokem

    Hey, Thanks for explaining everything, i like this video.
    which IDE are you using ?

    • @clumsycomputer
      @clumsycomputer  Před rokem +1

      Hi! I don't use an IDE, I only ever use vim.

    • @mechpunk
      @mechpunk Před rokem

      @@clumsycomputer your content is really amazing , If you have any spear time consider making more videos.

  • @JorgeGonzalez-jo3zv
    @JorgeGonzalez-jo3zv Před 4 lety +3

    Instructional video about machine learning, it would be good if you could make a video on how to configure vi with python with the basic instructions. Best regards.

  • @snenscoleman7095
    @snenscoleman7095 Před 3 lety

    What features does "extract_features" extract?

  • @PrakashKing8342
    @PrakashKing8342 Před 2 lety +1

    Do you have solutions for alpha numeric recognition I have tried all the method finally I’m getting fail tesseract easy ocr etc for example ABC00OO1 and SI1234H

    • @munteanionut3993
      @munteanionut3993 Před 2 lety

      Did you manage to find a solution for alpha numeric characters too?

  • @johnmele720
    @johnmele720 Před rokem +2

    Hey, the website you've got those files from now requires a password and username to get in. Is there another place we can get the files from?

    • @clumsycomputer
      @clumsycomputer  Před rokem

      Hi! I can download the files fine, could you tell me which website you're referring to?

    • @naufalafels
      @naufalafels Před 10 měsíci +1

      @@clumsycomputer this one, I also tried to get into the website but it needs password and username.

  • @jonas1351
    @jonas1351 Před rokem

    why did you stop dude, your explanation is so well

    • @clumsycomputer
      @clumsycomputer  Před rokem +1

      Thank you! New videos are coming, I just have a lot of other stuff to do. ❤

  • @hemantchauhan6437
    @hemantchauhan6437 Před 3 měsíci

    NEED HELP! I am making a website where user can upload a pdf but I want that pdf to upload only if that pdf has images of only HANDWRITTEN text. Thank you for reading.

  • @trungthanhbp
    @trungthanhbp Před 2 lety

    where are you now, we need your video :D

  • @iamHamza
    @iamHamza Před 3 lety +1

    Can u please tell us which editor did u used?

  • @ilyaosver4987
    @ilyaosver4987 Před 3 lety

    Nice

  • @AJ93-lx6on
    @AJ93-lx6on Před 3 měsíci

    Can this work on seven segment font though

  • @Tech_Inside.
    @Tech_Inside. Před 3 měsíci

    I want to ask how to test it on your img file?

  • @adwaithks
    @adwaithks Před 3 lety

  • @konalaravichandanreddy9886

    How to run this on jupyter notebook? Can someone please tell?

  • @beserious2773
    @beserious2773 Před rokem

    I wanted to create an ocr app for Android for myself only. But I need to implement maths also equations.
    Like Mathpix software did. I don't have money. To pay.
    I want to learn from very much basic.
    Like I am noob don't know coding also.
    But I wanted to learn.
    And wanted to create my own style data to get extracted from image.

  • @xhyhbdka
    @xhyhbdka Před 3 lety +3

    Good video but christ, presentation wise and viewer experience, an absolute car crash!! Max res 720P, you should have zoomed a bit more!!! That background chance from black to white just fucked my eyes up at 14:25

  • @kumaraguru3193
    @kumaraguru3193 Před 2 lety

    can i use my webcam to recognize any face

  • @Champe19
    @Champe19 Před 7 měsíci

    Wtf are the || swiggle characters

  • @truongzgiang
    @truongzgiang Před 7 měsíci

    Too many videos trianing copy from each other but no video talk about how to use it after trianed ,

  • @fahadsajeem2897
    @fahadsajeem2897 Před 3 lety +1

    next time please code in light mode. In dark mode its hard to read.
    Your videos are really useful and helpful. loved it 🙂

  • @MadhuKumar-hi2qs
    @MadhuKumar-hi2qs Před 3 lety

    507th subscriber

  • @shivu.sonwane4429
    @shivu.sonwane4429 Před 2 lety

    777

  • @sammy9453
    @sammy9453 Před 3 lety +1

    I think you should remove ocr tag. I am dissappointed.

  • @siyabonganxumalo4574
    @siyabonganxumalo4574 Před 3 lety

    I like the video but I was looking for something that will teach me from scratch because I don't have a experience in programming so I feel like you skipped a lot of steps because some people are already familiar with programming and I downloaded Python onto my laptop, so your coding was confusing me because I have gotten used to using Python that I downloaded.

    • @clumsycomputer
      @clumsycomputer  Před 3 lety +4

      Thank you for watching! If you just started learning Python, I don't think this is a good introduction, and I definitely don't think you should start with machine learning. In the future I plan to make more basic tutorials too, but until then, I recommend you start with some more beginner-level Python resources! :)

    • @siyabonganxumalo4574
      @siyabonganxumalo4574 Před 3 lety +1

      Thanks man, noted.