How Does Optical Character Recognition (OCR) Work?

Sdílet
Vložit
  • čas přidán 11. 09. 2024
  • How do computers read text on a page, and how has the technology improved?
    Freshbooks message: Head over to freshbooks.com/... and don’t forget to enter Tech Quickie in the “How Did You Hear About Us” section when signing up for your free trial.
    Techquickie Merch Store: www.designbyhu...
    Techquickie Movie Poster: shop.crowdmade...
    Follow: / linustech
    Join the community: linustechtips.com

Komentáře • 419

  • @freedomofmotion
    @freedomofmotion Před 7 lety +134

    Irish travelers will be deeply hurt that OCR and even you don't accept that dag is a word.
    Has no one ever tried to sell you a dag?
    Or admired your dag?

    • @chantafreak
      @chantafreak Před 7 lety +15

      Ya like dags?

    • @ataksnajpera
      @ataksnajpera Před 7 lety

      Knackers do not even speak english ;)

    • @GewelReal
      @GewelReal Před 7 lety +2

      hey kid, you wanna buy some dags?

    • @EvadingFate
      @EvadingFate Před 7 lety +18

      Oh, dogs. Sure, I like dags. I like caravans more.

    • @chantafreak
      @chantafreak Před 7 lety +5

      This is the post I was waiting for.

  • @jamesklein4399
    @jamesklein4399 Před 7 lety +412

    FILE FORMATS AS FAST AS POSSIBLE!
    png vs jpg
    mp4 vs mkv
    mp3 vs ...?

    • @laser5317
      @laser5317 Před 7 lety +44

      James Klein MP3 vs WAV

    • @RobertHildebrandt
      @RobertHildebrandt Před 7 lety +36

      mp3 vs flac

    • @coffeen8128
      @coffeen8128 Před 7 lety +2

      James Klein png keep the quility

    • @smarthd7749
      @smarthd7749 Před 7 lety +5

      MP4 and .mkv Is not a file format, IT is a container. And ITS not many difference between mkv and MP4 the only difference is that mkv can hold some more codecs.

    • @cldream
      @cldream Před 7 lety +2

      SmartFyrHD Also Matroska can also embed multiple subtitle formats (SRT, SSA/Advanced SSA)

  • @DustinRodriguez1_0
    @DustinRodriguez1_0 Před 7 lety +7

    OCR was one of the first practical uses of neural networks back in the 70s or 80s. Maybe even earlier? When I took an AI class in college, we wrote a simple OCR neural net and it was pretty easy.

  • @jandresshade
    @jandresshade Před 7 lety +2

    the OCR can use different techniques to recognize character, one is creating a model based on data of different characters and training the sofware to recognize them( Artificial neural networks is an example of this)

  • @TheOriginalFayari
    @TheOriginalFayari Před 7 lety +30

    That was the smoothest transition to a sponsor spot I've ever seen.

  • @dav2mai
    @dav2mai Před 7 lety +70

    Will it also recognize language?
    because "dag" translates to "day" in Danish

    • @Meg_A_Byte
      @Meg_A_Byte Před 7 lety +31

      Is there anything on this world that recognizes danish?

    • @22RH544
      @22RH544 Před 7 lety +10

      Nope, as a Dutch guy i can read it just fine, but when it is spoken.................I quit.

    • @TheDyingFox
      @TheDyingFox Před 7 lety +5

      Same result when translated to Swedish xD

    • @Mr.FastZombie
      @Mr.FastZombie Před 7 lety +3

      I would assume it sticks to one language, but some can probably change their language. Also perhaps some could be able to determine the language based on what it has already recognized.

    • @crewskater06
      @crewskater06 Před 7 lety +3

      It's from the movie Snatch

  • @ShreyPandya150
    @ShreyPandya150 Před 7 lety +7

    When Luke said it wouldn't look as crisp and the video resolution went down I instantly checked if I was at 1080p

  • @littletomatomonkeysmeeeeel8324

    Highly recommend PaddleOCR! 80 languages supported! Good performance! Easy to use! It would be great if bloggers could do a comparative evaluation of the popular OCR tools.

  • @DisbelieverH2o
    @DisbelieverH2o Před 7 lety +7

    I gotta say, I really liked this one! Very informative but what really made it for me was the seamless sponsor spot. I'd love to see more in such a way!

  • @SnypeSin
    @SnypeSin Před 7 lety +1

    that's good and all but I would have thought you'd give us and idea of what kind of devices use OCR for consumer/business.

  • @ziyitan8996
    @ziyitan8996 Před 7 lety +3

    I love how Luke explains stuff :D

  • @TheDyingFox
    @TheDyingFox Před 7 lety

    I was going to ask "How about Voice Recognition next?" but searched your channel, and I'll be damned, 1 year ago, you guys work fast! (Not sure how I've been missing it though, alot of content much?).
    It's a shame neither is "How to create your own Voice Recognition and Optical Character Recognition as fast as possible"

  • @quenjankosky7348
    @quenjankosky7348 Před 7 lety

    Well, with OCR, there is an exception for the lack of accuracy. When basic modern OCR was being developed, they made a series of fonts deigned to be as accurate as possible. These fonts were OCR-A and OCR-B. These fonts are super accurate with OCR, and there is usually never any error with them.

  • @Mr.FastZombie
    @Mr.FastZombie Před 7 lety

    There are also programs for character recognition on your screen.
    Project Naptha is a Chrome extension that can let you copy and paste words in an image.
    And ShareX has OCR that you can use for any program.

  • @jamilangon5798
    @jamilangon5798 Před 7 lety +1

    well google releases a OCRT (optical character recognition translator). which translate even other character aside from ASCII (chinese, japanese, thai and other non alpha character)... it become useful for those who travel and find themselves trap into a place where no one can speak or understand english.

  • @moenbase1
    @moenbase1 Před 2 lety

    In my industry, which is electronics. We use OCR in our automated optical machine to detect component marking on components as small as micro BGA's that are like 400microns wide. It's amazing to see how you can push it's limits. Just, sometimes like when there's a sufficient amount of flux on the components it makes it impossible to read.

    • @Ahmed71616
      @Ahmed71616 Před 2 lety

      What is the best scanner that does the same job as your devices

  • @HirooKoslov
    @HirooKoslov Před 7 lety +1

    My ScanSnap IX500 usese software to make scans readable. It works pretty well and the IX500 is blisteringly fast.

  • @leivadaros
    @leivadaros Před 7 lety +1

    Haven't read a single comment regarding the video's topic.... only "First", "Notification Squad where you at" and comments trying to be witty.....
    Great video by the way, i love getting general introductory information on the subject of my studies (computer engineer). Keep at it TechQuickie :D

  • @hillppari
    @hillppari Před 7 lety +2

    Google translate app with OCR is pretty nifty when you can translate foreign signs etc.

  • @KX36
    @KX36 Před 7 lety +1

    I did some OCR recently. Tesseract on Linux was the best at recognising the text accurately, but it outputs plain text only. There are 3rd party GUIs, but still none really preserve formatting.
    ABBYY FineReader on Windows (the gold standard for home use) was quite good at preserving formatting but worse at recognising text accurately. My scan was 200 pages of black 12pt Times New Roman on white paper scanned at 300dpi which should be one of the easiest things to process, and it regularly made mistakes on 1 vs l vs I , y vs v, H vs II etc. And these were often in places the dictionary should have easily known what it should have been. How often do you get a lower case L in the middle of a long number or a double upper case I at the start of a word or a v at the end of a word. It took 3 hours to go through the document correcting the mistakes it highlighted. Don't know how many mistakes are in there that it didn't highlight.

  • @HolarMusic
    @HolarMusic Před 7 lety +1

    Is that an 8k green-screen video? Looks super clean

  • @angelstrife
    @angelstrife Před 7 lety +15

    Hi! Could you do a FPS 1%low explaination? I have seen so many tech reviewers use this term but i have no idea what it means.

    • @sniperunrepeat752
      @sniperunrepeat752 Před 7 lety +18

      Long Nguyen Games tend to have "stutters" (i.e. briefly running out of VRAM on say, a 1060 3gb) which can temporarily bring the minimum fps incredibly low. So 1% lows are used. All they mean is the minimum fps that doesn't factor in the bottom 1% of frames, to give a more realistic minimum

    • @Bayonet1809
      @Bayonet1809 Před 7 lety

      Could also be called the 99th percentile.

  • @OMNIA_RH
    @OMNIA_RH Před 6 lety

    Thank so much for you explaining Sir.

  • @fleksimir
    @fleksimir Před 4 lety +1

    Linus ad (pulseway) on linus video. I love this ahahaha

  • @johneygd
    @johneygd Před 7 lety

    But can OCR ever distinguich hand written numbers and letters from eachother? Such as 0's & o's, G's & 6's, 1's & i's ,H's & 4's , j's & i's, 7's & 1's ,0's & 8's etc,,,, because numbers and letters looks similar to eachother.

  • @rinoy_43
    @rinoy_43 Před 7 lety +1

    I've tried Tesseract. Its free and pretty accurate.

  • @JRDev4All
    @JRDev4All Před 7 lety

    You should do an as fast as possible on assistive technologies such as screen readers

  • @cestsibon2468
    @cestsibon2468 Před 3 lety

    This is the first time i've watched a tech video and actually not had a headache after. Waiting for the interpretive google dance hehe

  • @bradad1111
    @bradad1111 Před 7 lety +10

    Saw OCR and immediately thought it had something to do with the Exam Board.

    • @craigmalcom6294
      @craigmalcom6294 Před 7 lety

      bradad111 Lool same

    • @StickyBagel
      @StickyBagel Před 6 lety

      So did youtube, i was watching a revision playlist and here i am??

  • @narutosasuke30
    @narutosasuke30 Před 5 lety

    Which OCR recognizes Handwritten text that you have shown at the end? I couldn't find anything which actually does that within a permissible error rate :/

  • @macpclinux1
    @macpclinux1 Před 7 lety +1

    luke are you finally using linux? i saw that little ubuntu font box :D good job mate!

  • @unguidedone
    @unguidedone Před 5 lety +1

    we need a firefox plugin that will log what youtube upload has paid promotions, skip past it and end the video when teh promotion happens.
    this video is an example of native advertisting

  • @sabaamin3179
    @sabaamin3179 Před 2 lety

    Just what I was looking for. Good Job!

  • @rediculousman
    @rediculousman Před 7 lety

    convolutional and LSTM neural networks are the cutting edge for these applications

  • @mickeyhage
    @mickeyhage Před 7 lety

    OCRs font work ive tried them but they dont properly. They dont read encrypted documents they spit out random incorrect letters.

  • @ulashofficial
    @ulashofficial Před 4 lety

    Sir can you tell me how can i find duplicate numbers with any OCR app or how should i pursue to make an app for that ?

  • @thornejman6467
    @thornejman6467 Před 7 lety +2

    Thumbs up if anyone else checked the videoquality at 0:36 xD

  • @MiMiOrt
    @MiMiOrt Před 3 lety

    I downloaded but , I thought that it will recognize the different fonts that are someonetimes in just ONE page. Does anyone know an APP/Program that can recognize the font on a scanned document?

  • @Juiceman777
    @Juiceman777 Před 2 lety

    I couldn't help but to think of the line from the movie Snatch when Brad Pitt said "ya like dags?" lol

  • @DeppImAll
    @DeppImAll Před 7 lety +1

    I mean tbh ... when I write in OneNote some text and microsoft can figure out what I just wrote and convert it into real characters I'm always astonished since my handwriting is horrible.

  • @terrybell898
    @terrybell898 Před 7 lety

    Micky: Ya like dags?
    Tommy: Dags?
    Micky: Yea, dags
    Tommy: OH, dogs, sure I like dags

  • @Quack201
    @Quack201 Před 7 lety +1

    So I guess the real question here is why is Luke only wearing socks while recording this? Doesn't Linus give sandals to all the employees?

  • @howardt12345
    @howardt12345 Před 7 lety +2

    Dennis: "You are dancing?"

  • @donaldfilbert4832
    @donaldfilbert4832 Před 7 lety +1

    OneNote has a pretty good built in OCR for small text articles - and it's free !! ABBYY FineReader does an excellent job converting image PDFs into searchable text based PDFs !!

  • @rushabmehta
    @rushabmehta Před 7 lety

    Can you do video on Virtualization such as hardware, network and storage Virtualization.

  • @182ndNegociator
    @182ndNegociator Před 7 lety

    What if it's supposed to say dag, that's also a completely legitimate word used in Australian English, plus it could also be used to describe a Directed Acyclic Graph, also known as a tree.

  • @pearls9133
    @pearls9133 Před 7 lety

    could you do videos explaining how mastering audio and video works? (if it doesnt already exist)

  • @TheZorch
    @TheZorch Před 7 lety

    I've got a Chrome extension that does OCR within images. Sometimes comes in really handy.

  • @antonjohansson1384
    @antonjohansson1384 Před 7 lety +4

    Dag is in swedish day

  • @zcuipylo
    @zcuipylo Před 7 lety

    TPS reports!!!!!! What a perfect example. Almost an easter egg.

  • @stayprofessional2453
    @stayprofessional2453 Před 7 lety

    Make an episode on network topologies

  • @MotivationAdonis
    @MotivationAdonis Před 7 lety

    Linus tech tips as fast as possible

  • @jean-lucasymptotic5083

    Speaking of machine learning..... that would make a good techquickie :D

  • @MrTuffarts
    @MrTuffarts Před 7 lety

    Dag is a word OCR software would not pick this up spellcheck does not pickup this also

  • @arnatsemtappra3822
    @arnatsemtappra3822 Před 6 lety

    Very useful knowledge and easy to understand provided to the new faces of this technology.

  • @1OldWriter
    @1OldWriter Před 7 lety +1

    Techquickie you do know most scanning software do this as part of their operation. If your's doesn't perhaps you should get a new one.

  • @metashrew
    @metashrew Před 7 lety +2

    If the software were dutch, the word would be "dag" (which means day in english), and not "dog".

  • @joerider5063
    @joerider5063 Před 7 lety +1

    Do speech recognition as fast as possible please.

  • @_Disi
    @_Disi Před 7 lety

    What about if you're trying to copy the line "D'ya like dags?" from Snatch?

  • @9421Bro
    @9421Bro Před 5 lety

    Can you please tell me about any OCR software for devanagari language .
    Which can cost me less

  • @feni_1553
    @feni_1553 Před 2 lety

    Images in video editing?

  • @bas116677
    @bas116677 Před 7 lety +2

    Dag actually means Hey or day in Dutch!

    • @kdm_6799
      @kdm_6799 Před 7 lety

      Bas Roelofs dag means bye too

  • @rry1994
    @rry1994 Před 7 lety +1

    I love u guys man

  • @Golde2Good
    @Golde2Good Před 7 lety

    You should explain core parking in the near future.

  • @SuperManitu1
    @SuperManitu1 Před 7 lety

    Tesseract is the best OCR program out there. It is Open Source and runs on all major OS

    • @9421Bro
      @9421Bro Před 5 lety

      How can I run it on Windows

  • @Pi7on
    @Pi7on Před 7 lety

    why isn't there an OCR software to scan videos?
    I mean, there are literally one or two, and they can't do much.
    It should be relatively simple since a video is composed by images.
    But I can't find ONE program that does that.
    And why doesn't Google release a standalone app/software to OCR things since it's OCR is the best? I'd pay for that.

    • @barnstormer322
      @barnstormer322 Před 7 lety

      I don't think OCR on video is all that practical. Plus you'd have to do things like work out if it's the same text but scrolling between frames, recognise transitions and animations, and also deal with the processor time that analysing at least 24 frames for every second would take.

    • @Pi7on
      @Pi7on Před 7 lety

      barnstormer322 Well, yes but it's not mandatory to analyze every frame in real time, even if i think Google could do it if you have a good upload speed to upload frames to them in real time.
      also I think It would be VERY useful for the anime community, and not only for that.
      to distinguish text from animations should not be that difficult since there are freeware that already do that ,it just need to be improved a bit.

  • @GroovingPict
    @GroovingPict Před 7 lety +3

    do you like dags?

  • @Jinni_SD
    @Jinni_SD Před 7 lety

    I really like Tesseract withHomebrew on Mac for OCR.

  • @isabellaereshki
    @isabellaereshki Před 7 lety

    I liked your dancing, ignore dennis. great video.

  • @Lorten369
    @Lorten369 Před 7 lety

    YEES More history please. love knowledge.

  • @Shirojm
    @Shirojm Před 7 lety

    So use a normal "photographic" scanner , then use OCR services such as google drive .

  • @DanRobards
    @DanRobards Před 7 lety +1

    Man, the ACR was great. Hardly any recoil

  • @JOELwindows7
    @JOELwindows7 Před 7 lety +1

    Wow, I saw this video right near before my National exam days.

  • @vapexxx
    @vapexxx Před 7 lety

    Luke - I actually watched the ad because of your fresh moves!

  • @marcusleung8985
    @marcusleung8985 Před 7 lety

    what about Fourier transform?

  • @Brusanan
    @Brusanan Před 7 lety

    Not one mention of neural networks?

  • @aislius9200
    @aislius9200 Před 7 lety

    Printing costs like 150 dollars for new ink if you go to retail, if you manage to go online it costs like 10-20 bucks. What the actual fuck?!!?

  • @unvergebeneid
    @unvergebeneid Před 7 lety

    4:11 That's not actually writing, is it? Because if it is, it beats _my_ character recognition.

  • @sebon11
    @sebon11 Před 4 lety

    Cool! Thx a lot.

  • @matthewpurcell5498
    @matthewpurcell5498 Před 7 lety

    What did Dennis say?

  • @jehdo144
    @jehdo144 Před 7 lety

    great video!

  • @NineToFiveGamer
    @NineToFiveGamer Před 7 lety

    I used to use an augmented translator app for my French tests. Shit just about worked half the time

  • @BenPotts
    @BenPotts Před 7 lety

    Nice dancing, Luke

  • @Seag-Gaming
    @Seag-Gaming Před 7 lety

    Who else had nostalgia @ 0:36?

  • @levingthedream
    @levingthedream Před 7 lety

    Is there any awesome free software that do this? Linux or PC. Besides Google drive that is

  • @ThePiGuy24
    @ThePiGuy24 Před 7 lety +1

    I WANT INTERPRETIVE DANCE TRANSLATOR NOW!!!

  • @AndyPhu
    @AndyPhu Před 7 lety

    This isn't in 4k! :(

  • @nitini.764
    @nitini.764 Před 6 lety

    I liked this "don't worry, be happy" in your video. Are you a Meher Baba lover too!!!!

  • @sahotaquack1
    @sahotaquack1 Před 7 lety +1

    Oxford Cambridge RSA

  • @evanvandenberg5805
    @evanvandenberg5805 Před 7 lety

    Anyone else thing the green screen looks a little wonky? (I feel like I shouldn't be noticing that it was green screened)

  • @todddembsky8321
    @todddembsky8321 Před 7 lety

    Luke, you have to tell me when you go on tour -- I need to leave the country at that point....

  • @TheMasonX23
    @TheMasonX23 Před 7 lety +6

    OCR is not for "pikies" apparently...

  • @thepalettewhispererasmr1227

    Arizona's audit brought me here 🇺🇸

  • @bassmickey
    @bassmickey Před 7 lety

    Funny used OCR last night. What a coincidence

  • @blingerang
    @blingerang Před 6 lety

    3:33 dag is actualy morning in dutch

  • @Oyamada13
    @Oyamada13 Před 7 lety

    Soon, we will have OCR AI to eliminate all those errors. Now, we have to worry about putting the data onto the correct column in Excel because OCR thinks the second column belongs on the bottom and the third column is broken up into two column... *sigh*

  • @fa.h.
    @fa.h. Před 7 lety +14

    dag is a word in Norwegian :)

    • @22RH544
      @22RH544 Před 7 lety +1

      Ha en fin dag

    • @fa.h.
      @fa.h. Před 7 lety +1

      Vel, ha en fin dag i morgen :)

    • @forestR1
      @forestR1 Před 7 lety +2

      it's a word in English too. poo hanging from a sheeps bum

    • @villenilsson7182
      @villenilsson7182 Před 7 lety

      Vilken trevlig dag vi har, eller hur?

    • @aidantuckwell9191
      @aidantuckwell9191 Před 7 lety

      Its an English word too, but I think it must mostly be used in Australia/NZ

  • @Ghjklt544
    @Ghjklt544 Před 7 lety +1

    I want to see the Google interpretive dance translater

  • @svsrkpraveen
    @svsrkpraveen Před 6 lety

    When did Dan Reynolds start doing tech stuff?

  • @Exploreyourlife88
    @Exploreyourlife88 Před 4 lety

    Thanks