EVERYONE Needs to Learn a Little Bit of AWK!

Sdílet
Vložit
  • čas přidán 2. 06. 2024
  • AWK is a brilliant scripting language for processing text files, comma separated values, and text databases. Deal 60% off - Machine Learning & Data Science Bundle: andauth.co/MachineLearningBun...
    If you find yourself creating spreadsheet to process rows of data or numbers then you MUST learn some AWK!
    ####
    Machine Learning & Data Science Bundle - andauth.co/MachineLearningBun...
    ####
    Introduction to Android app development: www.dgitacademy.com
    Let Me Explain T-shirt: teespring.com/gary-explains-l...
    Twitter: / garyexplains
    Instagram: / garyexplains
    #garyexplains

Komentáře • 1,1K

  • @nickiebanchou
    @nickiebanchou Před 4 lety +939

    "forget spreadsheets and excel"
    crowd: ooh?
    "use command line!!"
    crowd : oh...

    • @1MinuteFlipDoc
      @1MinuteFlipDoc Před 4 lety +43

      yulp! LOL Python (heavy duty jobs) > Excel > AWK.
      AWK is a legacy tool. there's a reason people don't use it. hahahah

    • @tanithrosenbaum
      @tanithrosenbaum Před 4 lety +122

      @@1MinuteFlipDoc Except for those people who do use it. There's nothing legacy about it, it's just different from excel. And while that obviously doesn't include you (and that's fine), there are lots of people who prefer to write a short script on the command line over clicking through excel dialogs for 2 hours. For conditioning data to use it in data science and number crunching, awk gives you an amazing amount of productivity much quicker than excel does. If you still prefer excel that's fine though, your choice.

    • @andiriki
      @andiriki Před 4 lety +3

      nickie banchou thanks you save me Time.

    • @JohnR31415
      @JohnR31415 Před 4 lety +40

      1MinuteFlipDoc awk is very powerful, use it pretty much every day. Can’t use excel in pipe chains...

    • @mrclaytron
      @mrclaytron Před 4 lety +36

      @@1MinuteFlipDoc no, people do use it all the time! Plus, it fits perfectly with the Unix philosophy of piping data from one small terminal program to the next. It's been around since the 1970s, but that certainly doesn't mean it's legacy - it's still an amazingly powerful tool that people still choose to use.

  • @SEX_ON_DRUGS
    @SEX_ON_DRUGS Před 4 lety +291

    Gary I will never forgive you for teaching everyone my only impressive linux skill

    • @senantiasa
      @senantiasa Před 4 lety +10

      LOL, oh come one... the FOSS world is about sharing.
      But I completely understand what you're saying. It's also my only Linux skill that no one I know have the slightest understanding, yet they call me in for it...

    • @satibel
      @satibel Před 3 lety +1

      regex-plosion!

    • @mikicerise6250
      @mikicerise6250 Před 3 lety

      Thanks, Gary. ;)

    • @ylc870
      @ylc870 Před 3 lety +5

      Well those who really want learn this still have a bunch of documentation to read... No worries

    • @zonumev
      @zonumev Před 2 lety

      And he did it so effortlessly, made it look easy ... the mark of an expert! I'm happy to know that AWK ain't a dead skill 'cause I still rely on it.

  • @captainz9
    @captainz9 Před 3 lety +14

    Awk and grep were the heart of many scripts I've written over the years.

  • @meowcula
    @meowcula Před 2 lety +7

    awk is one of my favourite tools, not because I'm any good at using it, mind, but just because of its history. It's from 1977! The basic apps of our unix/linux ecosystem have a rich history.

  • @carndt124
    @carndt124 Před 2 lety +17

    Used AWK in 70's and 80's. Had an accounting system written in AWK. Also, had an AWK to C compiler, for the real hardcore number crunchers.

  • @michaelburns8073
    @michaelburns8073 Před 4 lety +39

    If you like using Awk to do stuff like this, there are a few other cool Unix tools you should have a look at. Split, join, uniq will allow you to do DB-like things on text files. Another is tr. Of course sed, cut, grep as well. I took a grad course in which we had to create software development tools only using Unix tools. That class was both illuminating and fascinating. Learned a lot that I still use to this day.

    • @99corncob
      @99corncob Před 2 lety

      A lot of us learned to process tables of data using all these tools before spreadsheets were invented. Welcome to the club.

    • @CARPB147
      @CARPB147 Před 2 lety

      Oh yes. Think of AWK as SQL for text files. You can SELECT rows, and columns (words separated by whatever delimiter applies), and even declare "WHERE" (conditions) using regular expressions and/or boolean operators. Your input could be log files, emails, or whatever you have on text (like text stuff on chats). It could be source code of any programming language if you are interested in gathering quality metrics on them. Your imagination is the limit.

    • @yashwanth.chaudhari339
      @yashwanth.chaudhari339 Před 10 měsíci

      CAN YOU share ur knowledge?

    • @egorsozonov7425
      @egorsozonov7425 Před 9 měsíci +1

      Yes, just like you said, "db-like". This is reinventing SQL with a bunch of half-assed incoherent unorthogonal tools barely cobbled together. It's simpler to just use a real DB like Sqlite.

    • @drwombat
      @drwombat Před 7 měsíci

      As previously asked can you share your program you made? Or anything like it?

  • @jimsteele9261
    @jimsteele9261 Před 2 lety +15

    Back when I was contracting for EDS one of the sys admins handed me a book on AWK and asked me if I could figure out a way to extract billing info from the processing logs. I was hooked. :-) One of the handy features was using strings as array subscripts, and having "sparse" arrays, where only the accessed elements existed. Eventually, I had most of my production automated with AWK scripts.

  • @prettylitch1785
    @prettylitch1785 Před 2 lety +12

    00:00 Intro
    01:15 more show data in file
    01:29 wc -l show number of lines in data
    01:48 awk '{print}' prints file
    02:24 awk '{print $0}' $0 prints every line, $1 prints first field ect.
    03:10 awk '/gcc/ {print $1}' match gcc print file name
    03:41 awk '/^w/ {print $1}' Lines start with a double-u
    04:23 awk '/path/ {print $1,$2}' Lines start with path, include multiple lines
    04:41 awk '/path/ {print $1,$2/1024}' can divide output

  • @rondamon4408
    @rondamon4408 Před 4 lety +310

    AWK? I thought I was the only dinosaur in this world which still uses AWK. Glad to know that I'm not the only one.

    • @wv1138
      @wv1138 Před 4 lety +6

      That guy (or gal) that they don't let out much whips out awk like an old trucker whipping out a snatch block

    • @JohnDavidDunlap
      @JohnDavidDunlap Před 4 lety +6

      Use it every day.

    • @soiledhalo2296
      @soiledhalo2296 Před 4 lety +15

      It's been a while since I've used it. First time I ever had to use it was in the early 2000s at a call centre. Their call detail records were gigabyte size and excel was struggling with it. AWK just crunched through the numbers and spit out the results in less than 5 minutes. Think I used SED initially, but AWK was the answer.

    • @johnnyq4260
      @johnnyq4260 Před 3 lety +11

      Now if you two will mate, the dinosaurs will not go extinct.

    • @johnmcginnis5201
      @johnmcginnis5201 Před 3 lety +14

      I can have a log file parsed with awk faster than that young'n with a spreadsheet. And reformatting an address file that has quotes drives excel nuts. In awk you just manipulate the FS constant. I might be a dinosaur but I would point out that the dinosaur ruled the earth for 165m years and birds are still here.

  • @monikabalint400
    @monikabalint400 Před 3 lety +9

    I survived my PhD thanks to awk and sed!! Command line rules!

  • @RoryIsNotACabbage
    @RoryIsNotACabbage Před 2 lety +1

    Man I've watched so many awk videos and this has been the only one that has said "this is why you do it" rather than just do the thing
    I don't know how people expect you to learn when they don't explain what you're doing

  • @ripwolfe
    @ripwolfe Před 3 lety +3

    I used awk to do data analysis of a large database that output data as a .csv. The company hired an "analyst" (really, he just ran DB scripts) to put the data into Excel, format it and run a few math functions on it. It took 3 hours to complete their way. The awk way took less than 15 minutes.

    • @lacs83
      @lacs83 Před 3 lety

      Una historia de exito en el uso de AWK

    • @CARPB147
      @CARPB147 Před 2 lety

      Excel is good at many things, but doing what that analyst did was shstoopid... "A hammer thinks that every problem is a nail."

    • @d3vilscry666
      @d3vilscry666 Před 3 měsíci

      Was it accurate tho?

  • @zeke6099
    @zeke6099 Před 4 lety +8

    I've tried reading more than one awk intro and I've never made it very far. You've successfully taught me way more than all of them, thank you!

  • @AllanKobelansky
    @AllanKobelansky Před 3 lety +31

    AWK isn’t a number manipulation tool. It’s a text processing tool that can do math. And so much more. Explore GAWK, the Gnu version.
    And use the tools you know, as best as you can to get the job done. And don’t stop learning.

  • @AuctorisVideo
    @AuctorisVideo Před 4 lety +4

    Nice video Gary. In case you (or anyone else watching) didn't know - you can simplify your "rounding up the file sizes" example using printf() and a format specifier.
    For example to print the file sizes to one decimal place you could use:
    {printf("%s %.1fKb
    ",$1, $2/1024)}
    Or:
    {printf("%s %.0fKb
    ",$1, $2/1024)}
    To recreate your exmaple & round to the nearest integer…

  • @FrankyDesign
    @FrankyDesign Před 4 lety +106

    Awk is really amaz...
    Syntax error: Missing ending '}'

    • @ThePsyForce
      @ThePsyForce Před 3 lety

      % in vim will skip between matching parentheses

    • @dosmastrify
      @dosmastrify Před 2 lety

      Java too

    • @dosmastrify
      @dosmastrify Před 2 lety

      Missing bracket here
      (insert bracket)
      Extra bracket here
      (smashes keyboard)

    • @CARPB147
      @CARPB147 Před 2 lety

      That'll do it.

  • @nickskywalker2568
    @nickskywalker2568 Před 3 lety +4

    Best introduction I've ever seen! I've always been kind of reluctant to learn, but knowing inside that I should do it...
    Thanks for the video!

  • @juzujuzu4555
    @juzujuzu4555 Před 4 lety +31

    Gary, I love these introductions to Linux/unix commands/software.

  • @erectlocution
    @erectlocution Před 3 lety +1

    I really appreciate how you build up here. I've done most of this in my tinkering but this adds some context that's really helpful.

  • @aravindhreturns
    @aravindhreturns Před 3 lety +7

    WHOA! As the class went on, My eyes only widened. Thank you Gary! Much love

  • @srirachahero4679
    @srirachahero4679 Před 2 lety +3

    It's 2:00AM and I'm watching Gary Explain awk... and it was amazing!
    Thank you, Mr. Simms!

  • @IreshDissanayakaM
    @IreshDissanayakaM Před 3 lety +9

    This is the best introduction to awk I have encountered.

  • @asandax6
    @asandax6 Před 2 lety

    I've been just doing this from any programming language I was learning when I get to the read and write files section of the documentation. Nice to see it can be done directly on the command line.

  • @talbech
    @talbech Před 4 lety +2

    Thanks. Brought back some great memories of data manipulation of huge point cloud datasets on SGIs. We had to do very similar things before piping data into the OpenGL 3D engine for visualisation purposes. Awk is very flexible and fast and still have many usecases in todays system administration tasks.

  • @inwhichidie7171
    @inwhichidie7171 Před 3 lety +11

    Just want you to know you saved my ass with this video. Procrastinated on an assignment for my CS class and this really helped me understand some stuff I'd missed and get the assignment done in time. Thanks a ton!

  • @stumbling
    @stumbling Před 3 lety +4

    Used quite a bit of AWK in my 3rd year physics project. I had hundreds of experimental data files to process and it was a good choice.

  • @aseerose5684
    @aseerose5684 Před 4 lety +3

    I have used it to generate a useful database of user information from emails after concatenating the emails for processing. It was not hard to learn and ended up being a very useful multi-purpose tool in addition to its primary mission success. Thanks for an excellent video!

  • @LoneIgadzra
    @LoneIgadzra Před 2 lety +2

    For small files there are more user-friendly tools, but awk really shines when you have some huge text file that you need to massage because it is just about as fast as you can get. Really important to know about in those cases, because you could easily be lead down an unnecessarily slow or expensive "big data" path.

  • @senantiasa
    @senantiasa Před 3 lety +130

    This is the only skill in my job and my previous job where people turn to no one other than me..:*( Getting sentimental here...

    • @paschikshehu7988
      @paschikshehu7988 Před 3 lety +2

      what do you do?

    • @senantiasa
      @senantiasa Před 3 lety +14

      @@paschikshehu7988 Systems engineer, but it's programmers who come to me for this, usually because they need simple parsing or data manipulation which doesn't take a lot of effort. Then, their programs run my AWK script. These programmers know Sed (which is even simpler than AWK), but their case is usually where Sed is too simplistic and using their programming language would be overkill.

    • @Frankx520
      @Frankx520 Před 3 lety +1

      kaluq system engineer is so cool. I just found a good PID value for my motor speed control using Matlab earlier today.

    • @afonsosalbrecht
      @afonsosalbrecht Před 2 lety

      omg, if you are working as software engineer and they can't do it... change jobs. I doubt your are technically growing in this company hahahah

    • @aneeshprasobhan
      @aneeshprasobhan Před 2 lety +1

      @@Frankx520 he said systems engineer, not Control Engineer.

  • @scottkelbell5003
    @scottkelbell5003 Před 4 lety +5

    Easy to learn, too. I love awk! Thank you Gary!

  • @BryanChance
    @BryanChance Před 2 lety +2

    Awk and sed one of the most useful and powerful text manipulation and formatting tools I ever learned to use.

  • @MrMusic25
    @MrMusic25 Před 3 lety +1

    Very glad this video popped up on my feed! I've been currently working with data using sed but after watching this i think awk it much more suited for me, especially knowing I can write my own functions that run faster than Bash can! Great video, thanks for the explanation!

  • @NexuJin
    @NexuJin Před 4 lety +23

    I uses awk/sed on a daily basis at work. I uses AWK primary to analyze excel(exported to csv) or other data files for audits. That is on Windows! In both MINGW64 or WSL2 Linux.

  • @cheaterman49
    @cheaterman49 Před 3 lety +10

    Thanks Gary, that was simple and useful. I usually write small Python scripts for such data processing because I'm more fluent in it, but if it were CSVs or tab-delimited outputs (like from other shell programs) awk is just way simpler. Always wished I got some proper simple intro to it, now it's done :-) - perhaps you could make a follow-up episode or two? Thanks again!

  • @Chris-op7yt
    @Chris-op7yt Před 4 lety +2

    awk is useful as a very short/quick way to get at Nth field in a text file, especially as it treats consecutive delimiters as one. for example, fixed width input files.
    But where awk really shines is multi-levelled line delimited files, like old style config files etc., as you dont need to write loops and keep flags about which section of the input file you're in.

  • @randomsalgari
    @randomsalgari Před 2 lety +2

    I really love awk, it was one of the main tools I used for my thesis.

  • @apmcd47
    @apmcd47 Před 3 lety +2

    A long time ago when someone was telling me how wonderful Excel was, I simply said "ed, perl, tbl, troff" as in edit your data using ed (actually, I never use ed), process it with Perl (I don't know awk), and finally format it with troff using the tbl preprocessor.

  • @johng7rwf419
    @johng7rwf419 Před 4 lety +6

    Good old integer arithmetic, takes me back to when I was a lad..
    Awk would have been very useful in a former life. Thank you very interesting.

  • @damiancosta2588
    @damiancosta2588 Před 2 lety +2

    Gary, awesome job giving me the basic understanding of awk. All my little failed projects have been revived since Your walk thru of the AWK!

  • @Rovsau
    @Rovsau Před 3 lety +2

    Thanks for the info. Spreadsheets are more practical for me, and more practical for newbie experimentation, but AWK really bridges the gap between sheet logic and optimizing work. Plus, you get support from a lot more scripting capabilities. Great stuff!

  • @mahmoodabdulla5499
    @mahmoodabdulla5499 Před 3 lety +4

    This is the best awk tutorial I've seen so far.. please make a video for SED

  • @adam872
    @adam872 Před 4 lety +3

    Love me some AWK and have made plenty of use of it over the years.

  • @websciencenl7994
    @websciencenl7994 Před 4 lety

    Wow, i've just stumbled upon your channel and i'm very very impressed with your teaching skills. Absolutely brilliant. Thank you soooo much for doing this!

  • @richa9719
    @richa9719 Před 2 lety +18

    I used awk in the early 1990’s as a developer. Not sure it is something that a non developer should really use. Good demo, I wrote complex code with Awk to parse flat files.

    • @CARPB147
      @CARPB147 Před 2 lety

      That is what AWK is perfect for. Great for processing log files or other UNIX/Linux sysadmin stuff.

    • @SteveJones172pilot
      @SteveJones172pilot Před 2 lety +4

      Not sure a non-developer should be using?! What's wrong with learning to use a tool? Should a non-sysadmin never open a command prompt because they're not an expert?! How do people become developers then? Any tool like this you can learn to use is a huge plus! Let's encourage experimentation and learning!

    • @CARPB147
      @CARPB147 Před 2 lety

      @@SteveJones172pilot - Totally agree with your sentiment on this. People will either be interested (or not) in doing this stuff. That will naturally weed out people that program (programmers/developers, etc.) from people that don't. I don't see any reason to have arbitrary mandates against writing AWK commands (or scripts) for people only because they are not *professional* developers. Anyhow, the 40+ year secret is out! 😏😆

    • @CARPB147
      @CARPB147 Před 2 lety

      Oddly, my very first AWK script was my most complex! Only a page and a half, but it replaced a 40-page SQR program that tried to parse CSV files (ugh) written by a hammer who thought every problem looked like a nail.
      After that I would use it in smaller piped sequences with several AWK commands like:
      awk '{if ( FNR == 1 ){print "FILENAME=",FILENAME}; if (NF && $0 !~ /^--/){print}}' Database/*.sql \
      | awk '/^FILENAME=/{files++}; $0!~/FILENAME=/ && NF {loc++}; END{avgloc=loc/files; printf "%d Database LoC in %d files at avg lines per file = %0.f
      ", loc, files, avgloc}' > $countfile
      (sorry about the look of the run-in line).
      For more complex problems, like ETL cases, etc., I just used Perl which was a natural progression from using Shell + AWK.

  • @ibuprofenPill
    @ibuprofenPill Před 2 lety +56

    As a senior software engineer, I understand the advantages of what you’re demonstrating. However, the common person will never use this on a wide basis. But, your heart is in the right place.

    • @johnmcginnis5201
      @johnmcginnis5201 Před 2 lety +2

      That is indeed a problem. I have overcome it for many tasks for mere mortals by writing awk code for the task then front ending that with zenity. Good enough that when I need to see the division manager she makes sure I am at the top of the list in the morning. :)

    • @CARPB147
      @CARPB147 Před 2 lety +1

      @@johnmcginnis5201 - I had the same experience. I remember the first time I took a *formatted report*, ran it through AWK, and provided my astonished manager with information she needed selected and totaled in a totally different way in a matter of minutes.

    • @not_herobrine3752
      @not_herobrine3752 Před rokem

      yeah the comment section goes "just use python"

    • @syte_y
      @syte_y Před 7 měsíci

      When i was doing a lot of data processing on csv files this would’ve been useful. don’t think it applies to a lot of people like you said though

  • @yjk_ch
    @yjk_ch Před 3 lety +4

    I remember using Awk for extracting a column from a command result(using something like {print $1}, but I didn’t know that it could do much more than that.

  • @huistelefoon5375
    @huistelefoon5375 Před 2 lety +1

    AWK has been used to create a full parser/tokenizer, and other purposes that are arguably way more complex for its intended use. Using AWK you can also make advantage of pattern matching with regular expressions, and AWK has many other tools for text manipulation. But I think one of the most powerful aspects of AWK is using it as a complimentary Unix tool. Use it together with other Unix command-line utilities with pipelining, not everything has to be done in AWK. For example, you can use AWK to parse out formatted words from a complex text file, and now pipe this data to be processed by another utility.

  • @MadMetsFan
    @MadMetsFan Před 4 lety +2

    I learned a lot of AWK about 20 years ago - very useful

  • @SpeccyMan
    @SpeccyMan Před 4 lety +46

    Just thought I'd explain, that 'up arrow' is a caret or circumflex.

    • @peppigue
      @peppigue Před 3 lety +3

      That's why ppl call it up arrow

    • @hecko-yes
      @hecko-yes Před 3 lety +3

      @@peppigue you wouldn't call v a down arrow even though it's used that way sometimes
      < > can be less/greater than symbols or angle brackets or left/right arrows, but in a programming context you'd probably use the former
      ...except when it's a "shift left" operation in which case it'd make sense to call them arrows
      hm
      maybe left v and right v??

    • @randycreasi
      @randycreasi Před 3 lety +2

      Sometimes people call it "hat" referring to the hat operator in mathematics.

    • @gorgolyt
      @gorgolyt Před 3 lety +6

      Weird 'flex but okay.

    • @jwadaow
      @jwadaow Před 3 lety +2

      @@gorgolyt very circumspect

  • @shikhanshu
    @shikhanshu Před 4 lety +16

    I use awk at my job and I am always in awe of it. This video is a great little intro and the rounding logic was pretty neat too! Thanks Gary!

  • @ORagnar
    @ORagnar Před 3 lety +2

    He has the smoothest advertising transitions I've come across. Great job! Great content, too!

  • @jameschums
    @jameschums Před 2 lety +1

    I have been using AWK for 20 years, it rocks!

  • @basilburgess8415
    @basilburgess8415 Před 3 lety +7

    I love awk for text formatting and, arguably informal reporting. Also admin scripts. Honestly, though, you can all this and more with PERL, which I recommend.

    • @xrafter
      @xrafter Před 2 lety

      Isn't perl a prolang ?
      Prolang = PROgramming LANGuage

    • @CARPB147
      @CARPB147 Před 2 lety

      Yes! (upvoted you for being spot on)

  • @hereb4theend
    @hereb4theend Před 4 lety +16

    AWK, Sed, Bash, TCL, GREP, Perl and Nvim are my command line friends. 😍

    • @josephdoyle5304
      @josephdoyle5304 Před 3 lety +4

      Actually, grep was taken out of the line editor ed. The command in ed is
      g/RE/p ( globally search for a regular expression and print the line). Hence "grep RE filename"
      nawk has more capabilties.
      BTW "awk" are the initials for Aho, Wineberger, and Kerinigan, the developers who created awk.

    • @hereb4theend
      @hereb4theend Před 3 lety

      @@josephdoyle5304 you're a prince amongst men. 😊

    • @thaddeusolczyk5909
      @thaddeusolczyk5909 Před 3 lety

      Philistine emacs!

    • @zyan983
      @zyan983 Před 3 lety

      @@thaddeusolczyk5909 Emacs is nice, but I don't see a reason to use it, so I stick with nvim.

    • @AbhishekBM
      @AbhishekBM Před 3 lety

      @@thaddeusolczyk5909 the command line version of Emacs is terrible.

  • @traal
    @traal Před 2 lety +1

    GNU awk was the first scripting language I learned really well, and I wrote most of my early Bourne shell scripts as basically wrappers around huge chunks of awk code. Then I graduated to Perl, which is absolutely unmatched if you love regexes (I do!), and nowadays I write everything in Python if it's too much for a simple bash script. 😊
    I still use awk and Perl daily for oneliners when I do data wrangling. The awk syntax is super comfortable for the things that it is good at. 👍🏻

  • @SteveJones172pilot
    @SteveJones172pilot Před 2 lety +2

    Great video! I have always used grep to search strings in linux and never bothered to figure out what awk did.. This was a great introduction - Just what I need so that next time I have a use case I will remember this and figure out how to do it in awk!

  • @spaceiswater6539
    @spaceiswater6539 Před 4 lety +5

    Please can you make some more AWK videos Gary?
    I'm learning AWK at the moment spent a few days on it, its hard to learn but the rewards in knowing how to use it is worth the reward. This is a great video to get people into using it and seeing the power of it.

    • @johnmcginnis5201
      @johnmcginnis5201 Před 3 lety +1

      1) Learn a middling amount of 'C', K&R please, none of that C++/# crud.
      2) Have a good understanding of regular expressions.
      3) Realize that each line is processed in the order received by the program statements after BEGIN and before END. Process order can be important.

    • @xrafter
      @xrafter Před 3 lety

      @@johnmcginnis5201
      Why we need to put the {} between print .
      What does it mean?

  • @villesyrjala3354
    @villesyrjala3354 Před 4 lety +8

    I've used awk to extract useful information from pdf documents. The problem was that the information was awkwardly (pun intended) split into several tables throughout the document so I had to first process each table to collect up all the pieces for each element I wanted to output. The solution I came up with was pdttotext + awk to do the processing. The few hours I spent on that awk script has paid off nicely since I've had to reprocess new versions of the same document several times over the years.
    The alternatives would have been:
    a) Manually copy paste all the information. One thing I've learned over the years is you *never* trust anything copy pasted by a human (least of all myself)! Also would have been extremely tedious (which adds to the chance to making a mistake), and I would've had to repeat it whenever a new version of the document came out.
    b) Find some pdf library for my favorite programming language to extract more structured data from the document. Couldn't quickly find anything that worked and I didn't want to start debugging pdf libraries.

    • @PraveenKumar-xq7xb
      @PraveenKumar-xq7xb Před 4 lety

      Hi I am interested in pdf extraction. Can you kind of give some clue codes to me to explore further.

    • @mychromebook9935
      @mychromebook9935 Před 3 lety +1

      Another solution is to use a command line pdf editing tool called, pdfTK. You can read out pdf files from command line and even fill in pdf forms with it.

    • @stevebabiak6997
      @stevebabiak6997 Před 3 lety

      IIRC, Perl has some modules (think libraries) for handling PDF files, and Excel files too.

    • @CARPB147
      @CARPB147 Před 2 lety

      Cool.

  • @JR-pk1fr
    @JR-pk1fr Před 3 lety +1

    I love it! The best awk demo on CZcams. More awk vids in the future???

  • @Ureallydontknow
    @Ureallydontknow Před 4 lety

    high quality video. perfect audio. the pace was perfect. the explanation and examples were perfect.

  • @nickharrison3748
    @nickharrison3748 Před 3 lety +3

    awk -F "/" {print ...} . for specifying the different seperator other than default whitespace

  • @1MarkKeller
    @1MarkKeller Před 4 lety +31

    *GARY!!!*
    *Good Morning Professor!*
    *Good Morning Fellow Classmates!*

    • @GaryExplains
      @GaryExplains  Před 4 lety +10

      MARK!!!

    • @jamess1787
      @jamess1787 Před 4 lety +2

      Mark, sit back down and turn to page 33 in the 2020 edition of GE

  • @SimonJM
    @SimonJM Před 2 lety +1

    My ex-colleagues used to hate me writing awk scripts! Brilliant little language. One happy use was to take the output from informix commands to detail table descriptions and create an output shell script to recreate the database for disaster recovery purposes.

    • @CARPB147
      @CARPB147 Před 2 lety

      Perfect application for AWK. Nice.

  • @Mildimage
    @Mildimage Před 3 lety +1

    Awesome video man!. This was my first contact with awk command. Very very good explanation!

  • @robbo916
    @robbo916 Před 4 lety +3

    I won't go into the specifics but AWK holds a special place in my heart. I know that might sound a bit weird but it's true. Even though I've only ever had to use it in anger twice it was well worth learning just for them.

    • @philipcoltharp918
      @philipcoltharp918 Před 2 lety +1

      Thanks, Im glad I clicked in. I never would have searched out this otherwise. I do use SED, GREP, and GVIM. The next time I have the opportunity, I'll have to try to apply these lessons.

  • @adityathapan2544
    @adityathapan2544 Před 4 lety +6

    Your way of explaining topice is very easy... Please make more videos on linux

  • @merlin-ju6fu
    @merlin-ju6fu Před 2 lety +1

    This is the first I have heard of AWK. I am number crunching sports recreational sports handicapper, so AWK might be useful to me.

  • @dragonsage6909
    @dragonsage6909 Před 2 lety +2

    Great tutorial, subbed.. thank you!

  • @oguzaranay
    @oguzaranay Před 4 lety +5

    My first time hearing and knowing this language!

    • @CARPB147
      @CARPB147 Před 2 lety +1

      Oh, oh. The secret is coming out!

  • @ricsanders69
    @ricsanders69 Před 4 lety +7

    Reminds me of my days on the Sparc 2!! :-D Those were the days. *sigh*

  • @greyman1104
    @greyman1104 Před 4 lety +1

    Haven't understood awk before. Thanks, Gary!

  • @olimpiudehelean5885
    @olimpiudehelean5885 Před rokem +1

    This has to be the best clarification i've ever seen . Thanks a lot !

  • @watcherquek263
    @watcherquek263 Před 4 lety +8

    perl >> (sed, awk ). You can do all of sed and awk in perl (and there are even conversion scripts for it, called a2p and s2p) but not the other way.

    • @jrd33
      @jrd33 Před 4 lety

      Sure. And an 18-wheeler can carry more than a pickup. But a lot of people find a pickup works just fine for day-to-day tasks.

    • @CARPB147
      @CARPB147 Před 2 lety

      @@jrd33 - Certainly. There are tools more suited for certain jobs than others. I think it is good to have variety of choice.

  • @harrikoivula3932
    @harrikoivula3932 Před 3 lety +13

    I used to use GREP, AWK and SED in the 80's while porting a CAD program from on operating system to other. But nowdays I tend to use PERL and many times with excel. You can do many things with excel, but complex data manipulation tasks are much easier with perl. One of the best concepts in data manipulation with PERL and AWK are associative arrays.

  • @mfolarin
    @mfolarin Před 3 lety

    Gary! I'm excited to find your channel. Pleasing. Subscribed!

  • @12Q46HPRN
    @12Q46HPRN Před 3 lety +1

    Excellent! You explained awk very well!

  • @jzmmm
    @jzmmm Před 4 lety +148

    In the early 90s: hey everyone, learn guis!!
    Today: hey let's go back to the command line!!!

    • @Hassan8Ola
      @Hassan8Ola Před 4 lety +3

      GUIs are good for repeated, error-prone tasks. If you find yourself doing a task over and over again in which the task never changes, then build a GUI for it. But probably, that it is not likely, since the task can always improve and change. If you can isolate something so well that it can get its own GUI, then go for it. Nowadays, that is not easy to do.

    • @walker9893
      @walker9893 Před 4 lety +46

      @@Hassan8Ola this seems like the criteria for scripted automation....

    • @xijinpingpong4426
      @xijinpingpong4426 Před 4 lety +11

      GUI is nice if you need to see some visualized information or for entertainment. Terminal is nice for fast programs that have a specific task and work together with other programs.
      People who refuse one of them (GUI or Terminal) limit them self.

    • @diederick76
      @diederick76 Před 3 lety +7

      I think the general idea in the 90s was that for GUIs you didn't have to learn anything.

    • @MsHojat
      @MsHojat Před 3 lety +3

      There's nothing stopping a program that is GUI that has all the functionality of a command line program, or even having a command line entry area inside of it. The problem is more with the fact that most or practically all GUI programs don't do this for some stupid reason!

  • @khatharrmalkavian3306
    @khatharrmalkavian3306 Před 3 lety +3

    Kay, maybe if - for god knows what reason - I'm writing a super complex bash script.
    Even then, probably not. I generally just sub in a proper scripting language for that.

  • @xtnctr
    @xtnctr Před 3 lety +3

    Outstanding explaining skills!

  • @prabhujeeva2228
    @prabhujeeva2228 Před 4 lety +2

    Thanks Garry!!. It's really interesting

  • @lactobacillusprime
    @lactobacillusprime Před 4 lety +53

    Databases! I was using SQL and databases / Dbase for a long time. Can't stand using spreadsheets as databases.

    • @makoto_0
      @makoto_0 Před 4 lety +9

      Spreadsheet is just for office work, can't do anything for big data (It blows up at the moment you open the file). Database is the real deal.

    • @1MinuteFlipDoc
      @1MinuteFlipDoc Před 4 lety +16

      the right tool for the right job!
      Nobody ever: I hate having to commute from LA to NY on a skateboard!

    • @jamess1787
      @jamess1787 Před 4 lety

      AMEN

    • @kencheng2929
      @kencheng2929 Před 4 lety +4

      Hi, I’m not a db admin but my feeling is spreadsheets are easier to use and they’re right in front of you. Databases need some kind of ui or they use the cli (inserts, selects). Please confirm/correct.

    • @jamess1787
      @jamess1787 Před 4 lety +3

      @@kencheng2929 hi Ken, you have a valid point. If you have a small set of data points to keep track of: then spreadsheets make sense. When you start to get into the 1000's+ then it's time to start looking into a database solution.
      Spreadsheets should be more for temporary data that has no long-term value. Like forecasting or basic customer metrics. =)

  • @trueriver1950
    @trueriver1950 Před 4 lety +3

    awk one liners are great for ad hoc queries and I use it for that, but as soon as you go to scripting surely perl is the way to go?
    Or if you don't already know perl, then maybe Python which is more friendly for beginners?

    • @jrd33
      @jrd33 Před 4 lety +1

      Awk is great for awk-shaped problems (basically, report generation on files of simply-formatted ASCII data). If you have a different-shaped problem, don't use awk.

  • @dmays67
    @dmays67 Před měsícem

    Currently assessing how to extract useful data from multiple differently formatted fuel receipts here. Found your lovely little primer video very helpful - thanks!

  • @shaileshsharma1173
    @shaileshsharma1173 Před 4 lety +1

    Thanks Gary for explaining awk in cmd

  • @jinchoung
    @jinchoung Před 4 lety +4

    subscribed. dammit, I'm not a coder (much of) but you made that understandable even while talking blisteringly fast! kudos.

    • @CARPB147
      @CARPB147 Před 2 lety

      You can slow playback of CZcams videos when clicking on the cogwheel icon and selecting your playback speed.

  • @hinkhall5291
    @hinkhall5291 Před 2 lety +7

    Most young devs don’t even know about commands that aren’t cd, ls, mv, cp or docker.

    • @xrafter
      @xrafter Před 2 lety

      pacman -S firefox .
      Help this line above doesn't work .
      The error is weird 😞

    • @not_herobrine3752
      @not_herobrine3752 Před rokem

      they have heard about the command line?

  • @mikeCavalle
    @mikeCavalle Před 2 lety +1

    lol - awk in the late '70's -- life saver then and now.

  • @rish1459
    @rish1459 Před 3 lety +2

    Regex, SED, & AWK are awesome tools!

  • @marcoandreknottjung3418
    @marcoandreknottjung3418 Před 4 lety +34

    Just a question from a newbie:
    what can I do with those informations?
    I come to your channel, via ColdFusion and the graphene battery!
    Thanks in advance

    • @fuseteam
      @fuseteam Před 4 lety

      anything you want really ;)

    • @cfbasz
      @cfbasz Před 4 lety +11

      You can manipulate information from files and extract what you want in the way you need. It is just pure formated text being manipulated. No spreadsheets needed. Cheers!

    • @hextz
      @hextz Před 4 lety +10

      you can redirect the formated output of the awk script to another file, for example: I wanted to create a test file like the one used in the video but didn't know how to do it using only ls, so I used a "ls -la /usr/bin > ls-output.txt", then used awk to select only the fields in the order I wanted with "awk '{print $9,$5} ls-output.txt > ls-awk-output.txt". It's very handy to manipulate formated text files like csv, config files, logs, program outputs, whatever you can imagine...

    • @berndeckenfels
      @berndeckenfels Před 4 lety +3

      See 14:47 for example

    • @claudehebert3131
      @claudehebert3131 Před 4 lety +7

      One example: Extract data from a not very user-friendly system, in a tab-delimited format. Convert it into SQL commands (using loads of "printf"). Run the generated SQL code to load the data in a database. AWK can be the glue between otherwise incompatible systems.

  • @piiumlkj6497
    @piiumlkj6497 Před 4 lety +119

    Let's become terminal ninjas ;) , AWK is very used in linux commands

    • @jpdj2715
      @jpdj2715 Před 3 lety +1

      What do you expect from something that was developed in the terminal age? First stable release 1985. Developed by Aho, Weinberger and Kernighan (yes the Kernighan who wrote "The C Programming Language" with colleague Ritchie 1st ed. 1978). Or, when Linus scribbled his Ux, awk was already age old.

    • @uhmasyt
      @uhmasyt Před 3 lety +2

      awk is slow, too.
      I remember just moving a script from awk to Perl I could reduce processing time by 70% and ended up with something like 30 minutes on a well equipped Unix machine.

    • @piiumlkj6497
      @piiumlkj6497 Před 3 lety +1

      Why perl when you've got python which has a much better syntax ? The best tool is the one available to you and the one you feel most comfortable using .

    • @lxathu
      @lxathu Před 3 lety

      @@uhmasyt Really complex matches that perl can do in one step with its advanced regexps but one must has to write subroutines for can lead to perl's victory but most of the cases the burden that comes with that draws perl back.
      Awk if FAST with the task it was designed for.

    • @CARPB147
      @CARPB147 Před 2 lety

      @@lxathu - "but one must has to write subroutines for can lead to perl's victory".
      Not always. We can write Perl one-liners about as well as we can write AWK one-liners (that is, an entire script in a single command, without script files, or subroutines). It depends on the problem being solved, of course. Perl is pre-compiled and will run faster than AWK in most (all?) cases, and certainly scales-up more gracefully to solve more complex problems.

  • @kahlschlag17
    @kahlschlag17 Před 2 lety

    Awesome tutorial. Thanks.

  • @aminebouaita9202
    @aminebouaita9202 Před 3 lety

    Thank you for the crystal clear demo !

  • @faaaszoooom6778
    @faaaszoooom6778 Před 3 lety +7

    I use AWK a lot. And while I'm a C++ dev, I'd still recomment Python as a replacement for excel sheets for quick calculations.
    AWK has severe limitations, which makes it a bit harder to use for anything more complex than basic arithmetics (or string manipulations, but even that is a bit difficult sometimes.)

    • @huistelefoon5375
      @huistelefoon5375 Před 2 lety

      AWK has been used to create a full parser/tokenizer, and other purposes that are arguably way more complex for its intended use. Using AWK you can also make advantage of pattern matching with regular expressions, and AWK has many other tools for text manipulation. But I think one of the most powerful aspects of AWK is using it as a complimentary Unix tool. Use it together with other Unix command-line utilities with pipelining, not everything has to be done in AWK. For example, you can use AWK to parse out formatted words from a complex text file, and now pipe this data to be processed by another utility.

  • @lawrencedoliveiro9104
    @lawrencedoliveiro9104 Před 4 lety +87

    Every time I hear the word “Awk”, I reach for my Perl.

    • @jamess1787
      @jamess1787 Před 4 lety +23

      Oh no. Not nondescernible perl.
      You're that guy from stack overflow? Always posting a solution in perl that no one understands. 😂

    • @lawrencedoliveiro9104
      @lawrencedoliveiro9104 Před 4 lety +14

      @@jamess1787 I don’t claim to be a Perl expert. But I know more Perl than I do Awk, and I do know that Perl can do everything Awk can do, just as concisely, and more.

    • @TerenceKearns
      @TerenceKearns Před 4 lety +25

      I used to work in the IT deptartment at a university. We soon realised that the entire university IT infrastructure was glued together with perl scripts all over the place. They were written by one dude over the decause. He ended up being the department head. His name was bob. We just called them .bob scripts. He retired during my tenure there. To this day, there will still be .bob scripts in the uni it matrix holding certain things together.

    • @lawrencedoliveiro9104
      @lawrencedoliveiro9104 Před 4 lety +9

      @@TerenceKearns That was an absolutely ubiquitous use of Perl. It made it so much faster to throw together quick utility programs, rather than having to write C/C++ or even Java code. And the ever-growing collection of add-on modules collected at CPAN let you interface to network stacks, graphics APIs, DBMSes-just about anything you need to handle the low-level stuff that was not built into Perl itself.
      That was the model that Python followed-and took to the next level.

    • @jinchoung
      @jinchoung Před 4 lety +17

      such a perl clutcher

  • @MrDaskon
    @MrDaskon Před 2 lety +1

    Thanks Garry! Really interesting video.

  • @pierrec1590
    @pierrec1590 Před 2 lety +1

    My favorite tool for making sense out of long log files.

  • @skyler4517
    @skyler4517 Před 3 lety +13

    Idk, the idea of learning awk has been rattling around in the back of my head for a while, I just don't feel like it's worth the overhead when I could do all this just as easily in Python.

    • @lxathu
      @lxathu Před 3 lety +1

      Of course you can, but when it comes to tricky interval pattern matching that "just as easily" can fade away quickly.

    • @MathieuDuponchelle
      @MathieuDuponchelle Před 3 lety +3

      @@lxathu import re; ?

    • @gorgolyt
      @gorgolyt Před 3 lety

      @@lxathu I don't think you're aware that `pandas` is a thing, and that it has regular expression functionality.

    • @user-ze7tl2dw4i
      @user-ze7tl2dw4i Před 3 lety +1

      That's cool and pandas is great, but it doesn't beat efficient command line scripting. That's one of the areas I think perl is actually preferable to python

    • @lxathu
      @lxathu Před 3 lety

      @@MathieuDuponchelle I don't mean interval of characters but interval of consecutive records.
      RE is nice, but RE without any ifs and mandatory indents in expressions that match ranges of records because the first one matches the first RE (or any expression) and the second one matches the second is nicer.
      Python can do anything. (G)awk can't but what it can, it can with beautifully short but still understandable codes.

  • @MH-in8hz
    @MH-in8hz Před 3 lety +3

    *Meanwhile*: *Dying in remorse for all the time I've wasted on learning how to use batch files syntax for Windows*
    what makes it even sadder is that I've always wanted to make use of what I've learned from Java especially when it comes to file management, bash scripts look a lot similar to Java, didn't expect Linux os to be this awesome, I've got bored from all the propaganda for Linux os but, now I understand. I'm woken at last 😂
    btw, you did a brilliant job on the rounding function, so satisfying.🤩

    • @0x007A
      @0x007A Před 2 lety

      Sullying bash by comparisons to Java is heresy. Wash out your mouth with a bar of soap.

  • @marklyttle5449
    @marklyttle5449 Před 2 lety

    I used to work for a Telecom's company and at one point was involved in integrating our CMIP stack (millions of lines of C/C++ code) with network management solutions from vendors like IBM and HP etc. I remember the DEC solution to get their OSI network event logs into our stack was based around 5000 lines of AWK code. AWK is awesome.

  • @mywebuniversity
    @mywebuniversity Před rokem +1

    Great quick demo of awk power.