Word File Processing in Python

Sdílet
Vložit
  • čas přidán 5. 09. 2024
  • Today we learn how to create, edit and parse Word (docx) files in Python.
    ◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾
    📚 Programming Books & Merch 📚
    🐍 The Python Bible Book: www.neuralnine...
    💻 The Algorithm Bible Book: www.neuralnine...
    👕 Programming Merch: www.neuralnine...
    🌐 Social Media & Contact 🌐
    📱 Website: www.neuralnine...
    📷 Instagram: / neuralnine
    🐦 Twitter: / neuralnine
    🤵 LinkedIn: / neuralnine
    📁 GitHub: github.com/Neu...
    🎙 Discord: / discord
    🎵 Outro Music From: www.bensound.com/

Komentáře • 66

  • @Saint_Oscar
    @Saint_Oscar Před 2 lety +4

    I've literally been looking for something like this for my job. 🙏 Thank you!

  • @tomasgonzalez4819
    @tomasgonzalez4819 Před 11 měsíci +2

    I will certainly read the docs. Good intro. However, clearly you don't use Word documents all that much, since the ONE thing you didn't touch on was find and replace! Either way, I appreciate bringing this library to our attention.

    • @jeedub6142
      @jeedub6142 Před 11 měsíci +1

      The documentation is horrendous… I’ve been tearing my hair out with it all day

    • @tomasgonzalez4819
      @tomasgonzalez4819 Před 11 měsíci

      @@jeedub6142 I know! 😂🤦‍♂️ It appears the only way find-and-replace is to loop through each run with regex. In other words, nothing to do with the library itself.

  • @TheFlorin1998
    @TheFlorin1998 Před 2 lety +1

    So this is latex but for word documents. Kinda neat!

  • @asianpsalmshymns
    @asianpsalmshymns Před 6 měsíci

    This is the real mission man... Keep it up❤

  • @erosserver8142
    @erosserver8142 Před 10 měsíci +1

    Very good tutorial, but I have a question if we use a {Client} or [Name] or , how can we do this, as financial contracts are much more complex Example:
    {TermTitle}
    {Senior} {Currency} {AggPrinAmount} {Collared}{Type} Notes due {MaturityDate format “yyyy”}

  • @EdwardTilley
    @EdwardTilley Před 9 měsíci

    Great introduction to this topic. I wanted to automatically edit an existing CV and your tutorial covered most of this. Do you have another that just edits and updates an existing doc based on text found in the document?

  • @ThePianist51
    @ThePianist51 Před rokem +2

    Wonderful, mate! You helped me with my legal analysis! 🎉

  • @iaconst4.0
    @iaconst4.0 Před 6 měsíci

    lo digo en español , asi como te entendi en ingles perfectamente: buen video!! gracias por compartir tus conocimientos!

  • @divyarathnakar4245
    @divyarathnakar4245 Před 7 měsíci

    Great Video, Thank You!. Is there a way update the Header text in the existing word document? Can you please share your ideas on it?

  • @frankstranathan2352
    @frankstranathan2352 Před rokem +1

    Hey I am trying to figure out how to split a word document by paragraphs. I can't figure out how to handle paragraphs that split pages. For example the paragraph starts on page one and ends on page two. No matter how I try to do it it always splits a paragraph that spans pages into two paragraphs.

  • @embracingchange2191
    @embracingchange2191 Před rokem +1

    Thank you so much for this video!!! This has been a massive help. Thanks again.

  • @shreyas_._
    @shreyas_._ Před 2 lety +1

    Little exaggerating till "we gonna use an external Library called ...." . But what ever ... Your contents are "PAKKA" 🔥 and the selection of subject

  • @tanuj05
    @tanuj05 Před 7 měsíci

    Hi !
    Can you please make a video on merging multiple word documents into one combined document ? And the combined document has a table of contents for the merged documents ? TIA

  • @oussmayo
    @oussmayo Před rokem +1

    How can i extract text from the footnotes of a word document? Do you have a solution for that been looking everywhere on the internet but cant seem to find working solutions. Thanks in advance!

  • @elbadrey
    @elbadrey Před rokem

    I enjoy the tut, tell us about find and replace text.

  • @trruthawareness
    @trruthawareness Před měsícem +1

    brilliant.

  • @gustavojuantorena
    @gustavojuantorena Před 2 lety +2

    Great! Super useful for automation

    • @JLSXMK8
      @JLSXMK8 Před 2 lety

      This would especially be AWESOME for custom template generation. @NeuralNine I wonder if there would be something similar to this for PowerPoint slides. If there is, can you also show that to us?

  • @gregphillipsmusic
    @gregphillipsmusic Před rokem

    Great video, thanks! So, why I am poking around in this topic is to find out how to *modify* text in a Word document. In particular I have got an interview with two people, and I want to have all the interviewer's text to be in italics. Also I don't have a Python environment, I assume this is not too hard to set up. Do you have videos on this? Thanks

  • @eliotharreau7627
    @eliotharreau7627 Před 2 lety +1

    Nice work . How can I write in txt format document like Notepad ? Thnx Bro .

  • @innocent8484
    @innocent8484 Před rokem

    Thanks bu ı have a problem :My content starts with tags, how can I transfer them without losing style properties?

  • @kuniverse2200
    @kuniverse2200 Před rokem

    for anyone wondering: second part starts at 10:11

  • @ryanhamilton3609
    @ryanhamilton3609 Před rokem

    Incredibly useful, thank you very very much !!!!!

  • @dr.fritza959
    @dr.fritza959 Před 2 lety

    Would love to see more codewar videos.

  • @JLSXMK8
    @JLSXMK8 Před 2 lety

    Intro song for channel identified - £g0 - "Keep Talkin'" (2020); I gotta warn you all though, the lyrics are extremely explicit!!!
    But seriously, if there's something similar to the library shown here for use with Microsoft Powerpoint, could you please show that to us? Would be super awesome!

  • @wtarenta
    @wtarenta Před 6 měsíci

    Why is the intelisense in my vscode not working? I can't see the tips for functions...

  • @CemYilmaz-zh4ft
    @CemYilmaz-zh4ft Před 4 měsíci

    Is it possible to get the Text out from a Shape?

  • @bobvance9519
    @bobvance9519 Před 5 měsíci

    How can I create a hyperlink in the document?

  • @abzalomkyeuwkyeuw
    @abzalomkyeuwkyeuw Před 4 měsíci

    How to set column width

  • @iqragul136
    @iqragul136 Před rokem

    File is show error when run from V.S code. Is there any extension that should be downloaded in V.S code?

  • @epicplot_a_z
    @epicplot_a_z Před 8 měsíci

    CAN WE COUNT NUMBER PAGES PRESENT IN DOCX FILE IN PYTHON SCRIPT

  • @ellyeroms-qy1ym
    @ellyeroms-qy1ym Před 8 měsíci

    theres a new release of python docx. could create a video on that
    please

  • @Hex-Scholar
    @Hex-Scholar Před rokem

    Hey, how can we insert one document file into another document file using Python ?

  • @Zombitopia
    @Zombitopia Před 2 lety +1

    I downloaded python from the main site but I can't find the terminal, what did I do wrong and should I reinstall it? Thank you.

    • @hartoflearning
      @hartoflearning Před 2 lety +1

      If you are on Windows, the terminal is the command prompt.
      Type in cmd on WIndows 10 to open it.
      This is where you can do the commands like pip install

    • @Zombitopia
      @Zombitopia Před 2 lety +1

      @@hartoflearning so terminal is CMD? Thank you I thought it's different program.

    • @hartoflearning
      @hartoflearning Před 2 lety +1

      @@Zombitopia Yes. However in a coding software like Visual Studio Code, you can usually do much of the same thing in a window called terminal.
      Also, if you are just getting started, and you are using Windows, and decide to use something like Visual Studio Code or Pycharm, right click and "Run As Administrator."

  • @lubnalukman105
    @lubnalukman105 Před 7 měsíci

    Can anyone help me with an assignment..qstn is create a django project and prepare a word doc regarding project and app files ..pls anyone help

  • @zubairshaikh8444
    @zubairshaikh8444 Před rokem

    Can we keep the word file
    Open while the code runs to see if everything is happening correctly

  • @dvrchipro
    @dvrchipro Před rokem

    Hi!Can we automate with this workflow?

  • @sbksdg7019
    @sbksdg7019 Před 2 lety

    Thank you!

  • @petehinch3871
    @petehinch3871 Před 2 lety

    Thanks

  • @teofilomayol
    @teofilomayol Před rokem

    Great video!

  • @Victor_Marius
    @Victor_Marius Před 2 lety

    Isn't there a run.text property?

  • @SpartaniXGames
    @SpartaniXGames Před 11 měsíci

    why are add_heading, add_paragraph, add_run etc. not highlighted in my vs code editor?

  • @ammarfasih3866
    @ammarfasih3866 Před rokem

    how to extract the hyperlinks from the document? any suggestion?

    • @trruthawareness
      @trruthawareness Před měsícem

      although i'm quite the beginner, i could suggest for each portion you are going thru, do a match on a regex for a hyperlink. something close to this one? pattern = r'(https?://)' then maybe can do something like match = re.search(pattern, cell) if match: etc.

  • @LegendsGaming620
    @LegendsGaming620 Před 5 měsíci

    Bro i am not able to make a new file in python

  • @lawrencedoliveiro9104

    I use odfpy, which lets you read and write documents conforming to ISO 26300. That’s a *proper* document standard.

  • @philtoa334
    @philtoa334 Před 2 lety

    Thx . : )

  • @davelag7838
    @davelag7838 Před rokem

    Thanks for the video, this is incredible useful

  • @youknowit1001
    @youknowit1001 Před 10 měsíci

    How to split word document page by page using python? any one have code of this please let me know reply me here
    Thanks

  • @harryneuhoff9115
    @harryneuhoff9115 Před rokem

    I got error messages. It didn’t recognize pip.

    • @StefanoVerugi
      @StefanoVerugi Před rokem

      when installing Python make sure you thick the box about pip, also you need to have python in your path, see environment variables, hope it helps

  • @user-lt6uu5ud9u
    @user-lt6uu5ud9u Před rokem

    8:41

  • @kapibara2440
    @kapibara2440 Před 6 měsíci

    That was a cool video, thank you!

  • @404errorpagenotfound.6

    Err, why not just use word??

    • @NeuralNine
      @NeuralNine  Před 2 lety +6

      Let's say you have 200 word files that have a similar structure and you want to extract the data from all tables for example to create a dataframe from them. Want to do it manually?

    • @404errorpagenotfound.6
      @404errorpagenotfound.6 Před 2 lety

      @@NeuralNine point taken, also thanks for your work.

  • @innocent8484
    @innocent8484 Před rokem

    How can I export the word file? my function is as below
    def export_file(request):
    if request.method =="POST":
    document=Document()
    duyurular=Duyuru.objects.all()
    for duyuru in duyurular:
    print(duyuru.name)
    document.add_heading(duyuru.name,1)
    tagsiz_paragraf = BeautifulSoup(duyuru.aciklama, "lxml").text
    document.save("icerikler.docx")
    paragraph = document.add_paragraph(tagsiz_paragraf)
    paragraph.alignment = 3 # for left, 1 for center, 2 right, 3 justify ....
    return redirect('list_duyuru')

  • @asianpsalmshymns
    @asianpsalmshymns Před 6 měsíci

    This is the real mission man... Keep it up❤