Shortrocity EP3: Overlaying Transcript on the Video

Sdílet
Vložit
  • čas přidán 27. 07. 2024
  • In this video I add a transcript overlay to my Shortrocity AI CZcams short generator.
    GitHub: github.com/unconv/shortrocity
    Support: buymeacoffee.com/unconv
    Consultations: www.buymeacoffee.com/unconv/e...
    Memberships: www.buymeacoffee.com/unconv/m...
    00:00 Intro & system message update
    08:29 Plan for the video
    09:30 Figuring out how to draw text with CV2
    12:49 Styling and positioning the text
    18:48 Drawing text based on timings
    23:09 Drawing the narration on the video
    29:32 Adding narration audio to video
    31:30 Syncing transcript with narration
    39:08 Putting it all together into a single script
  • Věda a technologie

Komentáře • 26

  • @amandamate9117
    @amandamate9117 Před 6 měsíci +3

    This video establishes a strong foundation, offering ideas or techniques that others can further develop into something truly unique.

  • @nicklansbury3166
    @nicklansbury3166 Před 6 měsíci

    Liked and Subbed. This is a very interesting project.

  • @spicer41282
    @spicer41282 Před 6 měsíci

    Destroy??!
    Heck NO!
    a DIAMOND! 💎 to be polished more is what it is.
    I saw you got bored with it? about 3/4 of the way.
    We're learning a Lot!
    Please complete the polish!
    Make this Diamond 💎 glisten!!

  • @HealthyHive-oo6vc
    @HealthyHive-oo6vc Před 6 měsíci

    Very Insightful and interesting as usual. Love it! Just wondering if Blender's 3D models and rigs can be incorporated into this workflow. It would be intriguing to see how they can be manipulated within the context of this video creation process.

  • @luiscarlosrico2304
    @luiscarlosrico2304 Před 6 měsíci +1

    Use a text to video model, or with those images use a image to video model, use youtube api so it auto uploads the shorts, and use web scraping for finding articles, it could post about 1,000 shorts per day, with that much content it is more probable to get a popular one by luck, your lacking vision, keep improving this project

  • @Alf-Dee
    @Alf-Dee Před 6 měsíci +1

    What about using some stock footage API instead of generating images?
    I haven't tried it myself, but it should be more cost-effective since the cost should be per month and not per token.
    This way, you can ask GPT API to generate the search terms for the correct stock footage category and select a random stock video using their API.
    (I guess this is more or less what InVideo AI does, and I wanted to build a tool like that myself)

    • @unconv
      @unconv  Před 6 měsíci +1

      Very good idea!

    • @Alf-Dee
      @Alf-Dee Před 6 měsíci +2

      ​@@unconv please, please, please make a Part 4 with that!
      I've already subscribed to your channel because of this series ;)

  • @PDragonLabs
    @PDragonLabs Před 4 měsíci

    👍

  • @Semiotica_Tumbada
    @Semiotica_Tumbada Před 6 měsíci +1

    as a video editor, this makes me calm, that was a very boring output :d as a programmer enthusiast, that was interesting

    • @spicer41282
      @spicer41282 Před 6 měsíci

      I'm feelin'..Ya!
      Hope he keeps it up though.
      Cause it's just a matter of time... Until someone else gets it right!
      So keep saving your Bread and Cookies!

  • @AmerikaMeraklisi-yr2xe
    @AmerikaMeraklisi-yr2xe Před 6 měsíci

    Can we see stable diffusion image generate in this project ? It could be amazing

  • @AmerikaMeraklisi-yr2xe
    @AmerikaMeraklisi-yr2xe Před 6 měsíci

    Hello, My images are not resized, I generated 512x512 dal e-2 images becauce of free :(

  • @wobble_cat
    @wobble_cat Před 6 měsíci +1

    you opened a pandora box 😆

  • @shorts_faceless
    @shorts_faceless Před 6 měsíci

    on average, how much does it cost to generate a video like those that you tried?

    • @truehighs7845
      @truehighs7845 Před 6 měsíci

      couple of dollars, so many images become quite expensive.

    • @unconv
      @unconv  Před 6 měsíci +1

      Around $0.61 USD for a 6-image 57 second short + ElevenLabs cost ($5/ month)

    • @truehighs7845
      @truehighs7845 Před 6 měsíci +1

      @@unconv Yes you are right, I made longer movies of 3 minutes, and TTS1 is not that expensive, nor is GPT3.5, it's the Dall-e that quite dear.
      I have a system with an Nvidia, I want to try to run it from Stable Diffusion locally, I don't know if I can use something like LiteLLm or ollma to use the openai protocol locally.
      With solar and SD locally you would have only the TTS to pay for, unless there is an opensource TTS model.
      But potentially, with the current latest MOE models you could potentially run it for free.

  • @ea02ca6f
    @ea02ca6f Před 6 měsíci

    I really wish you used any language other than Python. It's just low-class.

    • @unconv
      @unconv  Před 6 měsíci +1

      Alright. You choose the next language

    • @ea02ca6f
      @ea02ca6f Před 6 měsíci

      i will wait for it and subbed now @@unconv

    • @ea02ca6f
      @ea02ca6f Před 6 měsíci +1

      C# OR JS/TS @@unconv