Skip to content

Latest commit

 

History

History
36 lines (30 loc) · 1.53 KB

README.md

File metadata and controls

36 lines (30 loc) · 1.53 KB

Simple Ai Video Generator (PoC)

^^ The amazing Dalle-2 API /s

Output Example:

A demo output video generated by this project can be found here:

How to run:

  • Install ffmpeg: https://ffmpeg.org/download.html (or, you can probably install it as a package: brew install ffmpeg, apt install ffmpeg, etc.)
  • Install Pip Environemnt: pipenv install && pipenv shell
  • Install Gemfile: gem install bundler && bundle install
  • Set OpenAI API Key: export OPENAI_API_KEY=<KEY>
  • Set Elevenlabs API Key: export ELEVEN_API_KEY=<KEY>
  • In ./src/main.py (bottom): set the channel name, the desired topic, voice name, and the destination dir.
  • Run ./src/main.py

PoC development:

  • GPT-turbo to generate video script and captions
  • Generate images with Dalle2 (or something much better if possible)
  • Elevenlabs api to generate narration
  • Get length of mp3 speech
  • Calculate time of each slide (equal times for now)
  • Generate output mp4 video
  • Background music
    • TBD

Known issues:

  • QuickTime Player on MacOS plays the audio incorrectly after a few seconds of playback. There is no problem with the video and it's audio, it's an issue with this specific player. The video can be uploaded without issues to YouTube and played with other players.

Contribution:

Feel free to fork, suggest ideas, report issues, and give general constructive feedback.

^(;,;)^