Settings

Theme

Show HN: New Audiobook Generator for Nvidia Using Chatterbox TTS

github.com

4 points by beboplifa 5 months ago · 3 comments · 1 min read

Reader

I am an audiobook addict that coded this https://github.com/cpttripzz/Chatterblez. I am using it all the time and it works nice. I have only bothered to get it working on windows but it should be cross-platform as it uses pyqt, I would be happy for contributors to help get it working on macos and linux and also ATI and other video cards.

If you are stuck without a video card I recommend using https://github.com/cpttripzz/audiblez it can generate an audiobook in around 4 hours with a decent CPU

drewbitt 5 months ago

With Chatterbox this finally feels almost possible. I find that I am sensitive to pacing issues which it often has. Kokoro was just alright. I'm using a tool I hacked together that runs Minimax Speech-02-HD which is still a whole other level, IMO, but not that cheap. Inworld-TTS-1-max is cheaper - I'm trialing it these days. async.ai seems promising too.

Thanks for the tool! I'm also quite interested in this space.

BinaryIgor 5 months ago

Interesting; I was thinking about creating something like that a few years ago - since I love listening to information a lot while doing some chores/walking - but back then, all available text-to-speech converters were unbearably robotic.

How much time does it take to convert a book/doc into audio using your approach? Also, as I understood it all runs locally, so you don't need to pay for any API access/usage?

  • beboplifaOP 5 months ago

    on an nvidia rtx 2060 mobile about half a day for a medium sized novel. Chatterbox TTS is really emotive, sometimes too much so.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection