Settings

Theme

Show HN: Combining LLMs and Voice Models – Part 1

nitric.io

4 points by tholm a year ago · 0 comments · 1 min read

Reader

This is a guide that I wrote to showcase a new batch inference feature for an OSS framework that I author (nitric.io).

I know things like Podcast generation via NotebookLM and also NotebookLlama exist, but wanted to demonstrate a case where an API could be built, and subsequently orchestrated in the cloud.

This is just the first part for producing audio using suno/bark via an API. I'm currently working on a part 2 that will introduce an LLM to make scripts from short prompt, which will be piped to the code introduced in Part 1.

Looking for feedback on improving this, there are a few things I'd like to clean up but overall am pretty happy with the outputs it produces so far.

Thanks in advance for any feedback given.

No comments yet.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection