Settings

Theme

Show HN: Voice to Text, but User Friendly

sona.wtf

3 points by floriankiem a year ago · 2 comments · 2 min read

Reader

We started having issues getting our notes done after having conversations with many other builders in 2023 Therefore, my friend and I started building Sona Insight, an AI-powered transcription app (iOS).

I know every "indie-hacker" is doing something similar now (which I want to speak about later in this post), but our app has some features (e.g. creation of summary templates, own AI backend built on top of OpenAI's whisper, and a unique architecture) that make it unique. Everything is auto-saved to your account and synced to the cloud (so when we release our web app you can see and edit everything on your desktop).

As we built this for ourselves the goal never was to make a gigantic thing out of it as we worked on it in the evenings. But the app gets used successfully and we have a lot of subscribers compared to the marketing effort we invested till now.

What I want to leave for discussion here, with the context given, is why on earth are that many people building something so similar? When we started posting about it on X, we only saw apps from 2018 that had another approach, however, in October and November of this year it felt like every second post was about an app built similarly. Looking into it they always seemed to only do one API call to OpenAI or Deepgram directly, but some of them didn't even work right.

nils-e13 a year ago

what makes your backend architecture unique compared to just calling an API?

  • floriankiemOP a year ago

    speaker recognition, faster responses and we'll be able to make our subscriptions much cheaper starting next month

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection