Show HN: AI Getting Started – A template helps you start a AI SaaS with ease

20 points by timqian 3 years ago · 6 comments

Reader

Given that there's plenty of options for every point in the README.md, one thing missing is how to guarantee that your stack does not miss requests from paying customers, metering usage & avoid ballooning server costs. I see a lot of YC startups trying to solve this Lago, Paigo etc.

I'm trying to evaluate best serverless solutions for inference without compromising on client usage & reducing idle time on GPU boxes. So far its down to base10, HF, Banana, I'll end up pooling them all & then sending requests between them. For dedicated training boxes Lambda, Modal, Oblivus, Runpod are the contenders.

ilaksh 3 years ago

If it's a small startup I don't think you can beat Replicate as far as costs.
To track usage you need a credits system. Basically you have to read a number from a file, subtract the cost for the operation, and if it's less than zero throw an exception. It does take a little bit of development, but I don't think you need a whole other startup to handle that. You can do the core part of it for one type of operation/cost in less than a day. Maybe a few days to debug something.

MildlySerious 3 years ago

Is there a reason not to use Supabase Auth if you use Supabase anyway?

timqianOP 3 years ago

We did not use supabase pg-vector at first (using pinecone)
- MildlySerious 3 years ago
  
  That makes sense. Thanks.

ano88888 3 years ago

are they trying to sell products of their porforlio companies?

Settings

Show HN: AI Getting Started – A template helps you start a AI SaaS with ease

Keyboard Shortcuts