Show HN: Run Llama 3.1 8B in the browser
app.wiz.chatOP here, You'd need a GPU and the latest version of chrome to run this.
Worked on my M1 Macbook! Is there a cost on your side to hosting this type of program that you can share? Or is the computation done on the user's side?
Computation's done entirely on the client side.