Build Claude Alternative in 20 Mins
Running DAX, OpenWork and GPT OSS 20B in GCP
Here are few steps to setup and run Claude alternative using OSS that runs in your cloud. Everything full in your control without any external API or sharing data required.
This provide automated GPU instance provisioning in our cloud without dealing with complexity like NVIDIA drivers, stuck in downloading large models, docker setup and many others things. This step will required 5 minutes of setup and make sure you have GPU Quota activated in your GCP project.
2. Download GPT OSS 20B and VLLM
Start by caching Docker images and models first, around 100GB in total.
Step 1: Cache the VLLM docker
Step 2: Cache GPTOSS 20B from Huggingface
You can start tunneling from host into your GCP VM or using VPN.
3. Install OpenWork in your Laptop
Connect openwork with your GPT OSS 20B VM instance. Follow their tutorial. For Linux, you need to edit vim ~/.config/opencode/opencode.json
Make sure to match the model , port and url
Or, learn more about Dagploy
dax run download_docker vllm/vllm-openai:nightly,ghcr.io/open-webui/open-webui:main --images vllm-lib --image-size 100dax run download_hf openai/gpt-oss-20b --image-size 50dax run create_vm_inference --stack-name gptoss --config-json '{"images":["models--openai{
"$schema": "https://opencode.ai/config.json",
"provider": {
"my-api": {
"npm": "@ai-sdk/openai-compatible",
"name": "GPT OSS 20B",
"options": {
"baseURL": "http://localhost:8000/v1"
},
"models": {
"model-name": {
"name": "model"
}
}
}
}
}