fzysingularity
- Karma
- 103
- Created
- 11 years ago
About
computer-vision, ml and all things meta.Recent Submissions
- 1. ▲ Unified Vision-Language Agents – Detect, Segment, OCR, Generate and More (github.com)
- 2. ▲ VLM Showdown: GPT vs. Gemini vs. Claude vs. Orion (chat.vlm.run)
- 3. ▲ Show HN: Chat with Orion – a visual agent that sees, reasons and acts (chat.vlm.run)
- 4. ▲ ChatGPT uses YOLOv8 to detect UI elements (twitter.com)
- 5. ▲ Build visual AI workflows from a prompt – OCR, detection, editing and more (colab.research.google.com)
- 6. ▲ How we solved multi-modal tool-calling in MCP agents – VLM Run MCP (docs.vlm.run)