fzysingularity
- Karma
- 105
- Created
- 12 years ago
About
computer-vision, ml and all things meta.Recent Submissions
- 1. ▲ DeepSeek OCR 2: Visual Causal Flow (huggingface.co)
- 2. ▲ Unified Vision-Language Agents – Detect, Segment, OCR, Generate and More (github.com)
- 3. ▲ VLM Showdown: GPT vs. Gemini vs. Claude vs. Orion (chat.vlm.run)
- 4. ▲ Show HN: Chat with Orion – a visual agent that sees, reasons and acts (chat.vlm.run)
- 5. ▲ ChatGPT uses YOLOv8 to detect UI elements (twitter.com)
- 6. ▲ Build visual AI workflows from a prompt – OCR, detection, editing and more (colab.research.google.com)