Settings

Theme

fzysingularity

Karma
103
Created
11 years ago

About

computer-vision, ml and all things meta.

Recent Submissions

  1. 1. Unified Vision-Language Agents – Detect, Segment, OCR, Generate and More (github.com)
  2. 2. VLM Showdown: GPT vs. Gemini vs. Claude vs. Orion (chat.vlm.run)
  3. 3. Show HN: Chat with Orion – a visual agent that sees, reasons and acts (chat.vlm.run)
  4. 4. ChatGPT uses YOLOv8 to detect UI elements (twitter.com)
  5. 5. Build visual AI workflows from a prompt – OCR, detection, editing and more (colab.research.google.com)
  6. 6. How we solved multi-modal tool-calling in MCP agents – VLM Run MCP (docs.vlm.run)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection