Settings

Theme

LLaVA-Plus: Large Language and Vision Assistants That Learn to Use Skills

llava-vl.github.io

1 points by readyplayeremma 2 years ago · 1 comment

Reader

readyplayeremmaOP 2 years ago

LLaVA-Plus maintains a skill repository that contains a wide range of vision and vision-language pre-trained models (tools), and is able to activate relevant tools, given users’ multimodal inputs, to compose their execution results on the fly to fulfill many real-world tasks.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection