Qwen

Qwen3-VL-8B-Instruct

A vision-language model built for deep visual understanding, GUI agent interaction, spatial reasoning, and long video comprehension with native 256K context.

Available on FriendliAI

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Container

Run this model inference with full control and performance in your environment.

Learn more

Model Details

Model Provider

Qwen

Model Tree

Base

this model

Input Modalities

Text

Image

Output Modalities

Text

Supported Functionality

Dedicated Endpoints

Container

Qwen3-VL-8B-Instruct

Explore FriendliAI today