Qwen3-VL - Vision Language Model

Powered by Qwen3-VL-235B-A22B-Instruct on ZeroGPU.

Capabilities:

  • Image understanding and VQA
  • Video analysis and description
  • OCR and text extraction
  • Multi-frame temporal reasoning

API Endpoints for EagleEye:

  • POST /call/api_analyze_image - Single image analysis
  • POST /call/api_analyze_video - Video analysis
  • POST /call/api_analyze_frames - Multi-frame analysis