Computer Vision Toolbox Model for Grounding DINO Object Detection

Grounding DINO is a zero-shot pre-trained Vision Language Model (VLM) that enables open vocabulary, text-prompted object detection.

MathWorks Computer Vision Toolbox Team

61 Downloads

(0)

17 Jun 2026

Download

Grounding DINO enables zero-shot object detection from textual inputs, without requiring dedicated class training on the input term. It can therefore detect objects outside of its training set. It combines a Transformer-based DINO object detector with grounded pre-training.

MATLAB Release Compatibility

Compatible with R2026a to R2026b

Platform Compatibility

Windows
macOS (Apple Silicon)
macOS (Intel)
Linux

Computer Vision Toolbox Model for Grounding DINO Object Detection

Tags

Requires

MATLAB Release Compatibility

Platform Compatibility