You are now following this Submission
- You will see updates in your followed content feed
- You may receive emails, depending on your communication preferences
The CLIP network uses contrastive learning to encode image and textual data into a shared feature space for joint classification. Images and text with high similarity will be close in this feature space, and have a high CLIP score. This further enables image search from input text, and text search from an input image.
MATLAB Release Compatibility
- Compatible with R2026a
Platform Compatibility
- Windows
- macOS (Apple Silicon)
- macOS (Intel)
- Linux
