Return to Article Details A text-guided vision model for enhanced recognition of small instances Download Download PDF