"As I turned on the wacky waving inflatable tube man, I knew I'd have a friend for life." —One weird and happy customer View ...
Therefore, in this work, we propose to build class prototypes from text descriptions instead of limited visual instances by leveraging a classical pretrained VLM named CLIP. Concretely, we generate ...