#计算机科学#🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
翻译 - 使用BERT模型将可变长度句子映射到固定长度向量
#IOS#Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.
#自然语言处理#Simple implementation of OpenAI CLIP model in PyTorch.
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.
#计算机科学#根据文本描述搜索本地图片的工具,powered by Rust + candle + CLIP
#Awesome#The most impactful papers related to contrastive pretraining for multimodal models!
#向量搜索引擎#Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retrieval"
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
#计算机科学#Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!
Youtube video moment searcher by text or photo
[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"
#计算机科学#A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocab...
#向量搜索引擎#Text to image search & Image Similarity Search using @Typesense
Semantic Emoji Search Plugin for FiftyOne
[ NeurIPS 2023 R0-FoMo Workshop ] Official Codebase for "Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data"
Traverse the space of concepts with a multi-modal similarity index in FiftyOne
#计算机科学#OpenAI's CLIP neural network