#计算机科学#A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
翻译 - 一个简单的命令行工具,用于图片生成的文本,使用Openai的剪辑和Biggan
#大语言模型#The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, ...
#大语言模型#Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
A Comparative Framework for Multimodal Recommender Systems
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
#计算机科学#Automated modeling and machine learning framework FEDOT
#大语言模型#✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models
#大语言模型#GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
#计算机科学#A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
#大语言模型#🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
#计算机科学#A knowledge base construction engine for richly formatted data
#自然语言处理#This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
#计算机科学#Sequence-to-Sequence Framework in PyTorch
#计算机科学#Towards Generalist Biomedical AI
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
#计算机科学#DANCE: a deep learning library and benchmark platform for single-cell analysis