GIT: A Generative Image-to-text Transformer for Vision and Language
Generative Adversarial Text to Image Synthesis / Please Star -->
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
#计算机科学#MMagic (Multimodal Advanced, Generative, and Intelligent Creation) 是一个供专业人工智能研究人员和机器学习工程师去处理、编辑和生成图像与视频的开源 AIGC 工具箱
Pytorch implementation of Generative Adversarial Text-to-Image Synthesis paper
TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
翻译 - NeMo:用于对话式AI的工具包
#计算机科学#Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Sparse Additive Generative Model of Text
Auto-regressive flow-based generative network for text to speech synthesis
翻译 - 基于自回归流的文本生成到语音合成网络
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
翻译 - PortaSpeech 的 PyTorch 实现:便携且高质量的生成文本到语音
code for "Deep Recurrent Generative Decoder for Abstractive Text Summarization"
Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text (EMNLP2018)
Stable Diffusion 是一个 text-to-image 扩散模型
The Generative AI Landscape - A Collection of Awesome Generative AI Applications
Generative Models by Stability AI