#计算机科学#A Collection of Variational Autoencoders (VAE) in PyTorch.
翻译 - PyTorch中的变种自动编码器(VAE)的集合。
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
Language Quantized AutoEncoders
#计算机科学#Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
#计算机科学#Voice conversion (VC) investigation using three variants of VAE
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
#计算机科学#VQ-VAE/GAN implementation in pytorch-lightning
#自然语言处理#Inverse DALL-E for Optical Character Recognition
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
#计算机科学#Large-Scale Bidirectional Training for Zero-Shot Image Captioning
#计算机科学#Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"
#计算机科学#Image Generation using VQVAE and GPT Models
Vector-Quantized Generative Adversarial Networks