vqvae · GitHub Topics

fishaudio / fish-speech

SOTA Open Source TTS

llama transformer tts valle vits vqgan vqvae

Python 20.62 k

14 小时前

AntixK / PyTorch-VAE

#计算机科学#A Collection of Variational Autoencoders (VAE) in PyTorch.

翻译 - PyTorch中的变种自动编码器（VAE）的集合。

PyTorch pytorch-implementation vae vae-implementation 深度学习 reproducible-research paper-implementations pytorch-vae variational-autoencoders architecture vqvae

Python 7.06 k

22 天前

v-iashin / SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

transformer vqvae Generative Adversarial Network PyTorch audio-generation melgan multi-modal video-understanding evaluation-metrics audio Video

Jupyter Notebook 360

9 个月前

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

auto-regressive-model image-generation tokenization vae video-generation vqvae

Python 288

9 个月前

k2kobayashi / crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

speech-synthesis voice-conversion vqvae adversarial-learning vocoder

Python 171

9 个月前

Vermeille / Torchelie

Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.

PyTorch utils perceptual loss Generative Adversarial Network vqvae torch

Python 110

4 个月前

haoliuhl / language-quantized-autoencoders

Language Quantized AutoEncoders

bert large-language-models multimodal roberta vqvae

Python 103

2 年前

mahmoodlab / SISH

#计算机科学#Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering

pathology image-retrieval image-search-engine histopathology friendly interactive shell 深度学习 vqvae

Python 101

2 年前

ZhengdiYu / SignAvatars

(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

human-pose-estimation motion-generation smplx vqvae eccv2024

Python 92

2 个月前

hqyyqh888 / RobustSemanComm

Demo of robust semantic communication against semantic noise

mask vqvae

Python 76

1 年前

zbr17 / OptVQ

Towards training VQ-VAE models robustly!

optimal-transport vq-vae vqgan vqvae

Python 66

3 个月前

vsimkus / vae-voice-conversion

#计算机科学#Voice conversion (VC) investigation using three variants of VAE

vae voice-conversion 机器学习 vqvae

Python 57

5 年前

explainingai-code / VQVAE-Pytorch

This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE

PyTorch vq-vae vqvae

Python 54

1 年前

SerezD / vqvae-vqgan-pytorch-lightning

#计算机科学#VQ-VAE/GAN implementation in pytorch-lightning

深度学习 PyTorch pytorch-lightning vqgan vqvae

Python 44

5 个月前

affjljoo3581 / Inverse-DALL-E-for-Optical-Character-Recognition

#自然语言处理#Inverse DALL-E for Optical Character Recognition

dalle 自然语言处理 gpt2 huggingface image-captioning image-generation image-to-text multimodal OCR optical-character-recognition PyTorch text-to-image transformers vqvae

Python 38

2 年前

amzn / sparse-vqvae

Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper

vqvae

Python 34

1 年前

MIMICLab / BITTERS

#计算机科学#Large-Scale Bidirectional Training for Zero-Shot Image Captioning

深度学习 image-captioning PyTorch pytorch-lightning vqvae bitters transformer

Python 21

2 年前

BhanuPrakashPebbeti / Image-Generation-Using-VQVAE

#计算机科学#Image Generation using VQVAE and GPT Models

深度学习 vqvae gpt image-generation 人工智能

Jupyter Notebook 15

1 个月前

jaywalnut310 / Vector-Quantized-Autoencoders

#计算机科学#Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"

vqvae vae autoencoder transformer Tensorflow 深度学习

Python 14

6 年前

sayedmohamedscu / VQGAN

Vector-Quantized Generative Adversarial Networks

codebook encoder-decoder-model Generative Adversarial Network PyTorch vq-vae vqgan vqvae

Python 9

1 年前