The repository for the code of the UltraFastBERT paper
#计算机科学#Deploy BERT for Sentiment Analysis as REST API using FastAPI, Transformers by Hugging Face and PyTorch
#计算机科学#Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
#自然语言处理#a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
#自然语言处理#Fast and memory-efficient library for WordPiece tokenization as it is used by BERT.
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
BERT-related papers
A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.
#自然语言处理#Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
#自然语言处理#[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
#自然语言处理#Google AI 2018 BERT pytorch implementation
Transformer related optimization, including BERT, GPT
Bert-classification and bert-dssm implementation with keras.
bert-base-chinese example