#计算机科学#Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
Awesome Protein Representation Learning
📖🔬☕ BioJava is an open-source project dedicated to providing a Java library for processing biological data.
Generation of protein sequences and evolutionary alignments via discrete diffusion models
#计算机科学#Get protein embeddings from protein sequences
A Python API for the RCSB Protein Data Bank (PDB)
#计算机科学#Implementation of ProteinBERT in Pytorch
Neural Networks for Protein Sequence Alignment
#计算机科学#Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)
FAPLM: A Drop-in Efficient Pytorch Implementation of Protein Language Models
Open-source usearch
Inference of couplings in proteins and RNAs from sequence variation
#计算机科学#Multi-task and masked language model-based protein sequence embedding models.
A collection of tasks to probe the effectiveness of protein sequence representations in modeling aspects of protein design
Lbster: Language models for Biological Sequence Transformation and Evolutionary Representation
Variational autoencoder for protein sequences - add metal binding sites and generate sequences for novel topologies
An R package to calculate indices and theoretical physicochemical properties of peptides and protein sequences.
#计算机科学#Affinity Protein-Protein Transformers—State of the art protein-protein binding affinity in seconds!
Protein 3D structure prediction pipeline