Python API

The functionality of bio_embeddings is split into 5 different modules

bio_embeddings.embed

Language models to translate amino acid sequences into vector representations

bio_embeddings.extract

Methods for predicting properties of proteins, both on a per-residue and per-protein level, including supervised (pre-trained) and unsupervised (nearest neighbour search) methods

bio_embeddings.project

Visualize high dimensional data with t-SNE or UMAP projections or project Bert embeddings with Tucker

bio_embeddings.utilities

Various helpers

bio_embeddings.visualize

Display t-SNE or UMAP projections from the project stage