ConSurf-DB | Evolutionary conservation profiles of proteins

67103 homologues were collected from the UNIPROT database using HMMER.
Of these, 12429 homologues passed the thresholds (min/max similarity, coverage, etc), 6017 of them are CD-HIT unique.
The calculations were conducted on 300 hits (query included), sampled from the unique hits. Click here if you wish to view the list of sequences which produced significant alignments, but were not chosen as hits.
Average pairwise distance : 1.24
Lower bound : 0.05
Upper bound : 2.23
Residue variety per position in the MSA (The table is best viewed with an editor that respects Comma-Separated Values)
View MSA and phylogenetic tree using WASABI
Download Phylogenetic Tree (Newick format)
The best evolutionary model was selected to be: WAG. See details here

Homologues were collected from UNIREF90 database, a clustered version of the UniProt database.
Homologues search algorithm is HMMER.
HMMER E-value cutoff is 0.0001.
Number of HMMER Iterations is 1.

CD-HIT cutoff is 95% (This is the maximal sequence identity between homologues).
Maximal number of final homologues is 300. These are sampled from the list of unique homologues.
Maximal overlap between homologues is 10% (If overlap between two homologues exceeds 10%, the highest scoring homologue is chosen).
Coverage is 60% (This is the minimal percentage of the query sequence covered by the homologue).

THE CONSURF DATABASE