MSA Prediction Results
Summary
View basic information related to this submission
Request ID: 2cc3a5af-14be-42af-b5e3-4be85158c5be
Predictor Type: MSA Tools (DynaMine, DisoMine, EarlyFolding, AgMata)
Status: FINISHED
Sequences: 28
Avg Length: 291
Min Length: 268
Max Length: 295
Alignment Length: 295
Created At: 2026-01-08 13:25:32
Last Updated: 2026-01-08 13:27:07
Input File:
>sp|P07949|RET_HUMAN Proto-oncogene tyrosine-protein kinase receptor Ret OS=Homo sapiens OX=9606 GN=RET PE=1 SV=3
LVLGKTLGEGEFGKVVKATAFHLKGRAGYTTVAVKMLKENASPSELRDLLSEFNVLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISQGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSQGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKRRDYL
>tr|A0A0D9REZ9|A0A0D9REZ9_CHLSB Proto-oncogene tyrosine-protein kinase receptor Ret OS=Chlorocebus sabaeus OX=60711 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQDAPGPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2K5MKU6|A0A2K5MKU6_CERAT Proto-oncogene tyrosine-protein kinase receptor Ret OS=Cercocebus atys OX=9531 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2K5V8L5|A0A2K5V8L5_MACFA Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca fascicularis OX=9541 GN=RET PE=3 SV=2
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2K6CCS4|A0A2K6CCS4_MACNE Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca nemestrina OX=9545 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2R9B0Q5|A0A2R9B0Q5_PANPA Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pan paniscus OX=9597 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|F6S299|F6S299_MACMU Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca mulatta OX=9544 GN=RET PE=3 SV=3
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|G1S142|G1S142_NOMLE Proto-oncogene tyrosine-protein kinase receptor Ret OS=Nomascus leucogenys OX=61853 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDDP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|G3QLZ3|G3QLZ3_GORGO Proto-oncogene tyrosine-protein kinase receptor Ret OS=Gorilla gorilla gorilla OX=9595 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|H2NA70|H2NA70_PONAB Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pongo abelii OX=9601 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|H2Q1T9|H2Q1T9_PANTR Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pan troglodytes OX=9598 GN=RET PE=3 SV=2
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2I3LXM4|A0A2I3LXM4_PAPAN Proto-oncogene tyrosine-protein kinase receptor Ret OS=Papio anubis OX=9555 GN=RET PE=3 SV=2
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2I3TR47|A0A2I3TR47_PANTR Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pan troglodytes OX=9598 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A7N9CS34|A0A7N9CS34_MACFA Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca fascicularis OX=9541 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A8C9HBV6|A0A8C9HBV6_9PRIM Proto-oncogene tyrosine-protein kinase receptor Ret OS=Piliocolobus tephrosceles OX=591936 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A8D2EQF8|A0A8D2EQF8_THEGE Proto-oncogene tyrosine-protein kinase receptor Ret OS=Theropithecus gelada OX=9565 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2I2ZX13|A0A2I2ZX13_GORGO Proto-oncogene tyrosine-protein kinase receptor Ret OS=Gorilla gorilla gorilla OX=9595 GN=RET PE=3 SV=1
-SL-----------------------SDRLQPALCCISENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2I3H8C4|A0A2I3H8C4_NOMLE Proto-oncogene tyrosine-protein kinase receptor Ret OS=Nomascus leucogenys OX=61853 GN=RET PE=3 SV=1
-------------------------EKDRLQPALCCISENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDDP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2J8IX92|A0A2J8IX92_PANTR Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pan troglodytes OX=9598 GN=CK820_G0052381 PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2J8IX93|A0A2J8IX93_PANTR Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pan troglodytes OX=9598 GN=CK820_G0052381 PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2J8XTA5|A0A2J8XTA5_PONAB Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pongo abelii OX=9601 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2K5HW95|A0A2K5HW95_COLAP Proto-oncogene tyrosine-protein kinase receptor Ret OS=Colobus angolensis palliatus OX=336983 PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2K5YH50|A0A2K5YH50_MANLE Proto-oncogene tyrosine-protein kinase receptor Ret OS=Mandrillus leucophaeus OX=9568 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2K6R0I7|A0A2K6R0I7_RHIRO Proto-oncogene tyrosine-protein kinase receptor Ret OS=Rhinopithecus roxellana OX=61622 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|A0A2R9AWY2|A0A2R9AWY2_PANPA Proto-oncogene tyrosine-protein kinase receptor Ret OS=Pan paniscus OX=9597 GN=RET PE=3 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|F6S7W7|F6S7W7_MACMU Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca mulatta OX=9544 GN=RET PE=3 SV=3
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|H9ZBZ9|H9ZBZ9_MACMU Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca mulatta OX=9544 GN=RET PE=2 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
>tr|H9ZC00|H9ZC00_MACMU Proto-oncogene tyrosine-protein kinase receptor Ret OS=Macaca mulatta OX=9544 GN=RET PE=2 SV=1
LVLGKTLGEGEFGKVVKATAFRLKGRAGYTTVAVKMLKENASPSELRDLLSEFNLLKQVN
HPHVIKLYGACSQD--GPLLLIVEYAKYGSLRGFLRESRKVGPGYLGSGGSRNSSSLDHP
DERALTMGDLISFAWQISRGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYE
EDSYVKRSKGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLF
NLLKTGHRMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKSRDYL
Whole-MSA median distribution profile:
Backbone Propensities
Disorder
Early Folding Events
Conservation
Sequence "1A2C"
Distribution Analysis focused on "backbone" predictor
The distribution of the biophysical predictions for the selected sequence is shown below including the sequence conservation colour map (using Shannon's entropy).
MSA Biophysical Predictions
The biophysical predictions calculated for the selected sequence are shown below including the gaps in the alignment and the sequence conservation colour map (using Shannon's entropy).
DynaMine's backbone dynamics
- Values > 1.0: Membrane spanning regions
- Values > 0.80 and < 1.0: Rigid conformations
- Values > 0.69 and < 0.80: Context dependent
- Values < 0.69: Flexible regions
DynaMine's side chain dynamics
Higher values mean more likely rigid. These values are highly dependent on the amino acid type (i.e. a Trp will be rigid, an Asp flexible).
DynaMine's conformational propensities
- Sheet: Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a generally twisted, pleated sheet.
- Helix: Alpha helices are right-handed coiled or spiral structures where each amino acid residue is hydrogen bonded to residues four positions away, forming a stable helical conformation.
- Coil: Coil regions lack regular secondary structure and are characterized by their flexibility and lack of specific hydrogen bonding patterns.
- ppII (polyproline II): A left-handed helical structure commonly found in proline-rich regions, characterized by three residues per turn and specific dihedral angles.
Higher values indicate higher propensities for each structural element.
EFoldMine's earlyFolding propensity
Values above 0.169 indicate residues that are likely to start the protein folding process, based on only local interactions with other amino acids.
DisoMine's disorder prediction
Values above 0.50 indicate that this is likely a disordered residue.
Agmata beta-sheet aggregation propensity
These values are divided by a factor of 20 from the original. Peaks indicate residues likely to be involved in beta-sheet aggregation.
Biophysical Profile
This tab provides a detailed analysis of the biophysical profile of the selected sequence compared to the MSA median values.
Backbone Propensities
Disorder
Early Folding Events
Comparison Table
| Feature | Category | Whole-MSA Median (%) | Selected Sequence (%) | Difference |
|---|
Predictions for the selected sequence
The biophysical predictions calculated for the seleected sequence including the gaps in the alignment are shown below.
DynaMine's backbone dynamics
- Values > 1.0: Membrane spanning regions
- Values > 0.80 and < 1.0: Rigid conformations
- Values > 0.69 and < 0.80: Context dependent
- Values < 0.69: Flexible regions
DynaMine's side chain dynamics
Higher values mean more likely rigid. These values are highly dependent on the amino acid type (i.e. a Trp will be rigid, an Asp flexible).
DynaMine's conformational propensities
- Sheet: Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a generally twisted, pleated sheet.
- Helix: Alpha helices are right-handed coiled or spiral structures where each amino acid residue is hydrogen bonded to residues four positions away, forming a stable helical conformation.
- Coil: Coil regions lack regular secondary structure and are characterized by their flexibility and lack of specific hydrogen bonding patterns.
- ppII (polyproline II): A left-handed helical structure commonly found in proline-rich regions, characterized by three residues per turn and specific dihedral angles.
Higher values indicate higher propensities for each structural element.
EFoldMine's earlyFolding propensity
Values above 0.169 indicate residues that are likely to start the protein folding process, based on only local interactions with other amino acids.
DisoMine's disorder prediction
Values above 0.50 indicate that this is likely a disordered residue.
Agmata beta-sheet aggregation propensity
These values are divided by a factor of 20 from the original. Peaks indicate residues likely to be involved in beta-sheet aggregation.
Predictions for the selected sequence
The biophysical predictions calculated for the seleected sequence ignoring the gaps in the alignment are shown below.
DynaMine's backbone dynamics
- Values > 1.0: Membrane spanning regions
- Values > 0.80 and < 1.0: Rigid conformations
- Values > 0.69 and < 0.80: Context dependent
- Values < 0.69: Flexible regions
DynaMine's side chain dynamics
Higher values mean more likely rigid. These values are highly dependent on the amino acid type (i.e. a Trp will be rigid, an Asp flexible).
DynaMine's conformational propensities
- Sheet: Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a generally twisted, pleated sheet.
- Helix: Alpha helices are right-handed coiled or spiral structures where each amino acid residue is hydrogen bonded to residues four positions away, forming a stable helical conformation.
- Coil: Coil regions lack regular secondary structure and are characterized by their flexibility and lack of specific hydrogen bonding patterns.
- ppII (polyproline II): A left-handed helical structure commonly found in proline-rich regions, characterized by three residues per turn and specific dihedral angles.
Higher values indicate higher propensities for each structural element.
EFoldMine's earlyFolding propensity
Values above 0.169 indicate residues that are likely to start the protein folding process, based on only local interactions with other amino acids.
DisoMine's disorder prediction
Values above 0.50 indicate that this is likely a disordered residue.
Agmata beta-sheet aggregation propensity
These values are divided by a factor of 20 from the original. Peaks indicate residues likely to be involved in beta-sheet aggregation.
3D Structure with Biophysical Annotations
This visualization integrates the 1D sequence features with 3D structural information, providing a comprehensive view of the protein. The biophysical properties are mapped onto the sequence track, allowing you to correlate structural features with predicted properties.
The 3D structure is provided by AlphaFold DB and rendered using the Molstar library. For the tracks, we use the Saguaro3D component to link the sequence features to the 3D structure.
The following tracks are available:
- Bio2Byte biophysical predictions
- Bio2Byte biophysical blocks for backbone dynamics and disorder
- Bio2Byte early folding events
- AlphaFold DB model's pLDDT provided by AlphaSync API
- AlphaFold DB model's secondary structure by AlphaSync API (using DSSP classifications from DSSP)
- AlphaFold DB model's surface by AlphaSync API (based on methodology described in Nature Structural & Molecular Biology)
- InterPro annotations (domains, families, superfamilies) fetched via InterPro API using UniProt ID. If no data is shown, it means there is no data associated with the requested UniProt ID.
DSSP Secondary Structure Classification
DSSP uses 8 types of secondary structure, which we've grouped into 5 categories for visualization:
- Alpha Helix - Includes DSSP codes H (4-turn α helix), G (3-turn 310 helix), and I (5-turn π helix)
- Strands - Includes DSSP codes E (extended strand in β sheet) and B (isolated β-bridge)
- Loop - Includes DSSP codes T (hydrogen-bonded turn) and S (bend)
- Coil - Includes DSSP code C (coil, which can also be represented as space or dash)
- Unstructured - Any other code not listed above (not displayed as blocks in the visualization)
Sequence Logo Visualization
WebLogo displays the conservation of each position in the alignment. The height of each letter indicates the relative frequency of that amino acid at that position. The overall height of the stack represents the sequence conservation at that position.
Gaussian mixture model (GMM) Analysis
The biophysical predictions calculated for all the individual sequences in the MSA are fitted on-the-fly using a Gaussian mixture model (GMM) with a single component using the Scikit-learn Python library (more information here).
This allows reduction of the number of dimensions from the biophysical predictions (backbone and side chain dynamics, helix, sheet, coil, ppII, early folding propensities, beta-aggregation propensities and disorder) to a single scoring value. The more negative the value, the further that residue is from the MSA-based model. This enables determination of residues that are over the 95, 99 and 99.9 percentiles as estimated by the GMM classification model.
Results Table
The biophysical predictions calculated for all the individual sequences in the MSA are shown in the table below.
| Position | Residue | Backbone | Side chain | DisoMine | Coil | Helix | PPII | Sheet | Early Folding | Agmata |
|---|