SGP |
| General Information |
| Protegen ID |
1651 |
|
Sequence Strain (Species/Organism) |
Sudan ebolavirus strain Boniface |
|
Taxonomy ID |
186540
|
|
Other Database IDs |
CDD:279888 CDD:197367 |
|
Molecule Role |
Protective antigen |
| References |
|
| Gene Information |
|
Gene Name |
SGP |
|
NCBI Nucleotide GI |
1041223
|
|
DNA Sequence |
>gi|1041223|gb|U28134.1|EVU28134 Sudan Ebola virus strain Boniface virion spike glycoprotein (SP) gene, complete cds, and small/secreted glycoprotein precursor (SGP) gene, complete cds
ATTTGATGAAGATTAAGCCTGATTAAGGCCCAACCTTCATCTTTTTACCATAATCTTGTTCTCAATACCA
TTTAATAGGGGTATACTTGCCAAAGCGCCCCCATCCTCAGGATCTCGCAATGGAGGGTCTTAGCCTACTC
CAATTGCCCAGAGATAAATTTCGAAAAAGCTCTTTCTTTGTTTGGGTCATCATCTTATTTCAAAAGGCCT
TTTCCATGCCTTTGGGTGTTGTGACCAACAGCACTTTAGAAGTAACAGAGATTGACCAGCTAGTCTGCAA
GGATCATCTTGCATCAACTGACCAGCTGAAATCAGTTGGTCTCAACCTCGAGGGGAGCGGAGTATCTACT
GATATCCCATCTGCGACAAAGCGTTGGGGCTTCAGATCTGGTGTGCCTCCCCAAGTGGTCAGCTATGAAG
CAGGAGAATGGGCTGAAAATTGCTACAATCTTGAAATAAAGAAACCGGACGGGAGCGAATGCTTACCCCC
ACCGCCGGATGGTGTCAGAGGCTTTCCAAGGTGCCGCTATGTTCACAAAGCCCAAGGAACCGGGCCCTGC
CCGGGTGACTATGCCTTTCACAAGGATGGAGCTTTCTTCCTCTATGACAGGCTGGCTTCAACTGTAATTT
ACAGAGGAGTCAATTTTGCTGAGGGGGTAATCGCATTCTTGATATTGGCTAAACCAAAGGAAACGTTCCT
TCAATCACCCCCCATTCGAGAGGCAGCAAACTACACTGAAAATACATCAAGTTACTATGCCACATCCTAC
TTGGAGTACGAAATCGAAAATTTTGGTGCTCAACACTCCACGACCCTTTTCAAAATTAACAATAATACTT
TTGTTCTTCTGGACAGGCCCCACACGCCTCAGTTCCTTTTCCAGCTGAATGATACCATTCAACTTCACCA
ACAGTTGAGCAACACAACTGGGAAACTAATTTGGACACTAGATGCTAATATCAATGCTGATATTGGTGAA
TGGGCTTTTTGGGAAAATAAAAAAATCTCTCCGAACAACTACGTGGAGAAGAGCTGTCTTTCGAAACTTT
ATCGCTCAACGAGACAGAAGACGATGATGCGACATCGTCGAGAACTACAAAGGGAAGAATCTCCGACCGG
GCCACCAGGAAGTATTCGGACCTGGTTCCAAAGGATTCCCCTGGGATGGTTTCATTGCACGTACCAGAAG
GGGAAACAACATTGCCGTCTCAGAATTCGACAGAAGGTCGAAGAGTAGATGTGAATACTCAGGAAACTAT
CACAGAGACAACTGCAACAATCATAGGCACTAACGGTAACAACATGCAGATCTCCACCATCGGGACAGGA
CTGAGCTCCAGCCAAATCCTGAGTTCCTCACCGACCATGGCACCAAGCCCTGAGACTCAGACCTCCACAA
CCTACACACCAAAACTACCAGTGATGACCACCGAGGAATCAACAACACCACCGAGAAACTCTCCTGGCTC
AACAACAGAAGCACCCACTCTCACCACCCCAGAGAATATAACAACAGCGGTTAAAACTGTTTGGCCACAA
GAGTCCACAAGCAACGGTCTAATAACTTCAACAGTAACAGGGATTCTTGGGAGCCTTGGACTTCGAAAAC
GCAGCAGAAGACAAGTTAACACCAGGGCCACGGGTAAATGCAATCCCAACTTACACTACTGGACTGCACA
AGAACAACATAATGCTGCTGGGATTGCCTGGATCCCGTACTTTGGACCGGGTGCAGAAGGCATATACACT
GAAGGCCTTATGCACAACCAAAATGCCTTAGTCTGTGGACTCAGACAACTTGCAAATGAAACAACTCAAG
CTCTGCAGCTTTTCTTAAGGGCCACGACGGAGCTGCGGACATATACCATACTCAATAGGAAGGCCATAGA
TTTCCTTCTGCGACGATGGGGCGGGACATGTAGGATCCTGGGACCAGATTGTTGCATTGAGCCACATGAT
TGGACCAAAAACATCACTGATAAAATCAACCAAATCATCCATGATTTCATCGACAACCCTTTACCCAATC
AGGATAATGATGATAATTGGTGGACGGGCTGGAGACAGTGGATCCCTGCAGGAATAGGCATTACTGGAAT
TATTATTGCAATCATTGCTCTTCTTTGCGTCTGCAAGCTGCTTTGTTGAATATCAACTTGAATCATTAAT
TTAAAGTTGATACATTTCTAACATTATAAATTATAATCTGATATTAATACTTGAAAATAAGGCTAATGCC
AAATTCTGTGCCAAACTTGAAAGTAGGTTTACCAAAATCCTTTGAACTGGAATGCTTTAATGCTCTTTCT
CAATACTATATAAGTTCCTTCCCAAAATAATATTGATGAAGATTAAGAAAAA
|
| Protein Information |
|
Protein Name |
virion spike glycoprotein precursor |
|
NCBI Protein GI |
1041225
|
|
Protein Accession |
AAB37096.1 |
|
Protein pI |
5.67 |
|
Protein Weight |
72335.57 |
|
Protein Length |
754 |
|
Protein Note |
subtype: Sudan |
|
Protein Sequence |
>AAB37096.1 virion spike glycoprotein precursor [Sudan ebolavirus]
MEGLSLLQLPRDKFRKSSFFVWVIILFQKAFSMPLGVVTNSTLEVTEIDQLVCKDHLASTDQLKSVGLNL
EGSGVSTDIPSATKRWGFRSGVPPQVVSYEAGEWAENCYNLEIKKPDGSECLPPPPDGVRGFPRCRYVHK
AQGTGPCPGDYAFHKDGAFFLYDRLASTVIYRGVNFAEGVIAFLILAKPKETFLQSPPIREAANYTENTS
SYYATSYLEYEIENFGAQHSTTLFKINNNTFVLLDRPHTPQFLFQLNDTIQLHQQLSNTTGKLIWTLDAN
INADIGEWAFWENKKNLSEQLRGEELSFETLSLNETEDDDATSSRTTKGRISDRATRKYSDLVPKDSPGM
VSLHVPEGETTLPSQNSTEGRRVDVNTQETITETTATIIGTNGNNMQISTIGTGLSSSQILSSSPTMAPS
PETQTSTTYTPKLPVMTTEESTTPPRNSPGSTTEAPTLTTPENITTAVKTVWPQESTSNGLITSTVTGIL
GSLGLRKRSRRQVNTRATGKCNPNLHYWTAQEQHNAAGIAWIPYFGPGAEGIYTEGLMHNQNALVCGLRQ
LANETTQALQLFLRATTELRTYTILNRKAIDFLLRRWGGTCRILGPDCCIEPHDWTKNITDKINQIIHDF
IDNPLPNQDNDDNWWTGWRQWIPAGIGITGIIIAIIALLCVCKLLC
|
| Epitope Information |
| IEDB Linear Epitope |
|
| IEDB ID |
Epitope |
Starting position |
Ending position |
| 852598 |
GAFFLYDRLAST |
157 |
169 |
| 832628 |
ENCYNLEIKKPDGSEC |
106 |
122 |
| 147815 |
IHDFIDNPLPNQDNDD |
627 |
643 |
| 478550 |
GEWAF |
286 |
291 |
| 739466 |
LFLRATTELRT |
571 |
582 |
| 833854 |
LHVPEGETT |
353 |
362 |
| 769763 |
TDKINQIIHDFIDNPL |
620 |
636 |
| 858649 |
LEIKKPDGSE |
111 |
121 |
| 47064 |
PDCCIEPHDWTKNIT |
606 |
621 |
|
|
|
MEGLSLLQLPRDKFRKSSFFVWVIILFQKAFSMPLGVVTNSTLEVTEIDQLVCKDHLASTDQLKSVGLNLEGSGVSTDIPSATKRWGFRSGVPPQVVSYEAGEWAENCYNLEIKKPDGSECLPPPPDGVRGFPRCRYVHKAQGTGPCPGDYAFHKDGAFFLYDRLASTVIYRGVNFAEGVIAFLILAKPKETFLQSPPIREAANYTENTSSYYATSYLEYEIENFGAQHSTTLFKINNNTFVLLDRPHTPQFLFQLNDTIQLHQQLSNTTGKLIWTLDANINADIGEWAFWENKKNLSEQLRGEELSFETLSLNETEDDDATSSRTTKGRISDRATRKYSDLVPKDSPGMVSLHVPEGETTLPSQNSTEGRRVDVNTQETITETTATIIGTNGNNMQISTIGTGLSSSQILSSSPTMAPSPETQTSTTYTPKLPVMTTEESTTPPRNSPGSTTEAPTLTTPENITTAVKTVWPQESTSNGLITSTVTGILGSLGLRKRSRRQVNTRATGKCNPNLHYWTAQEQHNAAGIAWIPYFGPGAEGIYTEGLMHNQNALVCGLRQLANETTQALQLFLRATTELRTYTILNRKAIDFLLRRWGGTCRILGPDCCIEPHDWTKNITDKINQIIHDFIDNPLPNQDNDDNWWTGWRQWIPAGIGITGIIIAIIALLCVCKLLC
|