ZGP |
General Information |
Protegen ID |
1652 |
Sequence Strain (Species/Organism) |
Zaire ebolavirus strain Zaire95 |
Taxonomy ID |
186538
|
Other Database IDs |
CDD:279888 CDD:333284 CDD:197367 |
Molecule Role |
Protective antigen |
Related Vaccines(s) |
Ebola Virus Vaccine Ad5-ZGP
|
References |
|
Gene Information |
Gene Name |
ZGP |
NCBI Nucleotide GI |
1695251
|
DNA Sequence |
>gi|1695251|gb|U28077.1|EVU28077 Zaire Ebola virus strain Zaire95 virion spike glycoprotein (SP) gene, complete cds, and small/secreted glycoprotein precursor (SGP) gene, complete cds
GCGATGAAGATTAAGCCGACAGTGAGCGTAATCTTCATCTCTCTTAGATTATTTGTCCTCCAGAGTAGGG
ATCGTCAGGTCCTTTTCAATCGTATAACCAAAATAAACTTCACTAGAAGGATATTGTGGGGCAACAACAC
AATGGGTGTTACAGGAATATTGCAGTTACCTCGTGATCGATTCAAGAGGACATCATTCTTTCTTTGGGTA
ATTATCCTTTTCCAAAGAACATTTTCCATCCCACTTGGAGTCATCCACAATAGCACATTACAGGTTAGTG
ATGTCGACAAACTGGTTTGCCGTGACAAACTGTCATCCACAAATCAATTGAGATCAGTTGGACTGAATCT
CGAAGGGAATGGAGTGGCAACTGACGTGCCATCTGCAACTAAAAGATGGGGCTTCAGGTCCGGTGTCCCA
CCAAAGGTGGTCAATTATGAAGCTGGTGAATGGGCTGAAAACTGCTACAATCTTGAAATCAAAAAACCTG
ACGGGAGTGAGTGTCTACCAGCAGCGCCAGACGGGATTCGGGGCTTCCCCCGGTGCCGGTATGTGCACAA
AGTATCAGGAACGGGACCGTGTGCCGGAGACTTTGCCTTCCACAAAGAGGGTGCTTTCTTCCTGTATGAC
CGACTTGCTTCCACAGTTATCTACCGAGGAACGACTTTCGCTGAAGGTGTCGTTGCATTTCTGATACTGC
CCCAAGCTAAGAAGGACTTCTTCAGCTCACACCCCTTGAGAGAGCCGGTCAATGCAACGGAGGACCCGTC
TAGTGGCTACTATTCTACCACAATTAGATATCAAGCTACCGGTTTTGGAACCAATGAGACAGAGTATTTG
TTCGAGGTTGACAATTTGACCTACGTCCAACTTGAATCAAGATTCACACCACAGTTTCTGCTCCAGCTGA
ATGAGACAATATATACAAGTGGGAAAAGGAGCAATACCACGGGAAAACTAATTTGGAAGGTCAACCCCGA
AATTGATACAACAATCGGGGAGTGGGCCTTCTGGGAAACTAAAAAAACCTCACTAGAAAAATTCGCAGTG
AAGAGTTGTCTTTCACAGCTGTATCAAACAGAGCCAAAAACATCAGTGGTCAGAGTCCGGCGCGAACTTC
TTCCGACCCAGGGACCAACACAACAACTGAAGACCACAAAATCATGGCTTCAGAAAATTCCTCTGCAATG
GTTCAAGTGCACAGTCAAGGAAGGGAAGCTGCAGTGTCGCATCTGACAACCCTTGCCACAATCTCCACGA
GTCCTCAACCCCCCACAACCAAACCAGGTCCGGACAACAGCACCCACAATACACCCGTGTATAAACTTGA
CATCTCTGAGGCAACTCAAGTTGAACAACATCACCGCAGAACAGACAACGACAGCACAGCCTCCGACACT
CCCCCCGCCACGACCGCAGCCGGACCCCTAAAAGCAGAGAACACCAACACGAGCAAGGGTACCGACCTCC
TGGACCCCGCCACCACAACAAGTCCCCAAAACCACAGCGAGACCGCTGGCAACAACAACACTCATCACCA
AGATACCGGAGAAGAGAGTGCCAGCAGCGGGAAGCTAGGCTTAATTACCAATACTATTGCTGGAGTCGCA
GGACTGATCACAGGCGGGAGGAGAGCTCGAAGAGAAGCAATTGTCAATGCTCAACCCAAATGCAACCCTA
ATTTACATTACTGGACTACTCAGGATGAAGGTGCTGCAATCGGACTGGCCTGGATACCATATTTCGGGCC
AGCAGCCGAGGGAATTTACACAGAGGGGCTGATGCACAATCAAGATGGTTTAATCTGTGGGTTGAGACAG
CTGGCCAACGAGACGACTCAAGCTCTTCAACTGTTCCTGAGAGCCACAACCGAGCTACGCACCTTTTCAA
TCCTCAACCGTAAGGCAATTGATTTCTTGCTGCAGCGATGGGGCGGCACATGCCACATTTTGGGACCGGA
CTGCTGTATCGAACCACATGATTGGACCAAGAACATAACAGACAAAATTGATCAGATTATTCATGATTTT
GTTGATAAAACCCTTCCGGACCAGGGGGACAATGACAATTGGTGGACAGGATGGAGACAATGGATACCGG
CAGGTATTGGAGTTACAGGCGTTATAATTGCAGTTATCGCTTTATTCTGTATATGCAAATTTGTCTTTTA
GTTTTTCTTCAGATTGCTTCATGGCAAAGCTCAGCCTCAAATCAATGAAACCAGGATTTAATTATATGGA
TTACTTGAATCTAAGATTACTTGACAAATGATAATATAATACACTGGAGCTTTAAACATAGCCAATGTGA
TTCTAACTCTTTTAAACTCACAGTTAATCATAAATAAGGTTTGACATCAATCTAGTTATCTCTTTGAGAA
TGATAAACTTGATGAAGATTAAGAAAAA
|
Protein Information |
Protein Name |
virion spike glycoprotein |
NCBI Protein GI |
1695253
|
Protein Accession |
AAB37095.1 |
Protein pI |
6.69 |
Protein Weight |
71315.12 |
Protein Length |
744 |
Protein Note |
Filovirus glycoprotein; pfam01611 |
Protein Sequence |
>AAB37095.1 virion spike glycoprotein [Zaire ebolavirus]
MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQVSDVDKLVCRDKLSSTNQLRSVGLNL
EGNGVATDVPSATKRWGFRSGVPPKVVNYEAGEWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHK
VSGTGPCAGDFAFHKEGAFFLYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPS
SGYYSTTIRYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWKVNPE
IDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNRAKNISGQSPARTSSDPGTNTTTEDHKIMASENSSAM
VQVHSQGREAAVSHLTTLATISTSPQPPTTKPGPDNSTHNTPVYKLDISEATQVEQHHRRTDNDSTASDT
PPATTAAGPLKAENTNTSKGTDLLDPATTTSPQNHSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVA
GLITGGRRARREAIVNAQPKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQ
LANETTQALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKIDQIIHDF
VDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF
|
Vaxign Prediction |
Localization(Probability) |
(Prob.=0) |
Adhesin Probability |
0.667 |
Trans-membrane Helices |
1 |
Detailed Vaxign Results |
Vaxign Results |
Epitope Information |
IEDB Linear Epitope |
|
IEDB ID |
Epitope |
Starting position |
Ending position |
6391 |
CHILGPDCCIEPHDW |
601 |
616 |
8777 |
DISEATQVEQHHRRTDND |
397 |
415 |
8885 |
DKIDQIIHDFVDKTL |
621 |
636 |
13658 |
EPHDWTKNITDKIDQ |
611 |
626 |
13781 |
EPVNATEDPSSGYYS |
201 |
216 |
13837 |
EQHHRRTDN |
405 |
414 |
26343 |
IGVTGVIIAVIALFC |
656 |
671 |
26510 |
IIHDFVDKTLPDQGD |
626 |
641 |
27448 |
IMASENSSAMVQVHS |
341 |
356 |
43502 |
NDNWWTGWRQWIPAG |
641 |
656 |
47064 |
PDCCIEPHDWTKNIT |
606 |
621 |
47167 |
PDQGDNDNWWTGWRQ |
636 |
651 |
50368 |
QATGFGTNETEYLFE |
221 |
236 |
52394 |
QSPARTSSDPGTNTT |
321 |
336 |
57019 |
SATKRWGFRSGVPPK |
81 |
96 |
61752 |
STLQVSDVDKLVCRD |
41 |
56 |
64624 |
TKNITDKIDQIIHDF |
616 |
631 |
65808 |
TPVYKLDISEATQVEQHH |
391 |
409 |
68003 |
VDKTLPDQGDNDNWW |
631 |
646 |
68320 |
VEQHHRRTD |
404 |
413 |
72065 |
VYKLDISEA |
393 |
402 |
72638 |
WIPAGIGVTGVIIAV |
651 |
666 |
76393 |
YVQLESRFTPQFLLQ |
241 |
256 |
91431 |
GVIHNSTLQ |
36 |
45 |
91451 |
HHRRTDNDS |
407 |
416 |
91945 |
SETAGNNNT |
456 |
465 |
162327 |
GKLGLITNTIAGVAGLI |
477 |
494 |
187296 |
DISEATQVEQHHRRT |
397 |
412 |
187300 |
DNSTHNTPVYKLDIS |
385 |
400 |
187314 |
EDHKIMASENSSAMV |
337 |
352 |
187509 |
RTSSDPGTNTTTEDH |
325 |
340 |
187541 |
TGEESASSGKLGLIT |
469 |
484 |
187549 |
TIRYQATGFGTNETE |
217 |
232 |
227064 |
CRDKLSSTNQLRSVG |
53 |
68 |
227097 |
DNLTYVQLESRFTPQ |
237 |
252 |
227099 |
DPGTNTTTEDHKIMA |
329 |
344 |
227141 |
ENSSAMVQVHSQGRE |
345 |
360 |
227170 |
FGTNETEYLFEVDNL |
225 |
240 |
227248 |
HNTPVYKLDISEATQ |
389 |
404 |
227269 |
IKKPDGSECLPAAPD |
113 |
128 |
227307 |
KPGPDNSTHNTPVYK |
381 |
396 |
227409 |
NISGQSPARTSSDPG |
317 |
332 |
227422 |
NTTTEDHKIMASENS |
333 |
348 |
227450 |
PPKVVNYEAGEWAEN |
93 |
108 |
227687 |
VYKLDISEATQVEQH |
393 |
408 |
233152 |
ATQVEQHHRRTDNDSTA |
401 |
418 |
233197 |
HNTPVYKLDISEATQVE |
389 |
406 |
475611 |
AIGLAWIPYF |
526 |
536 |
478550 |
GEWAF |
286 |
291 |
738477 |
AKKDFFSSHPLREPVNATEDPS |
189 |
211 |
738527 |
ASENSSAMVQVHSQGREAAVSHLTTLA |
343 |
370 |
738955 |
FSILNRKAIDFLLQRWGGTCHILGPD |
582 |
608 |
738996 |
GEESASSGKLGLITNTIAGVAGLIT |
470 |
495 |
739075 |
GTNTTTEDHKIM |
331 |
343 |
739152 |
IDTTIGEWAFWETKKNLTRKIRSE |
281 |
305 |
739153 |
IDTTIGEWAFWETKKNLTRKIRSEELSF |
281 |
309 |
739163 |
IEPHDWTKNITDKIDQIIHDFVDKTLPDQGDND |
610 |
643 |
739164 |
IEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWWTGWRQWI |
610 |
653 |
739279 |
IYTSGKRSNTTGKLIWKVNPE |
260 |
281 |
739438 |
LDISEATQVEQHHRRTDNDSTASDT |
396 |
421 |
739466 |
LFLRATTELRT |
571 |
582 |
739533 |
LNETIYTSGKRSNTTGKLIWKVN |
256 |
279 |
739659 |
MHNQDGLICGLRQLANETTQALQ |
548 |
571 |
739719 |
NAQPKCNPN |
506 |
515 |
739800 |
NNNTHHQDTGEESASSGKLG |
461 |
481 |
740269 |
SGYYSTTIRYQATGFGTNETEYLFEVDNLT |
211 |
241 |
740342 |
SPQNHSETAG |
451 |
461 |
740397 |
STLQVSDVDKLVCRDKLSSTNQLRS |
41 |
66 |
740442 |
TEGLMHNQDGLICGLRQLANETTQALQLFLRATTELRT |
544 |
582 |
740516 |
TPVYKLDISEATQVEQHHRRTDNDS |
391 |
416 |
740578 |
VAFLILPQAKKDFFSSHPLREPVNATEDPS |
181 |
211 |
769762 |
TDKIDQIIHDFVDKTL |
620 |
636 |
832524 |
ATTELRTFSILNRKAI |
575 |
591 |
832618 |
EDHKIMASENSSAMVQ |
337 |
353 |
832628 |
ENCYNLEIKKPDGSEC |
106 |
122 |
832979 |
GEESASSGKLGLITNTIAGVAGL |
470 |
493 |
833006 |
GPLKAENTN |
428 |
437 |
833196 |
IDFLLQRWGGTCHILGPDCCIE |
590 |
612 |
833465 |
IWKVNPEIDTTIGEWA |
274 |
290 |
833550 |
KEGAFFLYDRLASTVIYRGTTFA |
155 |
178 |
833625 |
KNISGQSPAR |
316 |
326 |
833870 |
LMHNQDGLICGLRQLA |
547 |
563 |
834038 |
NATEDPSSGYYSTTIR |
204 |
220 |
834427 |
RSEELSFTAV |
302 |
312 |
835856 |
VDKTLPDQGDNDNWWT |
631 |
647 |
835888 |
VGLNLEGNGVATDVPSATKRW |
66 |
87 |
836134 |
VQVHSQGREAAVSHLT |
351 |
367 |
852598 |
GAFFLYDRLAST |
157 |
169 |
858307 |
DNDSTASDTPPATTAAGPLK |
412 |
432 |
858614 |
DKIDQIIHDFV |
621 |
632 |
858649 |
LEIKKPDGSE |
111 |
121 |
929450 |
NSTHNTPVYKLDISE |
386 |
401 |
931784 |
TKNITDKIDQIIHDFVDKTL |
616 |
636 |
932264 |
VDKTLPDQGDNDNWWTGWRQWIPAG |
631 |
656 |
|
|
MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQVSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSATKRWGFRSGVPPKVVNYEAGEWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFFLYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTIRYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYTSGKRSNTTGKLIWKVNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNRAKNISGQSPARTSSDPGTNTTTEDHKIMASENSSAMVQVHSQGREAAVSHLTTLATISTSPQPPTTKPGPDNSTHNTPVYKLDISEATQVEQHHRRTDNDSTASDTPPATTAAGPLKAENTNTSKGTDLLDPATTTSPQNHSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRARREAIVNAQPKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQLANETTQALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKIDQIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF
|