HA |
General Information |
Protegen ID |
1620 |
Sequence Strain (Species/Organism) |
synthetic construct |
Taxonomy ID |
32630
|
Other Database IDs |
CDD:278910 |
Molecule Role |
Protective antigen |
References |
|
Gene Information |
Gene Name |
HA |
NCBI Nucleotide GI |
89994009
|
DNA Sequence |
>gi|89994009|gb|DQ420166.1| Synthetic construct codon-optimized hemagglutinin (HA) gene, complete cds
ATGGAGAAGATCGTGCTGCTGCTGGCCATCGTGAGCCTGGTGAAGAGCGACCAGATCTGCATCGGCTACC
ACGCCAACAACAGCACCGAGCAGGTGGACACCATCATGGAGAAGAACGTGACCGTGACCCACGCCCAGGA
CATCCTGGAGAAGACCCACAACGGCAAGCTGTGCGACCTGAACGGCGTGAAGCCCCTGATCCTGCGCGAC
TGCAGCGTGGCCGGCTGGCTGCTGGGCAACCCCATGTGCGACGAGTTCATCAACGTGCCCGAGTGGAGCT
ACATCGTGGAGAAGGCCAGCCCCGCCAACGACCTGTGCTACCCCGGCGACTTCAACGACTACGAGGAGCT
GAAGCACCTGCTGAGCCGCACCAACCACTTCGAGAAGATCCAGATCATCCCCAAGAGCAGCTGGAGCAAC
CACGACGCCAGCAGCGGCGTGAGCAGCGCCTGCCCCTACCACGGCCGCAGCAGCTTCTTCCGCAACGTGG
TGTGGCTGATCAAGAAGAACAGCGCCTACCCCACCATCAAGCGCAGCTACAACAACACCAACCAGGAGGA
CCTGCTGGTGCTGTGGGGCATCCACCACCCCAACGACGCCGCCGAGCAGACCAAGCTGTACCAGAACCCC
ACCACCTACATCAGCGTGGGCACCAGCACCCTGAACCAGCGCCTGGTGCCCGAGATCGCCACCCGCCCCA
AGGTGAACGGGCAGAGCGGCCGCATGGAGTTCTTCTGGACCATCCTGAAGCCCAACGACGCCATCAACTT
CGAGAGCAACGGCAACTTCATCGCCCCCGAGTACGCCTACAAGATCGTGAAGAAGGGCGACAGCGCCATC
ATGAAGAGCGAGCTGGAGTACGGCAACTGCAACACCAAGTGCCAGACCCCCATGGGCGCCATCAACAGCA
GCATGCCCTTCCACAACATCCACCCCCTGACCATCGGCGAGTGCCCCAAGTACGTGAAGAGCAACCGCCT
GGTGCTGGCCACCGGCCTGCGCAACACCCCCCAGCGCGAGCGCCGCCGCAAGAAGCGCGGCCTGTTCGGC
GCCATCGCCGGCTTCATCGAGGGCGGCTGGCAGGGCATGGTGGACGGCTGGTACGGGTACCACCACAGCA
ACGAGCAGGGCAGCGGCTACGCCGCCGACAAGGAGAGCACCCAGAAGGCCATCGACGGCGTGACCAACAA
GGTGAACAGCATCATCGACAAGATGAACACCCAGTTCGAGGCCGTGGGCCGCGAGTTCAACAACCTGGAG
CGCCGCATCGAGAACCTGAACAAGCAGATGGAGGACGGCTTCCTGGACGTGTGGACCTACAACGCCGAGC
TGCTGGTGCTGATGGAGAACGAGCGCACCCTGGACTTCCACGACAGCAACGTGAAGAACCTGTACGACAA
GGTGCGCCTGCAGCTGCGCGACAACGCCAAGGAGCTGGGCAACGGCTGCTTCGAGTTCTACCACAAGTGC
GACAACGAGTGCATGGAGAGCGTGAAGAACGGCACCTACGACTACCCCCAGTACAGCGAGGAGGCCCGCC
TGAACCGCGAGGAGATCAGCGGCGTGAAGCTGGAGAGCATGGGCACCTACCAGATCCTGAGCATCTACAG
CACCGTGGCCAGCAGCCTGGCCCTGGCCATCATGGTGGCCGGCCTGAGCCTGTGGATGTGCAGCAACGGC
AGCCTGCAGTGCCGCATCTGCATCTAA
|
Protein Information |
Protein Name |
codon-optimized hemagglutinin |
NCBI Protein GI |
89994010
|
Protein Accession |
ABD83813.1 |
Protein pI |
6.56 |
Protein Weight |
61265.95 |
Protein Length |
642 |
Protein Note |
derived from influenza A virus strain A/goose/Guangdong/1/96(H5N1); GSGD96 |
Protein Sequence |
>ABD83813.1 codon-optimized hemagglutinin [synthetic construct]
MEKIVLLLAIVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLNGVKPLILRD
CSVAGWLLGNPMCDEFINVPEWSYIVEKASPANDLCYPGDFNDYEELKHLLSRTNHFEKIQIIPKSSWSN
HDASSGVSSACPYHGRSSFFRNVVWLIKKNSAYPTIKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNP
TTYISVGTSTLNQRLVPEIATRPKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAI
MKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNTPQRERRRKKRGLFG
AIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLE
RRIENLNKQMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKC
DNECMESVKNGTYDYPQYSEEARLNREEISGVKLESMGTYQILSIYSTVASSLALAIMVAGLSLWMCSNG
SLQCRICI
|
Vaxign Prediction |
Localization(Probability) |
(Prob.=0) |
Adhesin Probability |
0.230 |
Trans-membrane Helices |
1 |
Detailed Vaxign Results |
Vaxign Results |
Epitope Information |
IEDB Linear Epitope |
|
IEDB ID |
Epitope |
Starting position |
Ending position |
20836 |
GLFGAIAGF |
347 |
356 |
20838 |
GLFGAIAGFIE |
347 |
358 |
29690 |
IYSTVASSL |
535 |
544 |
97231 |
CNTKCQTP |
290 |
298 |
97301 |
FFWTILKP |
244 |
252 |
97348 |
GLFGAIAGFIEGGW |
347 |
361 |
97376 |
GYHHSNEQGSGYAADKESTQKAIDGVTNK |
369 |
398 |
97413 |
KCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRN |
293 |
336 |
97441 |
LFGAIAGFIEGGWQGMVDGWYG |
348 |
370 |
97578 |
QKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNK |
388 |
429 |
97700 |
TNQEDLLVLWGIHHPNDAAEQT |
183 |
205 |
97771 |
WYGYHHSN |
367 |
375 |
102130 |
TKLYQNPTTYISVGT |
204 |
219 |
133590 |
GMVDGWYG |
362 |
370 |
133748 |
YNAELLV |
440 |
447 |
138194 |
IIPKSSWS |
132 |
140 |
148521 |
CNTKCQTPMGAINSS |
290 |
305 |
162644 |
KESTQKAIDGVTNKVNS |
384 |
401 |
163243 |
RKKRGLFGAIAGFIE |
343 |
358 |
167382 |
RERRRKKRGLFGAIA |
339 |
354 |
167383 |
RERRRKKRGLFGAIAGFI |
339 |
357 |
176979 |
CYPGDFNDYEELK |
106 |
119 |
177093 |
IATRPKVNGQSGRM |
229 |
243 |
177236 |
RKKRGLFGAIAGFIEGGW |
343 |
361 |
177300 |
STQKAIDGVTNKVNSIIDK |
386 |
405 |
177340 |
VDGWYGYHHSNEQGSGYA |
364 |
382 |
221495 |
WMCSNGSLQCRICI |
555 |
569 |
224827 |
KPNDAINF |
250 |
258 |
243006 |
NGNFIAPEYAYKI |
260 |
273 |
243067 |
QEDLLVLWGIH |
185 |
196 |
243602 |
VKPLI |
63 |
68 |
243849 |
WLLGNP |
76 |
82 |
538659 |
FGAIAGFIEG |
349 |
359 |
738340 |
GLFGAIAGFIEGGWQGMVDGWYGYHHSN |
347 |
375 |
745428 |
WGIHH |
192 |
197 |
745433 |
WYGYHH |
367 |
373 |
837494 |
HSNEQGSGYAADKE |
372 |
386 |
837985 |
MENERTLDFHDSNV |
448 |
462 |
840897 |
LVLWGIHHP |
189 |
198 |
840943 |
SVKNGTYD |
497 |
505 |
840970 |
WSYIVE |
92 |
98 |
923970 |
YAADKESTQKAIDGVTNKVN |
380 |
400 |
|
|
MEKIVLLLAIVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLNGVKPLILRDCSVAGWLLGNPMCDEFINVPEWSYIVEKASPANDLCYPGDFNDYEELKHLLSRTNHFEKIQIIPKSSWSNHDASSGVSSACPYHGRSSFFRNVVWLIKKNSAYPTIKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVPEIATRPKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAIMKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNTPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKQMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECMESVKNGTYDYPQYSEEARLNREEISGVKLESMGTYQILSIYSTVASSLALAIMVAGLSLWMCSNGSLQCRICI
|