HA |
| General Information |
| Protegen ID |
1620 |
|
Sequence Strain (Species/Organism) |
synthetic construct |
|
Taxonomy ID |
32630
|
|
Other Database IDs |
CDD:278910 |
|
Molecule Role |
Protective antigen |
| References |
|
| Gene Information |
|
Gene Name |
HA |
|
NCBI Nucleotide GI |
89994009
|
|
DNA Sequence |
>gi|89994009|gb|DQ420166.1| Synthetic construct codon-optimized hemagglutinin (HA) gene, complete cds
ATGGAGAAGATCGTGCTGCTGCTGGCCATCGTGAGCCTGGTGAAGAGCGACCAGATCTGCATCGGCTACC
ACGCCAACAACAGCACCGAGCAGGTGGACACCATCATGGAGAAGAACGTGACCGTGACCCACGCCCAGGA
CATCCTGGAGAAGACCCACAACGGCAAGCTGTGCGACCTGAACGGCGTGAAGCCCCTGATCCTGCGCGAC
TGCAGCGTGGCCGGCTGGCTGCTGGGCAACCCCATGTGCGACGAGTTCATCAACGTGCCCGAGTGGAGCT
ACATCGTGGAGAAGGCCAGCCCCGCCAACGACCTGTGCTACCCCGGCGACTTCAACGACTACGAGGAGCT
GAAGCACCTGCTGAGCCGCACCAACCACTTCGAGAAGATCCAGATCATCCCCAAGAGCAGCTGGAGCAAC
CACGACGCCAGCAGCGGCGTGAGCAGCGCCTGCCCCTACCACGGCCGCAGCAGCTTCTTCCGCAACGTGG
TGTGGCTGATCAAGAAGAACAGCGCCTACCCCACCATCAAGCGCAGCTACAACAACACCAACCAGGAGGA
CCTGCTGGTGCTGTGGGGCATCCACCACCCCAACGACGCCGCCGAGCAGACCAAGCTGTACCAGAACCCC
ACCACCTACATCAGCGTGGGCACCAGCACCCTGAACCAGCGCCTGGTGCCCGAGATCGCCACCCGCCCCA
AGGTGAACGGGCAGAGCGGCCGCATGGAGTTCTTCTGGACCATCCTGAAGCCCAACGACGCCATCAACTT
CGAGAGCAACGGCAACTTCATCGCCCCCGAGTACGCCTACAAGATCGTGAAGAAGGGCGACAGCGCCATC
ATGAAGAGCGAGCTGGAGTACGGCAACTGCAACACCAAGTGCCAGACCCCCATGGGCGCCATCAACAGCA
GCATGCCCTTCCACAACATCCACCCCCTGACCATCGGCGAGTGCCCCAAGTACGTGAAGAGCAACCGCCT
GGTGCTGGCCACCGGCCTGCGCAACACCCCCCAGCGCGAGCGCCGCCGCAAGAAGCGCGGCCTGTTCGGC
GCCATCGCCGGCTTCATCGAGGGCGGCTGGCAGGGCATGGTGGACGGCTGGTACGGGTACCACCACAGCA
ACGAGCAGGGCAGCGGCTACGCCGCCGACAAGGAGAGCACCCAGAAGGCCATCGACGGCGTGACCAACAA
GGTGAACAGCATCATCGACAAGATGAACACCCAGTTCGAGGCCGTGGGCCGCGAGTTCAACAACCTGGAG
CGCCGCATCGAGAACCTGAACAAGCAGATGGAGGACGGCTTCCTGGACGTGTGGACCTACAACGCCGAGC
TGCTGGTGCTGATGGAGAACGAGCGCACCCTGGACTTCCACGACAGCAACGTGAAGAACCTGTACGACAA
GGTGCGCCTGCAGCTGCGCGACAACGCCAAGGAGCTGGGCAACGGCTGCTTCGAGTTCTACCACAAGTGC
GACAACGAGTGCATGGAGAGCGTGAAGAACGGCACCTACGACTACCCCCAGTACAGCGAGGAGGCCCGCC
TGAACCGCGAGGAGATCAGCGGCGTGAAGCTGGAGAGCATGGGCACCTACCAGATCCTGAGCATCTACAG
CACCGTGGCCAGCAGCCTGGCCCTGGCCATCATGGTGGCCGGCCTGAGCCTGTGGATGTGCAGCAACGGC
AGCCTGCAGTGCCGCATCTGCATCTAA
|
| Protein Information |
|
Protein Name |
codon-optimized hemagglutinin |
|
NCBI Protein GI |
89994010
|
|
Protein Accession |
ABD83813.1 |
|
Protein pI |
6.56 |
|
Protein Weight |
61265.95 |
|
Protein Length |
642 |
|
Protein Note |
derived from influenza A virus strain A/goose/Guangdong/1/96(H5N1); GSGD96 |
|
Protein Sequence |
>ABD83813.1 codon-optimized hemagglutinin [synthetic construct]
MEKIVLLLAIVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLNGVKPLILRD
CSVAGWLLGNPMCDEFINVPEWSYIVEKASPANDLCYPGDFNDYEELKHLLSRTNHFEKIQIIPKSSWSN
HDASSGVSSACPYHGRSSFFRNVVWLIKKNSAYPTIKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNP
TTYISVGTSTLNQRLVPEIATRPKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAI
MKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNTPQRERRRKKRGLFG
AIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLE
RRIENLNKQMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKC
DNECMESVKNGTYDYPQYSEEARLNREEISGVKLESMGTYQILSIYSTVASSLALAIMVAGLSLWMCSNG
SLQCRICI
|
| Epitope Information |
| IEDB Linear Epitope |
|
| IEDB ID |
Epitope |
Starting position |
Ending position |
| 20838 |
GLFGAIAGFIE |
347 |
358 |
| 538659 |
FGAIAGFIEG |
349 |
359 |
| 97578 |
QKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNK |
388 |
429 |
| 97413 |
KCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRN |
293 |
336 |
| 97700 |
TNQEDLLVLWGIHHPNDAAEQT |
183 |
205 |
| 97441 |
LFGAIAGFIEGGWQGMVDGWYG |
348 |
370 |
| 97376 |
GYHHSNEQGSGYAADKESTQKAIDGVTNK |
369 |
398 |
| 97348 |
GLFGAIAGFIEGGW |
347 |
361 |
| 97301 |
FFWTILKP |
244 |
252 |
| 20836 |
GLFGAIAGF |
347 |
356 |
| 97771 |
WYGYHHSN |
367 |
375 |
| 102130 |
TKLYQNPTTYISVGT |
204 |
219 |
| 97231 |
CNTKCQTP |
290 |
298 |
| 224827 |
KPNDAINF |
250 |
258 |
| 138194 |
IIPKSSWS |
132 |
140 |
| 167382 |
RERRRKKRGLFGAIA |
339 |
354 |
| 148521 |
CNTKCQTPMGAINSS |
290 |
305 |
| 163243 |
RKKRGLFGAIAGFIE |
343 |
358 |
| 162644 |
KESTQKAIDGVTNKVNS |
384 |
401 |
| 243067 |
QEDLLVLWGIH |
185 |
196 |
| 133590 |
GMVDGWYG |
362 |
370 |
| 133748 |
YNAELLV |
440 |
447 |
| 167383 |
RERRRKKRGLFGAIAGFI |
339 |
357 |
| 177300 |
STQKAIDGVTNKVNSIIDK |
386 |
405 |
| 176979 |
CYPGDFNDYEELK |
106 |
119 |
| 177093 |
IATRPKVNGQSGRM |
229 |
243 |
| 243602 |
VKPLI |
63 |
68 |
| 243006 |
NGNFIAPEYAYKI |
260 |
273 |
| 243849 |
WLLGNP |
76 |
82 |
| 745433 |
WYGYHH |
367 |
373 |
| 745428 |
WGIHH |
192 |
197 |
| 29690 |
IYSTVASSL |
535 |
544 |
| 221495 |
WMCSNGSLQCRICI |
555 |
569 |
| 738340 |
GLFGAIAGFIEGGWQGMVDGWYGYHHSN |
347 |
375 |
| 837494 |
HSNEQGSGYAADKE |
372 |
386 |
| 837985 |
MENERTLDFHDSNV |
448 |
462 |
| 840897 |
LVLWGIHHP |
189 |
198 |
| 840970 |
WSYIVE |
92 |
98 |
| 840943 |
SVKNGTYD |
497 |
505 |
| 177340 |
VDGWYGYHHSNEQGSGYA |
364 |
382 |
| 177236 |
RKKRGLFGAIAGFIEGGW |
343 |
361 |
| 923970 |
YAADKESTQKAIDGVTNKVN |
380 |
400 |
|
|
|
MEKIVLLLAIVSLVKSDQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLNGVKPLILRDCSVAGWLLGNPMCDEFINVPEWSYIVEKASPANDLCYPGDFNDYEELKHLLSRTNHFEKIQIIPKSSWSNHDASSGVSSACPYHGRSSFFRNVVWLIKKNSAYPTIKRSYNNTNQEDLLVLWGIHHPNDAAEQTKLYQNPTTYISVGTSTLNQRLVPEIATRPKVNGQSGRMEFFWTILKPNDAINFESNGNFIAPEYAYKIVKKGDSAIMKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIGECPKYVKSNRLVLATGLRNTPQRERRRKKRGLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKESTQKAIDGVTNKVNSIIDKMNTQFEAVGREFNNLERRIENLNKQMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYDKVRLQLRDNAKELGNGCFEFYHKCDNECMESVKNGTYDYPQYSEEARLNREEISGVKLESMGTYQILSIYSTVASSLALAIMVAGLSLWMCSNGSLQCRICI
|