VIOLIN Logo
VO Banner
Search: for Help
Protegen Home
Introduction
Statistics
News and Updates
Protegen Query
Protegen BLAST
Selected Bacteria
Brucella spp. (26)
B. anthracis (14)
E. coli (40)
M. tuberculosis (28)
N. meningitidis (18)
Selected Viruses
Ebola virus (19)
HIV (41)
Influenza virus (50)
Selected Parasites
Plasmodium (47)
Data Submission
Data Exchange
Data Download
Documentation
FAQs
Disclaimer
Contact Us
UMMS Logo


ST6GALNAC4

General Information
Protegen ID 2231
Sequence Strain (Species/Organism) Homo sapiens
Taxonomy ID 9606
Molecule Role Protective antigen
Related Vaccines(s) YF-Vax
References  
Gene Information
Gene Name ST6GALNAC4
NCBI Gene ID 27090
Genbank Accession AF162789
Chromosome No 9
Locus Tag RP11-203J24.4
Gene Starting Position 127907885
Gene Ending Position 127917051
DNA Sequence
>NC_000009.12:127907885-127917051 Homo sapiens chromosome 9, GRCh38.p12 Primary Assembly
ATCCAGAAAACACACGTTTACCCGTACGTTCCTGATACGGCCCCGGCAGTCACAACTCCAAGGCCCCTTC
ACCAATGAACCCCAGGCTCTCGGCTGCCCCAACCAGGGCGGCCAGCCTGAGAAGTGTGGCACCAAAAGGT
GGTAGGAGCGGCTGGGGAGGGAGGACCAGGACTGGCACAAGGTCATGATGCTGGTGAGTTGGGGGCCATC
AGCCCCAGGGACAAAGGCCACTTGACAGACCTGGAGGTGGGGTGAGCCCCCAAATGCCGTGAATTACTCA
GGGAGTGGAGGGGGGAGACACGGCTCCCCCCTCCACTCCCCTTCAAGTCATGAGGCCTGAGATGGCTCCA
AGTGTCGCCAGAATGGGGCGGGAAGGAACCTTAGGTCCACCCAGCACGTGGCAGGAGGCAGGATCCATAA
ATTAAATGTTTTTGTGGAGTGTGATGGCTTGGGATGGGACATCCCGGAGGCCTCGCAACGGCATGGCGAC
TGGCAGGACGACGGAAGCTACTCAGTCCTCCAGGACGGATGGGCGAACACGATGGGCCTCTTCTTGGCCC
AGCGGGAGAAGACCGCCTTCTCAGTGATGAAGCGGTGGGCGCTTCGGGGCGCCTGCTCGTGTGCCAGGTA
CATCTGACACTCATCTAGCCGGCCCTTCTCAAAGTAGTGGTAAGGCACTGAGGGGTGGCTCTTCTCCCTG
CGGGGTGGGGAACAGGGCTGGGGTGGGACAGAACCCACTCGCCTCCGGCCACACCCTGCCGCCTACCTTG
ACCACCCCTCGCCAGGCCCATGGGCTCGTGAAGACCCACGCCTGATCCCTGGGGAGGCCAAGACACATAG
GGGAGTAGCAGATCCCGAGAGCCAGGGTTCGAGTCCTGGCTCTGCCTCTTGCAAGCTGGATGACTCTGGG
CCAGAGATGGACGCTGTGAGCCTCAAGAGCCCGTGTGTAAAACGCAGATGGTCACATCCGCCTTCCAGAA
CAAGTGCGTAGATCAAGAGATGGAGACTGCAGGCACCCTGCTAGTGGTCCTGGGCTCCTGGTGAGGGAGG
AGCCCCTCATTCCCATGTCACAGAGCCACCAGCCCTGCACGCCCACCTCCCCATCAGCATCACGGGGGCC
AGCCGTATGGCCAGGGCCATGGCTGGGGCACAGGGACACCTGTCACCTCGGTGACCTGTTGTGAACTCGT
TAGTTTCCATAATATTGAAATATAATAATATAGGCTGGTGCGGTGCTCACACCTGTAATCCCTGATCCCT
GGGGAGGCCAAGGCGGGAGGATCACCTGAGGTCAGGAGTTCCAGACCAGCCTGGCCAACATGATGAAACC
CCGTCTCTACTAAAACTACAAAAAATTAGCTGGGCGTGGTGGCACGTGCCTGTAATCCCAGCGACTAGGG
AGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCCAAGATCGCGCCACTGCA
CTCCAGCCTGGGCAACAAGAGCAAAACTCCATCTCAAAAAAAAAAAAAGAAATATAATAATATAACAATG
GTACAAAATCATACCATAAACATATCTGAACTTGATATCATGCTGGTATTATAATATCAAATATTATATA
ATATCAAAAATCACATATAATATCAAAAATTAAAACTTATATAAAACATATCACTGATATATATATATAT
ATATATTTAGGCCCTATATATATAGGCCCTGACTGTGGTAAGCACTTTTGTAGATTATTCCCTCCCTAAA
TGTTTCAACAACCTTCCAAGTAAGATGTGGTTCTTTTCTCTATTTTCCACATAAGGAAACAGAGGGTCTG
CGGGGTGGCATGACTGGCCCGTGAACAGCCTGTGACCACAACAGCCACACTGGGGTTGGTAAGGCAGCCT
GTTTGACACTGAGCTCTTTCCCAAGGCCCAGAGCGCAAGGGGTATCTGTGCTATGGTCACCCACCATGAA
GCCCACACTCCCTCTGCCCCCACAGCTCCCAAATCCTCTGGTTTAAGGCCTAGGAGGGCTCACAGTCCTC
CCCGCGTCCCCGCGTGCCCGGCCTGGCCGGTCTGACCTGCAGTAGCTGTCGCTGACCATCCCATAGACCA
CGATCTCCTCACACAGCTCCAGCGCGAGGATCATGGTGAACCAGCCGGTGCTGAGGAAGGAGCCCGACTG
CCTCCTGGGGGTGGCGGGGGGACAGGGGGACGCTGTCAACAAGAGGCCATGTGGTAGCTCCAGGCCCCAG
CAGCACTTGGTACTTTGCCAGCCCGGGGCTATGCACCTCAACCTCTTGCGGGGGGCTCTGGGGGAGGCTG
GGGAGGGGACAGAGCCAGCCCAAGGCCAGCAGTGGGTGACAGGCAGAGCGGCACACAAACCCAGGCCTGA
TCACCTCCTCAGCACCCAGCGGGGCTCGGGCCCAGCTACGCTTTGCAGGGTTTGAGCAAACCCCTGGATG
ATGGGTGTGGGGGCTCTGGCCACGTCCTCTCCTCATCTAAGGGACCAGGCCTGGGGAGACAGCATGAGGC
AGCAGGGCACCGTGGTTTGAAGTCAGGTGGGGGTTCTCTAGAGTCCAAGGATCTTCCACTTGAACCCTGG
CTCTGCTATGGGCATCACATTTGCTCGTGAACCTCTCTGAGCCTCATGACCAGTCTGTGAAGTGGAGATA
ATTTTAGGGGGTTATAGGGAGGGAACACTGTCAGCCCAGCCCCTGCCCGCATGCCCCCATGCCCCACTCA
GAGCAAGGGGCTAGGTGGTGGCCCCAGCCTCAGCTCCCTGGAGGCAGCAGCCCTGGGTACTCTTGCAGCC
AAGGGCCACCCCTCTCAACCTCTACAGAGCGGGTATGAACTGGTTCATTCATTCAAACTGAGCCCCACCC
AGGCCCTGTGCTGGCCCAGAACCTACCTCATCTGGTGCTAACATTCCACGGAAGCACCAGATGGGAAGGG
AAGGAGAAAAATAGATAAACAAGTAGCTAGGAGACAAATCTCTAAGCAAGATGATGGCAGAGGGTGACAC
CTGCTGCCAAGACAAGGAAACAAAGGAATGGATAAAGAATGCCAGGGCGGGAGGCACTCCTTCTGATGGG
GTGGTCAGGGCGGGCTTCCCTGAGGAGGTGACCTGAGTAACAAGAGGAGCCTGTCATGGGAAGATCTGGG
AGAAGAACATTCCAGGGAGAGGGAACAGCAGGTGCAAAGTCCTGCAGGTAGAAATAGGCATGGATGTCTG
AGCCCCAGACGGCCAGTATGGGTAGAACAGGGAGACTGAAGGAGACACAAAGTCACAGAGGCAGGCAGAG
GCAGGATCACGCAGGGCAGGCAGGCCAGGAGTGGTGGGGGCGGGAAGCCTAAGTCAAGGTGGCAGCCATG
GGCCATTGTACGTGGCAGATGATGATACGAGAGGAATCAGAGGCACAGCCCAATTTTCTGGGGGATACAC
CAGAGGCTGACTGTTCTATCTCCTGAGCACAGAGGCCAGTGGAGAGCAGGTGTGTGGAGGAGGGGCTGGC
AAGAGTCATGAGTGTGCTTTTGGATATGACATAAAATGAAACCTTGAATGCGGAGTGCTTCACCCAATGC
CTGGTGAGCAAAGAGGGCTGTTATTGCTCTGTTTTGTTTTGCTTTTTGAGACAGAGTCTCGCTCTGTCAC
CCAGGCTGGAGTGCAGTGGTGCGATCTTGGCTCACTGCAACCTCTGCCTCCAGGTTCAAACGATTCTCCT
GCCTCAGCTACTCTCCTGAGTAGCTAGAATTACAGGCATGCGCCACCACACCCGGCTAATTTTTGTATTT
TTAGTACAGATGGGGTTTCACCATGTTGGCCAGGGTGGTCTTGATCTCTTGACCTCGTGATCCTTCCACC
TTGGCCTCCCAAAGTGCTCGGATTACAGGCGTGAGCCACCGGGCCTAGCCCTAATTTTTGTATTTTTGGT
AGAGCCCAAGGTTTCACCATGTCAGCCAGGCTGGCCTCAAACTCCTGACCTCAAGTGATCCTCCCGCCTT
GGCCTCCCAAAATGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCTCTAACTTTTATTATCCAGAAG
GGATATCAGAGACCAAGTTAGACAGACCATTCCCTCCAGAGAGAAGTGCTCCCTCATGCCCCTACACACT
GGCTAGTGCTTCCCATCAGGTGCTCAAAAGGCCTGACAGCCCTCAACACCATCTGGGATTCTAGATCTGT
GTGACAGTTTCCTGAGTGACATCCCCTCTGCCTCCTGCCCCCCATTAGAGGGCAGGACCATGTCTGCCTC
CATGAGTGCTGACCTCCCGGGCCTGACAGAGCAGCCCCTGATGGACAAGAGACTCCCAGAGAAGGAAGGC
CCCAGCTGCCAGAGAGACCCCAGCAGGCAGGCCCCAGGCTCACCGGTTCTTGCCCGTCTCGTCCTGGAAG
ATCTGGTCGCAGTAGGCCATCATGCGCTCCGTGAAGGTGTACACCTGCAGGCCGGGGTACATCCTGGTGA
GCTGCAGCAGCGTGCGGTAGGTGCGGCCGCCGAGCACCCGGTCCATGTGCCTGCCCTGGCCCCACACCAT
GTAGAGCGTGTCTCGGGCCTTCTGGAAGTAGTGTGAATAGTTGCGCAGCAGCAGCGGCACGCTTGTGTGT
GAGACGACACGCAGGGTGCTGCGCTGGCCCACATCCGCCTCAAAGCCCACGGTGGGCGCCTGGTTCATGC
GGAACACGCACTCGGCACTGTCGATCTCAGCACCCAGGCCTGAGCCCAGCATTTGGCCGGAGCTGGACAC
CACGGCACAGCTGCGGCAGGGCTCGCGGACCAGCGGCTGCAGGGCAGGCAGGGAGAAAGAGACAGAGAGG
CATGAACACGCAGCTTACACCCCCTGCAGCTCCCCCTGGAATATCCACCAATCTTGCTTGATCAGGTATC
GGCTTGAAGGACTGTTATGAACCCATTTTGCAGGCTGGGCAGCAGGGATGACTTTCCCGAGATCACAGTG
ATACCCAGTGTCAGAGCTAAGGTTCTGCACTCAGCCCCCACTTCCTCATCAGCCATATTTGTGGTCCTTA
TTCCTGTTGCTGCCTCTGAGGCCCTGTCTCTACGTAATGAGTCCTTATGAGGTTTGCTTGAGCAAATAGT
TATAAAACACTTAGAACAGTGGCTGGCGCAGACTGGAGGTCATATAAGTATTTGTCAAGTAAACATGAAC
CTCAGCTCTTCAGAATCATTCTTAGGCCACTCCCTCCCCTGCACATGCCTCACCCACACTGTGCCCAGTT
TCTAAGTAAGAGCAGGGTCATCAGCGCCCAGGACAGCTGTGGGACTCCCTGGTCCTGAGCCTGAATCCTG
GCTCTGCCCTCTCCTAGCTATGTGGCCTGAAGCCAAGTCACCGGCCTCTGTGCGCCTCAAGTCTCTCATC
TATAAAATGGGGAAGAAAACAACATCTGGGCCGGGAGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGG
GAGGCCAAGGTGGGTGAATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGTCAACATGGCAAAACCCCA
TCTCTACTAAAAACACAAAAATTAGCTGGGCGTGGTAGCACACGCCTGTAATCCCAGCTACTCAGGAGAC
TGAGGCAGAAGAATCCCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATTGCGCCACTGTACTCC
AGCCTGTTGACAGAGTGAGCTCTTGTCTAAAATAAATAAACAAATAAAAATATAAATACAAAAAAAAGAA
AAAGAAAAAGAAAACAACATCTGGCCGGGCACGGTGGCTCACACCTGTAATCCCAGCACTCTGGGAGGCC
AAGGTGGGTGGATCACTTGAGATCAGGAGTTCAAGACCTGCCTAGCCAATATGGTGAAACCCCATCTCTA
AAAATTAACCAGGTGTGGTGGCATGAGCCTGTAGTCCCAGGGCAGGGCTAGGGCAGGCACATGGACGGGG
TGCTCGGCGGCCACACCTACTGCACACTGCTGCAGCTCACCAGGATGTACCCCGGCCTGCAGGTGTACAC
CTTCACCGAGCGCATGATGGCCTACTGTGACCAGATCTTCCAGGACGAGACAGGCAAGAACCGGTGGCGG
GAGGCTGAAGCATGAGAATCACTTGAATCCGGCAGGTGGAGGTTGCAGTGAGCTGAGATCGCACTACTGC
ACTCCAGCCTGGGTGACAAAGCTAGACTCTCTCTCTCTCAAACAAACAAACAAAAAAAAACATTTCTTGG
GCGGGCGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGAAAGGCCAAGGCAGGCAGATCACCTGAGGTT
GGGAGTTCGAGACCAGCCTGACCAACATGGAGAAACCCCATCCCTACTAAAAATACAAAATTAGCCAGGT
GTGGTGGTACATGCCTATAATCCCAGCTACTCGGGAGGGTGAGGCAGGAGAATCGCTTGAACCTGGGAGG
CAGAGGTTGCGTGAGCTGAGATCATGCCATTGCACTCCAGCCTGGGCAACAAGAGCAAAACTCCATCTCA
AAAAAAAAAAAAAAAATCTCTTTTTAGAATCTAGGGCTCAGAGCCGAGATCGCGCCACTGCACTCTAGCC
TGGGAGAAACAGAAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAACCTAGGGCTCAG
AGAGGTGATGTGATTGGCCAGGATCACATGGCTGGTGAGTAGAGGGGCCAAGCTCAAGGTCCGGTTTGGG
ACAAGCCCAGCTCTCACTACCCACCCCGGATGCATTGCCTGGCCCACTCACCTTCCCATCTGGCACACTG
CTATATCCACTGAAGTGCAGGGGTCCCGGCACAGTGGGCCTGGAGCCTGTGGGGAAGTGGTGGTCCAGGC
AGGTGGCCAGGCAGAGGGGCAGGCCGGCCCAGCAGCACAGGAGGATGTAGACGGCAGAGAAGACCACGGA
GCACAGGATGATGAGCACGAGCCGACCCTGAGGGAGACAGTGGCCATGGGGGGGTGTATGTGGGGAGGGG
CTCCCAGAAGCCCCTTCAGCGCTGCACCTGGCCTCCTGGGTCTAGAGAAGCTCCTGGGTCTAGAGCAGCT
CCTAGAACAGCAGCAGAATCCAGTCTCCATCCACTGGGCCTTACCTGGGGATGCCAAGCTCTGGGCACAT
AAGGCCCCGGAGATAACACTGTTATCATCCCCACATTCTAGATAAGGAAAGCAGTTTGGAGTGAAGTGTG
TAGTACAAGGCCATACAGCAAAGAAGCAGCACAGTGGGGCCTTAAATACTGGCTGGAAAGCCCATATTCC
TAACTGGAGGCTAGGGTGTACTGTGCGACCTATGGCTCTCCATCCCTCTGGGCCTGTTTCCTCCTCTACA
AACTGGGGACTGATAATGCCCACCGTGTGAACTTACGGGAGGGGGGCAGTGAAAGCCTTCTGTATACTGT
AAAGTGCTGTGCACATGACAACGGGAAAGTGGTGCCTCAGAAGGGCCACCAGCCCCTTCAGCCTAGGACC
AGGGCTGCCTAGTGTGGCCCCGCCTCCTCCCCACCCCCAGCTGACTCAGTGGCTGTCCCCTGGTGATCCG
GATCTGGGTCAGCAGGCCAGGTGGCACAAGGAGATTGGAGCAAAGGTCACTCAAGAAATGGTTTTAATAA
TTGAAAAGCTGGTGCCTGGAGGCTACCGCCTCCAGAAACCATGTGCCAAAGGCCATTGGAACCAGGAAGC
TCTTCTAGCCATCTGGCCTAGACCCAGCCTGGGGCTGGTCCTGCTGGCAGACTTAGTGAATGCCCCTACC
ACCACCAAGCCTCCCAGGAGGGGAAGGGCTTGGTGGCAGCAGAACTGTGGGCTAGTCACTGGGTAGTGGG
CCACCCTGGTCACCACGTGTGTCTCTCTGGGCTCCAGGTTTCTAGACTCGAGGCACAAGTCCCTAGGCCA
CCCTTCAGAAGCCACCAAGCAGTTGGCTGGTGCTCTCCAAAAATCAGAAGTGGGAAAGTGGCCCCCACCC
CCTACCCCATCACCCCACCTGCCCCGGCGCATCCTGCCTGAGGCCCACCATTTTCTGCCCGCAAGCTACG
TCAGCTTCTAAAAATAAGTCTTTTCTGTGTGGCCAGCTGTGCAGGGGAGGGAGTTCTGCCTGGCACCTGC
TCCCCTGCCCATGAGGAACCAGAGACCTTGACTCAGGCAGTCGAGATGAACTGCCCCCCACATCCAGCTC
TCTCTCTCACAGCCCGGGGCTTCTGCAGTAAAGACAAGAGCGGCTGAACTGAACTTCACAACCCCAGTTC
CACGTAGGCTTTGCCTGGTGAGCTCGTAGCGTTACCTCCGTTTTACAGGGATAGCAAGGTACTGACACTG
CAGAGTGGATGCCACCTGCCTGGGGCCACACGGTCTGCAAGAACAGGGTTGGACTTTGAGGCCATTCTGC
CCTTCTCTGCTCAGCAGTAGGGAGGTCAGAAGTGGCAGTGGGTCCTGGGGTGGAGAGGCTGCTGAGCTGC
TCCGCCAGTCGGGGCCTCTGCGTTCCCTGGAAGTCCTCCTTCTTCCCACTTACCGGAGCCTTCATGCTGT
CGCTGTCCCTCAGTAGTCTAGGGGCTGCTGTCTCCAGGGCTGGGGCTGGGAAGGGGTGGGAGCCGGGCAC
CTGCCAAGACCCAGAAACTCAGAGCCGGGAGGGGTAAGGCAGGTGGGGTTCTAAGCTATAGGGAAAACTG
AGGCAGGAGAGGGTACAGGCCGGGGTGAAAGCACATGGCACCCGGGAATCCAACCCCTTGACGTCTCCAA
ACACCGGAAGGCAATGTCCCCATCCTAAAGGAGCAGGAATCAGCGACGTTTGCGAGACCTCCTGCCAACC
CAATTCCCAGTGCGCAGATGGGGAGGAAGAGGCAGCGAGGAGGCGCCCCCAGCTCAAGGTCACCCATCAG
GTCTGGGGCAGAGAGAGCCAGAAGCCCGGAATTCCACCCCATCCATCCTACCTCTCTACCCGGGGGACGA
ATGGAGAGGCCGGGCGAGGGCAGGACGTAGGTTTTTCACCGCCCTTCCAGATTCCACTGCCGCATCTCCC
GGCCGAATGCTAACCGGGCGTGCAGGCTGGGGTTCGCAGGAGGCTCGCGATCCGCCGCTCGGAGCTCCAT
GGCCCCCGGGCCGGGCCGACTGGGCTGGAAGCTGATCCGCGCGGCCAGAGGCAGGGGGCGGGGCCGG

Protein Information
Protein Name transcript variant X2
NCBI Protein GI 28373092
Protein Accession NP_778204
Protein pI 8.65
Protein Weight 31819.16
Protein Length 302
Protein Note Also known as IV; SIAT3C; SIAT7D; SIAT3-C; SIAT7-D; ST6GalNAc; ST6GALNACIV
Protein Sequence
>NP_778204.1 alpha-N-acetyl-neuraminyl-2,3-beta-galactosyl-1,3-N-acetyl-galactosaminide alpha-2,6-sialyltransferase isoform a [Homo sapiens]
MKAPGRLVLIILCSVVFSAVYILLCCWAGLPLCLATCLDHHFPTGSRPTVPGPLHFSGYSSVPDGKPLVR
EPCRSCAVVSSSGQMLGSGLGAEIDSAECVFRMNQAPTVGFEADVGQRSTLRVVSHTSVPLLLRNYSHYF
QKARDTLYMVWGQGRHMDRVLGGRTYRTLLQLTRMYPGLQVYTFTERMMAYCDQIFQDETGKNRRQSGSF
LSTGWFTMILALELCEEIVVYGMVSDSYCREKSHPSVPYHYFEKGRLDECQMYLAHEQAPRSAHRFITEK
AVFSRWAKKRPIVFAHPSWRTE

Epitope Information
IEDB Linear Epitope