VIOLIN Logo
VO Banner
Search: for Help
Protegen Home
Introduction
Statistics
News and Updates
Protegen Query
Protegen BLAST
Selected Bacteria
Brucella spp. (26)
B. anthracis (14)
E. coli (40)
M. tuberculosis (26)
N. meningitidis (18)
Selected Viruses
Ebola virus (19)
HIV (41)
Influenza virus (50)
Selected Parasites
Plasmodium (47)
Data Submission
Data Exchange
Data Download
Documentation
FAQs
Disclaimer
Contact Us
UMMS Logo


THYN1

General Information
Protegen ID 3576
Sequence Strain (Species/Organism) Homo sapiens
Taxonomy ID 9606
Molecule Role Protective antigen
Related Vaccines(s) FluMist
References  
Gene Information
Gene Name THYN1
NCBI Gene ID 29087
Genbank Accession AP000859
Chromosome No 11
Gene Starting Position 134248264
Gene Ending Position 134253406
DNA Sequence
>NC_000011.10:134248264-134253406 Homo sapiens chromosome 11, GRCh38.p12 Primary Assembly
GTTTTAGAAGAAAAGTTCTTCAACTTTGATATTTTATTGAAAAAAGTACACAAAAACCACTGAGGTACAC
AAGCCCAGGTAAGCATACCAAGCAAGCCCCCTCACACCTTTTGTCTGAAAAAAGCTTGACTTCTTTGCAG
CAATGTCTCGCCCATTCCAGCAGCAGTATCTCAGTTAACTTGGTTCCTTTTCCTCCAGGCTCAAAACAAA
ATCAAACTCTTCTACAAAAAAAAAAGGGAAAAAGGAAGGATTAGTTTTTAATGTCCCCTCTTGAACAGGA
AAGTTGTGCCAGGCACAGGTACATCTTACATGGTACTAAGGCGAGCAGTCATTCGTTCAAAACTATTTAC
CAGGTGCCAAGCACTATAGAGTACTGGGTAATAGTAACAGGTAACTCTGGCCACATTCCCTCCCTCATGG
GTGTTACAGGTGAGGATGCTGACTGAGAAAGGTGAAGTACCATGCCCAAGTTTAATCAGCTAGTAAGTGG
GAATTCATGAGGTCATTCTGGTATATGGACAATAAAGGAGAGGGCCCACTCTTACCCTGGGTCAGGGGCT
GGATTGATAATCTCTGGCGAGTGAAGAGAACCATATTTTTTAAGGGGCCACCAGTAGCTTTGTGAGCTTG
ATGATAGGATTTGAGCTCAGCCAGGGGAATGAAACGTTTCATCATCCGAACAAACTGTACATCCACCTAT
GTGACAACAGTGTACATGACAGAAAGACGTGTCTAGGCTGACTGTGCCCACTGTATTTTTAACCGCCCAC
AGGAAGCTCCTATCTCAGTATCACCTCATCTGAGTAAGTGCTGGGCCGTGTACCTTGACCTACAGTCACC
CCAACCGTCTTCTTCCCCTGGTTCATCCTTCCCCTGTCATGAAATAGAAAGCATAGTCTTTACCATGGAC
CACTTAGGGTTGTCCTCTTTGCTAGATGGGTCATAATGGGGATTGTTTTTCTCAAACTGTGTGTGGTCTG
GGTAAGCCTCTTTCACGATCTGAAACCAAAACAAAAGCTTTGAATCATCTGCCTGCAGTCAGCTCCACAG
CCAATCTTTTAATCCATATCCAACTTTCATTTCTTCCCTGCACATTCTTAACCCATCTGCCTTGCTGTAC
AGCCAATGGGGGCTCAGATATTTGGAAACAATATATAGTCTAATATATAGACTAGGAAATCAACTCTGTT
TTTACAAAAGTCTTAAAAACATGCTCTAAAAACAAATGGCAAAGGTTATTACTGGTACCACTGTTTTTTT
CCCCTCTGCACAGTTAATAATTTTATCCTGATAAGCTTGGAGGAGCTATCATCAGGCCTGTTTTCGAGTG
AGATCAAGACTTAGAGGTGGAGTGAAGTCTGTGCTAGTCGGACCTGAATGAAGGCCTGCTTTGAGTTATC
AGATAGGAAGGGGCCAGGGCAGTTTTTCCCTATTCCTGTAAAAGGGGGATGGGGTGGGGGGGAGTGTACT
TTTTTGTTGCCTAATTAAGGTATATCCCTTTATATCTACTTAAGTTAGTCTCCCTGGAGGAAAAGGGATT
CACACCAAAACCTGCCCAACTAACCTTCATGAGTCCTGCGATGCCTGGCTCTTTGCAGTTGCTATGGTAG
AAGAAGGCTTCTTCTCCCAGCTTCATGGCTCTAAGGAAGTTCCGAGCCTGTCCCGGGAGAAGAAAGAGTA
ACTATCCTCCCACCAGCTGGAAAGTACTAGCTTTTAGCAGAAATCAAGTCTGCTTTTAATAGATGGGTGA
TTCTTGTCTGCAACTCCTCAATTCATTAGAAAGTCCTTCCCCTTTGCCAGGACACCCAGACTTTTACCAG
GACATTGGCCAGAACACAGGCCAATTGAAAATTTCATGCAGAGTACTTAAGGCATTTTTTAATACATTAT
CAGGTTTGCTTCTGAGAAAAAGCAACCTTGGCCAAGAGGTTCTGCTACTGAGTACGTCATGAGGAGGAGG
CAGGCATTCATCCCACCTGGGTCTCAGGCGACCTCCTCCTTACCCTCTTACCTGGTAGTTACGAACACCA
TCCCAGCATGTTGTCTGTTTGGGCTGTGCTTTGAGATCCTCAATGCTGAACTAGGCAAGAGGAAATAAAT
TCAATCATTAAACAATCCTTGGCCAGTTAGTAAGCACATATGGAAGCAGACAGTTCCTTCCCCTAAAATG
AGAGGGGCCACTGGCTATAGAGGAAGTATGTAGTTCACCGGGAAGTGGCTCCACAGAGACACTTTTGCTG
GGTGAATGTAAGTAAGCCTTGTGTTACCTTGTTATCCTTGCTAATTATGAGGATTAGCTGAATGAATAAT
CTAGGTGATAAAGTAATCTGCTAGTTACTTGCATTTTACTTTATATTTATAATAGTTCTGCTTATATTTT
TCCCAGGCTATATTAAATTCCCTGAAATATACATCAACTGTTGGCAGGGAGACACCTAGGAGCATGGCAG
GTGTTCAGAGAAAGGTAACCTAAGATTGAGGTTGGCAGACTATAGCCTTCAGGACAAATCTAACCCCATG
CTGGATTTTGTGAATAAAGTTGAACTTGAACACAGCCACAGCTGTTTACTTATTATCTGTGGCCACTCTC
TTGCTACAAGGGCAGGGTTGCAACAGAGACTGTAACACGCACAAGCCTACAAAATTTACTATCTGGCCCT
TTGTGGCAAAGGTTAGCCAACCCCTGCCCTAGATACACAGGTTCACAAACCAGCTGATGATAATAAACCC
TGCCGATTTTATTCATTTTTACCTTTTTCCTTCCCTTCAAAGATAAGAAGCTGGGGATGGAACCCTATGG
TGTCCCATATTCACTTCTCATTTAAAAAACACCATGAAGGGACTGGAGACAGGTCAAGGGTCTCACCTTC
ACATCTACACCTTTCTCTAGGCGGCTCTCTGGCTCTGACTTCATCAGCCAGTGGCTGCTTAGATTCTTCA
AACAGTTTTTAGTGGCTGAAGTCTTCTGAGGGTTGGAGTCCTCCACTTTAGCTAATGCCTCACCTGAGTT
CTCAGTTTTGGTGCGTTTTCCTGATAGTCCCTTGTCTGCAATAGGAGAAGCAAAAAGAAAAGATCATGCT
TGTTAAAGGCAATGTTTCATAATGGAAAAAGCTTAGCTTTGGAGTCAGACCTGGATTCAAATCATGGATC
CATTACCAGCTGTAGGAACTTAATCTTACTGTTAAGAAGCTTTCCTTATCTATAAAAATTAGACAAGAAT
ACCTACCTTACAGGGTATCTTCTGGATTATCTATTCAATAGGGTAAGAACACCTTACAGGGTTGTTCAGG
ATTAAAGGTCCGGACAGTAGTCAATACATGTTTTCCTTTCCTCACTCTCTCTTCTATGCCTTTGCATAGG
AAAGAGGAAACTGAGCACCTACTTGCAACTATTTTGAAGCTTAAATGAGGTAATACAAAGGGCCTAGACA
ATGCCTGGCATTACAATAAGTGCATAATAAAAGCATTATTATTCTCATCTCATACGTCTTTCCTTCTTGC
TCACTGTGGCTCCTGGCCCATCTCCTTTCTGCTCCTTGAAGGTGCCAATAAAAGTTGAAAGAAACAGTTC
ATTCAGGACATTGTAGAAAAATGGTGATGATTCGATTTCATCCCAAGACAATCAGAATGTGCCTGGAACA
CAGTAGGCATTCAGTAGCTAAGTTGTTAAATTACTATATGACAGGGCCCAAGGTCTATGTGTTACCCTAA
CCCTAACTCGGTTAATCTACATAACCTGTGAGATGTTCTACTCTTCATCTGGGAAGCATGCTCCAGATCA
CACATCCAGTAATGGTGGAATCAAGATATGAGGTCAGCACCTGTCACTGAGGTCTATCTGAAGCTACTTA
ATGACTATACTGCCTCTGAATTTAGTCAAAGTCCCTACATCAATTGTCATTAAATACCATTTATGGCTGT
CTAATGCTCTGAACCCAGCATACCTTCCTAACCTAGTTTCCTAATGGCCCCTTAGGAGTTAACAGGCCCA
CTGCAAACACCACATATTGATAAATAACTGCAATGTGCTGTAGTATATCTGGAACCCCTGTCATTCCCAA
AAGAGGTACTTTCTGTACCTTCCAATGTTCCTCCTCCTCCATCCACTTAGAATGCCATCCATCTTTGTGA
TTATAAAAACGACATCACGTTTTCTTAGTACTAAACTTTGCCCGTGGAAAGTCTCCCATTCTTCAAGGCT
GGGCTCAAATTCCACATCCTTCTCGAGGCCTTCCCCGTGATTCCTCCCTCCATAAACTTAACAATCCATT
TTCCTGAACCCCATCACATCTGTTTGCAGTTCACCCGCAAGCAGGCGCTTTATATTCTTTAGCACGCCAC
CTGGTATCTCAAAGAGTCTTTGTGCACAACAACCATTGCCTGAATAAACAAGAAAAAGGAACTGTAGGTA
GGAGCTTGGTGAGACAAGTAAAGAAGATGGAGAGTGATTAAGGATAATGGAGTAAAAATATCACAAAGGG
TTCTCAAAAAGCAGGGAGTGAAAAATGAGTACTAAACAGGCTCTTCGAGAAAAGTTTTTTCTTGGGCTGC
CTTTGTGCAGCCTAGGGGCAGCTCACCTGAACCAGAAGTCCCAGCCAGCCTCTTCCGGGGTCTCGACATG
GTCACGCTGCAGGGGACTTTAGTGCGGACGATTCTGGAATCAACGGAGATACAAGAAAGAATGCTAATGT
CCTCCAAAACCCGCGCAGAGCGAGATGGAGGCAACGAGAGGCAGCCTAGAACGTCTCCAACTTTTGCGAA
ACACAGACGCCTACGTTTGAGCCCTCAAATCCTTCCCTCCGTTATCTGCGCCATTTCCATCTCGCATAAC
CTGCCCCTAAACTCTTCTCGGTTCTGTCCTTGGTCCTTCTCATCCAGGAACCCCTATCTCAGTATGTAGT
TCACTGGGAAGTGGCTCCACAGAGACACTTTTGTTGGGTGAATATAAGTAAGCCTTGTGTTACCTTGTTA
TCCTTGCTAATTAGGATCAACTGAATGGATAATCTAGATGATGAAGTAATTTGCTGGCTACAGACCCAAC
ACTCTGCAGACTCGACTGCGGTCCGCCTCCGCTGCGCCGCAGGCTGTGCAGCGCGACCCCCGCGGCGCTT
GGTGGGCGGTGCATCTCTGCGGCGCCGAGGCCC

Protein Information
Protein Name transcript variant X1
NCBI Protein GI 83267859
Protein Accession NP_001032381
Protein pI 9.93
Protein Weight 18315.39
Protein Length 166
Protein Note Also known as MY105; THY28; MDS012; HSPC144; THY28KD
Protein Sequence
>NP_001032381.1 thymocyte nuclear protein 1 isoform 2 [Homo sapiens]
MSRPRKRLAGTSGSDKGLSGKRTKTENSGEALAKVEDSNPQKTSATKNCLKNLSSHWLMKSEPESRLEKG
VDVKFSIEDLKAQPKQTTCWDGVRNYQARNFLRAMKLGEEAFFYHSNCKEPGIAGLMKIVKEAYPDHTQF
EKNNPHYDPSSKEDNPKWSMKSLILF

Epitope Information
IEDB Linear Epitope