VIOLIN Logo
VO Banner
Search: for Help
Protegen Home
Introduction
Statistics
News and Updates
Protegen Query
Protegen BLAST
Selected Bacteria
Brucella spp. (26)
B. anthracis (14)
E. coli (40)
M. tuberculosis (28)
N. meningitidis (18)
Selected Viruses
Ebola virus (19)
HIV (41)
Influenza virus (50)
Selected Parasites
Plasmodium (47)
Data Submission
Data Exchange
Data Download
Documentation
FAQs
Disclaimer
Contact Us
UMMS Logo


Muc1

General Information
Protegen ID 1340
Sequence Strain (Species/Organism) Mus musculus
Taxonomy ID 10090
Molecule Role Protective antigen
Related Vaccines(s) Lung metastasis DNA vaccine pCEP4-MUC1 encoding MUC1 , MUC1-FSL-1-TT tricomponent vaccine
References  
Gene Information
Gene Name Muc1
NCBI Gene ID 17829
Genbank Accession AC132327
Chromosome No 3
Gene Starting Position 89229055
Gene Ending Position 89233380
DNA Sequence
>NC_000069.6:89229055-89233380 Mus musculus strain C57BL/6J chromosome 3, GRCm38.p4 C57BL/6J
CCTCACACACGGAGCGCCAGCCTTGAGTTTGTTTTCTAGCCCCTTCCCGCCTGTTCACCACCACCATGAC
CCCGGGCATTCGGGCTCCTTTCTTCCTGCTGCTACTTCTAGCAAGTCTAAAAGGTGAGAGGCGCAAGGTG
GGGAGGGGCTGCGCTGTTCAGGTGGGACTCCCAGTCTTTCTGTGGAGTTTGCTTACTGGTAGATTATGAG
AGTCATAAATGGCCATCCATACAGGTATGGCCCAGTATGTGCCAGGGGGGAAAAGAGGCTAAGGAAGGAG
GCTGAGAAGAGGTGACCCCTCAGGAGAGTGGGCACTATAGTAGGCAAATAATACTCTTTGTGGGGAGTAG
AGAAGAGAGGCTGTGTCAGGAGAAGTAGCTCCGGAGTGGAAGAATGGGGCCGTGTGCAGTGGTTCCAGTC
TATAGTTCCCACACCCTGGACGCTCAGGTTCACAGAGAGTTCAAGGCCAGCTTAGGCTACAGTATGAATA
CTTGTCTCAAGTAGAGGAATAAATGAGCCAGAAGCAGAGGTGAGACCTCTGCTTAGGGAGGGAGGCTGTC
AGGATGAAGGCAAACCAGGGGCTACCTGATGCTAACATGATCCCCCTTCCTTCCCCGGTTACTTTGTTTT
CAGAGTTTATATTTTTCTTCTTGCTCTGATTTTTATTCATCAACCCAGTAGGTTTTTTTTTTGTTGTTGT
TGTTGTTTTGTTTTTTTTTTCTTCAACTTTGGCTCTAGCCGGAACTAAAACATCACACATGTTATCCAAA
GCGACCCAGTGTCCAACTCTGTTGATTAGTGCTGCGCTGCTGTGAGCATAAAAACAGTTTACCCCTAATC
TTCCACTCTGGTTCAGGTTTTCTTGCCCTTCCAAGTGAGGAAAACAGTGTCACCTCATCTCAGGACACCA
GCAGTTCCTTAGCATCGACTACCACTCCAGTCCACAGCAGCAACTCAGACCCAGCCACCAGACCTCCAGG
GGACTCCACCAGCTCTCCAGTCCAGAGTAGCACCTCTTCTCCAGCCACCAGAGCTCCTGAAGACTCTACC
AGTACTGCAGTCCTCAGTGGCACCTCCTCCCCAGCCACCACAGCTCCAGTGAACTCCGCCAGCTCTCCAG
TAGCCCATGGTGACACCTCTTCCCCAGCCACTAGCCTTTCAAAAGACTCCAACAGCTCTCCAGTAGTCCA
CAGTGGCACCTCTTCAGCTCCGGCCACCACAGCTCCAGTGGATTCCACCAGCTCTCCAGTAGTCCACGGT
GGTACCTCGTCCCCAGCCACCAGCCCTCCAGGGGACTCCACCAGCTCTCCAGACCATAGTAGCACCTCTT
CTCCAGCCACCAGAGCTCCCGAAGACTCTACCAGTACTGCAGTCCTCAGTGGCACCTCCTCCCCAGCCAC
CACAGCTCCAGTGGACTCCACCAGCTCTCCAGTAGCCCATGATGACACCTCTTCCCCAGCCACTAGCCTT
TCAGAAGACTCCGCCAGCTCTCCAGTAGCCCACGGTGGCACCTCTTCTCCAGCCACCAGCCCTCTAAGGG
ACTCCACCAGTTCTCCAGTCCACAGTAGTGCCTCCATCCAAAACATCAAGACTACATCAGACTTAGCTAG
CACTCCAGACCACAATGGCACCTCAGTCACAACTACCAGCTCTGCACTGGGCTCAGCCACCAGTCCAGAC
CACAGTGGTACCTCAACTACAACTAACAGCTCTGAATCAGTCTTGGCCACCACTCCAGTTTACAGTAGCA
TGCCATTCTCTACTACCAAAGTGACGTCAGGCTCAGCTATCATTCCAGACCACAATGGCTCCTCGGTGCT
ACCTACCAGTTCTGTGTTGGGCTCAGCTACCAGTCTAGTCTATAATACCTCTGCAATAGCTACAACTCCA
GTCAGCAATGGCACTCAGCCTTCAGTGCCAAGTCAATACCCTGTTTCTCCTACCATGGCCACCACCTCCA
GCCACAGCACTATTGCCAGCAGCTCTTACTATAGCACAGTACCATTTTCTACCTTCTCCAGTAACAGTTC
ACCCCAGTTGTCTGTTGGGGTCTCCTTCTTCTTCTTGTCTTTTTACATTCAAAACCACCCATTTAATTCT
TCTCTGGAAGACCCCAGCTCCAACTACTACCAAGAACTGAAGAGGAACATTTCTGGATTGGTGGGTATCA
GCCTAGCCTCTGCCATGTGTCCCCTGACGTAGCTCTTCAGGACTGCATGGCTTTCACATCACTCCTGAGT
CTTCTCCTCTTCTCCCAGTTTCTGCAGATTTTTAACGGAGATTTTCTGGGGATCTCTAGCATCAAGTTCA
GGTACAGTTCTGGATTTGACTTGGGGGAGGAATGGTCAGTCTCGTGACTTTGTGGTGTCGGGATGGGGGT
GGGGTGGGGAGAGGAGTGCTGAGCTATAAGCTCAGTCTATCTGAGCTCCCTATTTCCTGTGACCAGGTCA
GGCTCCGTGGTGGTAGAATCGACTGTGGTTTTCCGGGAGGGTACTTTTAGTGCCTCTGACGTGAAGTCAC
AGCTTATACAGCATAAGAAGGAGGCAGATGACTATAATCTGACTATTTCAGAAGTCAAAGGTGAGGTGAT
AGCCCCAGCTGCAGCCTGGCACCATACTATGGGGCTTTACCACCTGTTTACTTCTGGCGCCAGGAGTGGG
AAATCCACCTCCCTTGGGGACTTCCCTGACCACCGCTTTCCCTTCTAGTGAATGAGATGCAGTTCCCTCC
CTCTGCCCAGTCCCGGCCGGGGGTACCAGGCTGGGGCATTGCCCTGCTGGTGCTGGTCTGTATTTTGGTT
GCTTTGGCTATCGTCTATTTCCTTGCCCTGGTAAGTCTCAAGCCTTCTGCGGCGCGGTGTGCCCTTGGTA
AATGGAACCCCACTGGCCAATCCAATCTCCTGTCTCCCTAGGCAGTGTGCCAGTGCCGCCGAAAGAGCTA
TGGGCAGCTGGACATCTTTCCAACCCAGGACACCTACCATCCTATGAGTGAATACCCTACCTACCACACT
CACGGACGCTACGTGCCCCCTGGCAGTACCAAGCGTAGCCCCTATGAGGAGGTAAAGTGTATCCCGAAGA
AGCTTGGGCCATCGACCTGGGCAGGGTGGGGCCTTCTGAAGGGGAACTTGGGAAAAACCCAAGGAAGCCC
AGATAGGTGAGCAGTTGGAGCCTGGCAAGGTTGGGAATGGAGGTCGAGTTCTGGGGGCGGGGGGAGGGGG
TCAGACCTTGGGAGATGACAGAAAAAAGTGTCTGCACTACCAGTCTTCCCCAAAAAGCCTCGCCGTCTAC
CTGTACCATTCTGTACTCATTAGGATGCCCAGAGGTGAGGCTAGGAGATCCTGTGCCCCTGGGAGCCCCG
CCCACTGCATTTCCCGCTGACCTCCTAGGGGTCTGGGAGTAGATACATGGGGCTGGGGTAGGGTGGGAGG
GCTCTGATGGAAGAGAAGTTTCCTCTTGCTAGATGAGTGTGCTGAGGGAGTTTTAGGAAAGCAAGGGTAC
CTCTCTCTTCCCTCCCTCAGGCCTTTCCCTGTTGAGTCTCGTAAATAATTACTGTGGCCACCTGAGACCG
AAAGGCCAGTGGGTGGGGAAGGAGACAAGCACGTCCTTTTCTCGCCCCACTTTCCTCCTCCTCCAGATAC
ACACACACACACACAAACACACACACACACACACACACACACACACACACACACACACAGCCCTTTGGGC
CAGAGGATCAAAAACTAAAAGGAGAGAAAGATAGGTGGGGCCTACAAGAGGGCTAGGGATCTACAGCCAA
GCGGAGGGGAGTGTGGGGGGGGAGGGGGCAGTGGAGTTGTGTGTTTACAGTCACCTGGCAGCTCTGGGGC
CAGCAGGTGCACACTCCGAATTAACCCCTTGAGAGCTTGCCAGGCCCCTTGGCCCCAGGTGCCATCGGGG
TTTTACCTGGAAGATTCTTTTTCCCTGAAATTATTTCCCCTTCACTCAGGTTTCGGCAGGTAATGGCAGT
AGCAGTCTCTCTTATACCAACCCAGCTGTGGTGACCACTTCTGCCAACTTGTAGGAGCAAGTCACCCCAC
CCACTTGGGGCAGCTTTGGCGGTCTGCTCCCTCAGTGGTCACTGCCAGACCCCTGCACTCTGATCTGGGC
TGGTGAGCCAGGACTTCTGGTAGGCTGTTCATGCCCTTTGTCAAGCGCCTCAACTACGTAAGCCTGGTGA
AGCCCAGCCCTGCCCTGGGGGACACTGGGGCAGTTAGTGGTGGCTCTCAGAAGGACTGGCCTGGAAAACT
GGAGACAGGGATGGGAACCCAAACATAGCTGAATAAAAGATGGCCTCCTGTTAGTT

Protein Information
Protein Name mucin 1, transmembrane
NCBI Protein GI 7305293
Protein Accession NP_038633
Protein pI 5.45
Protein Weight 59972.84
Protein Length 631
Protein Note Also known as EMA; CD227; Muc-1
Protein Sequence
>NP_038633.1 mucin-1 precursor [Mus musculus]
MTPGIRAPFFLLLLLASLKGFLALPSEENSVTSSQDTSSSLASTTTPVHSSNSDPATRPPGDSTSSPVQS
STSSPATRAPEDSTSTAVLSGTSSPATTAPVNSASSPVAHGDTSSPATSLSKDSNSSPVVHSGTSSAPAT
TAPVDSTSSPVVHGGTSSPATSPPGDSTSSPDHSSTSSPATRAPEDSTSTAVLSGTSSPATTAPVDSTSS
PVAHDDTSSPATSLSEDSASSPVAHGGTSSPATSPLRDSTSSPVHSSASIQNIKTTSDLASTPDHNGTSV
TTTSSALGSATSPDHSGTSTTTNSSESVLATTPVYSSMPFSTTKVTSGSAIIPDHNGSSVLPTSSVLGSA
TSLVYNTSAIATTPVSNGTQPSVPSQYPVSPTMATTSSHSTIASSSYYSTVPFSTFSSNSSPQLSVGVSF
FFLSFYIQNHPFNSSLEDPSSNYYQELKRNISGLFLQIFNGDFLGISSIKFRSGSVVVESTVVFREGTFS
ASDVKSQLIQHKKEADDYNLTISEVKVNEMQFPPSAQSRPGVPGWGIALLVLVCILVALAIVYFLALAVC
QCRRKSYGQLDIFPTQDTYHPMSEYPTYHTHGRYVPPGSTKRSPYEEVSAGNGSSSLSYTNPAVVTTSAN
L

Epitope Information
IEDB Linear Epitope