VIOLIN Logo
VO Banner
Search: for Help
Protegen Home
Introduction
Statistics
News and Updates
Protegen Query
Protegen BLAST
Selected Bacteria
Brucella spp. (26)
B. anthracis (14)
E. coli (40)
M. tuberculosis (26)
N. meningitidis (18)
Selected Viruses
Ebola virus (19)
HIV (41)
Influenza virus (50)
Selected Parasites
Plasmodium (47)
Data Submission
Data Exchange
Data Download
Documentation
FAQs
Disclaimer
Contact Us
UMMS Logo


C4424

General Information
Protegen ID 284
Sequence Strain (Species/Organism) Escherichia coli CFT073
VO ID VO_0010990
Taxonomy ID 199310
Molecule Role Protective antigen
Molecule Role Annotation Active immunization of BALB/c mice with C4424 antigen in Freund's adjuvant protects mice from lethal challenge with ExPEC strain S26 (Durant et al., 2007).
COG COG5295UW, under U: Intracellular trafficking, secretion, and vesicular transport; W: Extracellular structures
Related Vaccines(s) E. coli C4424 protein vaccine
References
Durant et al., 2007: Durant L, Metais A, Soulama-Mouze C, Genevard JM, Nassif X, Escaich S. Identification of candidates for a subunit vaccine against extraintestinal pathogenic Escherichia coli. Infection and immunity. 2007; 75(4); 1916-1925. [PubMed: 17145948].
Gene Information
Gene Name C4424
NCBI Gene ID 1038067
Genbank Accession AE014075
Locus Tag c4424
Gene Starting Position 4205983
Gene Ending Position 4211319
DNA Sequence
>NC_004431.1:4205983-4211319 Escherichia coli CFT073, complete genome
AATGAACAAAATATTTAAAGTTATCTGGAATCCGGCAACAGGCAGTTACACCGTTGCCAGCGAAACGGCG
AAGAGCCGTGGTAAAAAAAGCGGGCGCAGTAAGCTGTTAATTTCTGCACTGGTTGCGGGTGGGTTGTTGT
CGTCGTTTGGGGCAAGTGCAGATAATTACACTGGGCAGCCAACTGATTATGGCGATGGCTCAGCAGGTGA
CGGCTGGGTTGCTATCGGTAAAGGGGCAAAAGCAAATACCTTTATGAACACTAGTGGCGCGAGTACAGCT
TTAGGATATGACGCGATAGCCGAAGGTGAGTACAGTTCTGCCATCGGGTCAAAAACCCTTGCAACTGGTG
GAGCATCCATGGCGTTCGGGGTTAGTGCAAAAGCAATGGGTGACAGAAGTGTCGCGCTAGGTGCATCGTC
AGTAGCAAATGGCGATCGTTCGATGGCTTTTGGTCGTTACGCAAAGACGAATGGTTTTACATCTCTTGCT
ATTGGGGACTCCTCCCTTGCCGATGGTGAAAAAACTATTGCGTTAGGAAATACGGCTAAAGCTTACGAAA
TTATGAGCATCGCCCTCGGTGATAATGCCAATGCGTCAAAAGAGTATGCAATGGCGCTGGGAGCAAGTAG
CAAAGCTGGCGGTGCTGATAGCCTCGCATTCGGCAGAAAATCTACAGCTAATAGCACTGGCTCACTGGCA
ATAGGTGCTGACAGTAGCAGTTCGAACGATAACGCCATCGCGATAGGGAACAAAACGCAAGCCCTGGGAG
TGAATTCGATGGCCCTGGGTAATGCAAGTCAGGCATCTGGCGAATCCAGTATTGCATTAGGTAACACCAG
TGAAGCCAGCGAACAAAATGCGATTGCGCTGGGGCAAGGTAGCATTGCAAGCAAAGTGAACTCAATCGCG
TTGGGAAGTAACAGTTTGTCCTCGGGAGAGAATGCCATCGCATTGGGAGAGGGTAGTGCCGCTGGTGGCA
GCAACAGCCTTGCTTTCGGTAGCCAGTCCAGGGCAAACGGCAATGATTCTGTCGCCATCGGTGTAGGGGC
TGCAGCAGCGACCGACAATTCTGTCGCTATCGGCGCAGGATCGACCACAGATGCAAGCAATACGGTTTCA
GTTGGCAACAGCGCAACAAAACGCAAAATTGTTAATATGGCTGCTGGTGCCATAAGCAACACCAGTACCG
ATGCCATCAACGGCTCACAGCTTTATACGATCAGTGATTCAGTCGCCAAGCGACTCGGAGGAGGCGCTAC
TGTAGGCAGCGATGGCACCGTAACCGCAGTAAGCTACGCGTTGAGAAGCGGAACCTATAATAACGTGGGT
GATGCTCTGTCAGGAATCGACAATAATACCCTACAATGGAATAAAACCGCGGGGGCGTTCAGCGCCAATC
ACGGTGCAAATGCCACCAACAAAATCACTAATGTTGCTAAAGGTACGGTTTCTGCAACCAGCACCGATGT
AGTAAACGGCTCTCAATTGTACGACCTGCAGCAGGATGCTCTGTTGTGGAACGGCACAGCATTCAGTGCC
GCACACGGCACCGAAGCCACCAGCAAAATCACTAACGTCACCGCTGGCAACCTGACTGCCGGCAGCACTG
ACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCAC
TAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAAC
AAAGCAGCTGGCGCATTCAGCGCCGCGCACGGCACCGAAGCCACCAGCAAAATCACCAACGTCACCGCTG
GCAACCTGACTGCCGGTAGCACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAACGACAACGTGAC
GACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTC
GGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACGCCACCA
GCAAGATCACCAACGTCACCGCTGGCAACCTGACTGCCGGCAGCACTGACGCCGTTAACGGCTCCCAGCT
CAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAAC
CTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCG
CCGCGCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAGCAC
TGACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATCACC
AACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCA
GCGCCGCTCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAG
CACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATC
ACTAACCTGACGGATTCCGTTGGCGACCTTAAGGACGATTCTCTGCTGTGGAACAAAGCGGCTGGCGCAT
TCAGCGCCGCGCACGGTACCGAAGCTACCAGCAAGATCACCAACTTACTGGCTGGCAAGATATCTTCTAA
CAGCACTGATGCCATTAATGGCTCACAACTTTATGGCGTAGCGGATTCATTTACGTCATATCTTGGTGGT
GGTGCTGATATCAGCGATACGGGTGTATTAAGTGGGCCAACCTACACTATTGGTGGTACTGACTACACTA
ACGTCGGTGATGCTCTGGCAGCCATTAACACATCATTTAGCACATCACTCGGCGACGCCCTACTTTGGGA
TGCAACCGCAGGCAAATTCAGCGCCAAACACGGCATTAATAATGCTCCCAGTGTAATCACTGATGTTGCA
AACGGTGCAGTCTCGTCCACCAGCAGCGACGCCATTAACGGTTCACAACTTTATGGTGTTAGTGACTACA
TTGCCGATGCTCTGGGCGGGAATGCTGTGGTGAACACTGACGGCAGTATCACTACACCAACTTATGCCAT
CGCTGGCGGCAGTTACAACAACGTCGGTGACGCGCTGGAAGCGATCGATACCACGCTGGATGATGCTCTG
CTGTGGGATACAACAGCCAATGGCGGTAACGGTGCATTTAGCGCCGCTCACGGGAAAGATAAAACTGCCA
GTGTAATCACTAACGTCGCTAACGGTGCAGTCTCTGCCACCAGCAACGATGCCATTAATGGCTCACAGCT
CTATAGCACTAATAAGTACATCGCTGATGCGCTGGGTGGTGATGCAGAAGTCAACGCTGACGGTACTATC
ACTGCACCGACTTACACCATTGCAAATACCGATTACAACAACGTCGGTGAAGCCCTGGATGCGCTCGATA
ATAACGCGCTGCTGTGGGATGAAGACGCAGGTGCCTACAACGCCAGCCATGATGGCAATGCCAGCAAAAT
CACCAACGTTGCGGCTGGTGATCTCTCCACAACCAGTACCGATGCTGTTAACGGTTCCCAGTTAAACGCA
ACCAATATTCTGGTTACGCAAAATAGCCAAATGATTAACCAGCTTGCTGGTAACACTAGCGAAACCTACA
TCGAGGAAAACGGTGCGGGTATTAACTATGTACGTACCAACGACAGCGGCTTAGCGTTCAACGATGCCAG
CGCTTCAGGTATTGGCGCTACAGCTGTAGGTTATAACGCAGTTGCCTCTCATGCCAGCAGTGTAGCCATC
GGTCAGGACAGCATCAGCGAAGTTGATACGGGTATCGCTCTGGGTAGCAGTTCCGTTTCCAGCCGTGTAA
TAGTTAAAGGGACTCGTAACACCAGCGTATCGGAAGAAGGTGTTGTGATTGGTTATGACACCACGGATGG
CGAACTGCTTGGCGCGTTGTCGATTGGTGATGACGGTAAATATCGTCAAATCATCAACGTCGCGGATGGT
TCTGAAGCCCATGATGCGGTCACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTCGCAACCACACCAA
CCAAATACTATCACGCCAACTCAACGGCTGAAGACTCACTGGCAGTCGGTGAAGACTCGCTGGCAATGGG
CGCGAAAACCATCGTTAATGGTAATGCGGGTATTGGTATCGGCCTGAACACGCTGGTTCTGGCTGATGCG
ATCAACGGTATTGCTATCGGTTCTAACGCACGCGCAAATCATGCCGACAGCATTGCAATGGGTAATGGTT
CTCAGACTACCCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCGCAGAACTCTGTGGG
TGAGTTCTCTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCAACGTCGCAGCAGGTTCGGCGGATACC
GATGCGGTTAACGTGGGTCAGTTGAAAGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTA
ACCTGAACACTCAGGTCACTAATCTGGATACTCGCGTGACCAATATCGAAAACGGCATTGGCGATATCGT
AACCACCGGTAGCACTAAGTACTTCAAGACCAACACCGATGGCGCAGATGCCAACGCGCAGGGTAAAGAC
AGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCACTGGGCACGGGTTCCGTAG
CAGACGAAGAAAACACCATCTCTGTGGGTTCTTCTACCAACCAGCGTCGTATCACCAACGTTGCTGCCGG
TGTTAATGCCACCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGAC
ACCAAAGCTGATGGCTCTATCGACTACAGCAACATCACTCTCGGTGGCGGCAATAGCGGTACGACTCGCA
TCAGCAACGTTTCTGCTGGCGTGAACAACAACGACGCAGTGAACTATGCGCAGTTGAAGCAAAGTGTGCA
GGAAACGAAGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAACTGAAAGCAAG
CTGAGTGGTGGTATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCA
TGGCCTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGC
CAATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGT
GCCGGTATTCAGTGGTA

Protein Information
Protein Name adhesin
NCBI Protein GI 26250246
Protein Accession NP_756286
Protein pI 4.25
Protein Weight 166959.01
Protein Length 1778
Protein Note Escherichia coli O157:H7 ortholog: z5029
Protein Sequence
>NP_756286.1 adhesin [Escherichia coli CFT073]
MNKIFKVIWNPATGSYTVASETAKSRGKKSGRSKLLISALVAGGLLSSFGASADNYTGQPTDYGDGSAGD
GWVAIGKGAKANTFMNTSGASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVALGASS
VANGDRSMAFGRYAKTNGFTSLAIGDSSLADGEKTIALGNTAKAYEIMSIALGDNANASKEYAMALGASS
KAGGADSLAFGRKSTANSTGSLAIGADSSSSNDNAIAIGNKTQALGVNSMALGNASQASGESSIALGNTS
EASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGEGSAAGGSNSLAFGSQSRANGNDSVAIGVGA
AAATDNSVAIGAGSTTDASNTVSVGNSATKRKIVNMAAGAISNTSTDAINGSQLYTISDSVAKRLGGGAT
VGSDGTVTAVSYALRSGTYNNVGDALSGIDNNTLQWNKTAGAFSANHGANATNKITNVAKGTVSATSTDV
VNGSQLYDLQQDALLWNGTAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIAT
NTTNITNLTDAVNGLGDDSLLWNKAAGAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVT
TNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVTAGNLTAGSTDAVNGSQL
KTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGST
DAVNGSQLKTTNDNVSTNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGS
TDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLWNKAAGAFSAAHGTEATSKITNLLAGKISSN
STDAINGSQLYGVADSFTSYLGGGADISDTGVLSGPTYTIGGTDYTNVGDALAAINTSFSTSLGDALLWD
ATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAVVNTDGSITTPTYAI
AGGSYNNVGDALEAIDTTLDDALLWDTTANGGNGAFSAAHGKDKTASVITNVANGAVSATSNDAINGSQL
YSTNKYIADALGGDAEVNADGTITAPTYTIANTDYNNVGEALDALDNNALLWDEDAGAYNASHDGNASKI
TNVAAGDLSTTSTDAVNGSQLNATNILVTQNSQMINQLAGNTSETYIEENGAGINYVRTNDSGLAFNDAS
ASGIGATAVGYNAVASHASSVAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEGVVIGYDTTDG
ELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPTKYYHANSTAEDSLAVGEDSLAMG
AKTIVNGNAGIGIGLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVG
EFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIV
TTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTGSVADEENTISVGSSTNQRRITNVAAG
VNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNSGTTRISNVSAGVNNNDAVNYAQLKQSVQ
ETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSA
NGRWVYKLQGSTNSQGEYSAALGAGIQW

Vaxign Prediction
Localization(Probability) Outer Membrane
(Prob.=0.995)
Adhesin Probability 0.939
Trans-membrane Helices 0
Detailed Vaxign Results Vaxign Results
Epitope Information
IEDB Linear Epitope