C4424
General Information
Protegen ID
284
Sequence Strain (Species/Organism)
Escherichia coli CFT073
VO ID
VO_0010990
Taxonomy ID
199310
Molecule Role
Protective antigen
Molecule Role Annotation
Active immunization of BALB/c mice with C4424 antigen in Freund's adjuvant protects mice from lethal challenge with ExPEC strain S26 (Durant et al. , 2007 ).
COG
COG5295UW, under U: Intracellular trafficking, secretion, and vesicular transport; W: Extracellular structures
Related Vaccines(s)
E. coli C4424 protein vaccine
References
Durant et al. , 2007: Durant L, Metais A, Soulama-Mouze C, Genevard JM, Nassif X, Escaich S. Identification of candidates for a subunit vaccine against extraintestinal pathogenic Escherichia coli . Infection and immunity . 2007; 75(4); 1916-1925. [PubMed: 17145948 ].
Gene Information
Gene Name
C4424
NCBI Gene ID
1038067
Genbank Accession
AE014075
Locus Tag
c4424
Gene Starting Position
4205983
Gene Ending Position
4211319
DNA Sequence
>NC_004431.1:4205983-4211319 Escherichia coli CFT073, complete genome
AATGAACAAAATATTTAAAGTTATCTGGAATCCGGCAACAGGCAGTTACACCGTTGCCAGCGAAACGGCG
AAGAGCCGTGGTAAAAAAAGCGGGCGCAGTAAGCTGTTAATTTCTGCACTGGTTGCGGGTGGGTTGTTGT
CGTCGTTTGGGGCAAGTGCAGATAATTACACTGGGCAGCCAACTGATTATGGCGATGGCTCAGCAGGTGA
CGGCTGGGTTGCTATCGGTAAAGGGGCAAAAGCAAATACCTTTATGAACACTAGTGGCGCGAGTACAGCT
TTAGGATATGACGCGATAGCCGAAGGTGAGTACAGTTCTGCCATCGGGTCAAAAACCCTTGCAACTGGTG
GAGCATCCATGGCGTTCGGGGTTAGTGCAAAAGCAATGGGTGACAGAAGTGTCGCGCTAGGTGCATCGTC
AGTAGCAAATGGCGATCGTTCGATGGCTTTTGGTCGTTACGCAAAGACGAATGGTTTTACATCTCTTGCT
ATTGGGGACTCCTCCCTTGCCGATGGTGAAAAAACTATTGCGTTAGGAAATACGGCTAAAGCTTACGAAA
TTATGAGCATCGCCCTCGGTGATAATGCCAATGCGTCAAAAGAGTATGCAATGGCGCTGGGAGCAAGTAG
CAAAGCTGGCGGTGCTGATAGCCTCGCATTCGGCAGAAAATCTACAGCTAATAGCACTGGCTCACTGGCA
ATAGGTGCTGACAGTAGCAGTTCGAACGATAACGCCATCGCGATAGGGAACAAAACGCAAGCCCTGGGAG
TGAATTCGATGGCCCTGGGTAATGCAAGTCAGGCATCTGGCGAATCCAGTATTGCATTAGGTAACACCAG
TGAAGCCAGCGAACAAAATGCGATTGCGCTGGGGCAAGGTAGCATTGCAAGCAAAGTGAACTCAATCGCG
TTGGGAAGTAACAGTTTGTCCTCGGGAGAGAATGCCATCGCATTGGGAGAGGGTAGTGCCGCTGGTGGCA
GCAACAGCCTTGCTTTCGGTAGCCAGTCCAGGGCAAACGGCAATGATTCTGTCGCCATCGGTGTAGGGGC
TGCAGCAGCGACCGACAATTCTGTCGCTATCGGCGCAGGATCGACCACAGATGCAAGCAATACGGTTTCA
GTTGGCAACAGCGCAACAAAACGCAAAATTGTTAATATGGCTGCTGGTGCCATAAGCAACACCAGTACCG
ATGCCATCAACGGCTCACAGCTTTATACGATCAGTGATTCAGTCGCCAAGCGACTCGGAGGAGGCGCTAC
TGTAGGCAGCGATGGCACCGTAACCGCAGTAAGCTACGCGTTGAGAAGCGGAACCTATAATAACGTGGGT
GATGCTCTGTCAGGAATCGACAATAATACCCTACAATGGAATAAAACCGCGGGGGCGTTCAGCGCCAATC
ACGGTGCAAATGCCACCAACAAAATCACTAATGTTGCTAAAGGTACGGTTTCTGCAACCAGCACCGATGT
AGTAAACGGCTCTCAATTGTACGACCTGCAGCAGGATGCTCTGTTGTGGAACGGCACAGCATTCAGTGCC
GCACACGGCACCGAAGCCACCAGCAAAATCACTAACGTCACCGCTGGCAACCTGACTGCCGGCAGCACTG
ACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCAC
TAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAAC
AAAGCAGCTGGCGCATTCAGCGCCGCGCACGGCACCGAAGCCACCAGCAAAATCACCAACGTCACCGCTG
GCAACCTGACTGCCGGTAGCACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAACGACAACGTGAC
GACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTC
GGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACGCCACCA
GCAAGATCACCAACGTCACCGCTGGCAACCTGACTGCCGGCAGCACTGACGCCGTTAACGGCTCCCAGCT
CAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAAC
CTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCG
CCGCGCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAGCAC
TGACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATCACC
AACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCA
GCGCCGCTCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAG
CACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATC
ACTAACCTGACGGATTCCGTTGGCGACCTTAAGGACGATTCTCTGCTGTGGAACAAAGCGGCTGGCGCAT
TCAGCGCCGCGCACGGTACCGAAGCTACCAGCAAGATCACCAACTTACTGGCTGGCAAGATATCTTCTAA
CAGCACTGATGCCATTAATGGCTCACAACTTTATGGCGTAGCGGATTCATTTACGTCATATCTTGGTGGT
GGTGCTGATATCAGCGATACGGGTGTATTAAGTGGGCCAACCTACACTATTGGTGGTACTGACTACACTA
ACGTCGGTGATGCTCTGGCAGCCATTAACACATCATTTAGCACATCACTCGGCGACGCCCTACTTTGGGA
TGCAACCGCAGGCAAATTCAGCGCCAAACACGGCATTAATAATGCTCCCAGTGTAATCACTGATGTTGCA
AACGGTGCAGTCTCGTCCACCAGCAGCGACGCCATTAACGGTTCACAACTTTATGGTGTTAGTGACTACA
TTGCCGATGCTCTGGGCGGGAATGCTGTGGTGAACACTGACGGCAGTATCACTACACCAACTTATGCCAT
CGCTGGCGGCAGTTACAACAACGTCGGTGACGCGCTGGAAGCGATCGATACCACGCTGGATGATGCTCTG
CTGTGGGATACAACAGCCAATGGCGGTAACGGTGCATTTAGCGCCGCTCACGGGAAAGATAAAACTGCCA
GTGTAATCACTAACGTCGCTAACGGTGCAGTCTCTGCCACCAGCAACGATGCCATTAATGGCTCACAGCT
CTATAGCACTAATAAGTACATCGCTGATGCGCTGGGTGGTGATGCAGAAGTCAACGCTGACGGTACTATC
ACTGCACCGACTTACACCATTGCAAATACCGATTACAACAACGTCGGTGAAGCCCTGGATGCGCTCGATA
ATAACGCGCTGCTGTGGGATGAAGACGCAGGTGCCTACAACGCCAGCCATGATGGCAATGCCAGCAAAAT
CACCAACGTTGCGGCTGGTGATCTCTCCACAACCAGTACCGATGCTGTTAACGGTTCCCAGTTAAACGCA
ACCAATATTCTGGTTACGCAAAATAGCCAAATGATTAACCAGCTTGCTGGTAACACTAGCGAAACCTACA
TCGAGGAAAACGGTGCGGGTATTAACTATGTACGTACCAACGACAGCGGCTTAGCGTTCAACGATGCCAG
CGCTTCAGGTATTGGCGCTACAGCTGTAGGTTATAACGCAGTTGCCTCTCATGCCAGCAGTGTAGCCATC
GGTCAGGACAGCATCAGCGAAGTTGATACGGGTATCGCTCTGGGTAGCAGTTCCGTTTCCAGCCGTGTAA
TAGTTAAAGGGACTCGTAACACCAGCGTATCGGAAGAAGGTGTTGTGATTGGTTATGACACCACGGATGG
CGAACTGCTTGGCGCGTTGTCGATTGGTGATGACGGTAAATATCGTCAAATCATCAACGTCGCGGATGGT
TCTGAAGCCCATGATGCGGTCACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTCGCAACCACACCAA
CCAAATACTATCACGCCAACTCAACGGCTGAAGACTCACTGGCAGTCGGTGAAGACTCGCTGGCAATGGG
CGCGAAAACCATCGTTAATGGTAATGCGGGTATTGGTATCGGCCTGAACACGCTGGTTCTGGCTGATGCG
ATCAACGGTATTGCTATCGGTTCTAACGCACGCGCAAATCATGCCGACAGCATTGCAATGGGTAATGGTT
CTCAGACTACCCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCGCAGAACTCTGTGGG
TGAGTTCTCTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCAACGTCGCAGCAGGTTCGGCGGATACC
GATGCGGTTAACGTGGGTCAGTTGAAAGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTA
ACCTGAACACTCAGGTCACTAATCTGGATACTCGCGTGACCAATATCGAAAACGGCATTGGCGATATCGT
AACCACCGGTAGCACTAAGTACTTCAAGACCAACACCGATGGCGCAGATGCCAACGCGCAGGGTAAAGAC
AGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCACTGGGCACGGGTTCCGTAG
CAGACGAAGAAAACACCATCTCTGTGGGTTCTTCTACCAACCAGCGTCGTATCACCAACGTTGCTGCCGG
TGTTAATGCCACCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGAC
ACCAAAGCTGATGGCTCTATCGACTACAGCAACATCACTCTCGGTGGCGGCAATAGCGGTACGACTCGCA
TCAGCAACGTTTCTGCTGGCGTGAACAACAACGACGCAGTGAACTATGCGCAGTTGAAGCAAAGTGTGCA
GGAAACGAAGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAACTGAAAGCAAG
CTGAGTGGTGGTATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCA
TGGCCTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGC
CAATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGT
GCCGGTATTCAGTGGTA
Protein Information
Protein Name
adhesin
NCBI Protein GI
26250246
Protein Accession
NP_756286
Protein pI
4.25
Protein Weight
166959.01
Protein Length
1778
Protein Note
Escherichia coli O157:H7 ortholog: z5029
Protein Sequence
>NP_756286.1 adhesin [Escherichia coli CFT073]
MNKIFKVIWNPATGSYTVASETAKSRGKKSGRSKLLISALVAGGLLSSFGASADNYTGQPTDYGDGSAGD
GWVAIGKGAKANTFMNTSGASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVALGASS
VANGDRSMAFGRYAKTNGFTSLAIGDSSLADGEKTIALGNTAKAYEIMSIALGDNANASKEYAMALGASS
KAGGADSLAFGRKSTANSTGSLAIGADSSSSNDNAIAIGNKTQALGVNSMALGNASQASGESSIALGNTS
EASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGEGSAAGGSNSLAFGSQSRANGNDSVAIGVGA
AAATDNSVAIGAGSTTDASNTVSVGNSATKRKIVNMAAGAISNTSTDAINGSQLYTISDSVAKRLGGGAT
VGSDGTVTAVSYALRSGTYNNVGDALSGIDNNTLQWNKTAGAFSANHGANATNKITNVAKGTVSATSTDV
VNGSQLYDLQQDALLWNGTAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIAT
NTTNITNLTDAVNGLGDDSLLWNKAAGAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVT
TNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVTAGNLTAGSTDAVNGSQL
KTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGST
DAVNGSQLKTTNDNVSTNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGS
TDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLWNKAAGAFSAAHGTEATSKITNLLAGKISSN
STDAINGSQLYGVADSFTSYLGGGADISDTGVLSGPTYTIGGTDYTNVGDALAAINTSFSTSLGDALLWD
ATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAVVNTDGSITTPTYAI
AGGSYNNVGDALEAIDTTLDDALLWDTTANGGNGAFSAAHGKDKTASVITNVANGAVSATSNDAINGSQL
YSTNKYIADALGGDAEVNADGTITAPTYTIANTDYNNVGEALDALDNNALLWDEDAGAYNASHDGNASKI
TNVAAGDLSTTSTDAVNGSQLNATNILVTQNSQMINQLAGNTSETYIEENGAGINYVRTNDSGLAFNDAS
ASGIGATAVGYNAVASHASSVAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEGVVIGYDTTDG
ELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPTKYYHANSTAEDSLAVGEDSLAMG
AKTIVNGNAGIGIGLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVG
EFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIV
TTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTGSVADEENTISVGSSTNQRRITNVAAG
VNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNSGTTRISNVSAGVNNNDAVNYAQLKQSVQ
ETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSA
NGRWVYKLQGSTNSQGEYSAALGAGIQW
Vaxign Prediction
Localization(Probability)
Outer Membrane (Prob.=0.995)
Adhesin Probability
0.939
Trans-membrane Helices
0
Detailed Vaxign Results
Vaxign Results
Epitope Information
IEDB Linear Epitope
-- Assay Type --
T Cell Epitope
B Cell Epitope
IEDB ID
Epitope
MHC restriction
Starting position
Ending position
IEDB ID
Epitope
Starting position
Ending position
144311
GNLTAG
529
535
179856
LSGID
446
451
MNKIFKVIWNPATGSYTVASETAKSRGKKSGRSKLLISALVAGGLLSSFGASADNYTGQPTDYGDGSAGDGWVAIGKGAKANTFMNTSGASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVALGASSVANGDRSMAFGRYAKTNGFTSLAIGDSSLADGEKTIALGNTAKAYEIMSIALGDNANASKEYAMALGASSKAGGADSLAFGRKSTANSTGSLAIGADSSSSNDNAIAIGNKTQALGVNSMALGNASQASGESSIALGNTSEASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGEGSAAGGSNSLAFGSQSRANGNDSVAIGVGAAAATDNSVAIGAGSTTDASNTVSVGNSATKRKIVNMAAGAISNTSTDAINGSQLYTISDSVAKRLGGGATVGSDGTVTAVSYALRSGTYNNVGDALSGIDNNTLQWNKTAGAFSANHGANATNKITNVAKGTVSATSTDVVNGSQLYDLQQDALLWNGTAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKAAGAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLWNKAAGAFSAAHGTEATSKITNLLAGKISSNSTDAINGSQLYGVADSFTSYLGGGADISDTGVLSGPTYTIGGTDYTNVGDALAAINTSFSTSLGDALLWDATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAVVNTDGSITTPTYAIAGGSYNNVGDALEAIDTTLDDALLWDTTANGGNGAFSAAHGKDKTASVITNVANGAVSATSNDAINGSQLYSTNKYIADALGGDAEVNADGTITAPTYTIANTDYNNVGEALDALDNNALLWDEDAGAYNASHDGNASKITNVAAGDLSTTSTDAVNGSQLNATNILVTQNSQMINQLAGNTSETYIEENGAGINYVRTNDSGLAFNDASASGIGATAVGYNAVASHASSVAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEGVVIGYDTTDGELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGIGLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTGSVADEENTISVGSSTNQRRITNVAAGVNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNSGTTRISNVSAGVNNNDAVNYAQLKQSVQETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQW