C0393
General Information
Protegen ID
283
Sequence Strain (Species/Organism)
Escherichia coli CFT073
VO ID
VO_0010989
Taxonomy ID
199310
Molecule Role
Protective antigen
Molecule Role Annotation
Active immunization of BALB/c mice with C0393 antigen in Freund's adjuvant protects mice from lethal challenge with ExPEC strain S26 (Durant et al. , 2007 ).
COG
COG3468MU, under M: Cell wall/membrane/envelope biogenesis; U: Intracellular trafficking, secretion, and vesicular transport
Related Vaccines(s)
E. coli vaccine based on recombinant protein CO393
References
Durant et al. , 2007: Durant L, Metais A, Soulama-Mouze C, Genevard JM, Nassif X, Escaich S. Identification of candidates for a subunit vaccine against extraintestinal pathogenic Escherichia coli . Infection and immunity . 2007; 75(4); 1916-1925. [PubMed: 17145948 ].
Gene Information
Gene Name
C0393
NCBI Gene ID
1034958
Genbank Accession
AE014075
Locus Tag
c0393
Gene Starting Position
371876
Gene Ending Position
376006
DNA Sequence
>NC_004431.1:371876-376006 Escherichia coli CFT073, complete genome
ATCAGAATGAATAACGAATATTAGCGTTTATCGCATCATCTGTGTTGTATTTACCGAATGCAGAGCGTTC
AACTTCCAGCCCCAGACGCGTATTGTCGCCAAACCGGGCATTTAACCCCACACCGTAAAGCATACGACCG
TCTTTTCTGCCATTAATCTGATGTTCTCCCGCTGCATCCTTCAGGTGAACGTCAGCACTGTCCGTCAGAT
CGAACTCATAATGCAGGCCGGCACGGGCTGTCAGACTCCAGTCCTTACCACTGAAGGTTTTACCGGAAAC
AACGCCGGTTCTGCCTACCAGAGGATTAACGCTGTTACGACGCATTGAGACATCCATTCCACTGTCGTTC
CAGTTAAATGTTTGGCCCTGCAGTCTTCCCCAGACCAGTTCCGCCTGAGGTTCAACAAACGTCGTATCTG
TCAGATGATAACGGTATCCGACTTCTGCACCTGCATACAGTGAATGGCTGCGGAAGTTCTGTTTACCAGC
TCCGGCAAAGTTCAGGTCATATTTGTTTTCATTGTGAATATATTTGGCAATCAAATCAAAGTAAGCGCCG
GACCGGAACAGACCACTGGCATAGAAACCACCACCCCATGATTTTGTTTTACCGCTGTACAGGCCTGCTG
ACGCATCTGTGTCAGTGTAGGTGGCCATCACGCCGGTAAACAGGTCCATACTTCCCAGTTCGTGCTTACG
GTCAGCCCCCATCTGCAGCAGGGTATAGTGGTCAGTGAAACCGCCATCAGCAGAGCCGGAACCGTTCAGC
AGACGCACCCACGTACCGGCTTCGCCGTTAATATCCCTCAAATCGCCCATGCGTTTGTTCAGGTTGTTAA
CTTCAGTGATGAAGTTGTTATAGCTGATGTGCATGAATGTGGCGGCAGCCTTACCCTGGCCGTCGTTACG
TGCAACCTGGTAACCATCGAGGACCCACTCTTTTTTCCCGTCCTCTTTTCTGACACTAAGGGTGGGGGTG
ACATCACTGAATCCCACAACCCGTGTTGATGCCCTGAACAGATTATCAGCTGTCGCTTCAGGTGCGCTGA
CCAGTGGAATATCAAGCGTGTCCTTGTCAGAGGGTTTTTTCAGGAAGTTAACCCAGATGCTGTTGTCATG
ACCTGTTGCCGACTTGTTTATCACCAGTTTGTCTGCCTTGTTAAGGTCTGTACGCATGACAAATGCTGAC
TGAACCGCGTCCAGATTATCTGTTGTCAGTGTCGTGAACGATGATGTTCCCCCGTTAAAACCGACTATTG
TCCGGTTAAGTTTCATATTTCCTGCCGTGGAGTTTCCGTTCATCGACCACTGGGTGTCTGTCATGCTGAC
GGTGGCATCCGGTGCATTCAGGCTCCCGCTCCAGGTATTGCGGTACCCGTTAAACAGGCTGTACAACATC
TGATTCTGAAGAGTCAGGTCAGGACTCAGTTCCCCTTCCCCTCCGAGGGTGACAGTCCCTTTATCCTGAA
CATTGATATTACCTGACAACATACTGTACGGCCCCACGTTCAGGCGGGCATCGTCTCCCTTCAGGTTCCA
TGAACCGGCATAATCGTATACAGGTAAAAGTGTGTGAGATACCTCATCAGGACGGCTGTTCAGACTCAGG
GTGGCATCAGAAATATTCACCGGACCGTCAGAAACAAAACTCTGACTGGCCAGAACATTTGCTCCCTTGT
TCAGATTCAGGGCGGTACTGGTCAGCGTTGAGTTCTCCAGAACGGCACTGTCTGAGGAGATATTCACGGT
ACTGTTGTTCGCCTGTATTCCGCCATTGAATATCTCATTGATATTCAGCACTGACTGATTATCCAGGTTG
ACGGTGCCGTTGAAGACGCTTTTATCTGCATCTTTAGTTGCAACAGATGTGCCTTCTTCAAGGGTAAATG
CTGTTCCCTGGCCATCTTTTTTGTCGATAAATACCCGACTGTCGCCCAGCGTGACGCTGGAGTTATCTGC
CTGGATGGTTGTGTTCAGTGTGGCATTGCGGCCCAGACCAAAGTCTGTATCTTTTAACACGAGCGAACCA
AAGCTGAACGTCCTGTTCTCCCAGTCATCCTGTGTAAATGAGGTGGGCTGTGTCAGAACGGAATTGTCGC
CCAGAGACGAGACTGTATTTGCAATACTCTGAGACGTTGAAGCATGGATAACCGGGTGGCCCTGAATGGT
CAGACGACCGTTTTCCTGAGTAAATGTACCGGACATATTCGCTGAGCCGTCCATAACCAATGCGCCGGTT
GTACCCGGGGTTGCTTTATTTGAGAAATTAATATTTCCCAGCAACTTGCCATGATACAGATACCCTTTAT
TATTAATTCTGTTTGCAAGCAGTGCCTGTGCACTGTTCTGGTCATGTCCGACATATTCCCAGTGCTCGTT
ACTGACCTGACCGGTAGGGAACCAGCCATAACTACTTGTTTTCAGGATAAAATAATCGACGGTATGAGTA
TAGGGATTATTATAAATATATAATGAACCTACTGTTCCCCTGTTTGATGATGACCATTCATTAACTTTTA
CGTCTGCCGGACGCGTCTGATAATCCAGAGTGATATTAGCCGTTTTATCACTGCTGTTACCGAGAGTTGC
GCCATAATCGGCGGCATTCAGCTTATGAAATGTCAGGTCATTCCCGTTAACATCCAGAACCCCCCCCCGG
TAGCCCCAGGATATATTGTCCGGATTAACCTGCTGGTTGTCTGCCAGCACGACTGTCGGGCGGCCGCTGG
CAATATTCACGCTACTGAATGCCTGAACGTGTCCTGAACTGTCAGCCTGCTGATTGAGGACAACGGTCCC
ATCCCCGACTTTCAGGCCGCCCTCATTAACACCGGTTCCCTGTACAACCAGGGTTCCTTCGCCGATTTTA
TGCAGGTTGTCACCTTTCACACCATTAACCTGCCAGTTTACGGAGGCATCCTTGTCCACAATAATACCGG
CCCCGGTCCAGGTACTTCCGTTTGAAGTGGTGACAGTGTAGTCATCAGTAAATGTCAGTGAACCGGCACC
CTGCGTGACAGAGTTTTCCAGGTCAATCTGACCATTATGTCCCAGGAATGTCAGATTTTTACCTGCGTTC
AGGTCAGAACCTTTTTGCCCGTGCATGGCATATTCATCGGAACCCTGTTTCAGAGAGCCAGTGCCGGTGC
TGCTGTCAAATTTCCATTGCAGGGGGGCGCCGGATGAGGCATTAAAAAAGACGGGAGCGTCATTATCCTC
TGAATAGATCTGTGAGAGAAAACTCTGAGGAATAAGAGAATATATCAAATTGGTCCCCCCTCCTACTCCC
GAGTAAACACCGACCAGTTCCCACTGCCCTTTGGCCGTATTCCAGCCAAATAATGGAGAACCACTGTCGC
CGGCCTCTCCAAAAGAGGGCAGGATGCTATGATCATGTATGTTGCCCCCCATATACAGCTGAATGCCGTC
TGAGCCGTGATAAAAGAATGATGTCGGGAGTATTCCTCCTGTCAGATAACCATACCCACCTGTTACCCAA
TGTCGCTTACCCTGACTATCCTGAATATACTGACTTCCCGAACCAGCCCTGTAGAATGCCGAGTATTTTG
AAGGGTTCAATATATCAGCTGTTGATGAGCTGGTTACGGTAGCCGGAGCAACCTCAGTTACGAGCTTATC
AAGTCTTGGTGTGTGGAGATCAGATGAACTGTGTTCATTACGATCCACAATATGGTAACTGTTCTGACCA
TCACCGAAGCTGACGCTCTGATATCCTTTATTATGTTTTACACTGGCTATATATTGCGGGTTAATTAATG
TTGCAACGCCGGGATTTGAGCTTACATTCACACTGCTAAAATCAACCATGGGCGCTTTATCAAGATGTCC
TACTAATTCCCCTTTATTATTAAAAATAGGAATGTTTGTTGCGCCAGCCTGAAACTGCCCTTTGTTTTCT
GCAAAGTCGCGGTATGTCTGGTAAGGATTGTTGCCACCAACCGTTGATGCACCAGCAACGGTTGGGAGTA
ATGCAGATAGTGCCAGAGAGGTAAGTACTGAAAGTCTTTTTCCTCTGCGGGTACTCCCTTTACATACCCT
TCGGGCTAGTTCAGAGACAACCTTTACTGTGTTAGTAATATAACAATATTTTAGAGCGTATATTTTATTC
A
Protein Information
Protein Name
hemoglobin protease
NCBI Protein GI
26246291
Protein Accession
NP_752330
Protein pI
5.92
Protein Weight
138492.32
Protein Length
1376
Protein Note
Residues 1 to 1376 of 1376 are 79.04 pct identical to residues 1 to 1377 of 1377 from GenPept.129 : >emb|CAA11507.1| (AJ223631) haemoglobin protease [Escherichia coli ]
Protein Sequence
>NP_752330.1 hemoglobin protease [Escherichia coli CFT073]
MNKIYALKYCYITNTVKVVSELARRVCKGSTRRGKRLSVLTSLALSALLPTVAGASTVGGNNPYQTYRDF
AENKGQFQAGATNIPIFNNKGELVGHLDKAPMVDFSSVNVSSNPGVATLINPQYIASVKHNKGYQSVSFG
DGQNSYHIVDRNEHSSSDLHTPRLDKLVTEVAPATVTSSSTADILNPSKYSAFYRAGSGSQYIQDSQGKR
HWVTGGYGYLTGGILPTSFFYHGSDGIQLYMGGNIHDHSILPSFGEAGDSGSPLFGWNTAKGQWELVGVY
SGVGGGTNLIYSLIPQSFLSQIYSEDNDAPVFFNASSGAPLQWKFDSSTGTGSLKQGSDEYAMHGQKGSD
LNAGKNLTFLGHNGQIDLENSVTQGAGSLTFTDDYTVTTSNGSTWTGAGIIVDKDASVNWQVNGVKGDNL
HKIGEGTLVVQGTGVNEGGLKVGDGTVVLNQQADSSGHVQAFSSVNIASGRPTVVLADNQQVNPDNISWG
YRGGVLDVNGNDLTFHKLNAADYGATLGNSSDKTANITLDYQTRPADVKVNEWSSSNRGTVGSLYIYNNP
YTHTVDYFILKTSSYGWFPTGQVSNEHWEYVGHDQNSAQALLANRINNKGYLYHGKLLGNINFSNKATPG
TTGALVMDGSANMSGTFTQENGRLTIQGHPVIHASTSQSIANTVSSLGDNSVLTQPTSFTQDDWENRTFS
FGSLVLKDTDFGLGRNATLNTTIQADNSSVTLGDSRVFIDKKDGQGTAFTLEEGTSVATKDADKSVFNGT
VNLDNQSVLNINEIFNGGIQANNSTVNISSDSAVLENSTLTSTALNLNKGANVLASQSFVSDGPVNISDA
TLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPYSMLSGNINVQDKGTVTLGGEGELSPDLTLQN
QMLYSLFNGYRNTWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTIVGFNGGTSSFTTLTTDNLDAV
QSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSDKDTLDIPLVSAPEATADNLFRASTRVVGFSD
VTPTLSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWVR
LLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASAGLYSGKTKSWGGGFYASGLFR
SGAYFDLIAKYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFN
WNDSGMDVSMRRNSVNPLVGRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRK
DGRMLYGVGLNARFGDNTRLGLEVERSAFGKYNTDDAINANIRYSF
Epitope Information
IEDB Linear Epitope
-- Assay Type --
T Cell Epitope
B Cell Epitope
IEDB ID
Epitope
MHC restriction
Starting position
Ending position
IEDB ID
Epitope
Starting position
Ending position
MNKIYALKYCYITNTVKVVSELARRVCKGSTRRGKRLSVLTSLALSALLPTVAGASTVGGNNPYQTYRDFAENKGQFQAGATNIPIFNNKGELVGHLDKAPMVDFSSVNVSSNPGVATLINPQYIASVKHNKGYQSVSFGDGQNSYHIVDRNEHSSSDLHTPRLDKLVTEVAPATVTSSSTADILNPSKYSAFYRAGSGSQYIQDSQGKRHWVTGGYGYLTGGILPTSFFYHGSDGIQLYMGGNIHDHSILPSFGEAGDSGSPLFGWNTAKGQWELVGVYSGVGGGTNLIYSLIPQSFLSQIYSEDNDAPVFFNASSGAPLQWKFDSSTGTGSLKQGSDEYAMHGQKGSDLNAGKNLTFLGHNGQIDLENSVTQGAGSLTFTDDYTVTTSNGSTWTGAGIIVDKDASVNWQVNGVKGDNLHKIGEGTLVVQGTGVNEGGLKVGDGTVVLNQQADSSGHVQAFSSVNIASGRPTVVLADNQQVNPDNISWGYRGGVLDVNGNDLTFHKLNAADYGATLGNSSDKTANITLDYQTRPADVKVNEWSSSNRGTVGSLYIYNNPYTHTVDYFILKTSSYGWFPTGQVSNEHWEYVGHDQNSAQALLANRINNKGYLYHGKLLGNINFSNKATPGTTGALVMDGSANMSGTFTQENGRLTIQGHPVIHASTSQSIANTVSSLGDNSVLTQPTSFTQDDWENRTFSFGSLVLKDTDFGLGRNATLNTTIQADNSSVTLGDSRVFIDKKDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQSVLNINEIFNGGIQANNSTVNISSDSAVLENSTLTSTALNLNKGANVLASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPYSMLSGNINVQDKGTVTLGGEGELSPDLTLQNQMLYSLFNGYRNTWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTIVGFNGGTSSFTTLTTDNLDAVQSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSDKDTLDIPLVSAPEATADNLFRASTRVVGFSDVTPTLSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWVRLLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASAGLYSGKTKSWGGGFYASGLFRSGAYFDLIAKYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFNWNDSGMDVSMRRNSVNPLVGRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRKDGRMLYGVGLNARFGDNTRLGLEVERSAFGKYNTDDAINANIRYSF