VIOLIN Logo
VO Banner
Search: for Help
Protegen Home
Introduction
Statistics
News and Updates
Protegen Query
Protegen BLAST
Selected Bacteria
Brucella spp. (26)
B. anthracis (14)
E. coli (40)
M. tuberculosis (26)
N. meningitidis (18)
Selected Viruses
Ebola virus (19)
HIV (41)
Influenza virus (50)
Selected Parasites
Plasmodium (47)
Data Submission
Data Exchange
Data Download
Documentation
FAQs
Disclaimer
Contact Us
UMMS Logo


C0393

General Information
Protegen ID 283
Sequence Strain (Species/Organism) Escherichia coli CFT073
VO ID VO_0010989
Taxonomy ID 199310
Molecule Role Protective antigen
Molecule Role Annotation Active immunization of BALB/c mice with C0393 antigen in Freund's adjuvant protects mice from lethal challenge with ExPEC strain S26 (Durant et al., 2007).
COG COG3468MU, under M: Cell wall/membrane/envelope biogenesis; U: Intracellular trafficking, secretion, and vesicular transport
Related Vaccines(s) E. coli vaccine based on recombinant protein CO393
References
Durant et al., 2007: Durant L, Metais A, Soulama-Mouze C, Genevard JM, Nassif X, Escaich S. Identification of candidates for a subunit vaccine against extraintestinal pathogenic Escherichia coli. Infection and immunity. 2007; 75(4); 1916-1925. [PubMed: 17145948].
Gene Information
Gene Name C0393
NCBI Gene ID 1034958
Genbank Accession AE014075
Locus Tag c0393
Gene Starting Position 371876
Gene Ending Position 376006
DNA Sequence
>NC_004431.1:371876-376006 Escherichia coli CFT073, complete genome
ATCAGAATGAATAACGAATATTAGCGTTTATCGCATCATCTGTGTTGTATTTACCGAATGCAGAGCGTTC
AACTTCCAGCCCCAGACGCGTATTGTCGCCAAACCGGGCATTTAACCCCACACCGTAAAGCATACGACCG
TCTTTTCTGCCATTAATCTGATGTTCTCCCGCTGCATCCTTCAGGTGAACGTCAGCACTGTCCGTCAGAT
CGAACTCATAATGCAGGCCGGCACGGGCTGTCAGACTCCAGTCCTTACCACTGAAGGTTTTACCGGAAAC
AACGCCGGTTCTGCCTACCAGAGGATTAACGCTGTTACGACGCATTGAGACATCCATTCCACTGTCGTTC
CAGTTAAATGTTTGGCCCTGCAGTCTTCCCCAGACCAGTTCCGCCTGAGGTTCAACAAACGTCGTATCTG
TCAGATGATAACGGTATCCGACTTCTGCACCTGCATACAGTGAATGGCTGCGGAAGTTCTGTTTACCAGC
TCCGGCAAAGTTCAGGTCATATTTGTTTTCATTGTGAATATATTTGGCAATCAAATCAAAGTAAGCGCCG
GACCGGAACAGACCACTGGCATAGAAACCACCACCCCATGATTTTGTTTTACCGCTGTACAGGCCTGCTG
ACGCATCTGTGTCAGTGTAGGTGGCCATCACGCCGGTAAACAGGTCCATACTTCCCAGTTCGTGCTTACG
GTCAGCCCCCATCTGCAGCAGGGTATAGTGGTCAGTGAAACCGCCATCAGCAGAGCCGGAACCGTTCAGC
AGACGCACCCACGTACCGGCTTCGCCGTTAATATCCCTCAAATCGCCCATGCGTTTGTTCAGGTTGTTAA
CTTCAGTGATGAAGTTGTTATAGCTGATGTGCATGAATGTGGCGGCAGCCTTACCCTGGCCGTCGTTACG
TGCAACCTGGTAACCATCGAGGACCCACTCTTTTTTCCCGTCCTCTTTTCTGACACTAAGGGTGGGGGTG
ACATCACTGAATCCCACAACCCGTGTTGATGCCCTGAACAGATTATCAGCTGTCGCTTCAGGTGCGCTGA
CCAGTGGAATATCAAGCGTGTCCTTGTCAGAGGGTTTTTTCAGGAAGTTAACCCAGATGCTGTTGTCATG
ACCTGTTGCCGACTTGTTTATCACCAGTTTGTCTGCCTTGTTAAGGTCTGTACGCATGACAAATGCTGAC
TGAACCGCGTCCAGATTATCTGTTGTCAGTGTCGTGAACGATGATGTTCCCCCGTTAAAACCGACTATTG
TCCGGTTAAGTTTCATATTTCCTGCCGTGGAGTTTCCGTTCATCGACCACTGGGTGTCTGTCATGCTGAC
GGTGGCATCCGGTGCATTCAGGCTCCCGCTCCAGGTATTGCGGTACCCGTTAAACAGGCTGTACAACATC
TGATTCTGAAGAGTCAGGTCAGGACTCAGTTCCCCTTCCCCTCCGAGGGTGACAGTCCCTTTATCCTGAA
CATTGATATTACCTGACAACATACTGTACGGCCCCACGTTCAGGCGGGCATCGTCTCCCTTCAGGTTCCA
TGAACCGGCATAATCGTATACAGGTAAAAGTGTGTGAGATACCTCATCAGGACGGCTGTTCAGACTCAGG
GTGGCATCAGAAATATTCACCGGACCGTCAGAAACAAAACTCTGACTGGCCAGAACATTTGCTCCCTTGT
TCAGATTCAGGGCGGTACTGGTCAGCGTTGAGTTCTCCAGAACGGCACTGTCTGAGGAGATATTCACGGT
ACTGTTGTTCGCCTGTATTCCGCCATTGAATATCTCATTGATATTCAGCACTGACTGATTATCCAGGTTG
ACGGTGCCGTTGAAGACGCTTTTATCTGCATCTTTAGTTGCAACAGATGTGCCTTCTTCAAGGGTAAATG
CTGTTCCCTGGCCATCTTTTTTGTCGATAAATACCCGACTGTCGCCCAGCGTGACGCTGGAGTTATCTGC
CTGGATGGTTGTGTTCAGTGTGGCATTGCGGCCCAGACCAAAGTCTGTATCTTTTAACACGAGCGAACCA
AAGCTGAACGTCCTGTTCTCCCAGTCATCCTGTGTAAATGAGGTGGGCTGTGTCAGAACGGAATTGTCGC
CCAGAGACGAGACTGTATTTGCAATACTCTGAGACGTTGAAGCATGGATAACCGGGTGGCCCTGAATGGT
CAGACGACCGTTTTCCTGAGTAAATGTACCGGACATATTCGCTGAGCCGTCCATAACCAATGCGCCGGTT
GTACCCGGGGTTGCTTTATTTGAGAAATTAATATTTCCCAGCAACTTGCCATGATACAGATACCCTTTAT
TATTAATTCTGTTTGCAAGCAGTGCCTGTGCACTGTTCTGGTCATGTCCGACATATTCCCAGTGCTCGTT
ACTGACCTGACCGGTAGGGAACCAGCCATAACTACTTGTTTTCAGGATAAAATAATCGACGGTATGAGTA
TAGGGATTATTATAAATATATAATGAACCTACTGTTCCCCTGTTTGATGATGACCATTCATTAACTTTTA
CGTCTGCCGGACGCGTCTGATAATCCAGAGTGATATTAGCCGTTTTATCACTGCTGTTACCGAGAGTTGC
GCCATAATCGGCGGCATTCAGCTTATGAAATGTCAGGTCATTCCCGTTAACATCCAGAACCCCCCCCCGG
TAGCCCCAGGATATATTGTCCGGATTAACCTGCTGGTTGTCTGCCAGCACGACTGTCGGGCGGCCGCTGG
CAATATTCACGCTACTGAATGCCTGAACGTGTCCTGAACTGTCAGCCTGCTGATTGAGGACAACGGTCCC
ATCCCCGACTTTCAGGCCGCCCTCATTAACACCGGTTCCCTGTACAACCAGGGTTCCTTCGCCGATTTTA
TGCAGGTTGTCACCTTTCACACCATTAACCTGCCAGTTTACGGAGGCATCCTTGTCCACAATAATACCGG
CCCCGGTCCAGGTACTTCCGTTTGAAGTGGTGACAGTGTAGTCATCAGTAAATGTCAGTGAACCGGCACC
CTGCGTGACAGAGTTTTCCAGGTCAATCTGACCATTATGTCCCAGGAATGTCAGATTTTTACCTGCGTTC
AGGTCAGAACCTTTTTGCCCGTGCATGGCATATTCATCGGAACCCTGTTTCAGAGAGCCAGTGCCGGTGC
TGCTGTCAAATTTCCATTGCAGGGGGGCGCCGGATGAGGCATTAAAAAAGACGGGAGCGTCATTATCCTC
TGAATAGATCTGTGAGAGAAAACTCTGAGGAATAAGAGAATATATCAAATTGGTCCCCCCTCCTACTCCC
GAGTAAACACCGACCAGTTCCCACTGCCCTTTGGCCGTATTCCAGCCAAATAATGGAGAACCACTGTCGC
CGGCCTCTCCAAAAGAGGGCAGGATGCTATGATCATGTATGTTGCCCCCCATATACAGCTGAATGCCGTC
TGAGCCGTGATAAAAGAATGATGTCGGGAGTATTCCTCCTGTCAGATAACCATACCCACCTGTTACCCAA
TGTCGCTTACCCTGACTATCCTGAATATACTGACTTCCCGAACCAGCCCTGTAGAATGCCGAGTATTTTG
AAGGGTTCAATATATCAGCTGTTGATGAGCTGGTTACGGTAGCCGGAGCAACCTCAGTTACGAGCTTATC
AAGTCTTGGTGTGTGGAGATCAGATGAACTGTGTTCATTACGATCCACAATATGGTAACTGTTCTGACCA
TCACCGAAGCTGACGCTCTGATATCCTTTATTATGTTTTACACTGGCTATATATTGCGGGTTAATTAATG
TTGCAACGCCGGGATTTGAGCTTACATTCACACTGCTAAAATCAACCATGGGCGCTTTATCAAGATGTCC
TACTAATTCCCCTTTATTATTAAAAATAGGAATGTTTGTTGCGCCAGCCTGAAACTGCCCTTTGTTTTCT
GCAAAGTCGCGGTATGTCTGGTAAGGATTGTTGCCACCAACCGTTGATGCACCAGCAACGGTTGGGAGTA
ATGCAGATAGTGCCAGAGAGGTAAGTACTGAAAGTCTTTTTCCTCTGCGGGTACTCCCTTTACATACCCT
TCGGGCTAGTTCAGAGACAACCTTTACTGTGTTAGTAATATAACAATATTTTAGAGCGTATATTTTATTC
A

Protein Information
Protein Name hemoglobin protease
NCBI Protein GI 26246291
Protein Accession NP_752330
Protein pI 5.92
Protein Weight 138492.32
Protein Length 1376
Protein Note Residues 1 to 1376 of 1376 are 79.04 pct identical to residues 1 to 1377 of 1377 from GenPept.129 : >emb|CAA11507.1| (AJ223631) haemoglobin protease [Escherichia coli]
Protein Sequence
>NP_752330.1 hemoglobin protease [Escherichia coli CFT073]
MNKIYALKYCYITNTVKVVSELARRVCKGSTRRGKRLSVLTSLALSALLPTVAGASTVGGNNPYQTYRDF
AENKGQFQAGATNIPIFNNKGELVGHLDKAPMVDFSSVNVSSNPGVATLINPQYIASVKHNKGYQSVSFG
DGQNSYHIVDRNEHSSSDLHTPRLDKLVTEVAPATVTSSSTADILNPSKYSAFYRAGSGSQYIQDSQGKR
HWVTGGYGYLTGGILPTSFFYHGSDGIQLYMGGNIHDHSILPSFGEAGDSGSPLFGWNTAKGQWELVGVY
SGVGGGTNLIYSLIPQSFLSQIYSEDNDAPVFFNASSGAPLQWKFDSSTGTGSLKQGSDEYAMHGQKGSD
LNAGKNLTFLGHNGQIDLENSVTQGAGSLTFTDDYTVTTSNGSTWTGAGIIVDKDASVNWQVNGVKGDNL
HKIGEGTLVVQGTGVNEGGLKVGDGTVVLNQQADSSGHVQAFSSVNIASGRPTVVLADNQQVNPDNISWG
YRGGVLDVNGNDLTFHKLNAADYGATLGNSSDKTANITLDYQTRPADVKVNEWSSSNRGTVGSLYIYNNP
YTHTVDYFILKTSSYGWFPTGQVSNEHWEYVGHDQNSAQALLANRINNKGYLYHGKLLGNINFSNKATPG
TTGALVMDGSANMSGTFTQENGRLTIQGHPVIHASTSQSIANTVSSLGDNSVLTQPTSFTQDDWENRTFS
FGSLVLKDTDFGLGRNATLNTTIQADNSSVTLGDSRVFIDKKDGQGTAFTLEEGTSVATKDADKSVFNGT
VNLDNQSVLNINEIFNGGIQANNSTVNISSDSAVLENSTLTSTALNLNKGANVLASQSFVSDGPVNISDA
TLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPYSMLSGNINVQDKGTVTLGGEGELSPDLTLQN
QMLYSLFNGYRNTWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTIVGFNGGTSSFTTLTTDNLDAV
QSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSDKDTLDIPLVSAPEATADNLFRASTRVVGFSD
VTPTLSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWVR
LLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASAGLYSGKTKSWGGGFYASGLFR
SGAYFDLIAKYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFN
WNDSGMDVSMRRNSVNPLVGRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRK
DGRMLYGVGLNARFGDNTRLGLEVERSAFGKYNTDDAINANIRYSF

Epitope Information
IEDB Linear Epitope