VIOLIN Logo
VO Banner
Search: for Help
Protegen Home
Introduction
Statistics
News and Updates
Protegen Query
Protegen BLAST
Selected Bacteria
Brucella spp. (26)
B. anthracis (14)
E. coli (40)
M. tuberculosis (26)
N. meningitidis (18)
Selected Viruses
Ebola virus (19)
HIV (41)
Influenza virus (50)
Selected Parasites
Plasmodium (47)
Data Submission
Data Exchange
Data Download
Documentation
FAQs
Disclaimer
Contact Us
UMMS Logo


gag-pol SIV

General Information
Protegen ID 1255
Sequence Strain (Species/Organism) Simian immunodeficiency virus
Taxonomy ID 11723
Molecule Role Protective antigen
Related Vaccines(s) NYVAC-SIV
References  
Gene Information
Gene Name gag-pol SIV
NCBI Gene ID 1490009
Genbank Accession M29973
Locus Tag SIVgp1
Gene Starting Position 896
Gene Ending Position 5313
DNA Sequence
>NC_001549.1:896-5313 Simian immunodeficiency virus, complete genome
AATGGGCGGGGGTCACTCAGCACTGTCAGGGAGAAGCCTCGACACGTTCGAGAAGATTAGGCTACGTCCG
AACGGGAAAAAGAAGTACCAAATTAAACATTTAATATGGGCAGGAAAAGAAATGGAACGATTTGGGTTAC
ATGAGAAACTTTTAGAAACAAAAGAAGGCTGTCAAAAAATCATAGAAGTTTTAACCCCGTTGGAACCGAC
AGGCTCCGAGGGGCTAAAAGCTCTGTTTAATTTGTGCTGCGTCATTTGGTGCATTCACGCAGAACAGAAA
GTGAAAGACACAGAGGAAGCTGTAGTAACAGTTAAGCAACACTACCATCTAGTGGACAAAAATGAGAAAG
CAGCTAAAAAGAAAAATGAGACAACAGCGCCACCTGGTGGCGAATCAAGAAATTACCCAGTAGTAAATCA
GAATAATGCCTGGGTACACCAGCCTTTGTCTCCGCGCACGTTAAATGCGTGGGTCAAATGCGTGGAGGAA
AAAAGGTGGGGAGCAGAAGTAGTCCCCATGTTCCAAGCACTCTCAGAGGGATGTCTCTCCTATGATGTAA
ATCAGATGCTCAATGTAATAGGAGACCATCAGGGGGCATTACAAATTCTTAAGGAAGTCATTAATGAAGA
AGCAGCAGAGTGGGACAGGACACACAGACCACCAGCTGGCCCGTTACCAGCAGGGCAGCTAAGAGACCCG
ACAGGGTCAGATATAGCAGGAACTACCAGCTCAATTCAGGAACAAATAGAGTGGACCTTCAATGCCAATC
CAAGAATAGACGTAGGGGCACAATACAGAAAATGGGTTATTTTGGGCTTACAAAAGGTAGTGCAGATGTA
CAATCCCCAAAAGGTCCTAGACATTCGACAGGGACCTAAAGAACCCTTCCAGGACTATGTAGACAGATTC
TATAAAGCCCTGAGAGCAGAACAAGCACCACAGGATGTTAAAAATTGGATGACACAAACTTTGCTTATCC
AGAATGCCAATCCGGATTGTAAATTGATTCTGAAAGGATTGGGAATGAATCCAACCTTGGAGGAAATGCT
AATAGCTTGCCAGGGAGTAGGAGGGCCACAACATAAGGCTAAGCTAATGGTAGAAATGATGAGTAATGGA
CAGAATATGGTCCAAGTGGGACCTCAGAAAAAGGGCCCCCGAGGGCCGCTAAAATGCTTTAATTGTGGCA
AATTTGGACATATGCAAAGGGAATGCAAGGCACCAAGACAGATCAAATGCTTTAAGTGCGGCAAAATTGG
CCATATGGCAAAAGACTGCAAGAATGGACAGGCAAATTTTTTAGGGTATGGCCATTGGGGAGGAGCGAAA
CCAAGAAATTTTGTGCAATACAGAGGAGACACAGTTGGTCTGGAACCAACAGCCCCCCCAATGGAAACAG
CTTACGATCCAGCAAAGAAGCTCCTCCAGCAGTATGCAGAGAAGGGACAGCGCCTGAGAGAGGAGAGAGA
ACAGACAAGGAAACAGAAGGAGAAAGAAGTGGAGGATGTTTCCTTGAGCTCCCTCTTTGGAGGAGACCAA
TGAAACGAGTCATCATAGAAGGAACGCCAGTGCAAGCCTTGTTAGATACAGGAGCAGATGACACTATAAT
TCAAGAAAAGGACTTGCACTTTCCCCCACATAAACCATGGCGTTCCAAGGTAGTAGGAGGTATAGGAGGA
GGGATTCATGTCAAAGAATATCAGGGGGTACAAGTACAATTGGAGGATAAAATCATCACCGGCTCAATTC
TAATAGGAAGTACACCAATCAATATTATAGGAAGAAATATTTTAGCTCAGGCAGGCATGAAATTAGTTAT
GGGAGTTCTATCTAGTCAGATTGAGGAAACAAAAGTACAACTAAAAGAAGGGAAAGATGGACCTAAATTG
AAACAATGGCCCTTATCAAGAGAAAAAATTGAAGCTTTAACAGAAATATGCAAACAAATGGAAGAGGAGG
GAAAATTATCTAGGATAGGAGGAGAAAATCCTTATAATACACCAGTGTTTGCCATAAAGAAAAAGGATAA
AACACAATGGAGAATGCTTGTAGATTTCAGGGAACTAAACAAAGCTACTCAAGACTTTTTTGAGGTTCAG
CTGGGAATTCCTCACCCAGCGGGCCTTCAGAAAAAGAAGCAAATCACAGTAATAGACATAGGGGATGCCT
ATTATTCAATACCATTATGCAAGGAATTCAGAAAATATACAGCATTTACCATCCCCTCAGTAAATAATAC
AGGGCCAGGGATAAGGTATCAGTTCAATTGTCTGCCTCAGGGATGGAAAGGATCTCCTACAATTTTCCAG
AATACGGCAGCAAACATTTTAGAGGAGATCAAAAGGCACACTCCTGGGTTAGAAATTGTCCAATACATGG
ACGATTTGTGGTTGGCGTCAGACCATGATGAGACTAGACATAATCAACAGGTAGACATAGTAAGAAAGAT
GCTGCTAGAAAAAGGTCTAGAAACCCCAGACAAGAAAGTCCAAAGAGAACCGCCATGGGAATGGATGGGG
TATAAATTGCATCCGAATAAATGGACCATTAACAAAATAGAATTACCCCCCTTAGAAGGAGAATGGACAG
TAAACAAAATACAGAAGGTAGTAGGAGTTCTAAATTGGGCAAGTCAAATTTATCCAGGAATTAAAACCAA
ACATACCTGTGCCATGTTGAGAGGGAAAAAGAACCTCCTAGAAGAAATAGTATGGACAGAAGAGGCAGAG
GCAGAATATAAGAACAATCAAGGGATAGTGCAGGAAACACAAGAAGGAACATACTATGACCCTCTCAAAG
AATTAATAGCAACAGTTCAAAAGCAAGGAGAAGGGCAATGGACATACCAATTCACCCAAGAAGGGGCAGT
ATTAAAGGTGGGAAGATATGCCAAGCAAAGAGAAACTCATACTAATGATCTAAGGACTCTAGCACACCTT
GTCCAAAAAATCTGTAAGGAAGCACTTACCATTTGGGGAAGACTTCCACGAGTACAACTCCCAGTAGACA
AGAAAACATGGGATATGTGGTGGCAGGACTATTGGCAAGTATCCTGGATACCAGAATGGGAGTTTGTTAG
CACACCACTCCTAGTAAAACTGTGGTATTCCTTAGTAAAAGAACCAATCAAAGGAGAAGATGTTTATTAT
GTGGATGGGGCAGCATCCAAAGTGACCAAATTAGGTAAGGCAGGATATCTGTCAGAGAGAGGAAAAAGTA
GAATTAGGGAATTAGAAAACACCACTAACCAACAAGCAGAATTAACAGCAGTTAAGATGGCATTGGAGGA
CAGTGGAGAAAATGTAAATATAGTCACAGATTCTCAATATGTAATGAACATCTTGACAGCATGTCCACAG
GAAAGTAACTCACCCTTAGTGGAACAGATAATACAAGCCCTAATGAAAAAGAGGCAGGTCTACTTACAAT
GGGTACCAGCTCATAAGGGGATAGGAGGCAATACAGAAATAGATAAATTAGTAAGCAAAGGAATAAGACA
GATCCTCTTCTTAGATAGAATAGAAGAAGCACAAGATGACCATGCAAAGTACCATAACAATTGGAGAAGT
ATGGTACAGGAATTTGGATTACCTAATATAGTAGCAAAAGAGATAGTAGCGGCATGTCCCAAATGCCAAA
TAAGAGGAGAACCTAAGCATGGACAGGTAGACGCCTCCATTGAAACTTGGCAGATGGACTGCACCCATTT
AGAAGGAAAAGTTATAATAGTAGCAGTACATGTAGCCAGTGGATTCATAGAAGCAGAGGTGATCCCAAGA
GAAACTGGGAAGGAGACAGCACACTTTCTGCTGAAACTGTTAGCAAGATGGCCAGTGAAACATCTACACA
CTGATAATGGCCCAAACTTTACCTCTCAGAATGTGGCAGCGGTGTGCTGGTGGGGTAATATAGAGCACAC
CACTGGAATACCTTATAACCCACAGTCACAGGGTAGTGTAGAAAGCATGAACAGACAGCTCAAGGAAATC
ATCTCTCAAATAAGAGATGATTGTGAGAGATTGGAGACAGCAGTGCAAATGGCTACGCATATCCACAATT
TTAAAAGAAAGGGAGGAATAGGGGGTATCTCTAGTGCAGAAAGATTGGTTAATATGCTAACAACACAACT
AGAACTAAATACTCTACAAAACCAAATCCAAAAAATTTTGAATTTTAAGGTCTACTACAGAGAAGGTAGA
GATCCAGTGTGGAAAGGACCAGCGCGACTCATCTGGAAAGGAGAAGGCGCGGTGGTAATTAAAGAGGGGG
AAGACATCAAGGTAGTCCCCAGGAGAAAGGCTAAGATTATCAAAGATTATGGAGAGAGAAAAACAATGGA
TAGTGAGGGTAGTATGGAGGGTGTCAGAGAGGCAAATAAGCAGATGGAGGGGGATAGTGACTTACAAGAT
CAGGAATA

Protein Information
Protein Name Gag-Pol
NCBI Protein GI 22535297
Protein Accession NP_687035
Protein pI 8.66
Protein Weight 157171.22
Protein Length 1472
Protein Sequence
>NP_687035.1 Gag-Pol [Simian immunodeficiency virus]
MGGGHSALSGRSLDTFEKIRLRPNGKKKYQIKHLIWAGKEMERFGLHEKLLETKEGCQKIIEVLTPLEPT
GSEGLKALFNLCCVIWCIHAEQKVKDTEEAVVTVKQHYHLVDKNEKAAKKKNETTAPPGGESRNYPVVNQ
NNAWVHQPLSPRTLNAWVKCVEEKRWGAEVVPMFQALSEGCLSYDVNQMLNVIGDHQGALQILKEVINEE
AAEWDRTHRPPAGPLPAGQLRDPTGSDIAGTTSSIQEQIEWTFNANPRIDVGAQYRKWVILGLQKVVQMY
NPQKVLDIRQGPKEPFQDYVDRFYKALRAEQAPQDVKNWMTQTLLIQNANPDCKLILKGLGMNPTLEEML
IACQGVGGPQHKAKLMVEMMSNGQNMVQVGPQKKGPRGPLKCFNCGKFGHMQRECKAPRQIKCFKCGKIG
HMAKDCKNGQANFFRVWPLGRSETKKFCAIQRRHSWSGTNSPPNGNSLRSSKEAPPAVCREGTAPERGER
TDKETEGERSGGCFLELPLWRRPMKRVIIEGTPVQALLDTGADDTIIQEKDLHFPPHKPWRSKVVGGIGG
GIHVKEYQGVQVQLEDKIITGSILIGSTPINIIGRNILAQAGMKLVMGVLSSQIEETKVQLKEGKDGPKL
KQWPLSREKIEALTEICKQMEEEGKLSRIGGENPYNTPVFAIKKKDKTQWRMLVDFRELNKATQDFFEVQ
LGIPHPAGLQKKKQITVIDIGDAYYSIPLCKEFRKYTAFTIPSVNNTGPGIRYQFNCLPQGWKGSPTIFQ
NTAANILEEIKRHTPGLEIVQYMDDLWLASDHDETRHNQQVDIVRKMLLEKGLETPDKKVQREPPWEWMG
YKLHPNKWTINKIELPPLEGEWTVNKIQKVVGVLNWASQIYPGIKTKHTCAMLRGKKNLLEEIVWTEEAE
AEYKNNQGIVQETQEGTYYDPLKELIATVQKQGEGQWTYQFTQEGAVLKVGRYAKQRETHTNDLRTLAHL
VQKICKEALTIWGRLPRVQLPVDKKTWDMWWQDYWQVSWIPEWEFVSTPLLVKLWYSLVKEPIKGEDVYY
VDGAASKVTKLGKAGYLSERGKSRIRELENTTNQQAELTAVKMALEDSGENVNIVTDSQYVMNILTACPQ
ESNSPLVEQIIQALMKKRQVYLQWVPAHKGIGGNTEIDKLVSKGIRQILFLDRIEEAQDDHAKYHNNWRS
MVQEFGLPNIVAKEIVAACPKCQIRGEPKHGQVDASIETWQMDCTHLEGKVIIVAVHVASGFIEAEVIPR
ETGKETAHFLLKLLARWPVKHLHTDNGPNFTSQNVAAVCWWGNIEHTTGIPYNPQSQGSVESMNRQLKEI
ISQIRDDCERLETAVQMATHIHNFKRKGGIGGISSAERLVNMLTTQLELNTLQNQIQKILNFKVYYREGR
DPVWKGPARLIWKGEGAVVIKEGEDIKVVPRRKAKIIKDYGERKTMDSEGSMEGVREANKQMEGDSDLQD
QE

Vaxign Prediction
Localization(Probability)
(Prob.=0)
Adhesin Probability 0.138
Trans-membrane Helices 0
Detailed Vaxign Results Vaxign Results
Epitope Information
IEDB Linear Epitope