LOCUS HPV49 7560 bp ds-DNA VRL 04-OCT-1993 DEFINITION Human papillomavirus type 49 (HPV-49), complete genome. ACCESSION X74480 SOURCE Human papillomavirus type 49 DNA. ORGANISM Human papillomavirus type 49 Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7560) AUTHORS Egawa,K., Delius,H., Matsukura,T., Kawashima,M. and De Villiers,E.M. TITLE Two novel types of human papillomavirus, HPV63 and HPV65 comparisons of their distinct clinical and histological features and their DNA sequences to other HPV types JOURNAL Virology 194, 789-799 (1993) STANDARD full staff_review COMMENT Submitted (27-JAN-1993) on tape to the EMBL Data Library by: H. Delius, Deutsches Krebsforschungszentrum, Abt ATV, Im Neuenheimer Feld 506, W -6900 Heidelberg, FRG. Clone = insert in EcoRI site of pGEM-4. FEATURES Location/Qualifiers CDS 200..616 /note="putative" /note="ORF E6 from bp 131 to 616" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MARPVKVCELAHHLNIPIWEVLLPCNFCTGFLTYQELLEFDYKD FNLLWKDGFVFGCCAACAYRSAYHEFTNYHQEIVVGIEIEGRAAANIAEIVVRCLICL KRLDLLEKLDICAQHREFHRVRNRWKGVCRHCRVIE" CDS 613..924 /note="putative" /note="ORF E7 from bp 595 to 924" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="MIGKEVTIPDIILQEEFGQPIDLQCYENLTAEAPAEQELEAEEE LIQGIPYKVIATCGGGCGARLRVFVLATDAAIRSFQELLLEELQFLCPQCREEIRNGG R" CDS 911..2740 /note="putative" /note="ORF E1 from bp 824 to 2740" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MADDKGTDPKEGCSEWFIDNEADCSDLENDLEQLFDESPKSNIS NLLNDEEDVEQGNSRDLLRQQEFEESAEQVQKLKRKYFSPKAVQQLSPRLQSMSISPR QKSKRRLFEEDSGLELSGLEQSLTNEIEDTPAELEVPAATPAEQGGQGEGNLHYKELM RCNNSRAKLLSKVKEYFGVGFYELARQYKSDKTCCKDWVIAAYGVREELVESAKQLLL NHCSYVWININGIMTLYLLCFNHAKSRETVGRLLMSILDVQLLQLICEPPKLRSVVSA LYWYKGSMDSSVYAHGAYPDWIVNQTMISHQAAADAMQFDLSEMIQWAYDSDLTDEAD IAYLYAKMANSDSNARAWLAHNNQARYLRECAQMVRHYRRGEMRDMSMSEWIHHRIQQ VEGEGHWSEIVKFIRFQEINFIIFLDAFKQFIHGKPKKSCLLIHGPPDCGKSMFAMSL LKVLKGKVISFVNAKSQFWLSPLSECKIGLLDDATDPCWQYIDTYLRNGLDGNVVSVD CKHKTPMQIRFPPLLITSNYNIKANDKYKFLYSRIAIFEFKHKFPFKEDGTPVFQLTD QSWKSFFERLWTQLELSDPEDEADNGGTQRSFQCTTRDVNGHL" CDS 2682..4148 /note="putative" /note="ORF E2 from bp 2652 to 4148" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MEALNARFNVLQEMLMDIYESGKEDLETQIEHWKLLRQEQALLF FARKHSIMRLGYQPVPPMAVSETKAKQAIGMMLTLQSLQKSPFGKEKWTLVNTSLETY NAPPAQCFKKGPYNIEVIFDGDPENLMVYTAWKEIYFVDSDDMWQKVQGEVDYAGAYY KDGTIKQYYVTFADDAVRYGTSGQYEVRINNETVFAPVTSSTPPSTGLRESSNASPVH DTVDETPTSTTATTTTFSTTTATATATGAPELSSKTGTRKGRYGRKDSSPTAASNSRK EVSRRRSRSRTRTRRREASTSRSQKASRSRSRSRSTSRGSRGSGGSVTTSRDSSPKRT RRGRGRGGRSRRSPTPTSTSKRERRRSRSRGGEPVSGGVGISPDKVGSRVQTVSGRHL GRLGRLLEEASDPPVILLRGDPNILKCYRYRDKKRKLGLVKHYSTTWSWVGVDGNERI GRSRMLLSFTSNSTRSQYVKIMKLPKGVEWSFGNFDKL" CDS 3109..3903 /note="putative" /note="ORF E4 from bp 3103 to 3903" /gene="E4" /note="putative" /codon_start=1 /translation="MICGKRCKVRWIMQVHIIRMELSNSIMLPSLMMLLDMGHLDNMK SALTTKLCLLLLLAPPHHPRGYENPPTPAPFTTPSTRHPPAPQQPPPPSAPPQPQPQP QEHLNSHPKPVPGKEGTGEKTLVLQQPPTPGKRSRDDDPGLEPGPADGKRAPQGPKKP AVPDPDPDPLPEDPEGPEDLSQPPEIPAPREPAGAEGGEGEVEGHPPPPPPVNGKEGA AGQGGESLFLEGLASRLTRWDQEYKQLVDDILDDLEGYWRRLAILQ" CDS 4233..5798 /note="putative" /note="ORF L2 from bp 4194 to 5798" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MVRARRTKRDSVTNIYRTCKQAGNCPPDVVNKVEQTTIADQILK FGSTGVFFGGLGIGTGRGTGGSTGYVPIGEGPAIRVGGTPSVVRPGILPEAIGPADII PIDTVNPIDPNASSVVPLTDTGPDLLPGTIETIAEVNPAPDIPRVDTSVVTTSRGSSA VLEVASEPTPPTRTRISRTQYHNPSFQILTESTPSLGESALTDHVVVTSGSGGQPIGG VTPVEIELQELPSRYTFEIEEPTPPRRSSTPLRNITQAVGNLRRSLYNRRLTQQVNVQ DPLFLQQPSRLVRFAFDNPVFEEEVTQIFERDVAAVEEPPDRDFLDIAKLSRPLYSET PQGYVRVSRLGNRASIRTRSGATVGAQVHFYTDLSTIDAEESIELSLLGEHSGDATIV QGPVESSFVDLNVQELPQVIEVDPEPTFHSDDLLLDEQNEDFSGSQLVYGSGRRSTTF TVPRFSTPRSDTFYVQDLEGYAVSYPERRNYPEIIYPQPDLPTVIIHTADTSGDFYLH PSLRRRKRKRTYL" CDS 5811..7340 /note="putative" /note="ORF L1 from bp 5799 to 7340" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MTSLWLPATGKVYLPPSTPVARVQSTDEYIQRTDIYYHANSDRL LTVGHPYFDVRDTADNSKILVPKVSGNQYRAFRLLLPDPNRFALVDMNIYNPEKERLV WACRGLEIGRGQPLGVGTTGHPLFNKVKDTENANNYIVTSKDDRQDTSFDPKQVQMFI IGCTPCMGEYWDAAKPCDADAGQGKCPPLELINSVIQDGDMIDIGFGNINNKTLSVNR SDVSLDIVNDICKYPDFLKMANDIYGDACFFYARREQCYARHFFVRGGNVGDAIPNTA VGQDNNYILPAASQQAQNTLGSSIYFPTVSGSLVSTDAQLFNRPFWLQRAQGHNNGIC WENQLFITVADNTRNTNFTISVSTDGQTPTEYDSTKVREFLRHVEEYEISIILQLCKV PLEPEVLAQINAMNSSILENWQLGFVPTPDNPIHDTYRYLTSQATRCPDKQPAPERKD PYEQYNFWTVDLTEKLSLDLDQYSLGRKFLFQAGLQRASRVSKSSAARASTRGIKRKR R" source 1..7560 /organism="Human papillomavirus type 49" /sequenced_mol="DNA" BASE COUNT 2366 a 1436 c 1672 g 2086 t ORIGIN 199 bp upstream from beginning of E6 cds 1 ccacattcgt tccagctaca ttttggcgcc aactctttgg cagcaacacc agaacgataa 61 cggtaagttt caatcgggcg cggtcacatt atacttagtc atctcttgtg gttgttaaca 121 acaatcttga aacagatata catgtaaccg cttgcgtgct gtactttctt tattcttgga 181 aagaatacag acaggacaca tggctagacc tgttaaggta tgtgagctag cccaccactt 241 aaatatacct atttgggaag ttttgcttcc ttgtaatttt tgcacggggt ttctaacata 301 tcaggagttg ttagaatttg actataaaga ctttaatttg ctgtggaaag acggatttgt 361 ctttggttgt tgtgcagctt gtgcctatag atcagcatat cacgagttta ctaattatca 421 ccaagaaatt gtcgtaggca tcgaaataga aggacgagca gcggctaata ttgctgagat 481 agtagtcaga tgtctcattt gccttaagag gctagatttg ttggaaaagc ttgatatttg 541 tgcacagcac agagagtttc acagagttag aaataggtgg aaaggggtgt gtagacattg 601 cagagttata gaatgattgg gaaagaagtt acaataccag atataatact acaagaagag 661 tttggccagc ccattgacct gcaatgctac gagaatctaa cagctgaagc gccagctgaa 721 caagagttgg aggcagagga ggagcttatc caaggcatcc cttacaaagt tattgctact 781 tgtggcggcg gatgcggtgc cagactgcga gtcttcgtgt tagccactga cgctgctatt 841 agaagtttcc aagaactgct tctggaggaa ctgcaattct tgtgtcctca gtgtcgtgaa 901 gaaattcgga atggcggacg ataaaggtac tgatcccaaa gaagggtgta gcgagtggtt 961 tatagataat gaagcagact gtagtgattt agaaaatgat ttggaacaat tatttgatga 1021 aagcccaaag tccaatattt caaatttgtt aaatgatgag gaggatgtgg agcagggaaa 1081 ttcgcgagat ctgcttcgcc agcaggaatt tgaggagagc gcggagcaag tacaaaagtt 1141 aaaacgaaag tatttcagtc ctaaagcagt tcaacaactt agcccacggt tgcagtctat 1201 gtcaatatct ccgcgacaaa agtctaaacg aaggctattt gaggaggaca gcgggctgga 1261 attatcgggg ctcgaacagt ctttgactaa tgaaattgaa gatactcctg cggagctgga 1321 ggtaccggcg gcaacgccgg cagagcaggg tggtcaggga gagggcaatt tgcattataa 1381 agagttaatg cgatgcaata atagtcgtgc aaaattatta agtaaagtca aggaatattt 1441 tggtgtgggt ttttatgagt tagctagaca gtataaaagt gataaaacat gctgtaaaga 1501 ttgggtaatt gcagcctatg gcgtgcgaga agagctggta gaaagtgcaa aacaattact 1561 tttaaatcat tgttcctatg tgtggataaa tataaatggg attatgactt tatatttact 1621 gtgttttaat catgcaaaga gtagagaaac tgttggtaga ttgcttatgt caatactgga 1681 tgtacaatta ttgcaattaa tttgtgaacc accaaaacta agaagtgtgg tgtcagcact 1741 atactggtac aaaggcagta tggactcatc tgtgtatgct catggagcct atcctgattg 1801 gattgtaaat cagaccatga taagtcatca ggcagcagca gatgctatgc aatttgacct 1861 ttctgaaatg atacaatggg cctatgatag cgatctcaca gatgaagctg acattgcata 1921 tctttatgct aaaatggcaa atagtgactc taatgcaaga gcttggttag cacataataa 1981 tcaggcaagg tacttaagag aatgtgctca aatggttaga cattacagac ggggagaaat 2041 gagggatatg agtatgtctg agtggataca tcacagaata caacaagtag aaggggaagg 2101 ccattggtct gaaatagtta agtttataag atttcaagaa ataaacttta taatatttct 2161 ggatgcattt aaacagttta tacatggcaa acctaaaaaa agctgtttat taatacatgg 2221 gccgccggac tgtggcaagt caatgtttgc tatgtcatta ttaaaagttt taaaaggcaa 2281 ggtaatttca tttgtaaatg caaaaagtca attttggctg tctccacttt cagaatgtaa 2341 aatagggctg ttggatgatg ctaccgatcc ttgttggcaa tatatagata catatttaag 2401 aaatggtctc gatggaaatg ttgtaagtgt ggattgcaaa cataaaaccc ctatgcaaat 2461 taggttccca ccattgttaa taacttcaaa ttataatatt aaagctaatg ataaatataa 2521 gtttttgtac agtagaattg caatatttga atttaaacat aagttcccat tcaaagagga 2581 tggtacccct gtatttcaac ttactgacca aagctggaaa tctttttttg aaaggctttg 2641 gacacaatta gagctcagtg acccagaaga cgaggcagac aatggaggca ctcaacgctc 2701 gtttcaatgt actacaagag atgttaatgg acatttatga atcagggaaa gaggatcttg 2761 aaacacaaat agaacattgg aaactgttaa gacaggaaca agctttatta ttttttgcac 2821 gtaaacacag cataatgaga ctggggtatc aacccgtacc tccgatggca gtatctgaaa 2881 ccaaagccaa acaagctatt ggcatgatgc taactttgca aagcttgcaa aagtctcctt 2941 ttggaaaaga aaagtggact ttagtaaaca caagtcttga aacatacaat gcaccaccag 3001 cacagtgctt taaaaaaggt ccttataata tagaagttat atttgatgga gatcctgaaa 3061 atctaatggt atatactgct tggaaagaga tttattttgt agactcagat gatatgtggc 3121 aaaaggtgca aggtgaggtg gattatgcag gtgcatatta taaggatgga actatcaaac 3181 agtattatgt taccttcgct gatgatgctg ttagatatgg gacatctgga caatatgaag 3241 tccgcattaa caacgaaact gtgtttgctc ctgttactag ctccacccca ccatccacgg 3301 ggctacgaga atcctccaac gccagccccg ttcacgacac cgtcgacgag acacccacca 3361 gcaccacagc aaccaccacc accttcagca ccaccacagc cacagccaca gccacaggag 3421 cacctgaact ctcatccaaa accggtacca ggaaaggaag gtacgggcga aaagactcta 3481 gtcctacagc agcctccaac tccaggaaag aggtctcgcg acgacgatcc aggtctagaa 3541 ccaggacccg cagacgggaa gcgagcacct caaggtccca aaaagccagc cgttccagat 3601 ccagatccag atccacttcc agaggatcca gagggtccgg aggatctgtc acaacctcca 3661 gagattccag ccccaagaga acccgcaggg gcagagggag gggagggaga agtagaaggt 3721 cacccacccc cacctccacc agtaaacggg aaagaaggcg cagccggtca agggggggag 3781 agcctgtttc tggaggggtt ggcatctcgc ctgacaaggt gggatcaaga gtacaaacag 3841 ttagtggacg acatcttgga cgacttggaa ggttactgga ggaggctagc gatcctccag 3901 taatactttt gcgaggagac ccaaatattt taaaatgtta cagatacaga gataagaagc 3961 gtaaattagg tttagtaaaa cattatagta ccacctggtc atgggttggt gtagatggca 4021 atgaaagaat aggtagatca cgtatgcttt taagttttac ttcaaacagc actagatcac 4081 agtatgttaa aattatgaag ctccctaaag gtgtggaatg gtcttttggt aattttgata 4141 agctttaaca ttttgctaac atactaacgg tgcttgcact actaacacat taatctttta 4201 acatttttat attgcttttt tatttttata taatggtgcg tgctcgcaga acaaagcgag 4261 attctgtaac aaacatttac agaacctgca aacaggcagg aaactgtcct ccggatgttg 4321 ttaataaagt ggaacaaact acaattgctg accaaatatt aaaatttggc agcactggtg 4381 tgttttttgg tggtttggga ataggtacag gccgtggtac cggtggcagt actggctatg 4441 tacctatagg tgaaggccca gcaatacgtg ttgggggcac tccaagtgtt gttcgtccag 4501 gtatactccc tgaggctatt ggtccggcgg atatcattcc tattgatact gtcaatccaa 4561 ttgatccaaa tgcatcatct gtggtcccac tcactgacac aggacctgat ttgctacctg 4621 ggacaattga gactattgca gaagtgaacc ctgccccaga tattcctaga gttgacacat 4681 ctgttgtcac aacaagcaga ggctccagtg ctgtattgga ggttgcctct gaacccacac 4741 cacccactcg caccagaatt tccagaacac agtaccataa tccctctttt caaatattaa 4801 ctgaatctac accctctttg ggagaatctg cattaactga tcatgttgtt gttactagtg 4861 gttctggtgg tcaaccaata ggtggagtta caccagttga aatagaatta caagaacttc 4921 ctagcagata tacttttgaa atagaggaac ctacaccacc aagacgctct agtaccccac 4981 tacgcaacat cacacaagct gtaggaaatt taagaagatc actatataat aggcgactta 5041 ctcaacaagt aaatgtccag gatccattat tcttacaaca gccctcacgt ttagttcgct 5101 ttgcctttga taatcctgtg tttgaagaag aagttacaca aatatttgaa agggacgtag 5161 cagctgtaga agaacctcca gacagagact ttttagatat agcaaaatta agccgccctc 5221 tttactctga aacaccacag ggatatgtca gggtaagccg cttaggtaat agggcttcta 5281 ttagaacacg tagtggagct acagtagggg ctcaagtgca tttttataca gatcttagca 5341 caatcgatgc agaggagtct atagagttat cactattagg ggaacattct ggtgatgcta 5401 ctattgtcca aggcccagta gaaagctcat ttgtagattt aaatgttcag gaactgcctc 5461 aagtaataga agtagaccca gaacctactt tccactctga tgatttgcta ctggatgagc 5521 aaaatgaaga tttttctggc tcccagttag tttatggtag tggcaggcgt tctaccacat 5581 ttactgtacc ccgcttctct actcccagat ctgatacctt ttatgtacaa gatttggaag 5641 gttatgctgt gtcatatcct gaacgaagga attatccaga aattatttat cctcaacccg 5701 atttgccaac tgtaataatt catactgcag atacctctgg ggacttctat ttacatccaa 5761 gccttcgcag gcgaaaacgt aaacgcactt atttatgata tttctttcag atgacctcgc 5821 tatggttacc tgcaactggt aaggtatatc taccaccttc aacacctgtg gcaagggtac 5881 aaagcacgga tgaatacatt cagaggacag acatctacta tcatgctaat agtgatcgat 5941 tgttaactgt aggacatcca tattttgatg tgagagatac agcagacaat tctaaaattt 6001 tagtaccaaa ggtttcaggt aatcaatatc gagcctttag attactatta ccagatccca 6061 acagatttgc actagtagat atgaatatat ataacccaga aaaggaaaga ttagtatggg 6121 cctgtagagg cttagaaatt ggtcgtggcc agcctttagg tgttggtaca acaggacatc 6181 cattgtttaa caaagtcaaa gatactgaaa atgctaataa ctatatagta acttctaaag 6241 atgatagaca ggatacttca tttgacccta aacaggtaca aatgtttatc ataggttgta 6301 ctccttgtat gggtgagtac tgggacgctg ctaaaccttg tgatgcagat gctggtcagg 6361 gtaaatgccc tccattagaa ttaatcaatt cagttataca agatggtgat atgattgata 6421 taggttttgg taatatcaat aataagacat tatctgttaa cagatctgat gtcagtttgg 6481 atatagtaaa tgacatttgc aagtatcctg attttttaaa gatggcaaat gacatatatg 6541 gggatgcttg tttcttctat gctagacgtg aacaatgtta tgccaggcac ttctttgtta 6601 gaggtggtaa tgtaggggat gcgataccca atactgctgt aggtcaggat aacaattaca 6661 tattacctgc agcaagtcaa caggcccaaa atactcttgg cagctccatc tatttcccta 6721 ccgtcagtgg ctctttggta tctactgatg cgcagctatt caatagacct ttttggttac 6781 aaagagcaca gggtcacaac aatggaattt gctgggagaa tcagcttttt ataacagtgg 6841 ctgataatac cagaaatacc aattttacta ttagtgtaag tacggatggc cagacaccta 6901 cagaatatga cagtaccaag gttagagaat ttttaagaca tgtagaggaa tatgaaattt 6961 caattatatt acaattgtgt aaggtacctt tagaaccgga agtcctggca caaatcaatg 7021 ctatgaattc ttctatattg gaaaattggc aattgggatt tgttcctacc cctgataatc 7081 ctatacatga cacatatagg tatcttacat cacaggcaac acgatgccct gacaaacaac 7141 ctgctccaga aaggaaagat ccatatgagc agtataactt ttggactgta gatttaacag 7201 aaaaactgtc tttggatttg gatcaatatt ctttaggaag aaagttttta tttcaagctg 7261 ggctacaacg ggcttctaga gtgtctaaat cctctgctgc tagagcttcc acacggggta 7321 ttaaacgaaa acggagatga ccgttttcgg ttgctgggtc ttataataaa atattttata 7381 aactgttttg gtatgtgagg catgttttaa ccgagttcgt gactaagatt gattaaccca 7441 cctgcaaccg cacccggtta atcagattat aaaggtgcgc cggtgttcac ctctggctac 7501 ttggcagtta caagttcacc tctgccagaa gtgtgttttt gccaagacat ttgccaagta