ID HPV20 STANDARD; ds-DNA; VRL; 7757 bp. XX DE Human papillomavirus type 20 (HPV20), complete genome. XX AC U31778 XX DT 04-JUL-1995 XX OS Human papillomavirus type 20 DNA. OC Human papillomavirus type 20;Viridae; ds-DNA nonenveloped viruses; OC Papovaviridae;Papillomavirus. XX RN [1] RP 1 - 7757 RA Delius,H; RT "Direct Submission;" RL Unpublished. XX RN [2] RP 1 - 7757 RA Kremsdorf,D., Favre,M., Jablonska,S., Obalek,S., RA Rueda,L.A.,Lutzner,M.A., Blanchet-Bardon,C., Van Voorst Vader,P.C., and RA Orth,G; RT "Molecular cloning and characterization of the genomes of nine newly RT recognized human papillomavirus types associated with RT epidermodysplasia verruciformis;" RL Journal of Virology 52(3), 1013-1018 (1984). XX RN [3] RP 1 - 7757 RA Gassenmaier,A., Lammel,M., and Pfister,H; RT "Molecular cloning and characterization of the DNAs of human RT papillomaviruses 19, 20, and 25 from a patient with epidermo- dysplasia RT verruciformis;" RL Journal of Virology 52(3), 1019-1023 (1984). XX RN [4] RP 1 - 7757 RA Kiyono,T., Hiraiwa,A., and Ishibashi,M; RT "Differences in transforming activity and coded amino acid sequence RT among E6 genes of several papillomaviruses associated with RT epidermodysplasia verruciformis;" RL Virology 186(2), 628-639 (1992). XX XX Created by HIV database on 26-JAN-1996 from GenBank: U31778. XX XX XX HPV20 was originally isolated from skin warts of XX epidermodysplasia verruciformis (EV) patients [2,3]. It has XX additionally been detected in a squamous cell carcinoma from XX another EV patient, although the association is not frequent. XX Cloned HPV20 DNA was obtained from the Papillomavirus XX Reference Center, Heidelberg and subsequently sequenced by XX Dr. H. Delius. Hybridization assays and phylogenetic XX reconstructions based on DNA sequences indicate that HPV20 is XX most closely related to HPV21 and HPV14, and then to HPV19 and XX HPV25. This grouping agrees with assays of the degree of XX transforming activity of the E6 protein (these related HPV XX types had relatively low transforming activity as compared to XX HPVs 5, 8, and 47), and clustering of similarity of amino XX acids in the second zinc finger domain of E6 [4]. The E6 gene XX of HPVs 14, 21, and 25 can enhance the induction of anchorage XX independent growth of 3Y1 cells by the HPV16 E7 gene, although XX again less effectively than that of HPVs 5, 8, and 47. XX XX HPV20 was cloned via AvaI restriction. But contrary to the XX assumption that type 20 had only one AvaI site (Kremsdorf et al., XX 1984) the sequence analysis of the clone showed the presence of XX two additional AvaI fragments of 16 and 176 ntd, respectively, at XX the cloning site (position 1158 in the final sequence) in opposite XX orientation relative to the large AvaI fragment containing the XX major part of the viral genome. The segment between the AvaI sites XX at position 1142 and 1334 is inverted in the pBR322 clone. XX This inversion leads to disrupted E7 and E1 ORFs in the clone. XX The sequence has been fixed to yield colinearity with the closely XX related HPV types. XX XX FT KEY Location/Qualifiers FT CDS 200..697 FT /note="ORF E6 from bp 182 to 697" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT /translation="MATPPSSEDSADEGPSNIGEAKPPILEPPLPATICGLAKLLEIP FT LDDCLIPCNFCGNFLTHLEVCEFDEKKLTLIWKDHLVFACCRVCCSATATYEFNQFYE FT STVLGRDIEQVTGKSVFDIDVRCYTCMKFLDSIEKLDICGRKRPFYLVRGSWKGICRL FT CKHFQ" FT CDS 697..1005 FT /note="ORF E7 from bp 685 to 1005" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT /translation="MIGKEVTLQDIVLELNELQPEVQPVDLFCEEELPNEQQEREEEP FT QIERASYKVVAPCGCCKVKLRIFISATEFAIRSFQQLLIDELQLLCPDCRGNCKHGGS FT " FT CDS 992..2809 FT /note="ORF E1 from bp 944 to 2809" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT /translation="MADPKGSTSKDGLDDWCIVEAECSDVDNDLEELFDRDTDSDISE FT LLDDNDLEQGNSRELFHQQECKDSEEQLQKLKRKYISPKAIAQLSPRLESISLSPQQK FT SKRRLFAEQDSGLELTLTNEAEDVSSEVEEVPALDSQPVAEGHLGTVDIHYTELLRAS FT NHKAILLAKFKEAFGIGFNDLTRQFKSYKTCCNDWVLSVYAVHEDLLESSKQLLQQHC FT DYIWIRGIAAMSLFLLCFKAGKNRGTVHKLMTSMLNVHEKQILSEPPKLRNVAAALFW FT YKGAMGSGAFSHGPYPNWMAQQTIVGHQSTEASAFDLSEMIQWAFDHNYLDEADIAFQ FT YAKLAPENSNAVAWLAHNNQARFVRECASMVRFYKKGQMKEMSMSEWIYARINEVEGE FT GHWSSIAKFLRYQQVNVIMFLAALKDMLHSVPKHNCILIHGPPNTGKSAFTMSLIHVL FT KGRVLSFVNSKSQFWLQPMSETKIALIDDVTDPCWVYMDTYLRNGLDGHYVSLDCKHK FT APIQTKFPALLLTSNINVHNEVNYRYLHSRIKGFEFPNPFPMKPDNTPEFELTDQSWK FT SFFTRLWKQLELSDQEDEGENGESQQAFQCSARSANEHL" FT CDS 2751..4244 FT /note="ORF E2 from bp 2727 to 4244" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT /translation="MENLSKRFNALQDQLMNIYESAPDTLESQIEHWQTLRKEAVLLY FT FARQHGISRVGYQPVPVLAVSEAKAKQAIGMVLRLQSLQKSEYGSEPWSLVDASAETF FT RSPPENHFKKGPISVEVIYDKDKDNANAYTMWRFVYYQDDDDKWHKSASGVNQTGIYF FT MQGTFRHYYVLFADDASRYSTTGQWEVKVNKETVFAPVTSSTPPDSPGGQADSNASSQ FT TPATTTDSTTRQSPRKQSQQTNTKGRRYGRRPSSRTRRTTQTRQRRRSRSKSKSKSRS FT RSRSRHRSRSRSRSESPRRRSRYRSRSGSRGRVALRAITTTTTTTTRRAGGGSPTSTS FT STTSQRSRQLRGGGRGGSRQRARGRRSSSTSPTPSKRSRGESESVRQHGISPSDVGTA FT VYTVSSRHTGRLGRLLDEALDPPVILVRGEPNTLKCFRNRAKQRYTGLYKSFSTAWSW FT VAGDGTERLGRSRMLISFISFSQRKDFDETVKYPKGVDRSFGSFDSL" FT CDS <3313..3999 FT /note="ORF E4 from bp 3313 to 3999" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /translation="KLIRKLCLLLSPAPPPPTHQEDKQTQTPPPRPPPPPLTPRPDSR FT PENSHNKPTPKGEGTDGDLPVGQGEQPKRARGDGPGQSPSPSPGRGRGRGTGLGLGLG FT LNRRAGGLGTDHDPDPEGESPSAPLPPPPQPPPDGQVEGHPPPPPPPPHNGRDSCGEG FT AVGGADKEQGEGDHHPPPPPPQNGHEGSQSLLGNMASLLLTWEQQFTQLVQDIQEDLE FT DYWMKLSIPQ" FT CDS 4321..5877 FT /note="ORF L2 from bp 4306 to 5877" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT /translation="MARAKRVKRDSATNIYRTCKQAGTCPPDVINKVESTTIADKILQ FT YGSAGVFFGGLGISTGKGTGGTTGYVPLGEGPSVRVGGTPTVIRPALVPDTIGPSDII FT PVDTLNPVEPSTSSIVPLTESTGPDLLPGEVETIAEIHPGPSRPPTDTPVTSTTSGSS FT AVLEVAPEPTPPARVRVSRTQYHNPSFQIITESTPTLGESSLADHIVVTSGSGGQAIG FT GMTPELIELQDFPSRYSFEIEEPTPPRRTSTPMQRLQNVFRRRGGLTNRRLVQQVPVD FT NPLFLTQPSRLVRFQFDNPVFEEEVTQIFEQDLDTFNEPPDRDFLDVQSLGRPQYSET FT PAGYVRVSRAGQRRTIRTRSGAQIGSQVHFYRDLSSIDTEDPIELQLLGQHSGDATIV FT QGPVESTFVDINVDENPLSEISAYSDDLLLDEANEDFSGSQLVVGGRRSTSTYTVPHF FT ETTRSSSYYVQDTKGYYVAYPEDRDVSKDIIYPNPDLPVVIIHTYDTSGDFYLHPSLT FT KRLKRKRKYL" FT CDS 5893..7443 FT /note="ORF L1 from bp 5878 to 7443" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT /translation="MAVWQAASGKVYLPPSTPVARVQSTDEYVQRTNIYYHAYSDRLL FT TVGHPYFNIYDIQGTKIKVPKVSGNQHRVFRLKLPDPNRFALADMSVYNPDKERLVWG FT CRGIEIGRGQPLGVGSVGHPLFNKLGDTENPNSYKGNSTDDRQNVSFDPKQLQMFIIG FT CAPCLGEHWDRALPCADDVPNPGSCPPIELKNTAIQDGDMADIGYGNLNFKALQENRA FT DVSLDIVNETCKYPDFLKMQNDVYGDSCFFYARREQCYARHFFVRGGKTGDDIPAGQI FT DEGSMKNAFYIPPVNNQAQNNLGNSMYFPTVSGSLVSSDAQLFNRPFWLQRAQGHNNG FT ICWFNQLFVTVVDNTRNTNFSISVHSENTDVSKIQNYDSQKFQEYLRHVEEYEISLIL FT QLCKVPLTAEVLAQINAMNSNILEEWQLGFVPAPDNPIHDTYRYINSAATRCPDKNPP FT KEREDPYKDLNFWNVDLSERLSLELDQYSLGRKFLFQAGLQQATVNGTKTVSSKLSTR FT GVKRKRKQ" FT source 1..7757 FT /organism="Human papillomavirus type 20" XX SQ SEQUENCE 7757 bp; 2431 a; 1510 c; 1698 g; 2118 t; tcgggcgcgg tcatacatta ctcatttggt agttgttgtt gccagctacc atcaagcata 60 gcatgttttt gcctgtaacg ttatcggcac agtgattaat atatatatat atatatatat 120 atatatatat atatatatat atatatatat agatacatat agacagatat catagagcta 180 atgcagagag tgcaggcaca tggctacacc tccttcttca gaagacagcg ctgatgaagg 240 accatctaat attggagagg caaaacctcc aatcttagag ccaccattgc ctgcaacaat 300 ctgtggccta gcaaaacttt tagaaatacc gctagatgat tgtttgatac cttgtaactt 360 ctgcggtaat ttccttacac atttagaagt ttgtgagttt gatgagaaga agcttacttt 420 aatttggaaa gatcatttgg tttttgcatg ctgtcgtgtt tgctgctcgg caacagcgac 480 atatgagttt aatcaatttt atgagagtac tgttttaggc agagacatag agcaagtaac 540 aggcaaatct gtttttgata tagatgtcag gtgctacacc tgtatgaaat ttttagactc 600 aattgaaaag ctagacatct gtggcagaaa gcgtccattt tatttagtga gaggctcttg 660 gaaaggaatc tgtaggctgt gtaagcattt tcaataatga ttggtaaaga ggtcacattg 720 caagatattg tgctggagtt aaatgaattg cagcctgagg ttcaaccagt tgacctgttt 780 tgtgaagagg agttaccgaa cgagcagcag gagagagagg aggagcctca gattgaaaga 840 gcctcataca aagttgttgc accttgcggc tgctgcaagg tgaaacttcg catctttata 900 agcgctacag aatttgctat tagaagcttt caacaattgc tgattgacga gctgcagctg 960 ttgtgtcctg actgtcgcgg gaactgcaaa catggcggat cctaaaggta gtacatctaa 1020 agacgggttg gatgattggt gtattgttga agctgaatgt agcgatgtag acaatgattt 1080 ggaagaatta tttgacagag atacagactc agatatttca gaattattag atgataatga 1140 cctcgagcag ggcaattctc gggaactatt tcatcaacaa gagtgtaagg acagcgagga 1200 gcaattacaa aaactaaaac gaaagtacat aagtccaaaa gctattgcac agcttagtcc 1260 gcgacttgaa agtatttcac tgtcaccaca gcagaagtca aaacgaaggc tttttgcaga 1320 gcaggacagc gggctcgagt taactcttac aaatgaagct gaagatgttt cttctgaggt 1380 ggaggaggta ccggccctag actctcagcc ggttgctgag ggacacttag gaacagtaga 1440 cattcattat acagaattat tgcgtgccag taaccataag gcaattttgt tggcaaaatt 1500 taaggaggct tttgggatag ggtttaatga tttgacacgt caatttaaaa gttacaaaac 1560 ctgctgtaat gattgggttc tatctgtgta tgcagttcat gaggatcttc ttgaaagctc 1620 aaagcagtta ttgcaacagc attgtgatta tatatggatc cgtgggatag cagcaatgtc 1680 attgtttcta ttgtgtttta aagcaggaaa aaatcgtggg actgtgcata aattaatgac 1740 atcaatgttg aatgtgcatg aaaagcaaat attgtctgag cctccaaaat taagaaatgt 1800 tgctgctgct ttattttggt ataaaggtgc aatggggtcc ggagcatttt ctcatggtcc 1860 atatcctaac tggatggcac agcaaactat tgttggtcat cagagcacag aagccagtgc 1920 ttttgacttg tctgaaatga ttcagtgggc atttgaccat aattatctag atgaggctga 1980 tatagccttt cagtatgcta agctagcacc agaaaatagt aatgctgtag catggcttgc 2040 acataataac caagcaaggt ttgttagaga atgtgcatca atggtcaggt tttataaaaa 2100 aggtcaaatg aaagaaatga gcatgtcaga atggatttat gccagaatta atgaagtaga 2160 aggcgaagga cattggtcat ctattgctaa atttcttaga tatcagcaag taaatgttat 2220 aatgttttta gctgctttga aagatatgct gcattctgta cctaaacata actgtatatt 2280 aatacatggc ccacctaata ctggaaaatc tgcattcact atgtcattga tacatgtgtt 2340 aaagggaagg gtattgtcct ttgtaaattc taaaagccaa ttctggttac aaccaatgtc 2400 agaaactaaa atagcattaa ttgatgacgt aactgatcct tgctgggttt atatggatac 2460 atatttaaga aatggcttag atggacatta tgtctcacta gattgcaagc ataaagcacc 2520 aattcaaaca aaatttcctg cattactgct tacctctaat attaatgttc ataatgaagt 2580 taactataga tatttacata gtagaattaa aggatttgaa tttccaaatc catttccaat 2640 gaaaccagac aatacccctg agtttgagct tactgaccaa agctggaaat ctttttttac 2700 aaggctttgg aagcaattag agctgagtga ccaagaagac gagggagaaa atggagaatc 2760 tcagcaagcg tttcaatgct ctgcaagatc agctaatgaa catttatgag tctgcaccag 2820 acactcttga gtcgcaaatt gagcactggc aaaccctgcg aaaagaagct gtgctactat 2880 attttgctag gcaacatggt atcagcaggg ttggatatca acctgtgcct gtattagctg 2940 tgtcagaagc caaagctaaa caggctatag gaatggtatt aaggttacaa tcattgcaaa 3000 aatctgaata tggaagtgaa ccatggtctt tggtagatgc aagtgcagag acatttagaa 3060 gcccgccaga aaatcacttt aaaaaaggtc cgatttcagt agaggtcata tatgacaaag 3120 ataaagacaa tgccaatgct tataccatgt ggagatttgt ttattaccaa gatgatgacg 3180 acaagtggca caaaagtgct agtggtgtta accaaacagg catatatttt atgcaaggaa 3240 catttagaca ctactatgtt ttgtttgctg atgatgcgag tagatatagt acaactggac 3300 aatgggaagt gaaagttaat aaggaaactg tgtttgctcc tgtcaccagc tccacccccc 3360 ccgactcacc aggaggacaa gcagactcaa acgcctcctc ccagaccccc gccaccacca 3420 ctgactccac gaccagacag tcgcccagaa aacagtcaca acaaaccaac accaaaggga 3480 gaaggtacgg acggagacct tccagtagga caaggcgaac aacccaaacg cgccagaggc 3540 gacggtccag gtcaaagtcc aagtccaagt ccaggtcgcg gtcgaggtcg cggcaccggt 3600 ctcggtctcg gtctcggtct gaatcgccgc gccggcggtc tcggtaccga tcacgatccg 3660 gatccagagg gagagtcgcc ctccgcgcca ttaccaccac caccacaacc accaccagac 3720 gggcaggtgg agggtcaccc acctccacct cctccaccac ctcacaacgg tcgcgacagc 3780 tgcggggagg gggccgtggg gggagcagac aaagagcaag gggaaggcga tcatcatcca 3840 cctcccccac cccctcaaaa cggtcacgag gggagtcaga gtctgttagg caacatggca 3900 tctctccttc tgacgtggga acagcagttt acacagttag ttcaagacat acaggaagac 3960 ttggaagatt actggatgaa gctctcgatc ccccagtgat tttagttagg ggagagccta 4020 atacgcttaa gtgctttcgc aatagggcca aacaaagata tacagggctg tataagtctt 4080 ttagcacggc ctggtcgtgg gtggctggag atggcacgga gcgtctaggc aggtccagaa 4140 tgctcattag ctttatatcc ttcagtcaaa gaaaagattt tgatgagact gtgaaatatc 4200 cgaagggggt tgaccggtcg tttggttcat ttgacagctt atagcaacct aaccttctaa 4260 ccactgcatg ctactaacac actaacattt tttaattttt attaatattt tttatttgct 4320 atggcgcgcg ctaagcgagt caagcgggac tctgctacta acatatacag aacctgcaaa 4380 caagcaggta cttgtcctcc tgatgttata aataaagtgg aaagcacaac tattgctgat 4440 aaaattttgc agtatggtag tgctggtgtt ttttttgggg gattaggcat aagcactgga 4500 aaaggtacag gaggaaccac aggttatgtg cctttgggag aaggcccatc ggtgcgtgtt 4560 ggtggtacac ctacagtcat acgacctgct ttggtcccag acaccatcgg cccctccgat 4620 attatacctg tggacacctt aaatccggtg gagccttcta cctcttctat tgttccactt 4680 acagaatcca caggaccaga tcttttacct ggtgaagtgg aaactattgc agaaatacat 4740 ccaggcccct caaggccacc aactgataca ccagttacat ctactaccag tggttctagt 4800 gcagttctag aggtagcacc agaaccaaca cctccagctc gtgtcagagt cagccgcacc 4860 cagtatcata acccatcatt tcaaataata actgaatcaa caccaacatt gggggaaagc 4920 tcattagcgg atcatatagt agtgacatct ggttctgggg gccaagcaat tggggggatg 4980 acacctgaac ttatagagct tcaggatttc ccatcaaggt attcatttga aatagaagag 5040 ccaacccctc ctagaagaac tagcacacct atgcaaagac ttcaaaatgt gttcaggcgt 5100 agaggaggcc ttactaacag aagattagtt caacaagtgc ctgtagacaa tccattattt 5160 ttgacacaac cttctagatt ggtccggttt cagtttgata acccggtttt tgaggaagaa 5220 gttactcaaa tatttgaaca agatttagac acttttaatg agcccccaga cagagacttt 5280 ttggatgttc agagtttagg caggcctcaa tactcagaaa ctcctgcagg ttatgtgcgg 5340 gtcagccgtg caggtcaacg aaggactatc agaactcgtt ctggagcaca aatagggtct 5400 caagtgcact tttatagaga tctcagtagt attgatacag aagatcctat tgaactgcag 5460 ttgttgggtc agcattctgg cgatgcaact attgtccaag gtccagtaga aagcactttt 5520 gttgatatca atgtagatga aaacccactt tcagaaatca gtgcatattc tgatgattta 5580 cttttagatg aagctaatga agactttagt ggctctcagt tagttgtagg gggaaggcgt 5640 tctacatcta catacactgt tcctcacttt gaaactacta gatctagctc ttactatgta 5700 caagatacaa aggggtatta tgtagcatat cctgaagata gagatgttag taaggacatt 5760 atttatccta atccagattt accagtggtc attattcaca catatgacac aagtggagat 5820 ttttatttac atccaagtct tactaaaaga ttaaaaagaa aaaggaaata tttgtaactt 5880 tttcttttgc agatggcagt ttggcaagca gctagtggta aggtgtacct tccaccatct 5940 acaccagttg ccagggtcca aagtacggat gaatatgtgc aaaggactaa catatactat 6000 catgcataca gtgatcgcct actaactgtt ggtcatccat attttaatat atatgacatc 6060 caaggcacta agataaaagt ccctaaggtt tctggaaatc agcacagagt gtttaggtta 6120 aaactaccag atcccaacag atttgcatta gcagatatgt ctgtgtataa cccagataaa 6180 gaaagattgg tctggggctg tagaggtata gaaataggta gaggacagcc attaggcgtt 6240 ggaagtgtag gtcatccatt atttaataaa cttggtgaca cagaaaaccc taattcatat 6300 aaagggaatt caactgatga tagacaaaat gtatcttttg accctaaaca actacaaatg 6360 tttataatag gctgtgcccc atgtttagga gaacattggg acagggcttt accatgtgca 6420 gacgacgttc caaacccagg ttcatgccct ccaatagaat taaaaaatac agcaatacaa 6480 gatggcgata tggcagatat aggatatggc aacctaaatt ttaaagcatt acaagaaaac 6540 agagcagatg taagtttgga tattgttaat gagacctgta aatatccaga ctttttaaaa 6600 atgcagaatg atgtttatgg agattcctgc tttttttatg ctcggcggga acaatgttat 6660 gctagacact tttttgtacg tgggggcaaa acaggagatg atatacctgc aggacaaatt 6720 gatgaaggta gcatgaagaa tgcattctac attccacctg tgaataatca ggcacagaac 6780 aacctaggta attcaatgta tttcccaact gtcagtggct cattggtgtc tagtgatgct 6840 caattgttta ataggccatt ttggctgcag cgcgcacagg gccacaacaa tggcatctgc 6900 tggttcaatc aactatttgt tactgtagta gataatactc gaaatacaaa ttttagcata 6960 tcagttcatt cagaaaacac tgatgtttct aaaattcaaa attatgattc tcagaaattt 7020 caagaatatt taagacacgt agaagaatat gaaatttcat taattttaca gctctgtaaa 7080 gttcctttaa cagctgaagt tttagctcaa attaatgcta tgaattcaaa tatattagag 7140 gagtggcagt taggattcgt tcctgcaccg gataatccta tccacgatac atacagatat 7200 attaattctg cagctactag atgtcctgat aaaaatcctc caaaagaaag agaagatcct 7260 tacaaggatc taaacttttg gaatgttgac ctatcagaaa gattatcctt agaattggat 7320 caatattctt taggacgcaa attcttattt caagcaggtt tacaacaagc gaccgtaaac 7380 ggtacaaaaa ctgtatcttc aaagttatct actaggggcg tcaaacgaaa acgcaaacaa 7440 taaacccgac cgttttcggt acaataaagt caacttttac acggtattca aggaatgttt 7500 atttactctg actaactaag ataccaaccg cacccgacac ataaaggtga gttgtgtgcc 7560 aaatgaggtg agttgtgagc cagaagagat cacagccaag tcaggcttga gccagatcag 7620 atacactgcg tgccagagtt ggctcaaact tcatcgtccc aacacgttcg gaacaggagg 7680 aaatgtaagg ctgccaacgc ttttggctct tctttttggc acagcagaag accgttaacg 7740 gtaagttttt atttgta 7757