LOCUS HPV20 7757 bp ds-DNA VRL 04-JUL-1995 DEFINITION Human papillomavirus type 20 (HPV20), complete genome. ACCESSION U31778 SOURCE Human papillomavirus type 20 DNA. ORGANISM Human papillomavirus type 20 Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7757) AUTHORS Delius,H. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 (bases 1 to 7757) AUTHORS Kremsdorf,D., Favre,M., Jablonska,S., Obalek,S., Rueda,L.A., Lutzner,M.A., Blanchet-Bardon,C., Van Voorst Vader,P.C., and Orth,G. TITLE Molecular cloning and characterization of the genomes of nine newly recognized human papillomavirus types associated with epidermodysplasia verruciformis JOURNAL Journal of Virology 52(3), 1013-1018 (1984) REFERENCE 3 (bases 1 to 7757) AUTHORS Gassenmaier,A., Lammel,M., and Pfister,H. TITLE Molecular cloning and characterization of the DNAs of human papillomaviruses 19, 20, and 25 from a patient with epidermo- dysplasia verruciformis JOURNAL Journal of Virology 52(3), 1019-1023 (1984) REFERENCE 4 (bases 1 to 7757) AUTHORS Kiyono,T., Hiraiwa,A., and Ishibashi,M. TITLE Differences in transforming activity and coded amino acid sequence among E6 genes of several papillomaviruses associated with epidermodysplasia verruciformis JOURNAL Virology 186(2), 628-639 (1992) COMMENT HPV20 was originally isolated from skin warts of epidermodysplasia verruciformis (EV) patients [2,3]. It has additionally been detected in a squamous cell carcinoma from another EV patient, although the association is not frequent. Cloned HPV20 DNA was obtained from the Papillomavirus Reference Center, Heidelberg and subsequently sequenced by Dr. H. Delius. Hybridization assays and phylogenetic reconstructions based on DNA sequences indicate that HPV20 is most closely related to HPV21 and HPV14, and then to HPV19 and HPV25. This grouping agrees with assays of the degree of transforming activity of the E6 protein (these related HPV types had relatively low transforming activity as compared to HPVs 5, 8, and 47), and clustering of similarity of amino acids in the second zinc finger domain of E6 [4]. The E6 gene of HPVs 14, 21, and 25 can enhance the induction of anchorage independent growth of 3Y1 cells by the HPV16 E7 gene, although again less effectively than that of HPVs 5, 8, and 47. HPV20 was cloned via AvaI restriction. But contrary to the assumption that type 20 had only one AvaI site (Kremsdorf et al., 1984) the sequence analysis of the clone showed the presence of two additional AvaI fragments of 16 and 176 ntd, respectively, at the cloning site (position 1158 in the final sequence) in opposite orientation relative to the large AvaI fragment containing the major part of the viral genome. The segment between the AvaI sites at position 1142 and 1334 is inverted in the pBR322 clone. This inversion leads to disrupted E7 and E1 ORFs in the clone. The sequence has been fixed to yield colinearity with the closely related HPV types. FEATURES Location/Qualifiers CDS 200..697 /note="ORF E6 from bp 182 to 697" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MATPPSSEDSADEGPSNIGEAKPPILEPPLPATICGLAKLLEIP LDDCLIPCNFCGNFLTHLEVCEFDEKKLTLIWKDHLVFACCRVCCSATATYEFNQFYE STVLGRDIEQVTGKSVFDIDVRCYTCMKFLDSIEKLDICGRKRPFYLVRGSWKGICRL CKHFQ" CDS 697..1005 /note="ORF E7 from bp 685 to 1005" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="MIGKEVTLQDIVLELNELQPEVQPVDLFCEEELPNEQQEREEEP QIERASYKVVAPCGCCKVKLRIFISATEFAIRSFQQLLIDELQLLCPDCRGNCKHGGS " CDS 992..2809 /note="ORF E1 from bp 944 to 2809" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MADPKGSTSKDGLDDWCIVEAECSDVDNDLEELFDRDTDSDISE LLDDNDLEQGNSRELFHQQECKDSEEQLQKLKRKYISPKAIAQLSPRLESISLSPQQK SKRRLFAEQDSGLELTLTNEAEDVSSEVEEVPALDSQPVAEGHLGTVDIHYTELLRAS NHKAILLAKFKEAFGIGFNDLTRQFKSYKTCCNDWVLSVYAVHEDLLESSKQLLQQHC DYIWIRGIAAMSLFLLCFKAGKNRGTVHKLMTSMLNVHEKQILSEPPKLRNVAAALFW YKGAMGSGAFSHGPYPNWMAQQTIVGHQSTEASAFDLSEMIQWAFDHNYLDEADIAFQ YAKLAPENSNAVAWLAHNNQARFVRECASMVRFYKKGQMKEMSMSEWIYARINEVEGE GHWSSIAKFLRYQQVNVIMFLAALKDMLHSVPKHNCILIHGPPNTGKSAFTMSLIHVL KGRVLSFVNSKSQFWLQPMSETKIALIDDVTDPCWVYMDTYLRNGLDGHYVSLDCKHK APIQTKFPALLLTSNINVHNEVNYRYLHSRIKGFEFPNPFPMKPDNTPEFELTDQSWK SFFTRLWKQLELSDQEDEGENGESQQAFQCSARSANEHL" CDS 2751..4244 /note="ORF E2 from bp 2727 to 4244" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MENLSKRFNALQDQLMNIYESAPDTLESQIEHWQTLRKEAVLLY FARQHGISRVGYQPVPVLAVSEAKAKQAIGMVLRLQSLQKSEYGSEPWSLVDASAETF RSPPENHFKKGPISVEVIYDKDKDNANAYTMWRFVYYQDDDDKWHKSASGVNQTGIYF MQGTFRHYYVLFADDASRYSTTGQWEVKVNKETVFAPVTSSTPPDSPGGQADSNASSQ TPATTTDSTTRQSPRKQSQQTNTKGRRYGRRPSSRTRRTTQTRQRRRSRSKSKSKSRS RSRSRHRSRSRSRSESPRRRSRYRSRSGSRGRVALRAITTTTTTTTRRAGGGSPTSTS STTSQRSRQLRGGGRGGSRQRARGRRSSSTSPTPSKRSRGESESVRQHGISPSDVGTA VYTVSSRHTGRLGRLLDEALDPPVILVRGEPNTLKCFRNRAKQRYTGLYKSFSTAWSW VAGDGTERLGRSRMLISFISFSQRKDFDETVKYPKGVDRSFGSFDSL" CDS <3313..3999 /note="ORF E4 from bp 3313 to 3999" /gene="E4" /note="putative" /codon_start=1 /translation="KLIRKLCLLLSPAPPPPTHQEDKQTQTPPPRPPPPPLTPRPDSR PENSHNKPTPKGEGTDGDLPVGQGEQPKRARGDGPGQSPSPSPGRGRGRGTGLGLGLG LNRRAGGLGTDHDPDPEGESPSAPLPPPPQPPPDGQVEGHPPPPPPPPHNGRDSCGEG AVGGADKEQGEGDHHPPPPPPQNGHEGSQSLLGNMASLLLTWEQQFTQLVQDIQEDLE DYWMKLSIPQ" CDS 4321..5877 /note="ORF L2 from bp 4306 to 5877" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MARAKRVKRDSATNIYRTCKQAGTCPPDVINKVESTTIADKILQ YGSAGVFFGGLGISTGKGTGGTTGYVPLGEGPSVRVGGTPTVIRPALVPDTIGPSDII PVDTLNPVEPSTSSIVPLTESTGPDLLPGEVETIAEIHPGPSRPPTDTPVTSTTSGSS AVLEVAPEPTPPARVRVSRTQYHNPSFQIITESTPTLGESSLADHIVVTSGSGGQAIG GMTPELIELQDFPSRYSFEIEEPTPPRRTSTPMQRLQNVFRRRGGLTNRRLVQQVPVD NPLFLTQPSRLVRFQFDNPVFEEEVTQIFEQDLDTFNEPPDRDFLDVQSLGRPQYSET PAGYVRVSRAGQRRTIRTRSGAQIGSQVHFYRDLSSIDTEDPIELQLLGQHSGDATIV QGPVESTFVDINVDENPLSEISAYSDDLLLDEANEDFSGSQLVVGGRRSTSTYTVPHF ETTRSSSYYVQDTKGYYVAYPEDRDVSKDIIYPNPDLPVVIIHTYDTSGDFYLHPSLT KRLKRKRKYL" CDS 5893..7443 /note="ORF L1 from bp 5878 to 7443" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MAVWQAASGKVYLPPSTPVARVQSTDEYVQRTNIYYHAYSDRLL TVGHPYFNIYDIQGTKIKVPKVSGNQHRVFRLKLPDPNRFALADMSVYNPDKERLVWG CRGIEIGRGQPLGVGSVGHPLFNKLGDTENPNSYKGNSTDDRQNVSFDPKQLQMFIIG CAPCLGEHWDRALPCADDVPNPGSCPPIELKNTAIQDGDMADIGYGNLNFKALQENRA DVSLDIVNETCKYPDFLKMQNDVYGDSCFFYARREQCYARHFFVRGGKTGDDIPAGQI DEGSMKNAFYIPPVNNQAQNNLGNSMYFPTVSGSLVSSDAQLFNRPFWLQRAQGHNNG ICWFNQLFVTVVDNTRNTNFSISVHSENTDVSKIQNYDSQKFQEYLRHVEEYEISLIL QLCKVPLTAEVLAQINAMNSNILEEWQLGFVPAPDNPIHDTYRYINSAATRCPDKNPP KEREDPYKDLNFWNVDLSERLSLELDQYSLGRKFLFQAGLQQATVNGTKTVSSKLSTR GVKRKRKQ" source 1..7757 /organism="Human papillomavirus type 20" BASE COUNT 2431 a 1510 c 1698 g 2118 t ORIGIN 1 tcgggcgcgg tcatacatta ctcatttggt agttgttgtt gccagctacc atcaagcata 61 gcatgttttt gcctgtaacg ttatcggcac agtgattaat atatatatat atatatatat 121 atatatatat atatatatat atatatatat agatacatat agacagatat catagagcta 181 atgcagagag tgcaggcaca tggctacacc tccttcttca gaagacagcg ctgatgaagg 241 accatctaat attggagagg caaaacctcc aatcttagag ccaccattgc ctgcaacaat 301 ctgtggccta gcaaaacttt tagaaatacc gctagatgat tgtttgatac cttgtaactt 361 ctgcggtaat ttccttacac atttagaagt ttgtgagttt gatgagaaga agcttacttt 421 aatttggaaa gatcatttgg tttttgcatg ctgtcgtgtt tgctgctcgg caacagcgac 481 atatgagttt aatcaatttt atgagagtac tgttttaggc agagacatag agcaagtaac 541 aggcaaatct gtttttgata tagatgtcag gtgctacacc tgtatgaaat ttttagactc 601 aattgaaaag ctagacatct gtggcagaaa gcgtccattt tatttagtga gaggctcttg 661 gaaaggaatc tgtaggctgt gtaagcattt tcaataatga ttggtaaaga ggtcacattg 721 caagatattg tgctggagtt aaatgaattg cagcctgagg ttcaaccagt tgacctgttt 781 tgtgaagagg agttaccgaa cgagcagcag gagagagagg aggagcctca gattgaaaga 841 gcctcataca aagttgttgc accttgcggc tgctgcaagg tgaaacttcg catctttata 901 agcgctacag aatttgctat tagaagcttt caacaattgc tgattgacga gctgcagctg 961 ttgtgtcctg actgtcgcgg gaactgcaaa catggcggat cctaaaggta gtacatctaa 1021 agacgggttg gatgattggt gtattgttga agctgaatgt agcgatgtag acaatgattt 1081 ggaagaatta tttgacagag atacagactc agatatttca gaattattag atgataatga 1141 cctcgagcag ggcaattctc gggaactatt tcatcaacaa gagtgtaagg acagcgagga 1201 gcaattacaa aaactaaaac gaaagtacat aagtccaaaa gctattgcac agcttagtcc 1261 gcgacttgaa agtatttcac tgtcaccaca gcagaagtca aaacgaaggc tttttgcaga 1321 gcaggacagc gggctcgagt taactcttac aaatgaagct gaagatgttt cttctgaggt 1381 ggaggaggta ccggccctag actctcagcc ggttgctgag ggacacttag gaacagtaga 1441 cattcattat acagaattat tgcgtgccag taaccataag gcaattttgt tggcaaaatt 1501 taaggaggct tttgggatag ggtttaatga tttgacacgt caatttaaaa gttacaaaac 1561 ctgctgtaat gattgggttc tatctgtgta tgcagttcat gaggatcttc ttgaaagctc 1621 aaagcagtta ttgcaacagc attgtgatta tatatggatc cgtgggatag cagcaatgtc 1681 attgtttcta ttgtgtttta aagcaggaaa aaatcgtggg actgtgcata aattaatgac 1741 atcaatgttg aatgtgcatg aaaagcaaat attgtctgag cctccaaaat taagaaatgt 1801 tgctgctgct ttattttggt ataaaggtgc aatggggtcc ggagcatttt ctcatggtcc 1861 atatcctaac tggatggcac agcaaactat tgttggtcat cagagcacag aagccagtgc 1921 ttttgacttg tctgaaatga ttcagtgggc atttgaccat aattatctag atgaggctga 1981 tatagccttt cagtatgcta agctagcacc agaaaatagt aatgctgtag catggcttgc 2041 acataataac caagcaaggt ttgttagaga atgtgcatca atggtcaggt tttataaaaa 2101 aggtcaaatg aaagaaatga gcatgtcaga atggatttat gccagaatta atgaagtaga 2161 aggcgaagga cattggtcat ctattgctaa atttcttaga tatcagcaag taaatgttat 2221 aatgttttta gctgctttga aagatatgct gcattctgta cctaaacata actgtatatt 2281 aatacatggc ccacctaata ctggaaaatc tgcattcact atgtcattga tacatgtgtt 2341 aaagggaagg gtattgtcct ttgtaaattc taaaagccaa ttctggttac aaccaatgtc 2401 agaaactaaa atagcattaa ttgatgacgt aactgatcct tgctgggttt atatggatac 2461 atatttaaga aatggcttag atggacatta tgtctcacta gattgcaagc ataaagcacc 2521 aattcaaaca aaatttcctg cattactgct tacctctaat attaatgttc ataatgaagt 2581 taactataga tatttacata gtagaattaa aggatttgaa tttccaaatc catttccaat 2641 gaaaccagac aatacccctg agtttgagct tactgaccaa agctggaaat ctttttttac 2701 aaggctttgg aagcaattag agctgagtga ccaagaagac gagggagaaa atggagaatc 2761 tcagcaagcg tttcaatgct ctgcaagatc agctaatgaa catttatgag tctgcaccag 2821 acactcttga gtcgcaaatt gagcactggc aaaccctgcg aaaagaagct gtgctactat 2881 attttgctag gcaacatggt atcagcaggg ttggatatca acctgtgcct gtattagctg 2941 tgtcagaagc caaagctaaa caggctatag gaatggtatt aaggttacaa tcattgcaaa 3001 aatctgaata tggaagtgaa ccatggtctt tggtagatgc aagtgcagag acatttagaa 3061 gcccgccaga aaatcacttt aaaaaaggtc cgatttcagt agaggtcata tatgacaaag 3121 ataaagacaa tgccaatgct tataccatgt ggagatttgt ttattaccaa gatgatgacg 3181 acaagtggca caaaagtgct agtggtgtta accaaacagg catatatttt atgcaaggaa 3241 catttagaca ctactatgtt ttgtttgctg atgatgcgag tagatatagt acaactggac 3301 aatgggaagt gaaagttaat aaggaaactg tgtttgctcc tgtcaccagc tccacccccc 3361 ccgactcacc aggaggacaa gcagactcaa acgcctcctc ccagaccccc gccaccacca 3421 ctgactccac gaccagacag tcgcccagaa aacagtcaca acaaaccaac accaaaggga 3481 gaaggtacgg acggagacct tccagtagga caaggcgaac aacccaaacg cgccagaggc 3541 gacggtccag gtcaaagtcc aagtccaagt ccaggtcgcg gtcgaggtcg cggcaccggt 3601 ctcggtctcg gtctcggtct gaatcgccgc gccggcggtc tcggtaccga tcacgatccg 3661 gatccagagg gagagtcgcc ctccgcgcca ttaccaccac caccacaacc accaccagac 3721 gggcaggtgg agggtcaccc acctccacct cctccaccac ctcacaacgg tcgcgacagc 3781 tgcggggagg gggccgtggg gggagcagac aaagagcaag gggaaggcga tcatcatcca 3841 cctcccccac cccctcaaaa cggtcacgag gggagtcaga gtctgttagg caacatggca 3901 tctctccttc tgacgtggga acagcagttt acacagttag ttcaagacat acaggaagac 3961 ttggaagatt actggatgaa gctctcgatc ccccagtgat tttagttagg ggagagccta 4021 atacgcttaa gtgctttcgc aatagggcca aacaaagata tacagggctg tataagtctt 4081 ttagcacggc ctggtcgtgg gtggctggag atggcacgga gcgtctaggc aggtccagaa 4141 tgctcattag ctttatatcc ttcagtcaaa gaaaagattt tgatgagact gtgaaatatc 4201 cgaagggggt tgaccggtcg tttggttcat ttgacagctt atagcaacct aaccttctaa 4261 ccactgcatg ctactaacac actaacattt tttaattttt attaatattt tttatttgct 4321 atggcgcgcg ctaagcgagt caagcgggac tctgctacta acatatacag aacctgcaaa 4381 caagcaggta cttgtcctcc tgatgttata aataaagtgg aaagcacaac tattgctgat 4441 aaaattttgc agtatggtag tgctggtgtt ttttttgggg gattaggcat aagcactgga 4501 aaaggtacag gaggaaccac aggttatgtg cctttgggag aaggcccatc ggtgcgtgtt 4561 ggtggtacac ctacagtcat acgacctgct ttggtcccag acaccatcgg cccctccgat 4621 attatacctg tggacacctt aaatccggtg gagccttcta cctcttctat tgttccactt 4681 acagaatcca caggaccaga tcttttacct ggtgaagtgg aaactattgc agaaatacat 4741 ccaggcccct caaggccacc aactgataca ccagttacat ctactaccag tggttctagt 4801 gcagttctag aggtagcacc agaaccaaca cctccagctc gtgtcagagt cagccgcacc 4861 cagtatcata acccatcatt tcaaataata actgaatcaa caccaacatt gggggaaagc 4921 tcattagcgg atcatatagt agtgacatct ggttctgggg gccaagcaat tggggggatg 4981 acacctgaac ttatagagct tcaggatttc ccatcaaggt attcatttga aatagaagag 5041 ccaacccctc ctagaagaac tagcacacct atgcaaagac ttcaaaatgt gttcaggcgt 5101 agaggaggcc ttactaacag aagattagtt caacaagtgc ctgtagacaa tccattattt 5161 ttgacacaac cttctagatt ggtccggttt cagtttgata acccggtttt tgaggaagaa 5221 gttactcaaa tatttgaaca agatttagac acttttaatg agcccccaga cagagacttt 5281 ttggatgttc agagtttagg caggcctcaa tactcagaaa ctcctgcagg ttatgtgcgg 5341 gtcagccgtg caggtcaacg aaggactatc agaactcgtt ctggagcaca aatagggtct 5401 caagtgcact tttatagaga tctcagtagt attgatacag aagatcctat tgaactgcag 5461 ttgttgggtc agcattctgg cgatgcaact attgtccaag gtccagtaga aagcactttt 5521 gttgatatca atgtagatga aaacccactt tcagaaatca gtgcatattc tgatgattta 5581 cttttagatg aagctaatga agactttagt ggctctcagt tagttgtagg gggaaggcgt 5641 tctacatcta catacactgt tcctcacttt gaaactacta gatctagctc ttactatgta 5701 caagatacaa aggggtatta tgtagcatat cctgaagata gagatgttag taaggacatt 5761 atttatccta atccagattt accagtggtc attattcaca catatgacac aagtggagat 5821 ttttatttac atccaagtct tactaaaaga ttaaaaagaa aaaggaaata tttgtaactt 5881 tttcttttgc agatggcagt ttggcaagca gctagtggta aggtgtacct tccaccatct 5941 acaccagttg ccagggtcca aagtacggat gaatatgtgc aaaggactaa catatactat 6001 catgcataca gtgatcgcct actaactgtt ggtcatccat attttaatat atatgacatc 6061 caaggcacta agataaaagt ccctaaggtt tctggaaatc agcacagagt gtttaggtta 6121 aaactaccag atcccaacag atttgcatta gcagatatgt ctgtgtataa cccagataaa 6181 gaaagattgg tctggggctg tagaggtata gaaataggta gaggacagcc attaggcgtt 6241 ggaagtgtag gtcatccatt atttaataaa cttggtgaca cagaaaaccc taattcatat 6301 aaagggaatt caactgatga tagacaaaat gtatcttttg accctaaaca actacaaatg 6361 tttataatag gctgtgcccc atgtttagga gaacattggg acagggcttt accatgtgca 6421 gacgacgttc caaacccagg ttcatgccct ccaatagaat taaaaaatac agcaatacaa 6481 gatggcgata tggcagatat aggatatggc aacctaaatt ttaaagcatt acaagaaaac 6541 agagcagatg taagtttgga tattgttaat gagacctgta aatatccaga ctttttaaaa 6601 atgcagaatg atgtttatgg agattcctgc tttttttatg ctcggcggga acaatgttat 6661 gctagacact tttttgtacg tgggggcaaa acaggagatg atatacctgc aggacaaatt 6721 gatgaaggta gcatgaagaa tgcattctac attccacctg tgaataatca ggcacagaac 6781 aacctaggta attcaatgta tttcccaact gtcagtggct cattggtgtc tagtgatgct 6841 caattgttta ataggccatt ttggctgcag cgcgcacagg gccacaacaa tggcatctgc 6901 tggttcaatc aactatttgt tactgtagta gataatactc gaaatacaaa ttttagcata 6961 tcagttcatt cagaaaacac tgatgtttct aaaattcaaa attatgattc tcagaaattt 7021 caagaatatt taagacacgt agaagaatat gaaatttcat taattttaca gctctgtaaa 7081 gttcctttaa cagctgaagt tttagctcaa attaatgcta tgaattcaaa tatattagag 7141 gagtggcagt taggattcgt tcctgcaccg gataatccta tccacgatac atacagatat 7201 attaattctg cagctactag atgtcctgat aaaaatcctc caaaagaaag agaagatcct 7261 tacaaggatc taaacttttg gaatgttgac ctatcagaaa gattatcctt agaattggat 7321 caatattctt taggacgcaa attcttattt caagcaggtt tacaacaagc gaccgtaaac 7381 ggtacaaaaa ctgtatcttc aaagttatct actaggggcg tcaaacgaaa acgcaaacaa 7441 taaacccgac cgttttcggt acaataaagt caacttttac acggtattca aggaatgttt 7501 atttactctg actaactaag ataccaaccg cacccgacac ataaaggtga gttgtgtgcc 7561 aaatgaggtg agttgtgagc cagaagagat cacagccaag tcaggcttga gccagatcag 7621 atacactgcg tgccagagtt ggctcaaact tcatcgtccc aacacgttcg gaacaggagg 7681 aaatgtaagg ctgccaacgc ttttggctct tctttttggc acagcagaag accgttaacg 7741 gtaagttttt atttgta