LOCUS HPV37 7421 bp ds-DNA VRL 04-JUL-1995 DEFINITION Human papillomavirus type 37 (HPV37), complete genome. ACCESSION U31786 SOURCE Human papillomavirus type 37 DNA. ORGANISM Human papillomavirus type 37 Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7421) AUTHORS Delius,H. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 AUTHORS Scheurlen,W., Gissmann,L., Gross,G., and zur Hausen,H. TITLE Molecular cloning of two new HPV types (HPV 37 and HPV 38) from a keratoacanthoma and a malignant melanoma. JOURNAL International Journal of Cancer 37(4), 505-510 (1986) COMMENT Cloned HPV37 DNA was obtained from the Papillomavirus Reference Center, Heidelberg and subsequently sequenced by Dr. H. Delius. HPV37, as well as HPV9, was found in a keratoacanthoma in a patient who also had a basaloma. The HPV37 DNA was present as a circular monomeric episome, with approximately 10 copies per diploid cell. HPV37 was not detected in the basaloma. Keratoacanthoma is "a rapidly growing benign skin tumor which originates from the hair follicle and may resolve spontaneously after a period of months. It grows invasively and occurs as a single lesion or as multiple tumors... Tumors are found preferentially in areas exposed to sunlight. This proliferation has been suspected to be of viral origin, but there exists as yet no evidence to support this concept." [2] HPV37 was not found in 231 other tumor DNAs originating from different tissues (including 6 keratoacanthomas and 35 malignant melanomas, as well as 190 other tumors); thus no correlation has been found so far between HPV37 and any tumors of the skin or other tissues. HPV37 is closely related to HPV 9, 15, 17, 22, 23, 38 by cross-hybridization as well as phylogenetic analysis. These types also have in common a relatively short (7.4 kb) genome. FEATURES Location/Qualifiers CDS 200..625 /note="ORF E6 from bp 131 to 625" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MARPKPQSVQQLADTLCIPLVDVLLPCRFCYRFLAYIELIAFDR KGLQLIWTEEDLVYACCTSCAYATAQFEFTSFYEHSVSGREIEEIEQKPIGEIAIRCK FCLKLLDLLEKLETCYTQQQFHKVRRNWKGLCRHCGSIG" CDS 622..906 /note="ORF E7 from bp 604 to 906" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="MIGKEATIPEIVLELQELVQPTADLHCYEELSEEETEEERPHIP YKIVAPCCFCGSKLRLIVVATPIGIRSQEELLLGEVQLVCPNCRGKLRHD" CDS 899..2728 /note="ORF E1 from bp 806 to 2728" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MTDDTKGTKFDPKEGCSDWFVLEAECSDNSLDGDLEKLFEEGND TDISDLIDDEDTVQGNSRELLCQQQSEESEQQIHLLKRKYFSSQEILQLSPRLQSITI SPQHKSKRRLFEGDSGLELSFNEAEDFTQQTLEVQEVSASGSEPADQGAKGLGIVKDL LKCSNVKAMLLAKFKEAFGVGFMELTRQYKSCKTCCRDWVVTLYAVQDELIESSKQLL LQHCAYIWLQHMPPMCLYLLCFNVGKSRETVFRLLMNLLQVAEVQILAEPPKLRSTLS ALFWYKGSMNPNVYAHGEYPEWIMTQTMINHQSAEATQFDLSTMIQYAYDNDLINEDE IAYNYAKLADTDANARAFLQHNSQARFVRECALMVRYYKRGEMKDMSISAWIHNKMLV VEGEGHWSDIVKFVRFQDINFIRFLDVFKSFLHNTPKKNCLLFYGPPDTGKSMFTMSL IKVLKGKVLSFANYKSNFWLQPLADTKIALIDDVTHVCWDYIDQYLRNGLDGNFVCLD LKHRAPCQIKFPPLLLTSNMDIMKEERYRYLHSRVHAFAFPNKFPFDSNNKPQFRLTD QSWKSFFERLWKQLDLSDQEDEGDDGHTQRSFQCTAREPNGHL" CDS 2670..4034 /note="ORF E2 from bp 2640 to 4034" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MDTLSDRFNALQENLMDIYESGRDDLETQIMHWQLLRQEQILFH YARKNGVMRLGYQPVPPLATSEAKAKDAIGMVILLESLQQSAYGKESWTLTQTSLETV RSPPANCFKKGPQNIEVMFDNDPENLMVYTAWSFIYYQTVDDTWNKVEGHVDYYGAYY FEGDLKVYYIQFEGDAARFSKTGRWEVHVNKDTIFAPVTSSSPAAGEGTDGAASVHTV SGSPLARGFSTTSVSTRKRTPPRRYRRKASSPTTTAARQKRQGEDTATRRSRSTSRGK QATSRGGDRRRRRRERSYSRDTSSSPDRGRGGRSRGGPETRSQSRSLSRSRSRSRSRG SSSRGGVAPDAVGKSVRTVGRDHSGRLKRLLDEARDPPVIVLRGDANKLKCYRYRAKK KHGNLVKYYSTTWSWVGGSTNDRIGRSRMLLAFQSNTERELFLKTMKLPPGVDWSLGH LDEL" CDS <3166..3789 /note="ORF E4 from bp 3166 to 3789" /gene="E4" /note="putative" /codon_start=1 /translation="KSIIYNLKVMLPGLAKLDAGKYMLTRTLSLLLLLALRRQLEKGQ TGQPPSTPYPGRRSHGDSLPPPCPPENGHHHGDTEEKHLALQPPPPGKKDKEKTPQQG DQGPPPGGNKQPPGEGTDADGDENAPTPETPPVPPTGEGEGEVEGGPRHDPNQGPSHD PGRGRDPEGLLPGVALRLTQWESQFEQLVETIVDDLKDYWTKLGIPQ" CDS 4099..5703 /note="ORF L2 from bp 4084 to 5703" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MARARRTKRASVTDIYRGCKQAGTCPPDVINKVEQTTIADKILK YGGAGVFFGGLGISTGRGTGGATGYVPLGEGPGVRVGGAPTIVRPGVIPELIGPADVI PIDTVTPIDPAAPSIVTITDSSAVDLLPNEIETIAEVHPVPTDNLDIDTPVVTGGRDS SAVLEVADPSPPVRTRVSRTQYHNPSFQIITESTPLAGESALADHVIVFEGTGGQNIG GSRNATIETAQESFEMQSWPSRYSFEIEEGTPPRSSTPVQRAVQSLSSLRRALYNRRL TEQVAVTDPLFLSRPSQLVQFQFDNPAFEEEVTQIFERDLEAVEEPPDRQFLDVIRLG RPTVAETPQAYLRVSRLGRRATIRTRSGAQVGAQVHFYRDLSTIDSDALEMQLLGEHS GDTTIVQGPVESSFVDINIDEPGPLNIGQQESTMADDTDFNSADLLLEDAVEDFSGSQ LVFGTSRRSTNSITIPRFETPRDTGFYIQDIQGYNVAYPESRDTTQVILPQPETPTVV IRFGEAGTDYYLHPSLKKKKRKRKYL" CDS 5716..7239 /note="ORF L1 from bp 5704 to 7239" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MTLWLPATGKVYLPPTPPVARVQSTDDYVERTNVFYHAMSDRLL TVGHPYYDVRSSDGLKIEVPKVSGNQYRAFRVRLPDPNKFALADMSVYNPEKERLVWA CAGLEIGRGQPLGVGTTGHPLFNKLRDTENNSNYQGGSRDDRQNTSFDPKQVQMFVVG CVPCMGEHWDKAPVCASEENNQTGQCPPLELKNTVIEDGDMFDIGFGNINNKVLSTNK SDVSLDIVNEICKYPDFLTMANDVYGDACFFFARREQCYARHYFVRGGNVGDAIPDGT VNQDHKYYLPAKSDQQQYLLGNSTYFPTVSGSLVTSDAQLFNRPFWLRRAQGHNNGIL WGNQMFITVADNTRNTNFSISVSTDNGEVTEYNSQTLREYLRHVEEYQLSIILQLCKV PLKAEVLTQINAMNSGILEEWQLGFVPTPDNSVHDLYRYINSKATKCPDAVVEKEKED PFAKYTFWNVDLTEKLSLDLDQYPLGRKFIFQSGLQSRPRIVRSSVKVSKGTKRKRS" source 1..7421 /organism="Human papillomavirus type 37" BASE COUNT 2345 a 1343 c 1666 g 2067 t ORIGIN 1 agccaagaat atttggcaga acattttctt ggaagacaac cgataacggt aagattgtaa 61 tctttcaacc gtaggcggta ctttctgatt ggtttggccg attgtagcta acaacaatct 121 ttcttcataa atacatgtaa ccgcctgcgt taacttacat gatctaaata aatatgatga 181 gcaatactta agagaatata tggctaggcc taagcctcaa tctgttcaac agcttgcaga 241 tactttatgt atacctttag tagatgtttt actgccttgc agattttgtt atagattctt 301 agcatatata gaattgatcg catttgatcg aaaaggtctt caactaattt ggaccgaaga 361 agatttagtg tatgcgtgct gtactagctg tgcctatgct acagcacagt ttgaatttac 421 cagtttctat gagcactcag ttagtgggag ggagatagaa gagatagagc aaaagccaat 481 aggagaaata gccatacgct gcaaattttg cttaaagtta ttggatttgt tagagaagtt 541 ggagacttgc tatactcagc aacaatttca caaggttagg cgcaattgga aaggcttgtg 601 tagacattgt gggtcgatag gatgattggg aaagaagcta caataccaga aatagtgctt 661 gagctgcaag agcttgtcca gcccactgct gacctgcatt gttacgaaga gttgagtgaa 721 gaagagacag aggaggagcg tcctcacatc ccttacaaga ttgtagctcc gtgctgcttt 781 tgtggttcta aactacgact gatagttgtt gcaacgccta ttggaattag atcacaagaa 841 gagctattac ttggtgaagt gcagctggtt tgtccaaact gtcgggggaa gcttcgccat 901 gactgacgac acgaaaggta caaaatttga tcctaaagaa ggatgtagtg attggtttgt 961 gctagaagca gaatgctctg acaatagttt agatggtgat ttggaaaagt tatttgaaga 1021 agggaatgat actgacattt ctgatttaat agatgatgag gacactgttc agggaaattc 1081 ccgcgaattg ttatgccagc aacaaagtga ggaaagcgag caacaaatac atttgctaaa 1141 acgaaagtat ttcagttcac aagagattct gcagttaagt cctcgtctgc agtctattac 1201 tatttcgcca cagcataagt ctaaaaggag attatttgaa ggagacagcg gactagaact 1261 gtcatttaat gaagctgaag attttactca gcagactttg gaggtgcagg aggtatcggc 1321 atccggctct gagccggcag accagggtgc caagggactg ggcattgtta aagaccttct 1381 taaatgtagt aatgttaaag ctatgttgtt agcaaaattt aaagaagcat ttggagttgg 1441 ctttatggaa cttactaggc aatataaaag ttgtaaaaca tgttgcagag attgggttgt 1501 aacgttgtat gcagttcaag atgaactgat agaaagctcc aaacagctgt tgcttcaaca 1561 ctgtgcttat atatggttgc agcatatgcc tccaatgtgt ttatatttat tgtgttttaa 1621 tgtgggtaaa agtagagaaa ctgtttttag actgctaatg aatttattgc aagtagcaga 1681 agtacaaata ttggctgaac ctccaaagct tcggagcaca ttatctgcac tgttttggta 1741 taaaggtagc atgaatccaa atgtctatgc acatggtgaa tatcctgagt ggattatgac 1801 acaaaccatg atcaatcacc aatcagcaga agctacacaa tttgatttat ccactatgat 1861 acaatatgca tatgacaatg atttaataaa tgaagatgaa attgcttata attatgccaa 1921 attagcagat acagacgcta atgccagagc ttttttacag cacaatagtc aagccagatt 1981 cgttagagaa tgtgcactaa tggttagata ttacaaacga ggtgaaatga aagatatgag 2041 catatctgcc tggatacata ataaaatgtt agttgtggaa ggcgaaggac attggtctga 2101 tattgtaaag tttgtaagat tccaagatat caattttata aggtttctag atgtctttaa 2161 atcatttttg cataacactc ctaaaaagaa ttgtctttta ttttatggtc cacctgatac 2221 aggcaaatca atgtttacta tgtctttaat taaagtgtta aaaggaaaag ttttatcctt 2281 tgcaaattat aaaagtaatt tttggttgca gccgttggca gatactaaaa ttgctttaat 2341 agatgacgtc acgcatgtgt gttgggatta catagatcaa tatttaagaa atggattgga 2401 tggtaatttt gtttgtttag acctaaaaca tagagcgcca tgtcaaatta agtttccacc 2461 attattactg acttccaata tggatattat gaaggaagaa aggtatagat atttacatag 2521 cagggtgcat gcttttgcat ttccaaataa gtttcctttt gatagtaaca ataagccaca 2581 atttcgactt actgaccaaa gctggaaatc tttttttgaa aggctttgga aacagttaga 2641 tctcagtgac caagaagacg agggagacga tggacacact cagcgatcgt ttcaatgcac 2701 tgcaagagaa cctaatggac atttatgagt caggtcgaga tgacctagag acccaaatta 2761 tgcattggca acttctaagg caggagcaga tcctgtttca ttatgccaga aaaaatggag 2821 tcatgcgttt aggatatcaa cctgtacctc ctttagccac cagtgaagct aaagcaaaag 2881 atgcaattgg catggttata ttattagaaa gtttacaaca gtctgcttat ggtaaagagt 2941 cctggacact tacacaaact agtttggaga ccgtgaggag tccacctgca aattgtttta 3001 aaaagggccc tcagaacatt gaagtgatgt ttgacaatga ccctgaaaat ctaatggtgt 3061 atactgcctg gtcatttatt tattatcaga ctgtagatga cacgtggaac aaggttgagg 3121 gacatgttga ctactatggt gcatattatt ttgaaggaga tttaaaagtc tattatatac 3181 aatttgaagg tgatgctgcc aggtttagca aaactggacg ctgggaagta catgttaaca 3241 aggacactat ctttgctcct gttactagct cttcgccggc agctggagaa gggacagacg 3301 gggcagcctc cgtccacacc gtatccgggt cgccgctcgc acggggattc tctaccacct 3361 ccgtgtccac cagaaaacgg acaccaccac ggcgatacag aagaaaagca tctagcccta 3421 caaccaccgc cgcccggcaa aaaagacaag gagaagacac cgcaacaagg cgatcaaggt 3481 ccacctcccg ggggaaacaa gcaacctcca ggggagggga ccgacgcaga cggagacgag 3541 aacgctccta ctcccgagac acctccagtt cccccgacag gggaagggga gggagaagta 3601 gaggggggcc cgagacacga tcccaatcaa ggtccctctc acgatcccgg tcgcggtcgc 3661 gatccagagg gtcttcttcc aggggtggcg ttgcgcctga cgcagtggga aagtcagttc 3721 gaacagttgg tagagaccat agtggacgac ttaaaagatt actggacgaa gctagggatc 3781 ccccagtaat tgtgctgcgt ggtgatgcta acaaattaaa atgctatcgc tatagagcta 3841 agaaaaagca tggaaaccta gttaagtact acagtaccac gtggtcatgg gttgggggca 3901 gcaccaatga tagaattgga aggtcacgca tgttacttgc atttcaatcc aatacagaaa 3961 gagagttgtt tttaaaaact atgaaattac caccaggagt tgattggtca ctgggtcatt 4021 tagatgaatt gtgaaaacag cttttttata acaaactaac attgcttttg cttttgctac 4081 taacctacta acgttccaat ggctcgcgca cgtcgtacca aacgtgcgtc tgtaactgac 4141 atttacaggg gttgcaagca ggccggcact tgcccccccg atgtaattaa taaagtggaa 4201 caaacaacaa ttgcagacaa aattttgaag tatggtggtg ctggtgtttt ttttggtggg 4261 cttgggatta gcaccggccg aggaacaggt ggtgctacag gatatgtccc tttgggggaa 4321 ggccctggag tgcgcgtagg aggcgcaccc accattgttc gccctggggt catacctgaa 4381 ttgattgggc cagcagatgt aatacctatt gacacagtca ctccaattga ccccgcagca 4441 cccagtattg tcacaattac agacagtagt gctgttgacc ttttacctaa tgaaatagaa 4501 acaattgcag aagtgcatcc tgtgccaaca gacaatttgg atattgatac tcctgtagtt 4561 acaggaggcc gggattccag cgctgttttg gaagttgctg atcctagtcc ccctgtgcga 4621 acaagagttt ccagaacaca atatcataat ccttcttttc aaataataac tgaatctaca 4681 cctttagcag gagaatctgc tttagctgac catgttattg tttttgaagg cactggagga 4741 caaaatatag gtggttctcg aaatgcaact atagaaacag ctcaagaaag ttttgaaatg 4801 caaagttggc cgagtaggta tagttttgaa atagaagaag gaacacctcc tagatctagc 4861 acaccagtac aaagagcagt acaatcactc tctagtttaa gacgggcatt gtataatagg 4921 agattaacag aacaggtagc agtcacggat cctttattct tgagtagacc ctcacaatta 4981 gtacagtttc agtttgacaa tcctgcattt gaagaagaag taactcaaat atttgagagg 5041 gatttagagg ctgtagaaga acctccagat agacagtttt tggatgttat tcgcttaggt 5101 agacctactg ttgctgaaac accacaagcg tatttaagag taagcagatt aggacgtcgt 5161 gctaccatcc gtactcgtag tggagcacag gtgggggctc aggtacattt ttatagagat 5221 ttaagtacta tagattctga tgccctagaa atgcaattat taggagaaca ttcaggtgat 5281 actactatag tacaaggacc tgtagaaagt tcatttgttg atataaatat tgatgaacca 5341 ggtcccttaa atatagggca acaagagtct actatggcag atgacacaga ttttaattct 5401 gcagatttat tgttagagga tgctgtagaa gacttctcag gatctcagtt ggtttttgga 5461 acctcacgcc gcagtacaaa ttctatcaca atacctagat ttgaaactcc aagagatact 5521 ggattttata tacaagatat tcaaggttac aatgtagcct atcctgagtc acgtgacaca 5581 acacaagtta tcttgccaca acctgaaaca ccaactgtag ttattagatt tggagaggca 5641 ggtacagact attatttaca tcctagctta aaaaagaaaa agagaaaacg caaatattta 5701 taattgtttt tacagatgac tttgtggctg ccagcgacgg gtaaagtata cttgcctcca 5761 acaccaccag tagcccgggt gcaaagcacg gatgattatg tggaaagaac aaatgtattc 5821 tatcatgcca tgagcgatcg tctcctaact gtaggacacc catattatga tgtaagatct 5881 agtgatggct taaaaatcga ggttcctaaa gtatctggaa atcaatacag agcttttagg 5941 gttaggttgc cagatccaaa taaatttgct ttagcagata tgtcagtata taatccagaa 6001 aaggaaaggt tggtgtgggc ctgtgcgggc ttggagatag gccgagggca accacttgga 6061 gtaggaacga caggtcaccc tttatttaat aaattaaggg acactgagaa taatagtaat 6121 taccaagggg ggtcacggga tgatagacaa aacacatcat ttgatccaaa acaagtacag 6181 atgtttgtgg ttggatgtgt gccatgcatg ggtgaacatt gggataaagc accagtttgt 6241 gcatcagagg aaaataatca gacaggacag tgtccaccac ttgaattaaa aaacacagtg 6301 attgaagatg gggacatgtt tgatataggg ttcggaaata ttaacaataa ggttctctct 6361 actaataaat cagatgttag tttagatata gtaaatgaaa tatgcaaata ccctgatttt 6421 ttaacaatgg ctaatgatgt ttatggggat gcatgtttct tttttgctag gagagaacaa 6481 tgttatgcca gacattattt tgtaagaggg ggaaatgtag gtgatgctat tcccgatggc 6541 actgttaatc aggaccacaa atattactta cctgccaaat cagaccagca gcagtatctg 6601 ttaggcaatt ctacctattt tcccactgtt agtggatctt tagtaacatc tgatgctcag 6661 ctctttaaca ggcctttttg gttacgcaga gctcaaggtc acaacaatgg cattttatgg 6721 ggtaatcaaa tgtttatcac agttgctgat aatacacgga acacaaactt ttctattagt 6781 gtgtctactg acaatggcga agttacagaa tataattctc aaacactcag agaataccta 6841 agacatgttg aagaatacca gctttcaatt attttacaac tttgtaaagt tcctttaaag 6901 gctgaggttt taactcagat aaatgcaatg aattctggta tattggaaga gtggcaatta 6961 ggatttgtac ctactccaga taattcagta catgaccttt ataggtacat taattcaaag 7021 gctaccaagt gtcctgatgc agttgttgaa aaagaaaagg aagatccctt tgcaaaatat 7081 acattttgga atgtagattt aactgaaaaa ttatcattgg atttagatca atatccttta 7141 gggaggaaat tcatctttca gtcgggattg caaagtagac ctagaattgt tcgatcgtct 7201 gtaaaagtgt ctaaaggtac aaagcgtaaa cggtcgtgac cgttttcggt ttccaataaa 7261 caaataaacc aataaggtat gtgaagcatt ttttaccatg ttcgtgacta aaccatataa 7321 gtcaacgcca acaaccgcac ccggtttaat cagatataaa acacctggtg cgattttatc 7381 agagcttttg tggaagcacc tgaggcgacc gccagaactg c