LOCUS HPV42 7917 bp ds-DNA VRL 21-JAN-1992 DEFINITION Human papillomavirus ORF E6, ORF E7, ORF E1, ORF E2, ORF E4, ORF E5, ORF L2, and ORF L1 genes, complete cds. ACCESSION M73236 KEYWORDS vacular papilloma. SOURCE Human papillomavirus type 42 DNA. ORGANISM Human papillomavirus type 42 Viridae; ds-DNA nonenveloped viruses; Papovaviridae. REFERENCE 1 (bases 1 to 7917) AUTHORS Philipp,W., Honore,N., Sapp,M., Cole,S.T. and Streeck,R.E. TITLE Human papillomavirus type 42: New sequence, conserved genome organization JOURNAL Virology 186, 331-334 (1992) STANDARD full staff_review FEATURES Location/Qualifiers CDS <3282..3644 /note="ORF E4 from bp 3282 to 3644" /gene="E4" /note="putative" /codon_start=1 /translation="LFLLHPYLAPHPPTQRYPLLDLLSWYNKCAPQTHCTPQRPLTTT TQTVQTEQHTTCPSKPHRHENDTDSVDSRHHSTCSTQTPASPASPAHPWTLDCVGSEL TVKTVTSDGTTVEVRLRL" CDS <3919..4206 /note="ORF E5 from bp 3919 to 4206" /gene="E5" /note="putative" /codon_start=1 /translation="TVIGLQYCDSTTCGTTGQKLLLLLFIVVGACVVCVWISLQNYPY PVWASCLASYLTLVLLSWLQVLTYFDYFFLCLIILGIPSVLLTLLIHLAIQ" TATA_signal 74..81 CDS 114..566 /note="ORF E6 from bp 108 to 566" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MSGTSASSQPRTLYQLCKEFGLTLRNLQISCIWCKKHLTGAEVL AYHFKDLVVVWRKDFPYAACAFCLEFNSKICALRHYERSAFWYTVEKETGLLLEEQQI RCALCQKPLSQSEKNHHIDTGTRFQFILCQWTGRCTHCRGQCVERRLP" CDS 542..823 /note="ORF E7 from bp 476 to 823" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="MRGETPTLKDIVLFDIPTCETPIDLYCYEQLDSSDEDDQAKQDI QRYRILCVCTQCYKSVKLVVQCTEADIRNLQQMLLGTLDIVCPLCARVE" CDS 829..2760 /note="ORF E1 from bp 724 to 2760" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MADDTGTEEGLGCSGWFCVEAIVDKTTENAISDDEDENVDDSGL DLVDFVDNSTVIHTKQVHAQALLNKQQAHADQEAVQALKRKLLGSPYESPVSDSQHSI DNELSPRLGGLTLCRGSQGAKRRLFQSLENRDSGYGYSEVEVQQTQVEHGHGAVHGTM GNGGAVGSELGVQENEEGSTTSTPTTRVVELLKCKNLHATLLGKFKELFGVSFGDLVR QFKSDKSSCTDWVIAAFGVNHSIAEGFNTLIKADSLYTHIQWLTCTWGMVLLMLIRFK CGKNRTTVSKGLSKLLNIPTNQLLIEPPRLQSVAAAIYWFRSGISNASIVTGDTPEWI QRQTILEHCFADAQFNLTEMVQWAYDNDITEDSDIAYEYAQRADRDSNAAAFLKSNCQ AKYVKDCGVMCRHYKKAQMRRMSMGAWIKHRSAKIGDSGDWKPIVKFIRYQQIDFLAF MSAFKKFLHNIPKKSCLVLIGPPNTGKSQFGMSLINFLAGTVISFVNSHSHFWLQPLD SAKIAMLDDATPPCWTYLDIYLRNLLDGNPCSIDRKHKALTVVKCPPLLITSNTDIRT NDKWKYLYSRVSLFEFPNPFPLDTNGNPVYELNDKNWKSFFQRLWSSLEFQESEDEED YGETGQTFRCVPGTVVRTV" polyA_signal 1029..1035 polyA_signal 2280..2285 CDS 2702..3898 /note="ORF E2 from bp 2672 to 3898" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MERLAKRLDACQEQLLELYEENSRDLQKHIEHWKCLRMEAVVLY KAREMGFANIGHQIVPTLETCRAKAHMAIEIHLALETLLQSSYGKEPWTLQETSNELW LTNPKKCFKKQGRTVEVIFDGKQDNAMHYTAWTYIYIQTVQGTWCKVQGHVCHAGLYY IVENMKQFYCNFKEEAKKYGVTDQWEVHDGNQVIVSPAPISSTTSTDAEIPSTGSTKL VQQVCTTNPLHTTTSIDNHHADCTDGTAYNVPIQTSPPRKRYRQCGQSPSQHLQHSNP SIPSIPSASVDPGLCGVRTNSENCNKRRNHCGSQATPVIHLQGDPNCLKCLRFRLKRN CSHLFTQVSSTWHLTENDCTRDTKTGIITIHYYDEAQRNLFLNTVKIPSGIKSCIGYM SMLQFI" TATA_signal 3892..3895 polyA_signal 4378..4383 TATA_signal 4391..4394 CDS 4423..5856 /note="ORF L2 from bp 4348 to 5856" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MPPQRSRRRKRASATQLYQTCKASGTCPPDVIPKVEGTTLADKI LQWGSLGVFFGGLGIGTGAGTGGRTGYVPLGTRPPVIAEPGPAVRPPIAVDTVGPSDP SIVSLLEESSVIDAGITVPDITSHGGFNITTSTGGPASTPAILDISPPTNTIRVTTTT STNPLYIDPFTLQPPLPAEVNGRLLISTPTITPHSYEEIPMDTFVVSTDTTNTFTSTP IPGPRSSARLGLYSRATQQRPVTTSAFLTSPARLVTYDNPAYEGLTEDTLVFEHPSIH TAPDPDFMDIVALHRPMLSSKQGSVRVSRIGQRLSMQTRRGTRFGSRVHFFHDLSPIT HSSETIELQPLSASSVSAASNINDGLFDIYVDTSDVNVTNTTSSIPMHGFATPRLSTT SFPTLPSMSTHSANTTIPFSFPATVHVGPDLSVVDHPWDSTPTSVMPQGNFVMVSGWD FILHPSYFWRRRRKPVPYFFADVRVAA" CDS 5837..7345 /note=" ORF L1 from bp 5756 to 7345" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MSVWRPSDNKVYLPPPPVSKVVSTDEYVQRTNYFYHASSSRLLV VGHPYYSITKRPNKTSIPKVSGLQYRVFRVRLPDPNKFTLPETNLYNPETQRMVWACV GLEVGRGQPLGVGISGHPLLNKLDDTENAPTYGGGPGTDNRENVSMDYKQTQLCLVGC KPAIGEHWGKGTACTPQSNGDCPPLELKNSFIQDGDMVDVGFGALDFGALQSSKAEVP LDIVNSITKYPDYLKMSAEAYGDSMFFFLRREQMFVRHLFNRAGAIGEPVPDELYTKA ANNASGRHNLGSSIYYPTPSGSMVTSDAQLFNKPYWLQQAQGHNNGICWGNQLFLTVV DTTRSTNMTLCATATSGDTYTAANFKEYLRHAEEYDVQFIFQLCKITLTVEVMSYIHN MNPNILEEWNVGVAPPPSGTLEDSYRYVQSEAIRCQAKVTTPEKKDPYSDFWFWEVNL SEKFSTDLDQFPLGRKFLLQAGLRARPKLSVGKRKASTAKSVSSAKRKKTHK" polyA_signal 6758..6763 polyA_signal 7401..7406 polyA_signal 7745..7753 source 1..7917 /organism="Human papillomavirus type 42" /sequenced_mol="DNA" BASE COUNT 2433 a 1478 c 1647 g 2359 t ORIGIN 1 cttattataa actacaatcc tggctttgaa aaataaggga gtaaccgaat tcggttcaac 61 cgaaaccggt acatatataa accacccaaa gtagtggtcc cagttaaggc agaatgtcag 121 gtacatctgc ctcatcacag ccacgcacat tataccaatt gtgtaaggaa tttgggctga 181 cattgcggaa tttacagatt tcctgcattt ggtgcaaaaa gcacttaaca ggcgcagagg 241 tgctcgcgta ccattttaaa gatttggtag tggtgtggag gaaggacttt ccatatgctg 301 catgtgcatt ttgtttagaa tttaattcta aaatttgtgc actgcgacac tacgaaagat 361 cagcattttg gtatacagtg gagaaagaaa ctggactact tttagaagaa caacaaatta 421 gatgtgcctt gtgtcaaaag ccgttatcac agagcgaaaa aaaccatcat attgatacag 481 gtacaagatt tcaatttata ttgtgtcagt ggacgggtcg gtgtacgcat tgcagaggac 541 aatgcgtgga gagacgccta ccctaaagga cattgttttg tttgacatac caacgtgtga 601 gacacccatt gacctgtatt gctatgaaca attggacagc tcagatgaag atgaccaagc 661 caaacaggac atacagcgtt acagaatact gtgtgtgtgt acacagtgtt acaagtctgt 721 taaactcgtt gtgcagtgta cagaggcgga cataagaaac ctgcaacaga tgcttttggg 781 cacactggat attgtgtgtc ctttgtgtgc ccgcgtggag taactgcaat ggcggatgat 841 acaggtacag aggaggggct agggtgttct ggatggtttt gtgtagaagc tatagtagac 901 aaaacaacag aaaatgctat ttcagatgac gaggacgaaa atgtagacga tagtgggtta 961 gatcttgtgg attttgtaga taatagtaca gtaatacata caaagcaggt acatgcacaa 1021 gccttattaa ataaacaaca agcacatgca gatcaggagg cagtacaggc actaaaacga 1081 aagctattag gcagtccata tgaaagccct gtcagtgatt cacagcacag catagacaac 1141 gaactaagtc ctaggcttgg cggtttaacg ctatgtcggg ggtcccaagg ggccaaacga 1201 cgattattcc agtcactgga aaatcgagac agtggatatg gctattctga agtggaagta 1261 cagcagacac aggtagaaca cggacatggc gccgtacatg ggactatggg taacgggggg 1321 gcagtgggta gtgaacttgg ggtgcaggaa aatgaagaag gtagtactac aagtacgcct 1381 acaacaaggg tggtagaatt acttaagtgt aagaacctgc atgcaacatt gttaggtaag 1441 tttaaagaat tgtttggagt gtcatttggc gatttagtaa gacagtttaa aagtgacaaa 1501 agcagttgta cagactgggt tattgcagca tttggggtta atcatagtat tgcagaaggg 1561 tttaatacat taattaaagc agattcacta tatacacata tacaatggct aacctgtacg 1621 tggggcatgg tgttattaat gctaattaga tttaaatgtg gaaaaaatcg tactacagtg 1681 tccaaaggcc ttagtaaatt attaaacata cctacaaatc aattattaat agagccacct 1741 cggttacaaa gtgtggctgc cgccatatac tggtttagat caggaatatc taatgctagc 1801 attgtaaccg gagacacacc agagtggatt caaagacaaa caattttaga acattgtttt 1861 gcagatgccc aatttaattt aacagaaatg gtgcaatggg catatgataa tgatattact 1921 gaagacagtg acattgcata tgaatatgca caacgggcag acagggatag caatgctgct 1981 gcatttttaa aaagtaactg ccaggcaaaa tatgtaaaag attgtggcgt catgtgcaga 2041 cattataaaa aagcacaaat gagacgtatg tctatgggtg catggataaa acatagaagt 2101 gccaagatag gggatagtgg agattggaaa cctatagtaa aatttattag atatcaacaa 2161 attgattttt tagcatttat gtctgcattt aaaaagtttt tacataatat acctaaaaaa 2221 agttgtttag tgttaattgg tcctccaaat acaggaaaat cacagtttgg aatgagttta 2281 ataaacttct tagcaggaac tgtaatatca tttgtaaatt cacatagcca tttttggctg 2341 cagccattgg acagtgcaaa aatagctatg ctggatgatg caactccacc atgttggaca 2401 tatttagata tatatttaag aaatttatta gatggcaatc catgcagtat agatagaaaa 2461 cataaagcat taacagttgt taagtgccca ccattactta taacatcaaa tacagatatt 2521 agaacaaatg acaaatggaa atacctatac agcagagtta gtttatttga atttccaaat 2581 ccatttccat tagatacaaa tggaaatcct gtatatgaat taaatgacaa aaattggaaa 2641 tcattttttc aaaggttgtg gtccagctta gaatttcaag aatcagagga cgaggaagac 2701 tatggagaga ctggccaaac gtttagatgc gtgccaggaa cagttgttag aactgtatga 2761 ggaaaatagt agggatttac aaaaacatat tgaacattgg aaatgtttac gtatggaggc 2821 agtggtattg tataaggccc gtgaaatggg ctttgcaaat ataggacatc aaatagtacc 2881 aacattggaa acatgtagag ccaaggccca catggcaatt gaaatacact tggcattaga 2941 gacattattg cagtcctcgt atggtaaaga accatggaca ttgcaagaaa caagtaatga 3001 actgtggctt acgaatccta aaaaatgttt taaaaaacaa ggacgtaccg tggaggttat 3061 atttgatgga aaacaggaca atgcaatgca ttatacagca tggacatata tatatataca 3121 aactgtgcaa ggtacatggt gtaaagtaca aggacacgtt tgccatgcag gactatatta 3181 tattgtggaa aatatgaaac agttttattg taattttaaa gaggaggcaa aaaaatatgg 3241 ggtaacagac caatgggagg tacatgatgg caatcaggtg attgtttctc ctgcacccat 3301 atctagcacc acatccaccg acgcagagat accctctact ggatctacta agttggtaca 3361 acaagtgtgc accacaaacc cattgcacac cacaacgtcc attgacaacc accacgcaga 3421 ctgtacagac ggaacagcat acaacgtgcc catccaaacc tcaccgccac gaaaacgata 3481 cagacagtgt ggacagtcgc catcacagca cctgcagcac tcaaacccca gcatccccag 3541 catccccagc gcatccgtgg accctggatt gtgtggggtc agaactaaca gtgaaaactg 3601 taacaagcga cggaaccact gtggaagtca ggctacgcct gtaattcatt tacaaggtga 3661 ccctaattgc ctaaaatgcc tacgatttag gctaaaaaga aattgttcac atttatttac 3721 acaggtgtca tctacatggc atttaacaga aaatgattgt acacgtgaca ctaaaactgg 3781 tataataaca atacattatt atgatgaagc acaaagaaat ttatttttaa atactgtaaa 3841 aataccttct gggataaaat cctgtattgg atatatgtct atgttacagt ttatatgatt 3901 agttgtatat gtgtataaac agttatagga cttcaatact gtgactccac aacgtgtggg 3961 acaaccggcc agaaactgct gcttttattg tttatagttg ttggtgcgtg tgttgtgtgt 4021 gtgtggatta gtttacaaaa ttatccatat cctgtatggg cctcttgcct tgctagctac 4081 ctaacattgg tgctattatc atggttgcag gtactaacat actttgacta tttttttcta 4141 tgtttaatca ttcttggtat tccttctgtc ttactaacat tactaataca tttagcaata 4201 caataacaca tattagttta ggtgtgtgtg tgtggtgtgc atgtgatttg tacatggttg 4261 tacatatata ataccaatta ttgtttggct actattttca tttatagcca cactgctgtt 4321 ttgcatattg gtattacaaa catataaact gttaccatac gtatatacag tgctgtaaat 4381 aaacttttgt tatattgtgt gtacttcttt tgtgctatta caatgccacc acaacggtcc 4441 cgcagacgaa agcgggcctc tgccacacaa ttatatcaaa cgtgtaaggc ctcagggaca 4501 tgtcctccag atgttattcc caaagttgaa ggaaccacat tggcagataa aattttacaa 4561 tggggtagtt taggcgtgtt ttttgggggg ttgggaattg gcactggtgc aggtacgggt 4621 gggcgcacgg gctatgtgcc tctgggaaca aggcctcctg taattgctga accaggacct 4681 gcagtacgcc caccaatagc tgttgacacc gtggggccat ctgatccttc tattgtttcc 4741 ttattagaag agtcatcagt tattgatgca ggaataacag tacctgatat tacttctcat 4801 ggaggtttta atattactac atctactggt gggcctgcct caacgcctgc tatattagat 4861 atctcccctc ccactaatac tatacgtgtc acaacaacta catctaccaa tcctttatat 4921 attgatcctt ttacattgca gccgccattg ccagcagagg ttaatgggcg cctattaata 4981 tctactccta ccatcacacc ccactcatat gaagaaatac caatggacac gtttgttgta 5041 tctacagata caactaacac atttactagt actcccattc ctggccctcg gtcgtctgca 5101 cgcctggggt tatattctag agcaacgcaa caacgtccag ttactaccag tgcattttta 5161 acatctcctg cacggttggt tacttatgac aatccagcct atgaaggact tacggaggat 5221 acattagtat ttgaacatcc atccattcat actgcacctg accctgattt catggatata 5281 gttgcattgc atcgtcctat gttatcatcc aaacagggta gtgtacgtgt tagtagaatt 5341 ggacaaaggc tgtctatgca gacacgtcgc gggacccgtt ttgggtcacg tgtacacttt 5401 tttcatgacc ttagccctat tacacactct tcagaaacta ttgaattaca gcctttatct 5461 gcttcttcag tatctgcagc ctccaatatt aatgatgggt tatttgatat ttatgttgat 5521 actagtgatg taaatgttac aaataccact tcctctatac ctatgcatgg ttttgctacc 5581 ccccgtttgt ccactacatc tttccctaca ttacctagca tgtctacaca ttctgccaat 5641 accaccatac ctttttcgtt tcctgccact gtgcatgtgg gccctgattt atctgttgtg 5701 gaccacccat gggacagtac cccaacgtct gtaatgcctc agggtaactt tgtaatggta 5761 tcaggatggg attttatatt gcatcctagt tatttttggc gtaggcgccg taaacctgta 5821 ccatattttt ttgcagatgt ccgtgtggcg gcctagtgac aacaaggttt atctacctcc 5881 tcctcctgtt tccaaggtgg tcagcactga tgaatatgtg caacgcacca actactttta 5941 ccatgccagc agttctaggc tattggttgt tggtcaccct tattactcta ttacaaaaag 6001 gccaaataag acatctatcc ccaaagtgtc tggtttacag tacagagtat ttagagttag 6061 gctccctgat cctaataagt ttacattgcc tgaaactaat ttatataacc cagagacaca 6121 gcgcatggtg tgggcctgtg tggggctaga agtaggtcgt ggacagcctt tgggcgttgg 6181 tattagtggc catccattat tgaataagtt ggatgatact gaaaatgcgc ctacatatgg 6241 tggaggccct ggtacagaca atagggaaaa tgtttctatg gattataaac aaacacagtt 6301 gtgtttagtt ggctgtaaac ctgccatagg ggagcactgg ggtaaaggta ctgcctgtac 6361 accacagtcc aatggtgact gcccaccatt agaattaaaa aatagtttta ttcaggatgg 6421 ggatatggtg gatgtagggt ttggggcact agattttggt gctttacaat cctccaaagc 6481 tgaggtacct ttggatattg taaattcaat tactaaatat cctgattact taaaaatgtc 6541 tgctgaggcc tatggtgaca gtatgttttt ctttttaagg cgagaacaaa tgtttgttcg 6601 tcatttgttt aatagggctg gcgcaattgg tgaacctgta cctgatgaac tgtataccaa 6661 ggctgctaat aatgcatctg gcagacataa tttaggtagt agtatttatt atcctacccc 6721 tagtggttct atggtaacat ctgatgcaca actatttaat aaaccatatt ggttacaaca 6781 agcacaagga cacaataatg gtatatgttg gggaaatcag ctatttttaa ctgtggttga 6841 tactacccgt agtactaaca tgactttgtg tgccactgca acatctggtg atacatatac 6901 agctgctaat tttaaggaat atttaagaca tgctgaagaa tatgatgtgc aatttatatt 6961 tcaattgtgt aaaataacat taactgttga agttatgtca tatatacaca atatgaatcc 7021 taacatatta gaggagtgga atgttggtgt tgcaccacca ccttcaggaa ctttagaaga 7081 tagttatagg tatgtacaat cagaagctat tcgctgtcag gctaaggtaa caacgccaga 7141 aaaaaaggat ccttattcag acttttggtt ttgggaggta aatttatctg aaaagttttc 7201 tactgattta gatcaatttc ctttaggtag aaagttttta ctgcaggccg ggttgcgtgc 7261 aaggcctaaa ctgtctgtag gtaaacgaaa ggcgtctaca gctaaatctg tttcttcagc 7321 taaacgtaag aaaacacaca aatagatgta tgtagtaatg ttatgataca tatttatgtt 7381 atttatttgt gtactgtgtt aataaactac tttttatatg ttgtgtgttc tccattttgt 7441 tttttgtact ccattttgtt tctagaccga tttcggttgt atctggcctg ttaccaggtg 7501 cattggccat gtttcctaac attttgcaaa cctattcact ttttaaattt ataaatgcaa 7561 tatgtgctgc caactgtttt atggcacgta tgttctgcca acgtacactc cctaattcct 7621 ttacataaca cacacgcctt tgcacaggca tgtgcacaaa ggttggcaaa ggttagcata 7681 tctctgcagt tacccatttc ctttttcctt ttttttatgt atgagtaact taattgttat 7741 atgtaataaa aaagctttta ggcacatatt ttcagtgttg gcatacacat ttacaagtta 7801 ccttggctta aacaagtaaa gttatttgtc actgttgaca cattactcat atatataatt 7861 tgtttttaac atgcaggtgg caaccgaaac cggtacataa atccttctta ttctttt