LOCUS HPV5 7746 bp ds-DNA VRL 30-SEP-1988 DEFINITION Human papillomavirus type 5 (HPV-5), complete genome. ACCESSION M17463 KEYWORDS complete genome. SOURCE Human papillomavirus type 5 DNA recovered from a benign flat wart from an EV patient. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7746) AUTHORS Zachow,K.R., Ostrow,R.S. and Faras,A.J. TITLE Nucleotide sequence and genome organization of human papillomavirus type 5 JOURNAL Virology 158, 251-254 (1987) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by R.S.Ostrow, 10/23/87. HPV-5 is most often isolated from the pityriasis versicolor-like macular lesions of epidermodysplasia verruciformis (EV). HPV-5 has been found in both benign lesions and squamous cell carcinomas. EV is a rare genetic disease which is thought to be autosomal recessive, although a few single cases of X-linked inheritance have been reported. This genetic defect results in a patient who is immunotolerant. These patients are then infected by HPV types which are rarely seen in the general population and which often persist their entire life. This infection is characterized by either flat warts or pigmented (red or reddish-brown) macular plaques. Most often these warts are benign but occasionally they progress to malignancy. Ultraviolet light may be a cofactor in this progression. Benign lesions are associated with diverse EV HPVs, while in about 90% of cases cancers harbor HPV-5 or HPV-8. Studies indicate that carcinoma develops in approximately one-third of all EV patients. It is also interesting to note that other immunocompromised patients, such as renal allograft recipients and HIV positive patients, suffer from infection of EV HPV types and these infections have been known to progress to malignancy. FEATURES Location/Qualifiers protein_bind join(7741..7746,1..6) /function="gene regulation" /bound_moiety="E2 protein" /note="putative" polyA_signal 146..151 /note="putative" TATA_signal 152..158 /note="putative" TATA_signal 154..160 /note="putative" TATA_signal 156..162 /note="putative" TATA_signal 158..164 /note="putative" TATA_signal 160..166 /note="putative" CDS 200..673 /note="ORF E6 from bp 167 to 673" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFC GNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFNQFYEQTVLGRDIELA SGLSIFDIDIRCQTCLAFLDIIEKLDCCGRGLPFHKVRNAWKGICRQCKHFYHDW" CDS 663..974 /note="ORF E7 from bp 618 to 974" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="MIGKEVTVQDIILELSEVQPEVLPVDLFCEEELPNEQETEEEPD NERISYKVIAPCGCRNCEVKLRIFVHATEFGIRAFQQLLTGDLQLLCPDCRGNCKHDG S" CDS 961..2781 /note="ORF E1 from bp 913 to 2781" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MTDPNSKGSTSKEGFGDWCLLEADCSDVENDLGQLFERDTDSDI SDLLDDTELEQGNSLELFHQQECEQSEEQLQKLKRKYLSPKAVAQLSPRLESISLSPQ QKSKRRLFAEQDSGLELTLNNEAEDVTPEVEVPAIDSRPDDEGGSGDVDIHYTALLRS SNKKATLMAKFKESFGVGFNELTRQFKSHKTCCKDWVVSVYAVHDDLFESSKQLLQQH CDYIWVRGIGAMSLYLLCFKAGKNRGTVHKLITSMLNVHEQQILSEPPKLRNTAAALF WYKGCMGSGAFSHGPYPDWIAQQTILGHKSAEASTFDFSAMVQWAFHNHLLDEADIAY QYARLAPEDANAVAWLAHNNQAKFVRECAYMVRFYKKGQMRDMSISEWIYTKINEVEG EGHWSDIVKFIRYQNINFIVFLTALKEFLHSVPKKNCILIYGPPNSGKSSFAMSLIRV LKGRVLSFVNSKSQFWLQPLSECKIALLDDVTDPCWIYMDTYLRNGLDGHYVSLDCKY RAPTQMKFPPLLLTSNINVHGETNYRYLHTTIKGFEFPNPFPMKADNTPQFELTDQSW KSFFTRLWTQLDLSDQEEEGEDGESQRAFQCSARSANEHL" CDS 2723..4267 /note="ORF E2 from bp 2699 to 4267" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MENLSERFNALQDQLMNIYEAAEQTLQAQIKHWQTLRKEPVLLY YAREKGVTRLGYQPVPVKAVSETKAKEAIAMVLQLESLQTSDFAHEPWTLVDTSIETF RSAPEGHFKKGPLPVEVIYDNDPDNANLYTMWTYVYYMDADDKWHKARSGVNHIGIYY LQGTFKNYYVLFADDAKRYGTTGEWEVKVNKETVFAPVTSSTPPGSPGGQADTNTTPA TPTTSTTAVDSTSRQLTTSKQPQQTETRGRRYGRRPSSKSRRSQTQQRRSRSRHRSRS RSRSRSKSQTHTTRSTTRSRSTSLTKTRALTSRSRSRGRSPTTCRRGGGRSPRRRSRS PSTSSSCTTQRSQRARAESSTTRGARGSRGSRGGSRGGRGRRRGRSSSSSSPAHKRSR GGSAKLRGVSPGEVGGSLRSVSSKHTGRLGRLLEEARDPPVIIVKGAANTLKNVRNRA KIKYMGLFRSFSTTWSWVAGDGTERLGRPRMLISFSSYTQRRDFDEAVRYPKGVDKAY GNLDSL" CDS <3285..4022 /note="ORF E4 from bp 3285 to 4022" /gene="E4" /note="putative" /codon_start=1 /translation="KLIRKLCLLLSPAPRLQGRQEDKQTQTPPPRPPPPPQPPLTPRP DSSPHQNSHNKPKPEEEGTDGGPPASQGDRKRSKGDQGPDTGPGLGPGRGPSPKPTPL GPPPGPGPRRSPRLGPLQADRDPEEGPQPPAEGEVEGHPGGDQGHPPPPPPAPHNGHS GHEPKVQQPEGPEGREGHEEGAVGGEGGDEEGHPPPPPPPTNGHEGGLLSSVASLLVK WEGHFDQLVQSIQDDLEDYWKKLATPQ" CDS <3406..3912 /note="ORF E5 from bp 3406 to 3912" /gene="E5" /note="putative" /codon_start=1 /translation="LHVQTAHHIKTATTNRNQRKKVRTEALQQVKEIANAAKAIKVPT PVPVSVPVAVQVPNPHHSVHHQVPVHVAHQDSGPYKQIAIQRKVPNHLQKGRWKVTQA AIKVTLHLLLLHHTTVTAGTSRKFNNQRGPRVERVTRREPWGERAATRKVILLLLPRP QTVTRGVC" protein_bind 3537..3548 /function="gene regulation" /bound_moiety="E2 protein" /note="putative" CDS 4348..5904 /note="ORF L2 from bp 4240 to 5904" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MARAKTVKRDSVTHIYQTCKQAGTCPPDVINKVEQTTVADNILK YGSAGVFFGGLGISTGRGTGGATGYVPLGEGPGVRVGGTPTVVRPSLVPETIGPVDIL PIDTVNPVEPTASSVVPLTESTGADLLPGEVETIAEIHPVPEGPSVDTPVVTTSTGSS AVLEVAPEPIPPTRVRVSRTQYHNPSFQIITESTPAQGESSLADHVLVTSGSGGQRIG GDITDIIELEEIPSRYTFEIEEPTPPRRSSTPLPRNQSVGRRRGFSLTNRRLVQQVQV DNPLFLTQPSKLVRFAFDNPVFEEEVTNIFENDLDVFEEPPDRDFLDVRELGRPQYST TPAGYVRVSRLGTRATIRTRSGAQIGSQVHFYRDLSSINTEDPIELQLLGQHSGDATI VHGPVESTFIDMDISENPLSESIEAYSHDLLLDETVEDFSGSQLVIGNRRSTNSYTVP RFETTRNGSYYTQDTKGYYVAYPESRNNAEIIYPTPDIPVVIIHPHDSTGDFYLHPSL HRRKRKRKYL" polyA_signal 4438..4443 /note="putative" CDS 5917..7467 /note="ORF L1 from bp 5905 to 7467" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MAVWHSANGKVYLPPSTPVARVQSTDEYIQRTNIYYHAFSDRLL TVGHPYFNVYNINGDKLEVPKVSGNQHRVFRLKLPDPNRFALPDMSVYNPDKERLVWA CRGLEIGRGQPLGVRSTGHPYFNKVKDTENSNAYITFSKDDRQDTSFDPKQIQMFIVG CTPCIGEHWDKAVPCAENDQQTGLCPPIELKNTYIQDGDMADIGFGNMNFKALQDSRS DVSLDIVNETCKYPDFLKMQNDIYGDACFFYARREQCYARHFFVRGGKTGDDIPRAQI DNGTYKNQFYIPGADGQAQKTIGNSMYFPTVSGSLVSSDAQLFNRPFWLQRAQGHNNG ILWANQMFITVVDNTRNTNFSISVYNQAGALKDVADYNADQFREYQRHVEEYEISLIL QLCKVPLKAQVLAQINAMNSSLLEDWQLGFVPTPDNPIQDTYRYIDSLATRCPDKNPP KEKEDPYKGLHFWDVDLTERLSLDLDQYSLGRKFLFQAGLQQTTVNGTKAVSYKGSNR GTKRKRKN" polyA_signal 6289..6294 /note="putative" protein_bind 7564..7575 /function="gene regulation" /bound_moiety="E2 protein" /note="putative" repeat_region 7703..7730 /rpt_type=tandem /rpt_unit=7703..7717, 7716..7730 /note="putative" source 1..7746 /organism="Human papillomavirus type 5" /sequenced_mol="DNA" BASE COUNT 2376 a 1547 c 1736 g 2087 t ORIGIN 354 bp upstream of HindIII site. 1 aacggtaagt tgcaatttcc ttgtaccagg tgcggtattg ggatttcaca attataatgg 61 ttgttgccaa ctaccatagg catattcaag tttttgcctg tatcgttttc gtatcctgta 121 ataatatcca atatatgtat acataaataa atatatatat atataagtgt ctaagattgg 181 gttcttctgt aatcaggcaa tggctgaggg agccgaacac caacagaaac tgacagaaaa 241 agataaggca gaattacctt taagtattag agacttagct gaagccttag gcatccctgt 301 gattgattgt ttaatacctt gcaatttctg tggcaacttt ctaaattatt tggaagcttg 361 tgaattcgac tacaaaaggc ttagtctaat ttggaaagat tattgtgtgt ttgcgtgctg 421 tcgcgtatgc tgtggcgcca ctgcaactta tgaatttaac caattttatg agcagacagt 481 gttaggaaga gatattgaat tagcttcagg actttcaata tttgatattg atatcaggtg 541 tcaaacttgc ttagcatttc ttgacattat agaaaagtta gattgctgtg gcagaggcct 601 tccctttcat aaggtgagga acgcctggaa gggaatctgt aggcagtgta agcattttta 661 tcatgattgg taaagaggtc accgtgcaag atattattct ggagctcagt gaggtgcagc 721 ccgaagtgct accagttgac ctgttttgtg aagaggaatt accaaacgag caggaaacgg 781 aggaggagcc tgacaacgaa aggatctctt acaaagttat agctccgtgc ggttgcagga 841 actgtgaggt caagcttcgc atttttgtcc acgccacaga atttggtatt agagctttcc 901 aacagctact gaccggagat ctgcagctcc tgtgccctga ctgtcgcgga aactgcaaac 961 atgacggatc ctaattctaa aggtagtaca tctaaagaag ggtttggtga ttggtgttta 1021 ttggaagctg actgtagtga tgtagaaaat gatttgggac aattatttga gagagataca 1081 gactctgata tatcggattt gttagatgat actgaactgg agcagggcaa ttccctggaa 1141 ctatttcatc aacaggagtg tgagcagagc gaggagcaat tgcaaaaact aaaacgaaag 1201 tatcttagtc caaaagctgt cgcacagctt agtccgcgac ttgagtcaat ttcattgtca 1261 ccccagcaga agtctaagcg aaggctcttt gcagagcagg acagcggact cgagctgact 1321 ttaaacaatg aagctgaaga tgttactcct gaggtggagg taccggctat tgactctcgg 1381 ccggatgacg agggaggttc aggggacgta gatatacatt acactgcatt gttgcgttct 1441 agcaacaaaa aagctacatt aatggctaag tttaaagagt cgtttggagt aggttttaat 1501 gaattgacac ggcaattcaa aagccacaaa acctgctgta aggactgggt tgtctctgta 1561 tatgcagtgc atgatgatct atttgaaagc tcaaagcagc tattgcaaca gcattgtgac 1621 tatatctggg tccgtgggat aggtgcaatg tcattatacc tattgtgttt taaggcggga 1681 aaaaatcgcg ggacagttca taagttaatt acctcaatgt taaatgtgca tgaacagcaa 1741 atattgtctg agccgccaaa attgagaaat acagccgctg cattgttctg gtataagggt 1801 tgtatgggat cgggggcgtt tagccatgga ccatatcctg attggattgc ccaacaaact 1861 atattaggtc acaaaagtgc tgaggcaagt acttttgatt tttcagcaat ggtccaatgg 1921 gcatttcata atcacttatt agacgaagca gatatagcat accagtatgc aaggcttgct 1981 cccgaagacg cgaatgcagt agcttggctt gcacataaca accaggccaa atttgtgaga 2041 gaatgtgcat atatggtacg attttataag aagggacaaa tgagagacat gagtatatct 2101 gaatggatat acactaaaat caatgaagta gaaggggaag ggcactggtc agatatagta 2161 aagtttatta gataccaaaa tataaacttt attgtattcc taactgcatt aaaagaattc 2221 ctacactcag tgccaaaaaa aaattgcatt ttaatttatg gtcctccaaa ttctggaaag 2281 tcatcatttg caatgtcatt aataagagtg ttgaagggta gagtgttgtc atttgtaaat 2341 tctaaaagtc agttttggct gcaacccctt tcagagtgca agatagctct attggatgat 2401 gtaacagacc cttgttggat atacatggat acatatttaa gaaatggctt ggatggacat 2461 tatgtttcat tagattgtaa atatagagcc ccaacgcaaa tgaaatttcc cccattatta 2521 ttaacatcta acattaatgt gcatggggaa actaattata gatatttaca cactacaata 2581 aaaggatttg aatttccaaa tccttttcct atgaaagcag ataatacacc tcagttcgaa 2641 ctaactgacc aaagctggaa atcttttttt acaaggcttt ggacacaatt agacctgagt 2701 gatcaagaag aggagggcga ggatggagaa tctcagcgag cgtttcaatg ctctgcaaga 2761 tcagctaatg aacatttatg aagctgcaga acaaacattg caggcacaaa ttaaacattg 2821 gcaaacctta cgaaaagaac ctgtattact ctactatgct agggagaaag gtgttacaag 2881 gcttggatat caacctgtgc ctgtaaaggc agtatcagaa acaaaggcta aagaagccat 2941 agcaatggtg ctgcagcttg agtcactaca gacatctgat tttgctcatg agccatggac 3001 tctagttgat accagcatag aaacatttag aagcgctcca gaaggtcact tcaaaaaagg 3061 ccccctccct gtagaagtta tttatgacaa tgatccagat aatgccaatt tgtatacaat 3121 gtggacctat gtgtattata tggatgcgga tgataagtgg cataaggcaa gaagtggggt 3181 gaatcacatt ggcatttatt atttacaagg aacttttaaa aactattatg tactgtttgc 3241 tgacgatgcg aaaagatatg gtacaactgg agaatgggaa gtaaaagtta ataaggaaac 3301 tgtgtttgct cctgtcacca gctccacgcc tccagggtcg ccaggaggac aagcagacac 3361 aaacaccacc cccgcgaccc ccaccacctc cacaaccgcc gttgactcca cgtccagaca 3421 gctcaccaca tcaaaacagc cacaacaaac cgaaaccaga ggaagaaggt acggacggag 3481 gccctccagc aagtcaagga gatcgcaaac gcagcaaagg cgatcaaggt cccgacaccg 3541 gtcccggtct cggtcccggt cgcggtccaa gtcccaaacc cacaccactc ggtccaccac 3601 caggtcccgg tccacgtcgc tcaccaagac tcgggccctt acaagcagat cgcgatccag 3661 aggaaggtcc ccaaccacct gcagaagggg aggtggaagg tcacccaggc ggcgatcaag 3721 gtcaccctcc acctcctcct cctgcaccac acaacggtca cagcgggcac gagccgaaag 3781 ttcaacaacc agaggggccc gagggtcgag agggtcacga ggagggagcc gtggggggag 3841 agggcggcga cgaggaaggt catcctcctc ctcctccccc gcccacaaac ggtcacgagg 3901 ggggtctgct aagctccgtg gcgtctctcc tggtgaagtg ggagggtcac ttcgatcagt 3961 tagttcaaag catacaggac gacttggaag attactggaa gaagctcgcg accccccagt 4021 aatcattgtc aaaggggcgg ctaacacact gaaaaatgtc cgcaacagag ctaaaattaa 4081 atacatggga ctgtttaggt catttagtac tacctggtca tgggtggcag gagatggcac 4141 tgagcgtcta ggcaggccca gaatgctcat tagcttttct tcctatactc aaaggagaga 4201 ttttgatgaa gcggtgcgat accccaaagg agttgataag gcctatggca acctggacag 4261 tctttaacat ttactaatgc tgcttttgct actaacatac taacataccc tagcatttta 4321 tatttttttt tacattttgt atttgctatg gcgcgtgcaa aaacggtcaa gcgagactct 4381 gtaactcata tttaccaaac ctgcaaacag gcaggcactt gcccccctga tgttattaat 4441 aaagtggaac aaacaacagt tgctgacaat attttaaaat atggcagtgc tggtgtattt 4501 tttggtggcc ttggtattag tacaggccga ggaactgggg gtgctacagg gtacgtgcca 4561 cttggggaag gtcctggtgt ccgtgtcgga ggaaccccca cggttgtaag gccttccttg 4621 gttcctgaaa caatcgggcc cgttgatatt ttgcccattg atacagttaa ccccgtggaa 4681 cctacagcat catccgtggt ccctctaact gagtccacag gcgctgattt acttccaggt 4741 gaagtagaaa caattgctga aatccatcct gtacctgagg ggccatcagt ggatacccct 4801 gtagttacca ctagcacagg ttccagtgct gttttagagg ttgccccaga gcctattcct 4861 ccaacacggg tcagggtttc acgcacacag tatcacaatc catcttttca aataataact 4921 gagtctactc cagcacaagg ggaatcgtct cttgcagatc acgttttggt gacatcgggt 4981 tctggggggc aacgaatagg gggtgatata actgacataa ttgagttaga ggaaattcct 5041 agtaggtata catttgaaat tgaagaacca actcctccac gccgcagcag tactccattg 5101 ccacgcaatc aatctgtagg ccgtaggagg ggtttctctt tgactaatag acgtttagta 5161 cagcaggtac aagtggacaa tccattgttt ctaactcaac catctaagtt agttcgtttt 5221 gcatttgata atcctgtttt tgaggaagaa gtgactaata tatttgaaaa tgatctggat 5281 gtctttgaag aacctccaga cagagatttt cttgatgtta gggaattggg acgtccacaa 5341 tattctacaa caccagcggg atatgttaga gtaagcaggt tggggactcg agccactatt 5401 cgcactcgct ctggtgcaca gatagggtcg caagtccatt tttacagaga tcttagctct 5461 attaatactg aagatcctat tgaattacaa ttattaggcc aacattcagg tgatgctact 5521 atagtccacg gacctgttga aagcacattt atagatatgg atatttctga aaatccatta 5581 tctgaaagca ttgaagcata ttcacatgat ttattattag atgaaacggt ggaagatttc 5641 agtgggtctc agctggttat aggtaatcga aggagcacaa actcttacac tgttcctagg 5701 tttgaaacta caagaaatgg ttcatactat acacaagaca caaagggata ttatgttgca 5761 tatccagagt cacgtaataa tgcagaaatc atttatccta cacctgatat tcctgtagtc 5821 attatacacc ctcatgacag tacaggggac ttttatttac atcccagtct tcacaggcgc 5881 aaacgtaaaa gaaaatattt gtgatttgca ttcgagatgg cagtgtggca ctcggctaat 5941 ggtaaagtat atcttccacc atcgacaccg gtggccagag tccaaagcac cgatgaatac 6001 attcaaagaa caaatatcta ctatcatgca tttagtgaca gattgttaac tgtaggtcat 6061 ccttatttca atgtatacaa tattaatggt gataagcttg aggttcctaa ggtttcagga 6121 aatcaacaca gagtatttcg cctaaaatta ccagatccta acagatttgc attacctgat 6181 atgtctgttt acaaccctga caaagaacgt ttggtttggg cctgtagagg cttagaaata 6241 ggtaggggcc agccattagg tgtacggagt actggtcacc cttatttcaa taaagtaaaa 6301 gatacagaaa acagtaatgc atacataaca ttttctaaag atgacagaca ggatacatct 6361 tttgatccta aacagatcca aatgtttatt gtaggatgca caccttgcat aggagagcat 6421 tgggataaag ctgttccatg tgcagaaaat gatcagcaaa ctggcctttg tcctcctatt 6481 gaactaaaaa acacatatat acaagatggt gatatggcag acataggttt tgggaacatg 6541 aattttaagg cacttcaaga tagtagatca gatgtcagtt tagacatcgt caatgaaact 6601 tgcaagtatc cagatttttt aaagatgcaa aacgatattt atggcgatgc gtgctttttt 6661 tatgctcgta gggagcaatg ttatgccaga cacttttttg ttagaggggg aaaaactggt 6721 gatgacattc cacgtgcaca aattgacaat ggtacataca aaaatcagtt ttacattcca 6781 ggggctgatg gccaagctca aaagactata ggaaattcca tgtatttccc aactgttagt 6841 ggctcattag tatccagtga tgctcaattg tttaacaggc ccttctggct ccaaagagcc 6901 caaggtcata ataatggcat cctgtgggct aatcaaatgt ttatcacagt ggttgacaac 6961 acaagaaata ctaatttcag tatttctgta tataatcagg ctggagcact aaaagatgtt 7021 gcagactata atgcagatca atttagagaa tatcaaagac atgtagaaga atatgaaata 7081 tctttaattc tacaactctg taaggttcct ttaaaggcac aggtattggc acagatcaat 7141 gcaatgaact cttcgttatt ggaggattgg cagttaggat ttgttcccac tcctgataat 7201 ccaattcagg acacctacag atatattgac tctttggcta cacggtgtcc agataagaat 7261 cctccgaaag aaaaggaaga cccttataag ggcttacatt tttgggatgt agatttaact 7321 gaaagattgt cattagattt agatcaatat tccttaggca gaaaattttt attccaagct 7381 gggttacaac aaacgaccgt taacggtaca aaagcagtgt cttataaagg gtctaataga 7441 ggaacaaaac gcaaacgtaa aaattgaggt ctgaccgaaa gtggtacatt tttataaact 7501 tttacacagt attcaaggaa tgtttgttta ctctgactaa gtataagtct tccaaggata 7561 ccgaccgcac ccggtacact cagtcaagtt gttgccaata tagaatcaga tcagtgccaa 7621 acacaccgtc ttggactcag aacagaccgt gttcgttata acatgctcgg attagggacc 7681 tccccaaaga agatttaatc tacaatcgct tttggcaatc gcatttggca ctgctaaaag 7741 accgtt