LOCUS HPV22 7368 bp ds-DNA VRL 04-JUL-1995 DEFINITION Human papillomavirus type 22 (HPV22), complete genome. ACCESSION U31780 SOURCE Human papillomavirus type 22 DNA. ORGANISM Human papillomavirus type 22 Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7368) AUTHORS Delius,H. TITLE Direct Submission JOURNAL Unpublished REFERENCE 2 AUTHORS Kremsdorf,D., Favre,M., Jablonska,S., Obalek,S., Rueda,L.A., Lutzner,M.A., Blanchet-Bardon,C., Van Voorst Vader,P.C., and Orth,G. TITLE Molecular cloning and characterization of the genomes of nine newly recognized human papillomavirus types associated with epidermodysplasia verruciformis JOURNAL Journal of Virology 52(3), 1013-1018 (1984) COMMENT HPV22 was originally isolated from macules on the chest of an Italian epidermodysplasia verruciformis (EV) patient [2]. Cloned HPV22 DNA was obtained from the Papillomavirus Reference Center, Heidelberg and subsequently sequenced by Dr. H. Delius. The HPV22 genome, like that of HPVs 9, 15, 17a/b, 23, 37, and 38, is smaller than most PV genomes at approximately 7.4 kb. Phylogenetic reconstructions based on DNA sequences of established types indicate that HPV22 is most closely related to HPVs 23 and 38, and then to 15, 17, 37 and 9. Although Kremsdorf et al [2] found substantial cross-hybridization between HPV22 and HPV19, nucleotide sequence comparison fails to support a close relationship between these two types. FEATURES Location/Qualifiers CDS 149..646 /note="ORF E6 from bp 125 to 646" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MQPLVVIYALLAYYLSRMGCYSVFMALQRPLTVQQLSDKLTVPV VDLLLPCRFCSRFLTYLELRQFDYKNLQLIWTDEDFVFACCSGCAYASAQFEFQQYYQ VTLYGREIEQEEQRPVGQIYMRCQYCLKSLDLLEKLDICCSNQPFHKVRDHWKGRCRH CKAIE" CDS 643..945 /note="ORF E7 from bp 634 to 945" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="MIGKQATLCDIVLEELVLPIDLHCHEELPELPEELEESVVEEEP EYTPYKIVVYCGGCDTKLKLYILATLSGIRDFQTSLLGPVKLLCPTCREEIRNGRR" CDS 932..2758 /note="ORF E1 from bp 896 to 2758" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MDDDKGTDTTDAKEGCSGWFMLEAACSDDSDLDNSLEKLFEDGT ESDVSDLINDDDTAAQGNSRELLCQQQSEECEQQIQYLKRKYFSPKAVQQLSPRLQSM NISPGHKSKRRLFVEHDSGLECSLNEAEDLTEEVEVPASAPAPAAQGGVGSGHYTSLL RCNNVKAVLLGKFKDAFGVSYNELTRQFRSNKTCCKHWVLAIYAAKDELIDASKQLLQ QHCTYLWLQTFSPMSLYLCCFNVGKSRETVMRLLSSMLQVNENHILSEPPKIRSMIAA LFWYKGSMNPNVYAFGEYPEWIMTQTMIHHQTADSVQFDLSEMIQWAYDQDYVDECTI AYQYARLADSNSNARAFLAHNSQAKYVRECAQMVRYYKRGEMRDMSISAWIHHCISKI EGDGHWQDIVKFLRYQGLNFIVFLDKFRTFLKNFPKKNCLLICGPPDTGKSMFSMSLM KALRGQVVSFANSKSHFWLQPLADAKLALLDDATEVCWQYIDAFLRNGLDGNMVSLDM KHRAPCQMKFPPLIITSNISLKKEKKFPYLHSRIYEFEFPNKFPFDANDTPLFKLTDQ SWASFFKRLWTQLELSDQEEEGENGETQRTFQCTTREVNGLI" CDS 2700..4010 /note="ORF E2 from bp 2676 to 4010" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MEKLSERFSALQEKLMDLYESGVEDLETQIQHWKLLRQEQVLFY YARRHGILRLGYQPVPTLATSESKAKDAIAMGLLLESLQKSQYAEEPWTLVETSLETV KSPPADCFKKGPKSVEVYFDGDPENVMSYTVWSYIYYQTDDESWEKVEGHVDYTGAYY IEGTFKTYYIKFETDAKRYGTTGHWEVHVNKDTVFTPVTSSTPPVGVASQNSAPEPAS TSDSPQRSSQVTHRYGRKASSPTITTIRRQKRRERQRQETPTRRRKTRSRSRSTEQRG GRATRRSLSRESAESPRRGGRGGGGPLTRSRSRSRSRTRESVDGGGVAPDEVGATLRS IGRQHSGRLAQLLDAAKDPPVILLRGAANTLKCYRYRFRKKHAGSFQFISTTWSWVGG HTTDRIGRSRILISFHTDREREKCLQQMKLPLGVEWSYGQFDDL" CDS <3184..3765 /note="ORF E4 from bp 3184 to 3765" /gene="E4" /note="putative" /codon_start=1 /translation="KGPLKPIILNLKQMLNDMVQQDIGRCMLIKILCLPLLPVLRRQL ESPPRTPHPNRHPPPTPHNGHHKSPTDTAEKHLVLQSPPSGGKKGERDKDKKPQQGEE KPDQGPEAPSSGEGGPPDDPSPENPQNPPGGEGEVEGAPSPGPAQGRDPVHESLLTGV ASRLTKWEQHFDQLVDSIVGDLRNYWTQLKTPQ" CDS 4077..5651 /note="ORF L2 from bp 4035 to 5651" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MARARRTKRASVTDIYKGCKASGTCPPDVINKVEQNTLADKILK YGSVGVFFGGLGISTGKGTGGPTGYIPLGQGPGVRVGATPTVVRPGVIPEIIGPTELI PVDSVTPIDPAAPSIVTLTDSSAGADLLPGEVETIAEVHPVPIDNVELDTPLVSGDRH AILEVTDANPPFRRTVTRTQYHNPAFEIISESTPLIGESTPSDHVFVFEGSGGVQVGD ANESIELDTFPSRYSFDIEEPTPPRRVSTPIERISQEFRTLRRALYNRRLTEQVQVRD PLFIRSPSRLVRFQFDNPVFDEEVTQIFERDVAAVEEPPDRDFLDIERLGRPILTETA EGRVRVSRLGQRASLSTRSGARVGARVHFFTDISTINAEEPIELELLGEHSGDSSVVQ EPFESTILDVNIDNIPESLDTNIAETSVDYDSADLLLDNGVEDFSRSQLVIGPSDRSL PSITVPQFESPRETIVYIQDIEGNTVVYPKYEERPTIILPTPSGPAIIQSPTHSSFDY YLHPSLRRKKRKRKYL" CDS 5662..7194 /note="ORF L1 from bp 5569 to 7194" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MTLWLPTSGKIYLPPTPPVARVQNTDEYVERTDIYYHAISDRLL TVGHPYFDVRSSDGAKIEVPKVSGNQFRAFRVTFPDPNKFALGDMTIHDPERYRLVWA CKGLEIGRGQPLGVGTTGHPLFNKLHDTENPTERQEGTSDDRRNVSFDPKQVQMFIIG CIPCLGEYWDKAPVCEDAGSQVGLCPPLELKNGVIEDGDMFDIGFGNINNKTLSFNRS DVSLDIVNEICKYPDFLTMSNDVYGDSCFFCARREQCYARHNFVRGGLVGDAIPDDAV QQDHKYYLPAASQTALENSTYFPTVSGSLVTSDAQLFNRPFWLKRAQGHNNGILWNNQ MFVTVADNTRNTNFSISVASDGTTVNYDAKKIREFMRHVEEYQLSFILQLCRIPLEAE VLTQINAMNHGILENWQLGFVPTPDNSVHDTYRYLQSKATKCPDAVPDTQKEDPFGQY TFWNVDMSEKLSLDLDQYPLGRKFLFQSGLQRARASARVSVKRSATRKTSKTVKRRKL TS" source 1..7368 /organism="Human papillomavirus type 22" BASE COUNT 2315 a 1352 c 1614 g 2087 t ORIGIN 1 ccgccaaagc tttgccaggt cttggcagaa catttgctgg caaagactgc accgataacg 61 gtaagaactt ttaattttta accgtaggcg gttatttgtt attcgtagca acaattgtgg 121 ttaacaacaa tctcctgcca gaatatacat gcaaccgctt gtggtaattt atgcactgct 181 tgcatattat ttaagtagga tgggctgcta ttctgtattc atggctttgc aaagaccact 241 gacagtacag caacttagtg ataagttgac tgtacctgta gtagatcttt tgctaccttg 301 tagattctgc agtaggtttt taacctattt ggaattgcgg caatttgatt ataagaattt 361 gcaattaatt tggacagacg aggactttgt gtttgcatgt tgcagcggct gtgcctacgc 421 ttcagcccaa tttgaatttc agcagtatta tcaagttact ttgtatggtc gtgaaattga 481 gcaagaagaa caacgacctg taggccaaat ttatatgaga tgtcaatatt gcttgaagtc 541 tcttgatttg ctagaaaagt tagatatctg ctgttccaat caaccatttc acaaggttag 601 agatcattgg aagggaaggt gcaggcactg taaagcaata gaatgattgg gaaacaagct 661 actctgtgtg atatagttct tgaagagctt gtcctgccca ttgacctgca ttgccacgag 721 gagctgcctg aacttccaga agagttagaa gaatcagtgg tagaggagga gcctgagtac 781 actccttaca agattgtagt atattgtggg ggttgtgata caaagctgaa gctgtatata 841 ctagcaactc tctctggaat tcgcgacttt caaacatctc tacttggacc tgtaaaactt 901 ttgtgtccca cctgtcgaga agagattcgc aatggacgac gataaaggta ctgacacaac 961 tgatgctaaa gaaggatgta gtggttggtt tatgttagaa gctgcgtgct cagatgatag 1021 tgacttagat aatagtttgg aaaagttatt tgaagatggt acagagtcag atgtatctga 1081 tttaataaat gatgatgata ctgctgctca gggaaattcc cgcgaattgc tatgtcaaca 1141 gcaaagtgag gaatgtgagc agcagattca atatctaaaa cgaaagtatt tcagtccaaa 1201 ggctgttcag cagctaagtc cacgtctgca gtctatgaat atttcgcctg ggcataaatc 1261 taaaaggaga ttatttgtgg agcacgacag cggactggag tgttccctaa atgaagctga 1321 agatcttact gaagaggtgg aggtaccggc gagcgctcca gcgccggcag cacagggtgg 1381 tgtagggtcg ggacattaca ccagtttgtt aagatgcaac aatgtaaagg cagtattgct 1441 gggaaaattt aaagacgcat ttggagtgag ctataatgag ctgactagac aatttagaag 1501 taataagact tgctgtaagc attgggtatt ggccatatat gctgctaaag atgaattaat 1561 agatgcgtcc aaacaattgt tacaacagca ttgtacctat ttgtggttgc aaacattctc 1621 acccatgtca ttatatttat gttgttttaa tgttgggaaa agtagagaaa ctgtgatgcg 1681 attgttatct tccatgttac aagttaatga gaatcatatt ttatcagaac ctccaaaaat 1741 cagaagtatg atagctgctt tattttggta taaaggtagt atgaatccaa atgtctatgc 1801 atttggagag tatcctgagt ggattatgac acagactatg atacatcacc aaactgctga 1861 cagtgtacaa tttgacctgt ctgaaatgat acaatgggct tatgatcaag attatgttga 1921 tgaatgtact attgcatacc agtatgctag attggctgat agtaatagta atgccagagc 1981 atttttagct cataatagtc aagccaaata tgttagagaa tgtgctcaaa tggttagata 2041 ttataaacgt ggagaaatgc gagatatgtc aatttctgca tggatacatc attgtatatc 2101 aaagatagaa ggcgatggtc actggcaaga tattgttaaa tttttgcgat accaagggtt 2161 aaattttata gtgtttttag ataaatttag aacatttcta aaaaattttc caaagaaaaa 2221 ttgtttgtta atatgtggtc ctccggatac aggaaaatct atgtttagca tgtcattaat 2281 gaaagcatta agaggacagg tagtttcatt tgcaaattct aaaagtcatt tttggctaca 2341 gcctttagca gatgcaaaac tggctttatt agatgatgct acagaagttt gctggcaata 2401 tattgatgct ttcttaagaa atggattaga tggtaacatg gtatctttag atatgaaaca 2461 tagagctcca tgtcaaatga aatttccacc tcttattata acatctaaca ttagtttaaa 2521 aaaagaaaaa aaatttccct atttacatag tagaatatat gaatttgagt ttcctaacaa 2581 atttcccttt gacgcaaatg atacacctct gtttaaactt actgaccaaa gctgggcgtc 2641 tttttttaaa aggctttgga cacaattaga actgagtgat caagaagaag agggagaaaa 2701 tggagaaact cagcgaacgt ttcagtgcac tacaagagaa gttaatggac ttatatgaat 2761 caggtgtaga ggatcttgaa acccaaattc aacattggaa attattaaga caagaacaag 2821 tgttatttta ttatgcaagg agacatggga tattgcgttt ggggtaccaa ccagtaccca 2881 ctctggcaac ttcagagagt aaagcaaaag atgctatagc catgggacta ttgctggaaa 2941 gcttacaaaa atcacaatat gcagaggaac cgtggacctt agtagaaact agtttggaga 3001 cagttaaaag ccctccagca gactgtttta aaaaaggacc taaatctgtg gaagtgtact 3061 ttgatggaga tcctgaaaat gtaatgtctt atacagtgtg gtcatacatt tattatcaga 3121 ctgatgatga gtcatgggaa aaggtggaag gtcatgtgga ctatacagga gcttactata 3181 tagaagggac ctttaaaacc tattatatta aatttgaaac agatgctaaa cgatatggta 3241 caacaggaca ttgggaggtg catgttaata aagatactgt gtttacccct gttaccagtt 3301 ctacgccgcc agttggagtc gcctcccaga actccgcacc cgaaccggca tccacctccg 3361 actccccaca acggtcatca caagtcaccc accgatacgg ccgaaaagca tctagtccta 3421 caatcaccac catcaggagg caaaaaaggc gagagagaca aagacaagaa accccaacaa 3481 ggcgaagaaa aaccagatca aggtcccgaa gcaccgagca gcggggaggg agggccacca 3541 gacgatccct ctccagagaa tccgcagaat cccccaggcg gggagggaga ggtggagggg 3601 gccccctcac caggtcccgc tcaaggtcgc gatcccgtac acgagagtct gttgacgggg 3661 gtggcgtcgc gcctgacgaa gtgggagcaa cacttcgatc aattggtaga cagcatagtg 3721 ggcgacttgc gcaattactg gacgcagcta aagacccccc agtaattctg ctacgcggtg 3781 cagcaaatac attaaaatgc tatcgctata gatttagaaa gaaacatgct ggaagcttcc 3841 aatttattag tacaacgtgg tcctgggtag gggggcatac aaccgataga atcgggcgct 3901 ctaggatact aatatcattt catacagata gggaaagaga gaagtgcttg caacaaatga 3961 aacttccttt aggtgtagaa tggtcatatg gccagtttga tgatttataa actgcttttt 4021 tactaacaca ctaacattgc ctatttatac taacctattt gcttgctact aacaaaatgg 4081 cgcgagcgcg aagaacaaag cgagcgtcag taactgacat ttataaaggc tgtaaggcct 4141 ctgggacttg tccccctgat gttattaata aagtggaaca aaatacactt gctgataaaa 4201 ttttaaagta tggcagtgtt ggtgtgtttt ttggtggtct tggtataagt acaggtaagg 4261 gtaccggtgg tcctacaggc tatattcctt taggtcaagg tcctggagtg cgtgtgggcg 4321 ccactcccac agtggtccgc cccggggtca tacctgaaat aattggacca actgaattaa 4381 taccagttga ctcagtaaca ccaattgacc ctgcagcacc atccatagtg acattaacag 4441 acagtagtgc aggtgctgac cttttacctg gtgaagttga aactattgca gaagtacatc 4501 cggtcccaat agacaatgtg gaacttgaca cacctttagt ttctggggac cgtcacgcca 4561 ttttggaggt gactgatgct aatccccctt ttaggcgcac ggttacccgg acacaatatc 4621 ataatcctgc ttttgaaatt atttcagagt ctacaccatt aataggtgaa tctacaccct 4681 ctgaccatgt ttttgttttt gaaggctcgg gaggtgtaca ggtaggggat gctaatgaaa 4741 gcattgaatt ggatactttt ccttctagat atagttttga cattgaggag ccaacccctc 4801 ctcgtagagt tagtacacca attgaaagaa tcagtcagga atttagaact ttaagaagag 4861 ccttatacaa cagaagatta acagaacagg tccaagtaag agaccccttg tttattcgat 4921 ccccgtccag gcttgtgaga tttcaatttg ataatccagt attcgatgag gaagttacac 4981 aaatatttga aagagatgta gctgcagtag aagaaccacc agacagggat tttttagata 5041 ttgaaagact tggaaggcct atactaacag aaactgcaga aggccgtgtt cgtgtcagca 5101 ggttagggca acgtgcatcg ctgagcacac gcagcggcgc acgtgtaggt gctagagtgc 5161 atttctttac agatattagc actattaatg cagaagagcc cattgaatta gaattattag 5221 gtgagcattc tggcgacagc tctgtagtac aagaaccatt tgaaagcaca atattggatg 5281 tcaatattga caacatacct gaaagtttgg atacaaacat agcagaaaca tctgtagact 5341 atgattctgc tgatttgtta ttagacaacg gtgtggagga ctttagtagg tcacaattgg 5401 taataggtcc ttcagataga tcacttccat ctattactgt tccacaattt gaatccccta 5461 gagaaaccat tgtgtacata caagacatag agggtaatac agttgtatat cctaaatatg 5521 aagaaaggcc aactattata ttacctacac cctcggggcc tgctataatt caatcaccta 5581 cacattcctc ctttgactat tatttacatc ctagcttgcg aaggaaaaaa cgcaaacgca 5641 aatatttata atgtttttca gatgaccctc tggcttccaa cttcgggtaa gatatatttg 5701 cctcctacgc caccggtagc ccgagtacaa aacacggacg agtatgtgga gaggactgac 5761 atctattacc atgctataag tgaccgttta ttaactgtag gacatcctta ctttgatgtt 5821 agatcatcag atggagcaaa aatagaggtc cctaaagtgt ctggaaatca gtttagggct 5881 tttagagtaa catttccaga tcctaacaaa tttgctttgg gagatatgac aatccatgat 5941 cccgaaaggt atagattagt atgggcttgt aaagggttag aaataggaag aggacagccc 6001 ttaggtgtag gtaccacagg tcatccatta tttaataaat tacatgatac tgaaaaccct 6061 actgaacgcc aggaaggaac atcagatgat agaagaaatg tttcttttga tcctaaacag 6121 gttcaaatgt ttatcattgg atgtataccg tgtttaggtg aatattggga taaagctcct 6181 gtttgtgaag atgcaggcag tcaggtagga ttatgtcctc cactagaatt aaaaaatggt 6241 gttatagagg atggagatat gtttgatata ggatttggaa atataaataa taaaacacta 6301 tcatttaata gatctgatgt aagcttagac attgtaaatg aaatctgtaa atatcctgat 6361 tttcttacaa tgtcaaatga tgtctatggc gactcatgct ttttttgtgc acgtagggag 6421 caatgttatg cacgacacaa ttttgtacgt ggtggtcttg ttggtgatgc tataccagat 6481 gatgcagttc aacaagatca taaatattac ttgcctgcag cttcacagac tgctttagaa 6541 aactccactt actttccaac cgttagtggt tctttagtaa cctctgatgc ccaactattc 6601 aacaggcctt tttggttgaa gcgcgcgcag ggccataata atggtatttt gtggaacaac 6661 caaatgtttg taacagtagc tgataatacc cgtaacacta atttttctat tagtgtggca 6721 agtgacggca ccacagttaa ttatgatgct aaaaaaatca gagaatttat gcgccatgtg 6781 gaagaatacc aattatcctt tattttgcag ctatgtagaa taccattaga agcagaggta 6841 ttaactcaaa ttaatgccat gaatcatggc attttagaaa attggcaact aggctttgta 6901 cctacaccag acaattctgt ccatgatact tataggtatt tacaatctaa agctacaaaa 6961 tgtcctgatg ctgtacctga cacacaaaag gaagatccct ttggtcaata tactttttgg 7021 aatgtagaca tgtctgaaaa gttatcattg gatttagatc agtatccact gggtcgtaaa 7081 tttttatttc aatctgggtt acaacgtgca agggccagtg ccagggtcag tgtgaaacgt 7141 tctgctacgc ggaaaacgtc taaaactgta aaacgaagga aacttacctc ttaaccgttt 7201 tcggttgctt taataaaatc tattaactaa tctggtatgt gaagcatttt ttgaccacct 7261 ttgtgactaa accgaacaag tcaacaccag caaccgcacc cggtttttac attataaatt 7321 cctcgaggta agataaccat cagtagatac catcggcacc tggagcaa