ID HPV22 STANDARD; ds-DNA; VRL; 7368 bp. XX DE Human papillomavirus type 22 (HPV22), complete genome. XX AC U31780 XX DT 04-JUL-1995 XX OS Human papillomavirus type 22 DNA. OC Human papillomavirus type 22;Viridae; ds-DNA nonenveloped viruses; OC Papovaviridae;Papillomavirus. XX RN [1] RP 1 - 7368 RA Delius,H; RT "Direct Submission;" RL Unpublished. XX RN [2] RP RA Kremsdorf,D., Favre,M., Jablonska,S., Obalek,S., RA Rueda,L.A.,Lutzner,M.A., Blanchet-Bardon,C., Van Voorst Vader,P.C., and RA Orth,G; RT "Molecular cloning and characterization of the genomes of nine newly RT recognized human papillomavirus types associated with RT epidermodysplasia verruciformis;" RL Journal of Virology 52(3), 1013-1018 (1984). XX XX Created by HIV database on 1-NOV-1995 from GenBank: U31780. XX XX XX FT KEY Location/Qualifiers FT CDS 149..646 FT /note="ORF E6 from bp 125 to 646" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT /translation="MQPLVVIYALLAYYLSRMGCYSVFMALQRPLTVQQLSDKLTVPV FT VDLLLPCRFCSRFLTYLELRQFDYKNLQLIWTDEDFVFACCSGCAYASAQFEFQQYYQ FT VTLYGREIEQEEQRPVGQIYMRCQYCLKSLDLLEKLDICCSNQPFHKVRDHWKGRCRH FT CKAIE" FT CDS 643..945 FT /note="ORF E7 from bp 634 to 945" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT /translation="MIGKQATLCDIVLEELVLPIDLHCHEELPELPEELEESVVEEEP FT EYTPYKIVVYCGGCDTKLKLYILATLSGIRDFQTSLLGPVKLLCPTCREEIRNGRR" FT CDS 932..2758 FT /note="ORF E1 from bp 896 to 2758" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT /translation="MDDDKGTDTTDAKEGCSGWFMLEAACSDDSDLDNSLEKLFEDGT FT ESDVSDLINDDDTAAQGNSRELLCQQQSEECEQQIQYLKRKYFSPKAVQQLSPRLQSM FT NISPGHKSKRRLFVEHDSGLECSLNEAEDLTEEVEVPASAPAPAAQGGVGSGHYTSLL FT RCNNVKAVLLGKFKDAFGVSYNELTRQFRSNKTCCKHWVLAIYAAKDELIDASKQLLQ FT QHCTYLWLQTFSPMSLYLCCFNVGKSRETVMRLLSSMLQVNENHILSEPPKIRSMIAA FT LFWYKGSMNPNVYAFGEYPEWIMTQTMIHHQTADSVQFDLSEMIQWAYDQDYVDECTI FT AYQYARLADSNSNARAFLAHNSQAKYVRECAQMVRYYKRGEMRDMSISAWIHHCISKI FT EGDGHWQDIVKFLRYQGLNFIVFLDKFRTFLKNFPKKNCLLICGPPDTGKSMFSMSLM FT KALRGQVVSFANSKSHFWLQPLADAKLALLDDATEVCWQYIDAFLRNGLDGNMVSLDM FT KHRAPCQMKFPPLIITSNISLKKEKKFPYLHSRIYEFEFPNKFPFDANDTPLFKLTDQ FT SWASFFKRLWTQLELSDQEEEGENGETQRTFQCTTREVNGLI" FT CDS 2700..4010 FT /note="ORF E2 from bp 2676 to 4010" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT /translation="MEKLSERFSALQEKLMDLYESGVEDLETQIQHWKLLRQEQVLFY FT YARRHGILRLGYQPVPTLATSESKAKDAIAMGLLLESLQKSQYAEEPWTLVETSLETV FT KSPPADCFKKGPKSVEVYFDGDPENVMSYTVWSYIYYQTDDESWEKVEGHVDYTGAYY FT IEGTFKTYYIKFETDAKRYGTTGHWEVHVNKDTVFTPVTSSTPPVGVASQNSAPEPAS FT TSDSPQRSSQVTHRYGRKASSPTITTIRRQKRRERQRQETPTRRRKTRSRSRSTEQRG FT GRATRRSLSRESAESPRRGGRGGGGPLTRSRSRSRSRTRESVDGGGVAPDEVGATLRS FT IGRQHSGRLAQLLDAAKDPPVILLRGAANTLKCYRYRFRKKHAGSFQFISTTWSWVGG FT HTTDRIGRSRILISFHTDREREKCLQQMKLPLGVEWSYGQFDDL" FT CDS <3184..3765 FT /note="ORF E4 from bp 3184 to 3765" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /translation="KGPLKPIILNLKQMLNDMVQQDIGRCMLIKILCLPLLPVLRRQL FT ESPPRTPHPNRHPPPTPHNGHHKSPTDTAEKHLVLQSPPSGGKKGERDKDKKPQQGEE FT KPDQGPEAPSSGEGGPPDDPSPENPQNPPGGEGEVEGAPSPGPAQGRDPVHESLLTGV FT ASRLTKWEQHFDQLVDSIVGDLRNYWTQLKTPQ" FT CDS 4077..5651 FT /note="ORF L2 from bp 4035 to 5651" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT /translation="MARARRTKRASVTDIYKGCKASGTCPPDVINKVEQNTLADKILK FT YGSVGVFFGGLGISTGKGTGGPTGYIPLGQGPGVRVGATPTVVRPGVIPEIIGPTELI FT PVDSVTPIDPAAPSIVTLTDSSAGADLLPGEVETIAEVHPVPIDNVELDTPLVSGDRH FT AILEVTDANPPFRRTVTRTQYHNPAFEIISESTPLIGESTPSDHVFVFEGSGGVQVGD FT ANESIELDTFPSRYSFDIEEPTPPRRVSTPIERISQEFRTLRRALYNRRLTEQVQVRD FT PLFIRSPSRLVRFQFDNPVFDEEVTQIFERDVAAVEEPPDRDFLDIERLGRPILTETA FT EGRVRVSRLGQRASLSTRSGARVGARVHFFTDISTINAEEPIELELLGEHSGDSSVVQ FT EPFESTILDVNIDNIPESLDTNIAETSVDYDSADLLLDNGVEDFSRSQLVIGPSDRSL FT PSITVPQFESPRETIVYIQDIEGNTVVYPKYEERPTIILPTPSGPAIIQSPTHSSFDY FT YLHPSLRRKKRKRKYL" FT CDS 5662..7194 FT /note="ORF L1 from bp 5569 to 7194" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT /translation="MTLWLPTSGKIYLPPTPPVARVQNTDEYVERTDIYYHAISDRLL FT TVGHPYFDVRSSDGAKIEVPKVSGNQFRAFRVTFPDPNKFALGDMTIHDPERYRLVWA FT CKGLEIGRGQPLGVGTTGHPLFNKLHDTENPTERQEGTSDDRRNVSFDPKQVQMFIIG FT CIPCLGEYWDKAPVCEDAGSQVGLCPPLELKNGVIEDGDMFDIGFGNINNKTLSFNRS FT DVSLDIVNEICKYPDFLTMSNDVYGDSCFFCARREQCYARHNFVRGGLVGDAIPDDAV FT QQDHKYYLPAASQTALENSTYFPTVSGSLVTSDAQLFNRPFWLKRAQGHNNGILWNNQ FT MFVTVADNTRNTNFSISVASDGTTVNYDAKKIREFMRHVEEYQLSFILQLCRIPLEAE FT VLTQINAMNHGILENWQLGFVPTPDNSVHDTYRYLQSKATKCPDAVPDTQKEDPFGQY FT TFWNVDMSEKLSLDLDQYPLGRKFLFQSGLQRARASARVSVKRSATRKTSKTVKRRKL FT TS" FT source 1..7368 FT /organism="Human papillomavirus type 22" XX SQ SEQUENCE 7368 bp; 2315 a; 1352 c; 1614 g; 2087 t; ccgccaaagc tttgccaggt cttggcagaa catttgctgg caaagactgc accgataacg 60 gtaagaactt ttaattttta accgtaggcg gttatttgtt attcgtagca acaattgtgg 120 ttaacaacaa tctcctgcca gaatatacat gcaaccgctt gtggtaattt atgcactgct 180 tgcatattat ttaagtagga tgggctgcta ttctgtattc atggctttgc aaagaccact 240 gacagtacag caacttagtg ataagttgac tgtacctgta gtagatcttt tgctaccttg 300 tagattctgc agtaggtttt taacctattt ggaattgcgg caatttgatt ataagaattt 360 gcaattaatt tggacagacg aggactttgt gtttgcatgt tgcagcggct gtgcctacgc 420 ttcagcccaa tttgaatttc agcagtatta tcaagttact ttgtatggtc gtgaaattga 480 gcaagaagaa caacgacctg taggccaaat ttatatgaga tgtcaatatt gcttgaagtc 540 tcttgatttg ctagaaaagt tagatatctg ctgttccaat caaccatttc acaaggttag 600 agatcattgg aagggaaggt gcaggcactg taaagcaata gaatgattgg gaaacaagct 660 actctgtgtg atatagttct tgaagagctt gtcctgccca ttgacctgca ttgccacgag 720 gagctgcctg aacttccaga agagttagaa gaatcagtgg tagaggagga gcctgagtac 780 actccttaca agattgtagt atattgtggg ggttgtgata caaagctgaa gctgtatata 840 ctagcaactc tctctggaat tcgcgacttt caaacatctc tacttggacc tgtaaaactt 900 ttgtgtccca cctgtcgaga agagattcgc aatggacgac gataaaggta ctgacacaac 960 tgatgctaaa gaaggatgta gtggttggtt tatgttagaa gctgcgtgct cagatgatag 1020 tgacttagat aatagtttgg aaaagttatt tgaagatggt acagagtcag atgtatctga 1080 tttaataaat gatgatgata ctgctgctca gggaaattcc cgcgaattgc tatgtcaaca 1140 gcaaagtgag gaatgtgagc agcagattca atatctaaaa cgaaagtatt tcagtccaaa 1200 ggctgttcag cagctaagtc cacgtctgca gtctatgaat atttcgcctg ggcataaatc 1260 taaaaggaga ttatttgtgg agcacgacag cggactggag tgttccctaa atgaagctga 1320 agatcttact gaagaggtgg aggtaccggc gagcgctcca gcgccggcag cacagggtgg 1380 tgtagggtcg ggacattaca ccagtttgtt aagatgcaac aatgtaaagg cagtattgct 1440 gggaaaattt aaagacgcat ttggagtgag ctataatgag ctgactagac aatttagaag 1500 taataagact tgctgtaagc attgggtatt ggccatatat gctgctaaag atgaattaat 1560 agatgcgtcc aaacaattgt tacaacagca ttgtacctat ttgtggttgc aaacattctc 1620 acccatgtca ttatatttat gttgttttaa tgttgggaaa agtagagaaa ctgtgatgcg 1680 attgttatct tccatgttac aagttaatga gaatcatatt ttatcagaac ctccaaaaat 1740 cagaagtatg atagctgctt tattttggta taaaggtagt atgaatccaa atgtctatgc 1800 atttggagag tatcctgagt ggattatgac acagactatg atacatcacc aaactgctga 1860 cagtgtacaa tttgacctgt ctgaaatgat acaatgggct tatgatcaag attatgttga 1920 tgaatgtact attgcatacc agtatgctag attggctgat agtaatagta atgccagagc 1980 atttttagct cataatagtc aagccaaata tgttagagaa tgtgctcaaa tggttagata 2040 ttataaacgt ggagaaatgc gagatatgtc aatttctgca tggatacatc attgtatatc 2100 aaagatagaa ggcgatggtc actggcaaga tattgttaaa tttttgcgat accaagggtt 2160 aaattttata gtgtttttag ataaatttag aacatttcta aaaaattttc caaagaaaaa 2220 ttgtttgtta atatgtggtc ctccggatac aggaaaatct atgtttagca tgtcattaat 2280 gaaagcatta agaggacagg tagtttcatt tgcaaattct aaaagtcatt tttggctaca 2340 gcctttagca gatgcaaaac tggctttatt agatgatgct acagaagttt gctggcaata 2400 tattgatgct ttcttaagaa atggattaga tggtaacatg gtatctttag atatgaaaca 2460 tagagctcca tgtcaaatga aatttccacc tcttattata acatctaaca ttagtttaaa 2520 aaaagaaaaa aaatttccct atttacatag tagaatatat gaatttgagt ttcctaacaa 2580 atttcccttt gacgcaaatg atacacctct gtttaaactt actgaccaaa gctgggcgtc 2640 tttttttaaa aggctttgga cacaattaga actgagtgat caagaagaag agggagaaaa 2700 tggagaaact cagcgaacgt ttcagtgcac tacaagagaa gttaatggac ttatatgaat 2760 caggtgtaga ggatcttgaa acccaaattc aacattggaa attattaaga caagaacaag 2820 tgttatttta ttatgcaagg agacatggga tattgcgttt ggggtaccaa ccagtaccca 2880 ctctggcaac ttcagagagt aaagcaaaag atgctatagc catgggacta ttgctggaaa 2940 gcttacaaaa atcacaatat gcagaggaac cgtggacctt agtagaaact agtttggaga 3000 cagttaaaag ccctccagca gactgtttta aaaaaggacc taaatctgtg gaagtgtact 3060 ttgatggaga tcctgaaaat gtaatgtctt atacagtgtg gtcatacatt tattatcaga 3120 ctgatgatga gtcatgggaa aaggtggaag gtcatgtgga ctatacagga gcttactata 3180 tagaagggac ctttaaaacc tattatatta aatttgaaac agatgctaaa cgatatggta 3240 caacaggaca ttgggaggtg catgttaata aagatactgt gtttacccct gttaccagtt 3300 ctacgccgcc agttggagtc gcctcccaga actccgcacc cgaaccggca tccacctccg 3360 actccccaca acggtcatca caagtcaccc accgatacgg ccgaaaagca tctagtccta 3420 caatcaccac catcaggagg caaaaaaggc gagagagaca aagacaagaa accccaacaa 3480 ggcgaagaaa aaccagatca aggtcccgaa gcaccgagca gcggggaggg agggccacca 3540 gacgatccct ctccagagaa tccgcagaat cccccaggcg gggagggaga ggtggagggg 3600 gccccctcac caggtcccgc tcaaggtcgc gatcccgtac acgagagtct gttgacgggg 3660 gtggcgtcgc gcctgacgaa gtgggagcaa cacttcgatc aattggtaga cagcatagtg 3720 ggcgacttgc gcaattactg gacgcagcta aagacccccc agtaattctg ctacgcggtg 3780 cagcaaatac attaaaatgc tatcgctata gatttagaaa gaaacatgct ggaagcttcc 3840 aatttattag tacaacgtgg tcctgggtag gggggcatac aaccgataga atcgggcgct 3900 ctaggatact aatatcattt catacagata gggaaagaga gaagtgcttg caacaaatga 3960 aacttccttt aggtgtagaa tggtcatatg gccagtttga tgatttataa actgcttttt 4020 tactaacaca ctaacattgc ctatttatac taacctattt gcttgctact aacaaaatgg 4080 cgcgagcgcg aagaacaaag cgagcgtcag taactgacat ttataaaggc tgtaaggcct 4140 ctgggacttg tccccctgat gttattaata aagtggaaca aaatacactt gctgataaaa 4200 ttttaaagta tggcagtgtt ggtgtgtttt ttggtggtct tggtataagt acaggtaagg 4260 gtaccggtgg tcctacaggc tatattcctt taggtcaagg tcctggagtg cgtgtgggcg 4320 ccactcccac agtggtccgc cccggggtca tacctgaaat aattggacca actgaattaa 4380 taccagttga ctcagtaaca ccaattgacc ctgcagcacc atccatagtg acattaacag 4440 acagtagtgc aggtgctgac cttttacctg gtgaagttga aactattgca gaagtacatc 4500 cggtcccaat agacaatgtg gaacttgaca cacctttagt ttctggggac cgtcacgcca 4560 ttttggaggt gactgatgct aatccccctt ttaggcgcac ggttacccgg acacaatatc 4620 ataatcctgc ttttgaaatt atttcagagt ctacaccatt aataggtgaa tctacaccct 4680 ctgaccatgt ttttgttttt gaaggctcgg gaggtgtaca ggtaggggat gctaatgaaa 4740 gcattgaatt ggatactttt ccttctagat atagttttga cattgaggag ccaacccctc 4800 ctcgtagagt tagtacacca attgaaagaa tcagtcagga atttagaact ttaagaagag 4860 ccttatacaa cagaagatta acagaacagg tccaagtaag agaccccttg tttattcgat 4920 ccccgtccag gcttgtgaga tttcaatttg ataatccagt attcgatgag gaagttacac 4980 aaatatttga aagagatgta gctgcagtag aagaaccacc agacagggat tttttagata 5040 ttgaaagact tggaaggcct atactaacag aaactgcaga aggccgtgtt cgtgtcagca 5100 ggttagggca acgtgcatcg ctgagcacac gcagcggcgc acgtgtaggt gctagagtgc 5160 atttctttac agatattagc actattaatg cagaagagcc cattgaatta gaattattag 5220 gtgagcattc tggcgacagc tctgtagtac aagaaccatt tgaaagcaca atattggatg 5280 tcaatattga caacatacct gaaagtttgg atacaaacat agcagaaaca tctgtagact 5340 atgattctgc tgatttgtta ttagacaacg gtgtggagga ctttagtagg tcacaattgg 5400 taataggtcc ttcagataga tcacttccat ctattactgt tccacaattt gaatccccta 5460 gagaaaccat tgtgtacata caagacatag agggtaatac agttgtatat cctaaatatg 5520 aagaaaggcc aactattata ttacctacac cctcggggcc tgctataatt caatcaccta 5580 cacattcctc ctttgactat tatttacatc ctagcttgcg aaggaaaaaa cgcaaacgca 5640 aatatttata atgtttttca gatgaccctc tggcttccaa cttcgggtaa gatatatttg 5700 cctcctacgc caccggtagc ccgagtacaa aacacggacg agtatgtgga gaggactgac 5760 atctattacc atgctataag tgaccgttta ttaactgtag gacatcctta ctttgatgtt 5820 agatcatcag atggagcaaa aatagaggtc cctaaagtgt ctggaaatca gtttagggct 5880 tttagagtaa catttccaga tcctaacaaa tttgctttgg gagatatgac aatccatgat 5940 cccgaaaggt atagattagt atgggcttgt aaagggttag aaataggaag aggacagccc 6000 ttaggtgtag gtaccacagg tcatccatta tttaataaat tacatgatac tgaaaaccct 6060 actgaacgcc aggaaggaac atcagatgat agaagaaatg tttcttttga tcctaaacag 6120 gttcaaatgt ttatcattgg atgtataccg tgtttaggtg aatattggga taaagctcct 6180 gtttgtgaag atgcaggcag tcaggtagga ttatgtcctc cactagaatt aaaaaatggt 6240 gttatagagg atggagatat gtttgatata ggatttggaa atataaataa taaaacacta 6300 tcatttaata gatctgatgt aagcttagac attgtaaatg aaatctgtaa atatcctgat 6360 tttcttacaa tgtcaaatga tgtctatggc gactcatgct ttttttgtgc acgtagggag 6420 caatgttatg cacgacacaa ttttgtacgt ggtggtcttg ttggtgatgc tataccagat 6480 gatgcagttc aacaagatca taaatattac ttgcctgcag cttcacagac tgctttagaa 6540 aactccactt actttccaac cgttagtggt tctttagtaa cctctgatgc ccaactattc 6600 aacaggcctt tttggttgaa gcgcgcgcag ggccataata atggtatttt gtggaacaac 6660 caaatgtttg taacagtagc tgataatacc cgtaacacta atttttctat tagtgtggca 6720 agtgacggca ccacagttaa ttatgatgct aaaaaaatca gagaatttat gcgccatgtg 6780 gaagaatacc aattatcctt tattttgcag ctatgtagaa taccattaga agcagaggta 6840 ttaactcaaa ttaatgccat gaatcatggc attttagaaa attggcaact aggctttgta 6900 cctacaccag acaattctgt ccatgatact tataggtatt tacaatctaa agctacaaaa 6960 tgtcctgatg ctgtacctga cacacaaaag gaagatccct ttggtcaata tactttttgg 7020 aatgtagaca tgtctgaaaa gttatcattg gatttagatc agtatccact gggtcgtaaa 7080 tttttatttc aatctgggtt acaacgtgca agggccagtg ccagggtcag tgtgaaacgt 7140 tctgctacgc ggaaaacgtc taaaactgta aaacgaagga aacttacctc ttaaccgttt 7200 tcggttgctt taataaaatc tattaactaa tctggtatgt gaagcatttt ttgaccacct 7260 ttgtgactaa accgaacaag tcaacaccag caaccgcacc cggtttttac attataaatt 7320 cctcgaggta agataaccat cagtagatac catcggcacc tggagcaa 7368