ID HPV21 STANDARD; ds-DNA; VRL; 7779 bp. XX DE Human papillomavirus type 21 (HPV21), complete genome. XX AC U31779 XX DT 04-JUL-1995 XX OS Human papillomavirus type 21 DNA. OC Human papillomavirus type 21;Viridae; ds-DNA nonenveloped viruses; OC Papovaviridae;Papillomavirus. XX RN [1] RP 1 - 7779 RA Delius,H; RT "Direct Submission;" RL Unpublished. XX RN [2] RP RA Kremsdorf,D., Favre,M., Jablonska,S., Obalek,S., RA Rueda,L.A.,Lutzner,M.A., Blanchet-Bardon,C., Van Voorst Vader,P.C., and RA Orth,G; RT "Molecular cloning and characterization of the genomes of nine newly RT recognized human papillomavirus types associated with RT epidermodysplasia verruciformis;" RL Journal of Virology 52(3), 1013-1018 (1984). XX RN [3] RP nal of Virology 52(3 RA Kiyono,T., Hiraiwa,A., and Ishibashi,M; RT "Differences in transforming activity and coded amino acid sequence RT among E6 genes of several papillomaviruses associated with RT epidermodysplasia verruciformis;" RL Virology 186(2), 628-639 (1992). XX XX Created by HIV database on 1-NOV-1995 from GenBank: U31779. XX XX XX FT KEY Location/Qualifiers FT CDS 200..706 FT /note="ORF E6 from bp 95 to 706" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT /translation="MADSSTDSADEGPSPKRRHLEEENTSSFLEPPLPATIRDLANLL FT EIPLDDCLVPCNFCGNFLTHLEVCEFDEKKLSLLWKDHCVFACCRVCCAATATYEYNE FT FYESTVVGRDIEEITGKSIFDIDVRCYNCMKFLDSIEKLDICGRKFFFHKVRGSWKGI FT CRLCKHFQ" FT CDS 706..1011 FT /note="ORF E7 from bp 694 to 1011" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT /translation="MIGKEVTLQDIVLELNELQPEVQPVDLFCEEELPSEQQETEEEL FT PERTAYKVVTPCGCCKVKLRIFVNATQFAIRTFQNLLFEELQLLCPECRGNCKHGGS" FT CDS 998..2809 FT /note="ORF E1 from bp 908 to 2809" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT /translation="MADPKGSTSKEGLEDWCIVEAECSDVENDLEELFDRDTDSDISE FT LLDDNDLEQGNSRELFHQQESKESEEQLQKLKRKYLSPKAVAQLSPRLESITLSPQQK FT SKRRLFAEQDSGLECTLTNEEDVSSEVEVPALDSQPVAEAQLGTVDIHYKELLRASNN FT KAILMAKFKEFFGVGFNDLTRQFKSYKTCCNAWVLSVYAVHDDLLESSKQLLQQHCDY FT IWIRGIGAMSLFLLCFKVGKNRGTVHKLMTAMLNVHEKQIISEPPKLRNVAAALFWYK FT GAMGSGAFTYGPYPDWIAQQTIVGHQSTEASAFDMSAMVQWAFDNNYLDEADIAYQYA FT KLAPEDSNAVAWLAHNNQARYVREVASMVRFYKKGQMKEMSMSEWIHTRINEVEGEGH FT WSTIAKFLRYQQVNFIMFLAALKDMLHSVPKRNCILIYGPPNTGKSAFTMSLIHVLRG FT RVLSFVNSKSQFWLQPMSECKIALIDDVTDPCWIYMDTYLRNGLDGHVVSLDCKHKAP FT MQTKFPALLLTSNINVHNEVNYRYLHSRIKGFEFPNPFPMKADNTPEFELTDQSWKSF FT FTRLWNQLELSDQEDEGENGESQRSFQCSARSANEHL" FT CDS 2751..4262 FT /note="ORF E2 from bp 2727 to 4262" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT /translation="MENLSDRFNVLQDQLMNIYESAANTIESQIEHWQTLRKEAVLLY FT FARQKGVTRLGYQYVPPLAVSESRAKQAIGMMLQLQSLQKSEYAKEPWSLVDTSAETF FT RSPPENHFKKGPVSVEVIYDNDKDNANAYTMWRYVYYVDDDDQWHKSPSGVNHTGIYF FT MQGTFRHYYVLFADDASRYSRTGHWEVNVNKETVFAPVTSSTPPDSPGGQADSNTSST FT TPATTTDSTSRLSSTRKQSQQTNTKGRRYGRRPSSRTRRTTQTHQRRRSRSKSRSRSR FT SRSRLRSRSRSRSRSYSRSRSQSSDQPQYRFRSGGQVSLITTATTTTTTATNYSTRGS FT GRGSSSTSSSTSKRPRRPRGGAIGGSSGRGRRSSSTSPSPSKRSRGKSESVRQRGISP FT DDVGKSLQSVSTRNTGRLGRLLDEALDPPVILVRGEPNTLKCFRNRAKLKYAGLYKAF FT STAWSWVAGDGTERLGRSRMLISFFSFEQRKDFDKTVKYPKGVDRSYGSFDSL" FT CDS <3037..4017 FT /note="ORF E4 from bp 3037 to 4017" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /translation="IPVQRHLEALLKIISKKGQCQLRLFMITIKTMLMLTPCGDMFIT FT WMMTTNGIKVQAVSTTQAYILCKELLDTTMFYLLMMQVDIAELDIGKLTLIRKLCLLL FT SPAPPHPTHQEDKQTQTPPPRPPPPPLTPRPDSRPPENSHNKPTPKEEGTDGDRPVGP FT GERPKRIKGGDRGPSPGRGRGRGRGSDPDPGPDPGPIPGPGLNRLTSRNTDSDPEGKC FT PSSLPPPPPPPPQPTTPPEGQGEGHPPPPPPPPNGHDGHEEGPLEGAVGGGDGHPPPP FT PAPPNGHEESQSLLGNVASLLTTWESLFNQLVQEIQVDLEDYWTKLSIPQ" FT CDS 4351..5913 FT /note="ORF L2 from bp 4273 to 5913" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT /translation="MARAKRVKRDSATNIYRTCKQAGTCPPDVINKVESTTIADKILQ FT YGSAGVFFGGLGISTGKGTGGTTGYVPLGEGPAVRVGNAPTVIRPALVPDTIGPSDII FT PVDTLNPVEPTTSSIVPLTDSTGPDLLPGEVETIAEIHPGPTRPPPDTAVTTSTNGSS FT AVLEVAPEPTPPSRVRVTRTQYHNPSFQVITESTPTTGESSLADHILVTSGTGGQTIG FT GSTPELIELQDFPSRYSFEIEEPTPPRRTSTPIQRIQNIIRRRGGGLTNRRLVQQVNV FT ENPLFVSRPSRLVQFQFDNPAFEEEVTQIFEQDIDTFNEPPDRDFLDIKTLGRPQYSE FT TPAGYVRVSRLGKRGTIRTRSGTQIGSQVHFYRDLSTINTEDPIELQLLGEHSGDATI FT VQGPVESTFIDINVDENPLSEDFSAHSDDLLLDEANEDFSGSQLVVGGRRSTSSYTVP FT RFETTRSGSYYVQDTKGYYVAYPEDRDTSTDIIYPTPDLPVVIIHTFDTSGDFYLHPS FT LSRKFKRRRKYL" FT CDS 5929..7485 FT /note="ORF L1 from bp 5914 to 7485" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT /translation="MAVWQAASGKVYLPPSTPVARVQSTDEYVQRTNIYYHAYSDRLL FT TVGHPYFNVYDVNSAKIKVPKVSGNQHRVFRLKLPDPNRFALADMSVYNPDKERLVWA FT CRGIEIGRGQPLGVGSVGHPLFNKVGDTENPSSYKTQPNSTDDRQNVSFDPKQLQMFI FT IGCAPCLGEHWDKAIPCATDNPPPGSCPPIELINSAIQDGDMADIGYGNLNFKALQQN FT RSDVSLDIVNETCKYPDFLKMQNDVYGDSCFFYARREQCYARHFFVRGGKTGDDIPAG FT QIDEGSMKNAYYIPPMNDQAQYKIGNSMYFPTVSGSLVSSDAQLFNRPFWLQRAQGHN FT NGICWFNQLFVTVVDNTRNTNFSISVNPENADVSKIENYKAESFQEYLRHVEEYELSL FT ILQLCKVPLTAEVLAQINAMNANILEEWQLGFVPAPDNPIHDTYRYIDSAATRCPDKN FT PPKEREDPYKNMKFWDVDLTERLSLDLDQYSLGRKFLFQAGLQQTTVNGTKTLSSRVS FT TRGIKRKRKN" FT source 1..7779 FT /organism="Human papillomavirus type 21" XX SQ SEQUENCE 7779 bp; 2426 a; 1518 c; 1680 g; 2155 t; acggtaagtt atgcaccggg tgcggtcgaa ttattactca ttcgatagtt gttgttgcca 60 gctaccattt aggacagcat gtttttgcct gtaacgttat cgacacatac tcacaccata 120 tatatatata tatatatata tatatatata tatatatata tattcatata tacatactag 180 ggaagatgcc ctagtactca tggctgactc ttcaacagac agtgctgacg aaggtccttc 240 tcctaagcgt agacatttag aagaagaaaa tacatctagc tttttagagc caccattacc 300 agctacaatt cgtgacctag ccaatctgtt agagatacca ttggatgatt gtttagtacc 360 ttgtaacttt tgcggtaatt ttcttactca tttagaagtt tgtgagtttg atgagaaaaa 420 gcttagttta ctttggaaag atcattgtgt gtttgcctgt tgtcgtgttt gttgcgcagc 480 aacagcgaca tatgaatata atgaatttta tgaatctact gttgtaggta gagatataga 540 agaaataaca ggcaaatcta tttttgatat tgatgtcagg tgctacaatt gcatgaaatt 600 tttagactca atagaaaagc tagacatttg tggtaggaag tttttttttc ataaagtgag 660 aggctcttgg aaaggaatct gtaggctgtg taagcatttt caataatgat tggtaaagag 720 gtcacattgc aagatattgt tctggagtta aatgaattgc agcctgaggt acaaccagtt 780 gacctgtttt gtgaagagga gttaccgagc gagcagcagg aaacagagga ggagctacca 840 gaaaggaccg cgtacaaagt tgttacacct tgcggctgct gcaaggtcaa gcttcgcatc 900 tttgtaaacg ctacacaatt tgctattaga acatttcaga atctgctgtt tgaagaattg 960 cagctgttgt gtcctgagtg ccgcggaaac tgcaaacatg gcggatccta aaggtagtac 1020 atctaaagaa gggttggagg attggtgtat tgtggaagct gaatgtagcg atgtagaaaa 1080 tgatttggaa gaattatttg acagagatac agactcagat atttcagaat tgttagatga 1140 taatgacctg gagcagggaa attcgcggga actatttcac cagcaagaga gtaaggaaag 1200 cgaggagcaa ttacaaaaac taaaacgaaa gtacttaagt cctaaagctg tcgcacagct 1260 cagtccgcga ctcgaaagta taacgctgtc acctcagcag aagtctaaac gaaggctctt 1320 tgcagagcag gacagcgggc tcgagtgtac tcttacaaat gaagaagatg tttcttctga 1380 ggtggaggta ccggctctag actctcagcc ggttgctgag gcacaattag gaacagtaga 1440 cattcattat aaagagttat tacgtgccag caacaataag gcgattctta tggcaaaatt 1500 taaagagttt tttggggtag gatttaatga tctgacacgc caatttaaaa gttacaaaac 1560 ctgttgtaat gcttgggttc tgtctgtata tgcagttcat gatgatcttc ttgaaagctc 1620 aaagcagtta ttgcaacagc attgtgatta tatatggata cgtgggatag gagcaatgtc 1680 attgtttttg ttatgtttta aagttggaaa aaatcgtggg actgtgcata agttgatgac 1740 tgcaatgtta aatgtgcatg aaaagcagat catatctgag ccaccaaaat taagaaatgt 1800 tgctgctgca ttgttttggt ataagggtgc gatggggtct ggagcattta cttatggacc 1860 ttatcctgat tggattgccc agcaaacaat tgttggtcat caaagtacag aagccagtgc 1920 atttgatatg tctgcaatgg ttcaatgggc gtttgataat aactatttag atgaagctga 1980 tatagcctat caatatgcta agctagcacc agaagatagt aatgctgtag catggcttgc 2040 acataataat caggccagat atgttagaga agttgcatct atggtaagat tttataaaaa 2100 aggacaaatg aaagaaatgt ctatgtcaga gtggatacat actagaatta atgaagtaga 2160 aggagaagga cattggtcaa ctatagcaaa gttccttaga tatcagcaag taaattttat 2220 aatgtttcta gcagcattaa aagacatgct acattcagtt cctaaacgta attgtatatt 2280 gatttatggt ccccctaaca ctggaaagtc agcatttact atgtctttaa ttcatgtact 2340 aagagggagg gtgctatcat ttgtgaattc caaaagccag ttttggctgc agccaatgtc 2400 agaatgtaaa atagcattaa ttgatgatgt gacagatcca tgctggatat atatggatac 2460 ttatttaaga aatggcctag atggtcatgt tgtatcatta gactgcaaac ataaagcacc 2520 gatgcaaacc aaatttcctg cattactact tacatctaat atcaatgtgc ataatgaagt 2580 taattataga tatttgcata gcaggattaa aggctttgaa ttcccaaatc catttcccat 2640 gaaagcagac aatacccctg aatttgagct tactgaccaa agctggaaat ctttttttac 2700 aaggctttgg aatcaattag agctgagtga ccaagaagac gagggagaaa atggagaatc 2760 tcagcgatcg tttcaatgtt ctgcaagatc agctaatgaa catttatgag tctgcagcaa 2820 acactattga gtcgcaaatt gagcattggc aaacactgcg aaaagaagct gtgctgcttt 2880 attttgctag gcaaaagggt gtgacacggc ttggatatca atatgtacct ccattagcag 2940 tttcagaatc aagagctaaa caggctatag ggatgatgct gcagttgcaa tcattgcaaa 3000 aatctgaata tgcaaaggaa ccatggtcac tggtagatac cagtgcagag acatttagaa 3060 gccctcctga aaatcatttc aaaaaagggc cagtgtcagt tgaggttatt tatgataacg 3120 ataaagacaa tgctaatgct tacaccatgt ggagatatgt ttattacgtg gatgatgacg 3180 accaatggca taaaagtcca agcggtgtca accacacagg catatatttt atgcaaggaa 3240 cttttagaca ctactatgtt ttatttgctg atgatgcaag tagatatagc agaactggac 3300 attgggaagt taacgttaat aaggaaactg tgtttgctcc tgtcaccagc tccaccccac 3360 ccgactcacc aggaggacaa gcagactcaa acacctcctc cacgaccccc gccaccacca 3420 ctgactccac gtccagactc tcgtccacca gaaaacagtc acaacaaacc aacaccaaag 3480 gaagaaggta cggacggaga ccgtccagta ggacccggcg aacgacccaa acgcatcaaa 3540 ggcggcgatc gaggtccaag tccaggtcgc ggtcgcggtc gcggtcgcgg ctccgatccc 3600 gatcccggtc ccgatcccgg tcctattccc ggtcccggtc tcaatcgtct gaccagccgc 3660 aataccgatt cagatccgga gggcaagtgt ccctcatcac taccgccacc accaccacca 3720 ccaccgcaac caactactcc accagagggt cagggcgagg gtcatcctcc acctcctcct 3780 ccacctccaa acggccacga cggccacgag gaggggccat tggagggagc agtgggaggg 3840 ggagacggtc atcctccacc tcccccagcc cctccaaacg gtcacgagga aagtcagagt 3900 ctgttaggca acgtggcatc tctcctgacg acgtgggaaa gtctcttcaa tcagttagta 3960 caagaaatac aggtcgactt ggaagattac tggacgaagc tctcgatccc ccagtaatct 4020 tagtcagggg ggaacccaat acgctaaaat gctttcgcaa tagagccaag cttaaatacg 4080 cagggttgta taaggctttc agtacggcct ggtcgtgggt ggctggagat ggtactgagc 4140 gtctaggcag gtccagaatg ctcattagct tcttctcctt tgagcaaaga aaagattttg 4200 ataagactgt taaatatccg aaaggtgttg accggtcgta tggttccttt gatagcctat 4260 agcagccttt aacatactaa ctatagctct gctactaaca tattaacact ttttgattat 4320 atattttttt tttattttta tttttatgct atggcgcgtg ctaagcgagt caagcgagac 4380 tctgctacta atatttacag aacctgcaaa caagcaggca catgtccccc tgatgttatt 4440 aataaagttg aaagcacaac tattgctgat aaaatattgc agtatggtag tgctggtgtt 4500 tttttcgggg ggctgggcat aagcactgga aaaggtacag gcggtaccac aggttatgtg 4560 cctttgggag aaggtcctgc agtccgtgtt ggcaatgctc ctacggtcat tagacctgca 4620 ttggtccctg acaccattgg cccgtctgat attattcctg tggacacctt aaatccagtg 4680 gagcccacaa cttcctctat tgttccactc acagactcta caggcccaga tctgttacct 4740 ggagaagtgg aaactattgc agaaatacat cctggtccga ccaggcctcc acctgacact 4800 gcagtcacta ctagtacaaa tggttctagt gctgttttag aagtagcacc agagcctacc 4860 cctccttctc gtgttagagt aaccagaaca caatatcata atccatcttt tcaagtaata 4920 actgaatcaa ctcctactac aggcgaaagt tctttagcag atcatatatt agtaacatca 4980 gggactgggg gacaaactat agggggcagt acacctgaac tcatagaact ccaggacttt 5040 ccttctagat attcatttga aattgaggag ccaacacctc ctagaagaac tagtacaccc 5100 attcaaagaa ttcaaaatat tataaggaga aggggtggcg ggctcacaaa taggcgtttg 5160 gttcaacagg ttaatgtaga gaatcctttg tttgtatcca ggccttctag attagtgcag 5220 tttcaatttg ataaccctgc atttgaagaa gaagtgacac aaatatttga gcaagatatt 5280 gatactttca atgaaccacc agatagagac tttttagata ttaaaacact tggtaggcct 5340 caatactcag aaacccctgc aggttacgtg agagttagtc gtcttggtaa acgaggaact 5400 attcgtactc gttcaggaac acaaattggt tctcaggtcc atttttacag ggaccttagc 5460 accattaaca cagaggaccc tattgaactt caattattgg gtgagcattc tggcgatgct 5520 acaattgtcc agggtccagt tgaaagcaca tttattgata ttaacgttga tgaaaaccct 5580 ctttctgaag attttagtgc acattcagat gatttacttc tagatgaggc aaatgaagat 5640 tttagtggtt cccaattagt ggttggaggc cgccgctcca cttcttctta tactgttcca 5700 cgttttgaaa ctactagatc tggttcttat tacgtgcagg acaccaaggg ctattatgta 5760 gcctatcctg aagatcgaga cactagtaca gatataatct atccaacacc agatttgcca 5820 gttgtaatca tacacacatt tgatacaagc ggtgattttt acttacatcc gagtcttagc 5880 agaaaattta agagaagaag gaaatatttg taaccttttc ttttgcagat ggcagtttgg 5940 caagcagcta gtggtaaggt ttaccttcca ccgtctacac cagttgccag ggtccaaagc 6000 acggatgaat atgtacaaag aacaaacatc tactatcatg catatagtga tcgcttatta 6060 actgttggtc atccatattt taatgtctat gacgtcaata gtgctaagat aaaagtacct 6120 aaagtatctg ggaatcaaca cagggtattc agactcaaat tgccagatcc taatagattt 6180 gcacttgcag atatgtctgt atacaatcca gacaaggaaa gattagtttg ggcctgcaga 6240 ggtatagaaa taggaagagg gcaacccttg ggggtgggaa gtgtaggtca ccctttattt 6300 aataaagttg gggacacaga aaatcctagt tcatacaaaa ctcaaccaaa ttctactgat 6360 gatagacaaa atgtatcatt tgatcccaaa caactacaaa tgtttataat aggctgtgca 6420 ccttgcttag gagaacattg ggataaagct atcccatgtg caactgacaa tccacctcca 6480 ggatcgtgcc ctccgattga attaattaat tcagcaatac aagatggcga tatggcagat 6540 ataggatatg gcaatctaaa tttcaaagcc ttacaacaaa ataggtctga tgttagttta 6600 gacatagtta atgaaacgtg taagtatcca gacttcttaa aaatgcaaaa tgatgtgtat 6660 ggagattcat gtttctttta tgcacgcaga gagcaatgtt atgccagaca cttctttgtt 6720 agagggggca aaacaggaga tgacataccc gcaggacaaa ttgatgaggg tagtatgaag 6780 aatgcatatt acattccacc aatgaatgat caagcacagt acaagattgg taactccatg 6840 tatttcccaa ctgtcagtgg ctcattggtg tctagtgacg ctcaattgtt taacaggcca 6900 ttttggctac agcgtgcaca aggccataat aatggcatat gttggtttaa tcaattattt 6960 gttacagtag tagacaacac tcgtaacaca aactttagta tttcagtaaa tcctgagaat 7020 gcagacgtgt ctaaaattga aaattataaa gccgagagct ttcaagaata tttaagacac 7080 gttgaagaat atgaactttc tttaatttta caattatgta aagttccttt aacagcagaa 7140 gtcttagctc aaattaatgc aatgaatgca aatattttag aagaatggca gttaggattt 7200 gttcctgccc cagacaatcc tattcatgat acatatagat acattgactc tgcagctact 7260 agatgtcctg ataaaaaccc tccaaaagaa cgagaagatc cttataaaaa tatgaaattt 7320 tgggatgtag atttaacaga acggttgtct ctagacttag atcaatattc tcttggaaga 7380 aaatttttat ttcaagcagg tttgcagcag acgaccgtta acggtacaaa gacactttct 7440 tcaagggtat ctaccagagg aattaaacga aaacgcaaaa attagacatg accgttttcg 7500 gtacaataaa gtcaactttt acacagtatt caaggaatgt ttatttactc tgactaagca 7560 aaataccaac cgcgcccgac acataaaggt gagttgtgag ccaaatgagg tgagttgtaa 7620 gccaaaagag gtcagagcca agtctgttct gagccagatc agatactacg cgcgccagag 7680 ttggatcaca tctcgttgtt ctaacacgct aaggactcaa ggaaatgtaa gtctgccaat 7740 cgattttggc tcgtgttttg gcagaagtta ggaccgtta 7779