ID HPV29 STANDARD; ds-DNA; VRL; 7916 bp. XX DE Human papillomavirus type 29 (HPV29), complete genome. XX AC U31784 XX DT 04-JUL-1995 XX OS Human papillomavirus type 29 DNA. OC Human papillomavirus type 29;Viridae; ds-DNA nonenveloped viruses; OC Papovaviridae;Papillomavirus. XX RN [1] RP 1 - 7916 RA Delius,H; RT "Direct Submission;" RL Unpublished. XX XX Created by HIV database on 1-NOV-1995 from GenBank: U31784. XX XX XX FT KEY Location/Qualifiers FT CDS 102..548 FT /note="ORF E6 from bp 6 to 548" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT /translation="MSRGDGYPKNIFLLCRDSGVPFEDLRLQCVFCTKELTSPELAAF FT CIRELNVVWKSGAPYGACARCLLFEGIKRRLKYWQYSCFVEGVEAETNESIYTQLIRC FT YMCHKPLVREEKDKHRNEKRRLHKISGYWRGSCLYCWSRCMGQSPR" FT CDS 524..796 FT /note="ORF E7 from bp 308 to 796" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT /translation="MHGPKPTVKDIELDLAPEAVPLVCNEQLDSSDEEDCIDVVEPAQ FT QAYRVVTLCTKCSTTLRLVVESSEADIRAFQELLLRTLKIVCPRCA" FT CDS 803..2785 FT /note="ORF E1 from bp 797 to 2785" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT /translation="MADNSGTEGEEEDCSEAERAGGWFMVEAIVDRRTGDTISSDEDE FT EDEGEDMVDFIDDRPIGDGQEVAQELLLQQAAADDDEAVHTVKRKFAPSPYFSPVCVP FT SIEHELSPRLDAIKLGRQSSKAKRRLFQLPDSGYGQTQVDTDTGPSQVQDGCETGDQN FT GRQQYKEGSGTKDGENGSQEEERAGGDGEESQPLSTETEKGACGVLSILKASNQKATL FT LGKFKEQFGLGYNELVRHFKSSRTACVDWVVCVFGVYCTVAEGIKQLIQPLCEYAHIQ FT VLPCQWGMTVLMLVRYKRAKNRETVAKGLSTLLNVPESHMLIEPPKLRSSPAALYWYK FT TSMSNISDVYGETPEWIVRQTMVGHALQEVQFSLSEMVQWAYDHDITDEGTLAYEYAL FT IADVDSNAAAFLASNCQAKYVKDACTMCRHYKRGEQARMSMSEWIRFRSNKVQGEGDW FT KPIVHFLRYQNVEFIPFLCAFKLFLQGIPKKSCLVFYGPADTGKSYFCMSLLKFMGGV FT VISYANSHSHFWLQPLSEAKMGLLDDATSQCWSYVDTYLRNALDGNVMCIDRKHRSLL FT QLKCPPLLITTNVNPLEDDRWKYLRSRLQVFTFSNPCPLTSKGEPVYTLNDQNWKSFF FT QRLWARLSLTDPDDEEENGEPSEPFRCVPGQNTRTV" FT CDS 2727..3893 FT /note="ORF E2 from bp 2697 to 3893" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT /translation="MENLANRLDACQDKILELYERDSDKLEDQITHWYLMRVESALYY FT KARECGMTRIGHQVVPTLSVAKAKACSAIEMHVALQQLQQSAYGKEPWTLRDTSREMW FT DAVPKRCWKKRGVTVEVRYDGDETKAMCHVLWKDIIVQNLSDDQWVKVKGQVSYEGLY FT YVHEDVKVFYVKFHKDARVYGETGIWEVHVGGKVIHHNAFDPVSSTQEVPATGPLYAS FT HNTTRSPTQAPLGPEEGQERKRRRLEAVGPGPQQQQQQQHQQQQQQQTPTHTPSTQAC FT ARTGGPVDSNRTRDCDSTSQNPYRHPSDPDCAPVIHLRGDPNSLKCFRYRLQNGKKGL FT YCKASSTWRWSCEPENQSAFVTIWYTSVTQRAEFLANVKIPPGMQAILGHMSVF" FT CDS <3310..3657 FT /note="ORF E4 from bp 3310 to 3657" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /translation="FITMHLTLYLAHKKYPLLDLYTPPTTPPARPPKPRWGLRRDRNG FT NDAGLKQSGLGHSSSSSSSTSSSSSNRPRPTPPPRKPVHERVDQWTVTGPGTVTLQVK FT TPTGTQVILTVHL" FT CDS 4411..5832 FT /note="ORF L2 from bp 4348 to 5832" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT /translation="MVAHRARRRKRASATELYKTCKVAGTCPPDVIPKVEGTTLADRI FT LQWGSLGVYLGGLGIGTGSGTGGRTGYVPVGTRPGTVVDVSIPTRPPVVIEPVGPSDP FT SIVTLLEESSVINSGATIPTFTGTSGFEITSSATTTPAVLDITPAGDNVVITSTNFNN FT PLFTEPSLLEIPQTGETSGRVLVGTPTSGVHGYEEIPMDTFATSGTGLEPISSTPVPG FT VSRVAGPRLYGKALTQVRVSDPAFLTQPSSFVTFDNPVYDPEDETIIFERPSPGTRVP FT DPDFMDIVKLHRPALTSRRGTVRFSRVGQKFSMRTRSGTNIGARVHYYHDLSPILPTE FT DIELEPLLPPADPTAEESLYDIYADVDEADMAFTGGGRGATTYGGRITPSVFSSTLST FT RYGNVTIPFVSPVDVPLHTGPDIILPSSAQWPFVPVAPADTTHYVYIDGGDYFLWPVT FT FPVSRKRRRKRLSYFLADGFVAL" FT CDS 5813..7324 FT /note="ORF L1 from bp 5696 to 7324" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT /translation="MALWRSSDNLVYLPPTPVSKVISTDDYVTRTNIYYYAGSSRLLT FT VGHPHYSIPKSSGNKVDVPKVSAFQYRVFRVRLPDPNKFGLPDARIYNPEAERLVWAC FT TGVEVGRGQPLGVGLSGHPLYNKLNDTENSNIAHAENGQDSRDNIAVDYKQTQLCILG FT CTPPMGEHWGKGTVCARTSSAAGDCPPLELMTTHIEDGDMVDTGYGAMDFAALQVNKS FT DVPLDICQSTCKYPDYLGMAADPYGDSMFFFLRREQLFARHFFNRAGVVGDKIPDSLY FT LKGNNGRETPGSAIYSPTPSGSMVTSEAQIFNKPYWLQQAQGHNNGICWANQVFLTVV FT DTTRSTNMSLCATTESQPLTTYDATKIKEYLRHGEEYDLQFIFQLCKVTLTPEIMAYL FT HTMNSALLEDWNFGLTLPPSTSLEDTYRFVTSSAITCQKDLAPTEKQDPYAKLNFWDV FT DLKDRFTLDLSQFPLGRKFLLQIGARRRSVVPSRKRRTTTTAPTPAKRKRSKK" FT source 1..7916 FT /organism="Human papillomavirus type 29" XX SQ SEQUENCE 7916 bp; 2177 a; 1665 c; 1955 g; 2119 t; tataaactat catcttcata ataaaaagta gggagggacc gaaaacggta cgaccgaatg 60 gggtacatat aaaaagacat cactgcagcg tggcagaagc catgtccaga ggtgatggct 120 atccaaaaaa tatattcctg ttgtgcagag acagtggagt accatttgag gaccttcgcc 180 tacagtgtgt tttctgcacg aaagagctaa ccagcccaga actggcagca ttttgcattc 240 gggaattaaa tgtggtgtgg aaaagtggag ctccgtacgg tgcatgtgca cgctgcttat 300 tgtttgaagg cataaagcgg cgcctaaaat actggcagta ttcttgtttt gtggaaggcg 360 tggaagcgga gacaaacgag tccatatata cacagctaat tcgctgctac atgtgccaca 420 agccacttgt cagagaggaa aaagacaaac accgaaacga aaagcgaaga ctacacaaaa 480 tttctggata ctggagaggg agttgcctgt attgttggtc acgatgcatg ggccaaagcc 540 cacggtaaaa gatattgaat tggatcttgc accagaggcc gtacctttag tatgcaatga 600 gcaattagac agctcagatg aagaagattg tatagatgtt gtggaaccag cacaacaggc 660 gtatagggtg gtaactttgt gtacaaagtg tagtacaaca ctgcgactgg tggtagagag 720 cagcgaagca gatataaggg cattccagga gctcctacta cgcacattga agatcgtgtg 780 tcctcgctgt gcgtaactgg acatggccga taactcaggt acagaggggg aggaggagga 840 ctgttctgag gcggaacggg ctggaggatg gttcatggta gaggctatag tagacagacg 900 gacaggggac acaatatcca gtgacgagga tgaggaggat gagggtgaag acatggtaga 960 ctttatagat gatagaccta taggggacgg acaggaagta gcacaggaac tgttgctgca 1020 gcaagcagct gcggatgacg atgaagcagt gcacactgta aaacgaaagt ttgctcccag 1080 tccctatttc agccctgtgt gtgtgcccag catagaacat gagctaagtc ccaggctaga 1140 cgccataaag ctgggacggc agtcctctaa agccaaacgg aggctattcc aactaccgga 1200 cagtgggtat ggccaaacac aggtggatac ggacacggga ccaagccagg tacaagatgg 1260 ttgcgagacg ggtgatcaaa atggccgaca gcagtataag gaggggagtg gtacaaagga 1320 tggggaaaat ggcagccaag aggaggagcg tgcaggaggg gatggggagg aatcgcaacc 1380 tctgagtaca gaaacagaga aaggagcatg tggtgtgttg tctatactga aagctagtaa 1440 tcagaaagca accctactag gtaagtttaa agaacaattt ggacttggat ataatgaatt 1500 ggttaggcat tttaaaagta gtaggacagc atgtgtggat tgggtagtgt gtgtgtttgg 1560 ggtgtactgc actgtggccg agggcataaa acagttgata cagccactat gtgagtatgc 1620 acatatacaa gtgctaccct gtcaatgggg aatgacagtg ttaatgctgg tgcggtacaa 1680 acgtgccaag aatagggaga cagtagcaaa aggtcttagt actttattaa atgtaccaga 1740 aagccatatg ttaattgagc cacctaaact aagaagtagt ccagcagcat tgtattggta 1800 caaaactagt atgtccaata ttagtgatgt gtatggcgag acacctgaat ggatagtaag 1860 acagacaatg gtaggtcacg cattacaaga agtacagttc agtttatctg aaatggtaca 1920 atgggcatat gatcatgata taacagatga aggtaccttg gcatacgagt atgcattgat 1980 agcagatgta gactctaatg ctgcagcttt tcttgccagc aattgtcaag ctaaatatgt 2040 aaaggatgct tgcacaatgt gcagacatta caaacggggt gagcaggcac gaatgtccat 2100 gtctgagtgg atacggttta gaagcaacaa agtacaggga gagggggact ggaaaccaat 2160 agtacacttt ttaagatacc aaaatgtaga atttatacca tttctgtgtg cctttaagtt 2220 attcctacaa ggcataccca agaaaagctg tttagtgttt tatggacctg cagacacagg 2280 gaagtcatat ttttgcatga gtctgctaaa atttatgggc ggtgttgtaa tttcatatgc 2340 gaattcacac agccattttt ggctgcagcc attgtctgaa gctaaaatgg gtctgctaga 2400 cgatgcaaca agccaatgtt ggagttatgt agacacatat ttaagaaatg cattggatgg 2460 gaacgtaatg tgcatagata gaaaacacag gtccctacta caactcaaat gccctccact 2520 actaataact accaatgtga atccgttgga ggatgacaga tggaagtatt tgcgcagcag 2580 actgcaggta ttcacattca gcaatccatg tccattaaca agtaaaggag agccagttta 2640 tacactaaat gatcaaaatt ggaaatcatt ttttcaaagg ttatgggcac gtttaagcct 2700 taccgaccct gacgacgagg aggaaaatgg agaacctagc gaaccgttta gatgcgtgcc 2760 aggacaaaat actagaactg tatgaaagag atagcgacaa acttgaggac cagatcacgc 2820 attggtatct tatgcgtgta gagagtgcgt tgtattataa agcaagagaa tgtggaatga 2880 cacgtatagg ccaccaggtg gtgccaacac ttagtgtagc taaagctaaa gcatgcagtg 2940 ctattgaaat gcatgtagct ttacaacaat tgcaacaaag tgcatatgga aaggaaccat 3000 ggacacttcg ggacacttca cgagaaatgt gggacgcagt accaaagagg tgctggaaaa 3060 aaagaggagt gactgtggaa gttagatatg atggagacga gactaaagca atgtgccatg 3120 tactgtggaa ggacataatt gtacaaaacc ttagtgatga ccagtgggtt aaagttaaag 3180 gtcaagtctc atatgagggg ctatattatg tgcacgaaga cgtaaaagtg ttttacgtga 3240 aattccataa agacgcacgt gtgtatgggg aaacaggcat atgggaggtg catgtgggag 3300 gcaaagtaat tcatcacaat gcatttgacc ctgtatctag cacacaagaa gtacccgcta 3360 ctggacctct atacgcctcc cacaacacca cccgctcgcc cacccaagcc ccgttggggc 3420 ctgaggaggg acaggaacgg aaacgacgca ggcttgaagc agtcgggcct gggccacagc 3480 agcagcagca gcagcagcac cagcagcagc agcagcaaca gaccccgacc cacaccccct 3540 ccacgcaagc ctgtgcacga acgggtggac cagtggacag taacaggacc cgggactgtg 3600 actctacaag tcaaaacccc taccggcacc caagtgatcc tgactgtgca cctgtaatac 3660 acttacgagg tgacccaaac agtttaaaat gttttagata taggttacaa aacggaaaaa 3720 aagggttgta ctgtaaagca tcgtccacgt ggcggtggtc ctgtgaacca gaaaatcaat 3780 cagcatttgt aacaatatgg tacacaagtg ttacacagcg agccgaattt ttggctaatg 3840 ttaaaatacc accaggtatg caggccattt taggccatat gtctgtgttt tgactactgt 3900 gccacaacgt gtagcagcct ggatttttat ctgtgtcgac tgtctctgtg ggtgtatttt 3960 gtgttgcttc tgtgtctttt ctggctatct gtgcttcctg cgcttacttg ctacttggcc 4020 attgtgttgt gtttatacct aggattggtg gcactatatt tacaagttgt gcagcacatt 4080 gcacgaaaca cttaggctat catgtatcct ataataatta tagatgggta tggggatcgt 4140 actgtattgc tgtttgagcc aagggacgtg tatgtgttgg gattgttaat actaatggta 4200 tgcctgttgt tatttatagt ttatagacat ttgggattat tataacctgt atataacctg 4260 tatttgtaca tatacatgta ttttatatgt ggctgtggta atacgtgtat tgtatacatg 4320 gccatacaat tgtgcatgtg tttttaaagt tctaccactt tttttgtttt ttttgttgtt 4380 cctttgtttt tacagttcaa taaagcaacc atggtggcac atcgtgcaag gcgtcgcaag 4440 cgtgcatccg ccacagagct ttataaaacc tgcaaagttg caggcacatg cccccctgat 4500 gttattccaa aagttgaggg caccacactg gccgacagga tattgcaatg gggcagtcta 4560 ggtgtctatt tgggtgggtt aggtatcggt actgggtctg gcactggagg tcgcacaggt 4620 tatgtccctg tcggcactcg gccaggcact gttgtggatg ttagtattcc tacgcggcct 4680 cctgtggtta ttgagcctgt gggcccttct gatccttcta ttgttaccct gttagaagaa 4740 tccagtgtaa ttaattcggg tgctaccata cccaccttta ctggtacatc cgggtttgag 4800 ataacatcat ctgccacaac taccccggct gtgttagata taacccctgc tggtgacaat 4860 gtagtcatta ctagcacaaa ctttaataat cctttattca ccgagccttc actcttggaa 4920 attccacaaa ctggagaaac ttctggacgt gtcctggtgg gcacacccac ctcgggtgtc 4980 cacgggtatg aagaaatacc catggacacg tttgccacct ctggaactgg gttagagcct 5040 attagcagca ctcccgtccc tggtgtcagc agggttgcag gtccccgcct ctatggcaag 5100 gccctaacac aggttagggt gtctgatcct gcgtttttga ctcagccttc ttcgtttgta 5160 acctttgata atcctgtgta tgatcctgag gatgaaacta ttatttttga gcgtccttct 5220 cccggcactc gtgtgcctga tcccgatttt atggatattg ttaagctgca taggcccgca 5280 ttaacatctc gcaggggcac ggtgcgcttc agtcgcgttg gtcagaagtt tagcatgcgc 5340 actcgcagtg gcacaaacat aggtgccagg gttcactatt atcatgacct gagtcccata 5400 cttcccaccg aggacataga gttggaacca ctgctccccc ctgcagatcc cactgctgag 5460 gagtctctgt atgatatata tgctgatgtg gacgaggctg acatggcttt tacaggcggt 5520 ggtcgcggcg ccaccactta cgggggtcgc attactccat ctgtattttc ctccacactg 5580 tctacgaggt atggcaatgt cactattcca ttcgtgtcgc cagttgatgt gcctttacac 5640 acgggccctg atattattct gccctcctct gcacaatggc cttttgttcc tgtagcaccc 5700 gcagacacga cacattatgt gtacattgat ggaggggatt attttttgtg gcctgttacc 5760 tttcctgtgt cccgaaaacg tcgccgtaaa cgtctttcat attttcttgc agatggcttt 5820 gtggcgctct agtgacaacc tggtgtacct gcctcccacc ccagtctcaa aagttatcag 5880 cacggacgac tatgtgacac gcacaaatat ttattattat gcaggcagtt ctcgcctgct 5940 cactgtgggt catccacatt attcaattcc caaatcctct ggtaataagg tagatgtgcc 6000 taaggtgtct gcatttcagt acagggtttt ccgtgtgcgt ttgcctgacc ctaataagtt 6060 tggtttgccc gatgcccgca tatataaccc tgaggcagaa cgtttggtgt gggcctgcac 6120 tggtgtggag gtaggtcgag ggcaacctct cggtgtcggg ttgagtggac accctctgta 6180 taacaaactg aatgacacag aaaactctaa tattgcacat gctgaaaatg gtcaggattc 6240 cagggacaac attgctgttg actataagca aacacaactg tgcattctgg gctgtacgcc 6300 tcccatgggc gaacactggg gtaagggcac tgtgtgtgca cgcactagtt ccgctgctgg 6360 tgattgcccc cccctggagt taatgaccac acatattgag gatggcgata tggtggatac 6420 cgggtacggt gccatggact ttgctgctct gcaagttaat aagtctgatg tgccccttga 6480 tatttgccag tctacgtgta aatatcctga ctacttaggc atggctgctg acccctatgg 6540 cgacagcatg tttttttttc tgcgtaggga acaactgttt gccaggcact tctttaatcg 6600 tgctggtgta gtaggggaca aaatcccaga ttccttgtac ttaaagggta acaacgggcg 6660 agaaactcct ggcagtgcca tatacagtcc cacacctagt gggtccatgg taacgtctga 6720 ggctcaaata tttaataagc cttactggct acagcaggcc cagggacaca acaatggtat 6780 atgctgggcc aatcaggtat ttttaactgt ggtggacacc acacgcagca ccaatatgtc 6840 gttgtgtgct accacagagt ctcaaccgtt gaccacttat gatgctacca agattaaaga 6900 atatttgaga catggggagg aatatgattt gcagtttatt ttccagttgt gtaaagttac 6960 attgacacct gaaattatgg cttaccttca tactatgaac agtgccttac ttgaagactg 7020 gaattttgga ttgacattgc caccttccac tagcttggaa gacacgtata ggtttgtaac 7080 atcctctgcc ataacttgtc aaaaagattt ggcccctaca gaaaagcagg atccgtatgc 7140 aaagctaaat ttctgggatg tagatttaaa ggatagattt accctggatt tgtcacagtt 7200 tcccctggga cgtaaatttt tattacagat cggtgcgcgc cggcgttcag tagtcccctc 7260 cagaaagcgc cgaacgacca ccacggcccc cacccctgca aagcgaaaac gctcgaaaaa 7320 gtaaccccag tgttgttgtg tgctgtatgt tgtgtaatgt aatgtgtgta tgtatttatt 7380 accatatgtg tttgtatgtc tgtatgtctg tacaatgtat gtatgtatac ttcaactatg 7440 tatgtgtgga tgtataaata aagtatgtca catagtttta tattttatac atataattgt 7500 ttgctgagta agaagttaag gtataggtca ggggaccgat ttcggtctaa aatggccgcc 7560 ggtgcaggtg tgcacaccac taattactca tattattcaa tttcctgcga catgccgtct 7620 cacgcacagt tttggcagca attttttgct ttccactgtt tattttactg ctgtatcatt 7680 cttcttggca agtttgcaca tatacattgc aaattcgctg cttctgggca ccaacttatt 7740 atgactactt tcacataatt actgtcttgg cccagttttc taagttatct tgccaataaa 7800 acgtgtttgc aaatctccac cttaacaatg tgtttccatg acacacctaa tccggtcgct 7860 gcttgctttc taaccttaat taatgcagct gccacacctg tctttctaac tataat 7916