ID HPV66 STANDARD; ds-DNA; VRL; 7824 bp. XX DE Human papillomavirus type 66 (HPV66), complete genome. XX AC U31794 XX DT 04-JUL-1995 XX OS Human papillomavirus type 66 DNA. OC Human papillomavirus type 66;Viridae; ds-DNA nonenveloped viruses; OC Papovaviridae;Papillomavirus. XX RN [1] RP 1 - 7824 RA Delius,H; RT "Direct Submission;" RL Unpublished. XX XX Created by HIV database on 1-NOV-1995 from GenBank: U31794. XX XX XX FT KEY Location/Qualifiers FT CDS 102..569 FT /note="ORF E6 from bp 96 to 569" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT /translation="MDSIFSNTQERPRSLHHLSEVLQIPLLDLRLSCVYCKKELTSLE FT LYRFACIELKLVYRNNWPYAVCRVCLLFYSKVRKYRYYKYSVYGATLESITKKQLSDL FT SIRCYRCQCPLTPEEKQLHCEHKRRFHYIAYAWTGSCLQCWRHTSRQATESTV" FT CDS 572..889 FT /note="ORF E7 from bp 545 to 889" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT /translation="MHGKVPTLQEVILELAPQTEIDLQCNEQLDSSEDEDEDEIDHLL FT ERPQQARQAEQHKCYLIHVPCCKCELVVQLDIQSTKEELRVVQQLLMGALTVTCPLCA FT SSK" FT CDS 895..2787 FT /note="ORF E1 from bp 886 to 2787" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT /translation="MASPEGTDGEGMGCCGWFQVEAIVERKTGDTISDDESEEENETD FT TDVDGFIDNTLINNTQEDRETAQQLLQVQTAHADAQTLQKLKRKYIGSPLSDISNQQT FT VYREEVKRRLILSEDSGYGNTLETLETSQQVEYEKGNGCGSSQNGGSQNSNCSEHSVS FT NMDIDTNMETPTHQLQELFKSSNVQGRLHFKFKEVYGVPYTELVRTFKSDSTCCNDWI FT CAIFGVNETLAEALKTILKPQCVYYHMQCLTCSWGVIVMMLIRYICGKNRKTITKSLS FT SILNVPQEQMLIQPPKLRSPAVALYFYKTAMSNISEVYGETPEWIQRQTQLQHSLQDN FT QFELSKMVQWAFDNEVTDDSQIAFLYAQLADIDSNAQAFLKSNMQAKYVKDCGIMCRH FT YKRAQQQQMNMCQWIKHICSKVDEGGDWKPIVQFLRYQGVDFISFLSYFKLFLQGTPK FT HNCLVLCGPPNTGKSCFAMSLINFFQGSVISFVNSQSHFWLQPLDNAKLGLLDDATDT FT CWRYIDDYLRNLLDGNPISLDRKHKQLVQIKCPPVIITTNVNPMQDAKLRYLHSRISV FT FKFENPFPLDNNGNPVYELSNVNWKCFFERTWSRLNLDNDEDKENNGDSIPTFRCVPE FT QNTRLL" FT CDS 2729..3838 FT /note="ORF E2 from bp 2699 to 3838" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT /translation="METLSQRLDACQNKILDCYEKDSKCIIDHIDYWKAVRHEYVLYY FT KARENDINVLNHQMVPSLQVCKAKACSAIELQIALEAISNTIYKNEEWTLRDTCDELW FT RTEPKNCFKKEGQHIEVWFDGNKNNCMEYVVWKFIYYNGECGWCKVSSGVDYRGIYYM FT HDGHKTYYTDFEQEAKKYGCTNIWEVHMETESIYCPDSVSSTCRYNVPPVETVNEYNN FT HRTTTTASTFVGAQDAAVSHRPGKRPRASESEPDSSRESYAHCVTTDTDISNNANSRS FT PRINTQSHCGDKTTPVIHLKGEANRLKCCRYRFQKYKTLFTDVTTTYHWTSTDNKDSS FT IITILYKDETQRDTFLNVVKIPPSVQVILGQMSCP" FT CDS <3081..3605 FT /note="ORF E4 from bp 3081 to 3605" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /translation="KCGLMVTKIIVWNMWCGNLYIIMESVGGVKCHQGWITEAYIICM FT MATKHITQTLNRRPKNMGVQTYGKYIWKPRVFTVLTLCLVPVDTTYPLLRLLTNTTTT FT GPPPPPPPLWAPKTPRYPTDQENDPEQVNQNLTPPESPTHTVSQQTQTSVTTPTVEVH FT VSTHKATVVIKLRL" FT CDS <3994..4251 FT /note="ORF E5 from bp 3994 to 4251" FT /gene="E5" FT /note="putative" FT /codon_start=1 FT /translation="SPYIATIDFCVICVFALCFCVCLCVCHFVPLLLSASLFTSCLIL FT IILFWFVVATSFFDTFILFLLFFYIPTLCIYCHALWLINHL" FT CDS 4272..5666 FT /note="ORF L2 from bp 4167 to 5666" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT /translation="MVAHRATRRKRASATQLYKTCKLSGTCPEDVINKVEQKTWADRI FT LQWGSLFTYFGGLGIGTGSGSGGRAGYVPLGSRPSTIVDVTPARPPIVVESVGPTDPS FT IVTLVEESSVINSGAGVPNFTGSGGFEVTSSSTTTPAVLDITPTSSTVHVSSTTITNP FT LYIDPPVIEAPQTGEVSGNILISTPTSGIHSYEEIPMQTFAIHGTGNEPISSTPIPGF FT RRLAAPRLYSRAFQQVRVTDPAFLDNPTTLISADNPVFEGADTTLTFSPSGVAPDPDF FT MDIVALHRPAFTTRRTGVRFSRLGKKATMQTRRGTQIGARVHYYYDISPIAQADEIEM FT QPLLSTDNSFDGLYDIYANIDDEAPISFRQSGATPSAQLPIKPSTLSFASNTANVTAP FT LGNVWETPFYSGPDIVLPTGPSTWPFVPQSPSDVTHDVYIQGATFALWPVYFFKRRRR FT KRIPYFFADGDVAA" FT CDS 5647..7158 FT /note="ORF L1 from bp 5494 to 7158" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT /translation="MAMWRPSDNKVYLPPTPVSKVVATDTYVKRTSIFYHAGSSRLLA FT VGHPYYSVSKSGTKTNIPKVSAYQYRVFRVRLPDPNKFGLPDPSFYNPDQERLVWACV FT GLEVGRGQPLGAGLSGHPLFNRLDDTEVSNLAGNNVIEDSRDNISVDCKQTQLCIVGC FT APALGEHWTKGAVCKSTPGNTGDCPPLALVNTPIEDGDMVDTGFGAMDFKLLQESKAE FT VPLDIVQSTCKYPDYLKMSADAYGDSMWFYLRREQLFARHYFNRAGNVGEAIPTDLYW FT KGGNGRDPPPSSVYVATPSGSMITSEAQLFNKPYWLQRAQGHNNGICWGNQVFVTVVD FT TTRSTNMTINAAKSTLTKYDAREINQYLRHVEEYELQFVFQLCKITLTAEVMAYLHNM FT NNTLLDDWNIGLSPPVATSLEDKYRYIKSTAITCQREQPPAEKQDPLAKYKFWEVNLQ FT DSFSADLDQFPLGRKFLMQLGPRPPRPKASVSASKRRAAPTSSSSSPAKRKKR" XX SQ SEQUENCE 7824 bp; 2471 a; 1355 c; 1653 g; 2345 t; gaaagtttca atcatacttt attatattgg gagtaaccga aatgggttta ggaccgaaaa 60 cggtacatat aaaaggcagc ctgttgtgcc tgtagatatc catggattcc atattcagca 120 atacacagga acgtccacga agcctgcacc atctgagcga ggtattacaa atacctttac 180 ttgatcttag attatcatgt gtatactgca aaaaggaact tacaagttta gagctatata 240 ggtttgcatg tattgagtta aaactagtat atagaaacaa ttggccatat gcagtatgta 300 gggtatgttt attgttttat agtaaggtta gaaaatatag gtactataaa tattcagtgt 360 atggggcaac attagaaagt ataactaaaa aacagttatc tgatttatca ataaggtgct 420 accgatgtca atgtccgtta acaccggagg aaaaacaatt gcactgtgaa cataaaagac 480 gatttcatta tatagcatat gcatggaccg ggtcatgttt gcagtgttgg agacatacga 540 gtagacaagc tacagaatct acagtataac catgcatggt aaagtaccaa cgttgcaaga 600 ggttatatta gaacttgcac cgcaaacgga aattgaccta caatgcaatg agcaattgga 660 cagctcagag gatgaggatg aggatgaaat agaccatttg ctggagcggc cacagcaagc 720 tagacaagct gaacaacata agtgttacct aattcacgta ccttgttgta agtgtgagtt 780 ggtggtgcag ttggacattc agagtaccaa agaggagcta cgtgtggtac aacagctgct 840 tatgggtgcg ttaacagtaa cgtgcccact ctgcgcatca tctaaataac tgcaatggca 900 tcacctgaag gtacagatgg ggaggggatg ggatgttgtg gatggtttca ggtagaagca 960 attgtagaaa gaaaaacggg ggatacaata tcagatgatg aaagcgagga ggagaatgaa 1020 acagatacag atgtagatgg atttatagac aatacactta taaacaatac acaggaagac 1080 agggagacag ctcaacaatt attgcaagta caaacagcac atgcagatgc acagacgttg 1140 caaaaactaa aacgaaagta tataggtagt cccttaagtg atattagtaa tcagcaaact 1200 gtgtaccgag aggaagtaaa acgaaggcta atattatcag aagacagcgg gtatggcaat 1260 acattggaaa cattggaaac atcacaacag gtagaatacg aaaagggaaa tgggtgcggg 1320 agctcacaaa atggaggctc gcaaaacagt aattgtagtg agcactcggt atcaaatatg 1380 gatatagata caaatatgga aacaccaaca caccaattgc aggaactatt taaaagtagt 1440 aacgtacaag gaagattaca ttttaaattt aaagaagtgt atggagtgcc atatacagag 1500 ttggtgcgaa catttaaaag cgatagtaca tgttgtaacg attggatatg tgcaatattt 1560 ggcgttaatg aaacattagc agaggcgtta aaaactatac taaaaccaca atgtgtgtac 1620 tatcatatgc aatgcttaac atgttcatgg ggagtaattg taatgatgct aattagatat 1680 atatgtggaa aaaatagaaa aacaattaca aaatcgctaa gctcaatttt aaatgtacca 1740 caagagcaaa tgttaattca accaccaaaa ctacgaagtc ctgctgtagc attatatttt 1800 tataaaacag caatgtcaaa tattagtgag gtgtatgggg aaacaccaga atggatacaa 1860 agacagacac aattgcaaca cagtttacaa gacaatcaat ttgaattgtc taaaatggta 1920 cagtgggcat ttgataatga agtaacagat gatagccaaa ttgccttttt atatgcacaa 1980 ctagcagaca tagatagcaa tgcacaagca tttttaaaaa gtaatatgca agcaaaatat 2040 gtaaaggatt gtggaataat gtgtagacat tacaaaaggg cacagcaaca gcaaatgaat 2100 atgtgccagt ggataaagca tatatgtagt aaagtagatg aagggggtga ttggaaaccc 2160 attgtgcaat ttttacgata tcaaggggtc gacttcattt catttttaag ttattttaaa 2220 ttatttttac aaggaacgcc taaacataat tgtttggtac tgtgtggacc accaaataca 2280 ggtaaatcat gttttgctat gagccttata aattttttcc aagggtcagt catttcattt 2340 gttaattcac aaagccactt ttggttacag ccactagaca atgccaaatt aggtttgctg 2400 gatgatgcaa cagatacgtg ttggagatac atagatgatt atctaagaaa tttattagat 2460 gggaatccca taagtttaga taggaaacat aaacaattag tacaaataaa atgtcctcca 2520 gttattatta caactaatgt aaatcctatg caagatgcaa aattaagata tttacacagt 2580 agaatttcag tgtttaagtt tgaaaatcca tttccattag ataacaatgg taatcctgtg 2640 tatgaattaa gtaatgtaaa ttggaaatgt ttttttgaaa ggacatggtc cagattaaat 2700 ttggataacg acgaggacaa agaaaacaat ggagactcta tcccaacgtt tagatgcgtg 2760 ccagaacaaa atactagact gttatgaaaa agatagtaaa tgcattatag atcacataga 2820 ctattggaaa gctgtacgac atgaatatgt attatattat aaagcaagag aaaatgacat 2880 taatgtacta aaccaccaga tggtgccctc tttacaagtg tgtaaagcaa aagcatgtag 2940 tgcaatagaa ttacaaatag cactggaagc aataagtaac acaatatata aaaatgaaga 3000 gtggacatta cgtgatacat gtgatgaact gtggcgcacg gagcctaaaa actgttttaa 3060 aaaagaagga caacacatag aagtgtggtt tgatggtaac aaaaataatt gtatggaata 3120 tgtggtgtgg aaatttatat attataatgg agagtgtggg tggtgtaaag tgtcatcagg 3180 ggtggattac agaggcatat attatatgca tgatggccac aaaacatatt acacagactt 3240 tgaacaggag gccaaaaaat atgggtgtac aaacatatgg gaagtacata tggaaaccga 3300 gagtatttac tgtcctgact ctgtgtctag tacctgtaga tacaacgtac cccctgttga 3360 gactgttaac gaatacaaca accacaggac caccaccacc gcctccacct ttgtgggcgc 3420 ccaagacgcc gcggtatccc acagaccagg aaaacgaccc agagcaagtg aatcagaacc 3480 tgactcctcc agagagtcct acgcacactg tgtcacaaca gacacagaca tcagtaacaa 3540 cgccaacagt agaagtccac gtatcaacac acaaagccac tgtggtgata aaactacgcc 3600 tgtaatccat ttaaaaggtg aagctaatag attaaagtgt tgtagataca gatttcaaaa 3660 atataaaaca ttatttacag atgtaacaac aacatatcat tggacaagta cagataataa 3720 agacagtagt attattacaa tattatataa agatgaaaca caacgggaca cctttttaaa 3780 tgttgtaaaa ataccaccta gtgtacaggt tattttggga caaatgagtt gtccataaag 3840 tgttgtatat attgtatata catatgtgtt attgtaacac tggtacaggt gaagtgtaat 3900 tgccatacat tgctgctaag catatatatt gcacccatta attgtatttg gtatattatg 3960 tgttattgta acactgggaa aggtaacgtg taatcgccat atattgcaac cattgatttt 4020 tgtgtaattt gtgtgtttgc gctttgcttt tgtgtttgtc tgtgtgtgtg ccattttgtc 4080 ccgcttttgc tatctgcatc tttatttaca agttgtctta tactaattat tttattttgg 4140 tttgttgtgg ctacatcatt ttttgatact tttatactgt ttttactatt tttttatata 4200 cctacactgt gtatatattg ccatgctttg tggttaataa accatttgta acagtagtaa 4260 tttttgctac tatggttgcc caccgtgcca cacgacgcaa acgcgcatct gccacacaat 4320 tatataaaac atgcaaatta tctggtacat gtcctgagga tgttattaat aaggtggagc 4380 aaaaaacatg ggctgatagg attttacaat ggggaagttt atttacatat tttggggggc 4440 ttggcattgg tactgggtct gggtcgggtg gtcgggcggg ctatgttccc ttaggctcta 4500 ggccttctac tatagttgat gtcactcctg cacgaccacc tattgtggtg gagtcagttg 4560 ggcctacaga tccttctatt gttacactgg tagaagaatc tagtgttatt aactcagggg 4620 ctggtgttcc caattttact gggtcagggg gatttgaagt tacatcctct tccacaacca 4680 cacctgctgt gttggatatt acacccacat ctagtactgt acatgtaagt agtactacta 4740 taacaaaccc actatatatt gatcctccag taattgaggc tccacaaact ggagaggtat 4800 ctggtaatat tttgattagc actcctacat ctggaataca tagctatgag gaaataccta 4860 tgcaaacatt tgctatacac ggtactggca acgaacctat tagtagtacc cctattccag 4920 gttttagacg ccttgctgct cccaggttat atagtagggc ttttcagcag gttagggtca 4980 ctgacccagc atttttggac aaccccacaa cattaatatc tgctgataat cctgtttttg 5040 aaggtgctga cacaacgttg accttttctc cctcgggtgt ggctcctgat cctgatttta 5100 tggatatagt tgcattacat aggcctgcat ttactacacg tagaacaggt gtgcgtttta 5160 gtaggctagg caaaaaggct accatgcaaa cacgtagggg tacgcaaata ggtgctcgtg 5220 tgcattatta ttatgatata agtcctattg cacaggctga tgaaattgaa atgcagccat 5280 tattgtctac agacaattca tttgatggcc tatatgatat ttatgcaaat attgatgatg 5340 aggcacccat ttcatttcgt cagtctggtg ctacaccttc tgcacaatta cctattaaac 5400 cttctacatt atcctttgct agtaacacag ctaatgttac tgcccctttg ggaaatgttt 5460 gggaaacacc attttattca ggtcctgata tagttttacc tacaggcccc agtacttggc 5520 ccttcgtacc tcagtctcct tctgatgtta cacatgatgt atatatacag ggagctacat 5580 ttgcactatg gcctgtatat ttttttaaac gtaggcgccg taaacgtatt ccctattttt 5640 ttgcagatgg cgatgtggcg gcctagtgac aataaggtgt acctacctcc aacacctgtt 5700 tcaaaggttg tggcaacgga tacatatgta aaacgtacca gtatatttta tcatgcaggt 5760 agctctaggt tgcttgctgt tggccatcct tattactctg tttccaaatc tggtaccaaa 5820 acaaacatcc ctaaagttag tgcatatcag tatagagtgt ttagggtacg gttgcctgat 5880 cctaataagt ttggccttcc tgatccatct ttctataatc ctgaccagga acgtttggta 5940 tgggcctgtg taggtttgga ggtaggccga ggtcaacctt taggtgctgg gttaagtggt 6000 catccattat ttaataggct ggatgacact gaggtctcta atttagcagg taataatgtt 6060 atagaagata gccgggacaa tatatctgtt gattgtaaac aaacccagtt atgtattgtg 6120 ggatgtgcac cagcattagg ggaacattgg actaagggcg cggtgtgtaa gtctacacca 6180 ggtaatacag gggattgtcc acctcttgca ttagttaata ccccgataga ggacggtgac 6240 atggtggaca ccgggtttgg tgcaatggac tttaagctat tacaggaatc aaaggctgag 6300 gtgccattgg acattgtaca atctacatgt aaatatcctg attatttaaa aatgtctgca 6360 gatgcctatg gggattctat gtggttttac ttacgcaggg aacaattgtt tgccagacat 6420 tactttaata gggcaggtaa tgttggggaa gccattccta cagatttgta ttggaagggt 6480 ggcaatggca gggaccctcc tcccagttct gtatatgttg ctactcctag tgggtccatg 6540 attacctctg aggcccaatt atttaataaa ccttattggt tgcaacgtgc acagggccat 6600 aataatggca tatgctgggg taatcaggta tttgttactg ttgtggatac taccagaagc 6660 accaacatga ctattaatgc agctaaaagc acattaacta aatatgatgc ccgtgaaatc 6720 aatcaatacc ttcgccatgt ggaggaatat gaactacagt ttgtgtttca actttgtaaa 6780 ataaccttaa ctgcagaagt tatggcatat ttgcataata tgaataatac tttattagac 6840 gattggaata ttggcttatc cccaccagtt gcaactagct tagaggataa atataggtat 6900 attaaaagca cagctattac atgtcagagg gaacagcccc ctgcagaaaa gcaggatccc 6960 ctggctaaat ataagttttg ggaagttaat ttacaggaca gcttttctgc agacctggat 7020 cagtttcctt tgggtagaaa atttttaatg caactaggcc ctagaccccc tagacccaag 7080 gctagtgtat ctgcctctaa aaggcgggcg gctcctacct cttcctcttc ttcaccagct 7140 aaacgtaaaa aacgatagtt gtgtgttgtg tgttgtatgt attgtatggt tgtgcttgta 7200 ctgtatgttt ttgtgtatgt ttatgtattt tataattgtg tatgtgctat gtgtatgtat 7260 gactgtatgt atgtgtaatg ttttgtgtgt atgtaataaa catgcatggt tacttttacg 7320 cgtggttgca taaactaagg tgcggtagta tccttgggca gtgtgtgtca ggttaggtgg 7380 tgttccttac tgtttaatgt tatattaaat aggttgtttg tatgcactat agtaacacac 7440 caaactccat tttagtgctg tacgccattt tatgcatgca accgaattcg gttgcctagc 7500 cttttgtcct tatttaaacc caaaacgact tttcagcaaa acagttaatc ctttggcata 7560 ttgccgtttc ctgttgtatg attcaggtat gtacactgcc ttaccctgta ttactcacct 7620 gtatttctgt gccaactatg cttttatctg catactttgg cgctgttggg catatgtttt 7680 tatgcaggtg tttgcaatat attttgttgg cgtgtagccc ttattgtata agccaagtat 7740 ctgtcttgca aatatgtaac catatactta ctcattttac aaaaccgttt acggtcgtgc 7800 taaaacaggt ttcttttaat tgtt 7824