ID HPV44 STANDARD; ds-DNA; VRL; 7833 bp. XX DE Human papillomavirus type 44 (HPV44), complete genome. XX AC U31788 XX DT 04-JUL-1995 XX OS Human papillomavirus type 44 DNA. OC Human papillomavirus type 44;Viridae; ds-DNA nonenveloped viruses; OC Papovaviridae;Papillomavirus. XX RN [1] RP 1 - 7833 RA Delius,H; RT "Direct Submission;" RL Unpublished. XX XX Created by HIV database on 1-NOV-1995 from GenBank: U31788. XX XX XX FT KEY Location/Qualifiers FT CDS 105..557 FT /note="ORF E6 from bp 72 to 557" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT /translation="MESANASTSAQSIDQLCKECNIPMHNLQILCVFCRKTLSTAEVY FT SFAYKQLYVVYRGNFPFAACAICLELQGKVNQFRHFNYAGYAVTVEEETNKSILDVLI FT RCYLCHKPLCHVEKVRHILDKARFIKLQDTWKGRCFHCWTSCMETILP" FT CDS 533..826 FT /note="ORF E7 from bp 488 to 826" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT /translation="MHGNYTTLKEIVLQLEPPDPVGLHCNEQLDSSEDEVDELATQAT FT QDVTQPYQIVTTCGTCSRKVRLVVQCTGTDIHHLHTLLLGSLDILCPVCAPKT" FT CDS 832..2763 FT /note="ORF E1 from bp 715 to 2763" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT /translation="MADNTGTEGTGCSGWFLVEAIVENTTGQQISEDEDEAVEDSGLD FT MVDFIDDRPITHNSMEAQALLNEQEADAHYAAVQDLKRKYLGSPYVSPLSNIEQAVEC FT DISPRLDAITLSRQPKKVKRRLFDRPELTDSGYGNTEVEAETQVERNGEPEDCGGGGQ FT GRDTEGVEQVETEVQTHSNTQQHTGTTRVLELLKCKNIRATLLGKFKDCYGLSYTDLI FT RQFKSDKTTCGDWVIAAFGVHHSVSEAFQNLIQPVTTYSHIQWLTNAWGMVLLALVRF FT KVNKNRCTVARMMATRLNIPEDHMLIEPPKIQSGVAALYWFRSGISNASIVTGETPEW FT ITRQTIVEHGLADNQFKLADMVQWAYDNDFCEESEIAFEYAQRADIDANARAFLNSNC FT QAKYVKDCATMCKHYKTAEMKKMNMKQWIKFRSSKFEDTGNWKPIVQFLRHQNIEFIP FT FLTKLKMWLHGTPKKNCIAIVGPPDTGKSCFCMSLIKFLGGTVISYVNSSSHFWLQPL FT CNAKVALLDDVTQSCWVYMDTYMRNLLDGNPMTIDRKHKSLALIKCPPLIVTSNIDIT FT KEEKYKYLCSRVTLFTFPNPFPFDRNGNALYDLCETNWKCFFARLSSSLDIQTSEDED FT DGDNSQAFRCVPGTVVRTV" FT CDS 2705..3838 FT /note="ORF E2 from bp 2678 to 3838" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT /translation="METIAKHLDVCQEQLLELYEENSNKLTKHIQHWKCIRYECVLLH FT KAKQMGLNHIGMQVVPALAVSQTKGHQAIEMQMTLETLLNSDYGTEPWTLQETSREMW FT LTPPKYCFKKQGQTVEVKFDCNADNAMEYVWWKVIYVFDTDKWVKVTGHIDYKGLYYV FT HGGHKTYYTNFEKEAEKYGNSLQWEVCIGSSIICSPASISSTVQDVSIAGPASHSSSS FT TTTTLAQASSTLPIGTAEDCVDAPPCKRPRGPPTNTNNARNTVCVRNSDSVDSTNNNI FT LPNSYNSNKGRDNNYCTATPVVQLQGDANCLKCLRYRLHAKYKTLFVAASSTWRWTCS FT DTSSNALVTLTYVDEQQRQQFLNTVKLPPKVTYKVGYMSLQLL" FT CDS <3162..3596 FT /note="ORF E4 from bp 3162 to 3596" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /translation="TIKGCIMYMVGIKPIIQILKRRPKNMGTLYNGRYVLAAVSYVLL FT HLYLVLCKTYPLLGLLHTPPPPPPPPLHRPHPHCPLAPPRTAWTRRHVNDPEDPPQTP FT TTPETPSVSETATPWTVQTTTSSLTVTTVTKDGTTIIVQLRL" FT CDS 3874..4152 FT /note="ORF E5 from bp 3859 to 4152" FT /gene="E5" FT /note="putative" FT /codon_start=1 FT /translation="MEHIPIDATIGATSTSLLPVVIALFVCFVSIVLIICISDFIVYT FT SILVLTLLLYLLLWLLLTSALQFYLLTLCVCFFPAWYIHFHIVHTQQE" FT CDS 4325..5707 FT /note="ORF L2 from bp 4289 to 5707" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT /translation="MAHSRARRRKRASATQLYQTCKAAGTCPSDIIPKVEHNTIADQI FT LKWGSLGVFFGGLGIGTGSGTGGRTGYIPLQSTPRPDIPSVPTARPPILVDTVAPGDP FT SIVSLVEESAIINSGAPELVPPSHAGFEITTSESTTPAILDVSVTTHTTSTSVFKNPS FT FADPSVVQSQPAVEAGGHILISTSSISSHPVEEIPLDTFIVSSSDSNPASSTPIPASG FT ARPRIGLYSKALHQVQVTDPAFLSSPQRLITFDNPAYEGEDVTLHFAHNTIHEPPDDA FT FMDIIRLHRPAIQSRRGRVRFSRIGQRGSMYTRSGKHIGGRIHFYQDISPISAAAEEI FT ELHPLVATAQDSGLFDIYAEPDPDVTEEPVSLSFSTSTPFQRSSVSATPWGNTTVPLS FT LPADMFVQPGPDIIFPTASTTTPYSPVTPALPTGPVFISGAAFYLYPTWYFARKRRKR FT VSLFFADVAA" FT CDS 5694..7196 FT /note="ORF L1 from bp 5616 to 7196" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT /translation="MWRPSENQVYVPPPAPVSKVIPTDAYVKRTNIYYHASSSRLLAV FT GNPYFAIRPANKTLVPKVSGFQYRVFKMVLPDPNKFALPDTSIYDPTTQRLVWACIGL FT EVGRGQPLGVGISGHPLLNKLDDVENSASYAAGPGQDNRVNVAMDYKQTQLCLVGCAP FT PLGEHWGKGKQCNNVSVKDGDCPPLELITSVIEDGDMVDTGFGAMNFAELQPNKSDVP FT LDICTATCKYPDYLQMAADPYGDRLFFYLRKEQMFARHFFNRAGTVGEDVSQDLVIKS FT ASKNTVPNAIYFNTPSGSLVSSETQLFNKPFWLQKAQGHNNGICWGNQLFVTVVDTTR FT STNMTICAATTQSPPSTYTSEQYKQYMRHVEEFDLQFMFQLCSITLTAEVMAYLHTMN FT AGILEQWNFGLSPPPNGTLEDKYRYVQSQAITCQKPPPEKAKQDPYAKLSFWEVDLRE FT KFSSELDQYPLGRKFLLQTGVQARSSVRVGRKRPASAATSSSKQKRSRKK" XX SQ SEQUENCE 7833 bp; 2383 a; 1545 c; 1678 g; 2227 t; ttaataataa tctaaccttt acaaaaaaga ggaggaaccg aattcggttc caaccgaaaa 60 cggttatata aaaaccagcc caaaaattaa gcaagcgggg cataatggaa agtgcaaatg 120 cctccacgtc tgcacaaagt atagaccagt tgtgcaagga gtgcaacatt cctatgcaca 180 atctgcaaat tttatgcgtg ttttgcagaa aaacgttaag tactgcagag gtttattcat 240 tcgcatataa acagttatat gtagtgtacc gaggaaactt tccatttgca gcctgtgcca 300 tttgtttaga actacaaggt aaggtcaatc aatttaggca ttttaactac gcgggatatg 360 cagtaacagt ggaagaagaa acaaataagt caattctgga cgtgctgata cgctgctatt 420 tgtgccacaa accattgtgc cacgtggaaa aggtgcgcca catattggac aaggcgcgat 480 tcattaaatt acaagatacc tggaagggtc gctgcttcca ttgttggaca tcatgcatgg 540 aaactatact accttaaagg aaattgtttt acagctggaa cctcctgacc ctgtaggcct 600 acattgcaat gagcaattag acagctcaga agatgaggtg gatgaactag ccacgcaagc 660 cacgcaagac gttacacagc cttaccaaat agtaaccacc tgtggtacat gtagtcggaa 720 ggttcggctg gttgtgcagt gcacaggaac agacatccat cacctacata cgcttctgct 780 gggttcactg gatatattgt gtcctgtgtg tgcgcccaaa acctaacaac gatggctgac 840 aatacaggta cagagggaac gggatgctca ggatggtttc tagtagaggc tatagtggag 900 aacacaaccg ggcaacaaat atcagaggat gaggatgagg cagtggagga tagtgggttg 960 gatatggtgg actttataga tgacaggcct attacacaca attccatgga agcacaggca 1020 ttgttaaacg agcaggaggc ggatgctcat tatgcggctg tgcaggacct aaaacgaaag 1080 tatttaggta gtccatatgt tagtccttta agtaatattg agcaggcagt ggagtgtgac 1140 attagcccac ggctggacgc tataacatta agtagacaac caaaaaaagt aaagcgacgg 1200 ctgtttgaca gaccagaatt aacggacagt ggatatggca atactgaagt ggaagctgaa 1260 acgcaggtag agagaaatgg cgaaccggaa gattgtgggg gaggtggaca aggaagggac 1320 acagaggggg tggaacaggt ggaaacggaa gtgcagacac atagcaacac acaacagcac 1380 accgggacca cgcgggtact agaactattg aaatgtaaga atataagggc tacactgctt 1440 ggtaagttta aggattgcta tgggttatca tatacagatt taattagaca atttaaaagt 1500 gacaagacaa catgtgggga ctgggtaatt gcagcctttg gggtgcacca tagtgtgtca 1560 gaggcgtttc aaaatttaat acagccagta acaacatata gccacataca atggcttaca 1620 aatgcatggg gaatggtcct actggcatta gtaaggttta aggtaaataa aaacagatgt 1680 acagtggcac gtatgatggc aacccgttta aatatacctg aggaccacat gttaattgaa 1740 cctcctaaaa tacaaagcgg tgttgcagcg ttatattggt ttagaagtgg tatatccaat 1800 gccagtatag taactggaga aacaccggaa tggataacaa ggcaaaccat tgtagaacat 1860 gggcttgcag acaaccaatt taaattagca gacatggttc aatgggcata tgataatgac 1920 ttttgtgagg aaagtgaaat tgcatttgaa tatgcacaac gtgcagatat agatgccaat 1980 gccagagcat tcctaaatag taattgtcag gcaaaatatg taaaagactg tgccacaatg 2040 tgcaagcact ataaaactgc agaaatgaaa aaaatgaata tgaaacagtg gataaaattt 2100 aggagcagta aatttgaaga cacaggaaat tggaaaccaa tagtgcaatt tttaagacac 2160 caaaacatag aatttattcc gtttttaact aaattaaaga tgtggctgca tggtacacca 2220 aaaaaaaact gtattgcaat agtgggccca ccagacacag gtaaatcgtg tttttgtatg 2280 agtttaatta aattcttagg aggcactgta attagttatg taaactccag cagtcacttt 2340 tggctacagc ccttatgcaa tgcaaaagta gcattattag atgatgtaac ccaatcctgc 2400 tgggtatata tggatacata tatgagaaac ctattagatg gaaaccctat gaccattgac 2460 agaaaacaca aatcattagc attaataaaa tgtccgcctt taatagtaac atcaaacata 2520 gacattacta aagaagagaa atacaaatat ttatgtagca gggtaacatt atttacattt 2580 ccaaatccat tcccctttga cagaaatggg aatgcactat atgacctgtg tgaaacaaac 2640 tggaaatgtt tctttgcaag attatcatca agtctagata tacaaacatc agaggacgag 2700 gacgatggag acaatagcca agcatttaga tgtgtgccag gaacagttgt tagaactgta 2760 tgaagaaaat agtaataaac ttacaaaaca tatacaacat tggaaatgta tacgatatga 2820 atgtgtgtta ctacacaaag ctaagcaaat gggcctgaac cacattggaa tgcaagtggt 2880 gccagcatta gcagtgtcac agacaaaggg acaccaggca attgaaatgc aaatgacatt 2940 agaaacatta ctaaactctg actatggtac ggaaccatgg acattgcaag agacaagtcg 3000 ggaaatgtgg ttaacaccac ccaaatattg ctttaaaaag cagggacaaa ctgtggaagt 3060 aaaatttgac tgcaatgcag acaatgcaat ggagtatgta tggtggaaag tcatttatgt 3120 atttgacaca gacaaatggg taaaagtgac aggacacata gactataaag ggttgtatta 3180 tgtacatggt gggcataaaa cctattatac aaattttgaa aaggaggccg aaaaatatgg 3240 gaactcttta caatgggagg tatgtattgg cagcagtatc atatgttctc ctgcatctat 3300 atctagtact gtgcaagacg tatccattgc tgggcctgct tcacactcct cctcctccac 3360 caccaccacc cttgcacagg cctcatccac actgcccatt ggcaccgccg aggactgcgt 3420 ggacgcgccg ccatgtaaac gaccccgagg accccccaca aacaccaaca acgccagaaa 3480 caccgtctgt gtcagaaaca gcgactccgt ggacagtaca aacaacaaca tcctccctaa 3540 cagttacaac agtaacaaag gacgggacaa caattattgt acagctacgc ctgtagttca 3600 attacaaggt gatgctaatt gtttaaagtg tttaagatat agattacatg caaagtataa 3660 aacattgttt gtagcagcat cgtccacatg gcgctggaca tgttcagata catccagtaa 3720 tgcactggta acattaacat atgttgatga acagcaacgc cagcagtttt taaacactgt 3780 aaagttacca ccaaaagtta catataaagt tggatatatg tctttacaat tgttataatg 3840 tgtgttgtat atatctaatt gtatatattg tacatggaac acatacctat agatgctact 3900 ataggggcaa ccagcacatc attactgcca gttgtaattg ccctgtttgt atgctttgtt 3960 agcattgtat taattatttg tatttctgat tttatagtgt acacatctat attggtacta 4020 accttactgc tatatctgtt actttggctt ttactaacct ctgccctgca attttattta 4080 ctaacactgt gtgtctgctt ttttcctgcg tggtatatac atttccatat tgtacataca 4140 caacaagaat aactattaca atgctaacat gtacgtttga tgatggtgat acatggctgt 4200 tattgtggtt gttattaaca ttaattgtta ccattatagc attgttatta atgcatttaa 4260 aaactgtaca atgcgttaca tgcagtaaat aagtatttgt atatttggtg tgtattgtat 4320 aaatatggca cacagtaggg cacgtagacg taaacgtgca tctgctaccc aattatatca 4380 aacatgtaag gctgcaggca cctgtccctc tgatattatt cctaaggtgg aacataacac 4440 tattgcagat cagatattaa agtggggcag tttgggggtt ttttttgggg gactggggat 4500 tggtacaggc tctggcacag gcggtagaac agggtatata cctttacaat ccaccccgcg 4560 tcctgacatt ccctctgtac ctaccgcaag gccacctata cttgttgata ctgttgcacc 4620 tggggacccg tccattgtat ccttggttga agaatctgct attataaatt cgggggcccc 4680 ggaattggtc cctccttccc atgcaggatt tgaaatcact acatctgaat ctaccacacc 4740 agctatatta gatgtgtctg tcaccacaca tactacctct acaagtgtat ttaaaaaccc 4800 tagctttgct gacccatctg ttgtacagtc gcagcctgct gttgaagctg gtggccacat 4860 acttatctct acctcatcta tatcgtccca ccctgtagaa gaaatacctt tggatacatt 4920 tatagtatct tcctctgata gtaatcctgc atctagcact cccattccag catctggtgc 4980 acggccgcgt attggcctat acagtaaggc tttgcaccag gtacaggtaa cggatcctgc 5040 ctttttgtcc tctccccagc gcctaataac atttgataat cctgcatatg aaggggagga 5100 tgttacttta cactttgcac acaatactat acatgaacct ccagatgatg cgtttatgga 5160 tattatacga ttgcacagac cggctataca gtccaggcgt ggtcgtgtgc ggtttagtag 5220 aattggacaa cgagggtcta tgtacacacg tagtggcaaa catattggtg gcaggataca 5280 tttctatcaa gacatttctc ctatatctgc tgctgcagaa gaaatagaac tgcaccccct 5340 tgtggccact gcacaggata gtggcctgtt tgatatttat gcagaacctg accctgatgt 5400 tacagaagaa cctgtttcat tgtctttttc tacctccaca ccctttcagc ggtcttctgt 5460 gtcagccacc ccatggggca atactactgt ccctctttca ttacctgctg acatgtttgt 5520 acagcctggt cctgacataa tctttcctac tgcatccact acaactccct atagtcctgt 5580 cactcctgct ttacctacag gtcctgtttt tataagtggt gctgcatttt atttatatcc 5640 tacatggtat tttgcacgca aacgccgtaa acgtgtttcc ttgttttttg cagatgtggc 5700 ggcctagtga aaaccaggta tatgtgcctc ctcccgcccc agtatccaaa gtaataccta 5760 cggatgccta tgtcaaacgc accaacatat attaccatgc tagcagttct agacttcttg 5820 ctgtgggcaa cccttatttt gccatacgac cagcaaacaa gacacttgtg cctaaggttt 5880 cgggatttca atatagggtt tttaagatgg tattgccaga ccctaataaa tttgccttac 5940 ctgacacatc tatatatgac cccactacgc aacgcctggt atgggcctgc atcgggctgg 6000 aggtaggtag aggacagccc ttaggtgttg gtattagtgg gcatccatta ttaaataaat 6060 tggatgatgt agaaaattca gctagttatg cagccggtcc gggtcaggat aacagggtaa 6120 atgtggccat ggactataaa caaacacaat tatgtttggt tggctgtgca cccccgttag 6180 gtgagcattg gggtaaaggc aagcagtgta ataatgttag tgttaaggat ggggactgcc 6240 ctcccttgga attaattact agtgtaattg aggatggtga tatggtggac actggttttg 6300 gagccatgaa ttttgctgaa ttgcagccaa ataaatctga tgttccatta gatatatgca 6360 ctgctacatg taaatatcct gactatttac aaatggctgc agatccatat ggggacagat 6420 tgttttttta cttacgaaag gaacagatgt ttgccagaca tttttttaat agggctggaa 6480 cagttggtga ggacgtttcc caggatctgg ttattaaaag tgctagtaaa aatactgttc 6540 ctaatgctat atactttaat acacccagtg gttctcttgt atcttctgaa acccaattat 6600 ttaataagcc tttttggttg caaaaggcgc agggccacaa taatggtatt tgttggggaa 6660 atcagttatt tgttactgtt gtagatacta cccgtagtac aaacatgaca atatgtgctg 6720 ccactacaca gtcccctccg tctacatata ctagtgaaca atataagcaa tacatgcgac 6780 atgttgagga gtttgactta caatttatgt ttcaattatg tagtattacc ttaacggcgg 6840 aggtaatggc ctatcttcat actatgaatg ctggtatttt agaacagtgg aactttgggt 6900 tgtcgccgcc cccaaatggt accttagagg acaaatacag atatgtgcag tcccaggcca 6960 ttacatgtca aaagccaccc cctgaaaagg caaagcagga cccctatgca aaattaagtt 7020 tttgggaggt ggatcttaga gaaaagtttt ctagtgagtt ggatcaatat ccccttggta 7080 gaaaattttt attacaaacg ggtgtgcagg cccgttcctc tgttcgtgtg ggtaggaaac 7140 gtcctgcgtc tgcagccact tcctccagta aacaaaaacg gtctaggaag aagtagtatg 7200 tgttattgtt ttgtttgtat gtgtgtcata tgttattgtg ttatatatgt gttgtgttgt 7260 atatatgttg tatgtgtatg ttgtgtaatg ttgtctgtaa tggaatgcat gtgtgtgttg 7320 tacataataa acttaatctg tgtgtcctgt tccaccccat gagtaagtgt tgtagtgttg 7380 tgttctatgt ttggtatata taatatataa catatgtaca gccatgttag tttttaaaca 7440 tattcctcca ttttgggtgc aaccgttttc ggttgttcat tttgggtgca accgttttcg 7500 gttgttactc attacccaca tcctgtaccc aatttgttat agcaagcaaa atatttaatc 7560 atctctgcca gaactttatt atgttactaa gtacacacct ggcgcacagc taggcgcggt 7620 ttggcaacta cacaatacat tcctaatctc tatactactg ctgtctcgtt tgtgaacaat 7680 agtgcgctgg tagccaactt tttaaaagca tttttggcta ctagcactgc atttttgtac 7740 agttactgtt ggttttataa aatgagtaac ctaaggtcac acacctgcga ccggtatcgg 7800 ttgacacaca ccctgtacac ttccttatca tag 7833