ID HPV6b STANDARD; ds-DNA; VRL; 7902 bp. XX DE Human papillomavirus type 6b (HPV-6b), complete genome. XX AC X00203 XX DT 11-MAR-1994 XX OS Human papilloma virus type 6b DNA. OC Human papillomavirus type 6b OC Viridae; ds-DNA nonenveloped viruses; Papovaviridae; OC Papillomavirus. XX RN [1] RP 1-7902 RA Schwarz,E., Duerst,M., Demankowski,C., Lattermann,O., Zech,R., RA Wolfsperger,E., Suhai,S. and Zur Hausen,H.; RT "DNA sequence and genome organization of genital human RT papillomavirus type 6b"; RL EMBO J. 2, 2341-2348 (1983) XX XX HPV-11 and HPV-6 are responsible for the large majority of XX exophytic condylomas in the genital tract. Even though these XX lesions are frequently present in the genital tract, they are XX virtually absent in higher grade neoplasias and in cervical XX cancers. HPV-6 also infects other mucosal types; the respiratory XX tract, oral cavity and conjunctiva. It has been recovered from XX approximately 50% of respiratory tract lesions and 50% of all XX childhood conjuncitival papillomas. Respiratory papillomatosis is XX a rare disease that can be life-threatening because of its XX recurrent nature and the possibility of obstruction of the airways XX and respiratory distress. The most frequent sites of infection are XX the vocal cords in the larynx, but papillomas may also be present XX in the trachea, lungs, nose and oral cavity. These respiratory XX papillomas progress to malignancy rarely, as they account for less XX than 0.1% of all respiratory cancers. XX XX The 7902 bp complete genome of HPV-6b has been cloned in pBR322 and XX in lambda and was originally recovered from a genital wart. The XX sense strand has been numbered by comparative analysis with BPV-1 XX and HPV-1a. Both the E6 and E7 ORFs contain conserved Cys-X-X-Cys XX cysteine doublet motifs. The E6 ORF contains four of these motifs XX separated by 29, 36 and 29 intervening amino acids, while the E7 XX ORF contains just two separated by 29 amino acids. The E6 ORF also XX contains a small intron. The E5a ORF codes for a protein of 91 XX amino acids. It has a stretch of 13 amino acids which is very rich XX in Leusine. The L2 ORF contains an extremely conserved cluster of XX basic residues both at the N terminus and at the C terminus ends. XX The authors feel that the conserved region of this part of this XX peptide may interact with the conserved L1 structural peptide, XX where the variable region may be involved with host or tissue XX specific functions. XX XX Between the end of L1 and the beginning of E6 lies a small open XX reading frame E8. The first methionine is located in the middle of XX the ORF and it has no analog to other papillomaviruses sequenced at XX the time of publication. Because of these facts, this ORF is XX probably not functional. Thus, the region from the end of L1 to XX the beginning of E6 is probably the noncoding region containing the XX promoter and origin of replication. Within the first segment of XX this region lies a monotonous repetition of thymine-purine which is XX just slightly disturbed. Two repeats can be identified within the XX LCR; a 24 bp tandem repeat and a 9 bp direct repeat. A TATA box is XX located at nt 64 and a cap site is located directly in front of the XX E6 methionine codon. XX XX FT KEY Location/Qualifiers FT 5'UTR join(7292..7902,1..29) FT /note="putative" FT CDS join(7746..7902,1..5) FT /note="probably not functional" FT /note="E8 from bp 7611 to 5" FT /gene="E8" FT /note="putative" FT /codon_start=1 FT TATA_signal 64..70 FT misc_feature 98..104 FT /note="cap site" FT /note="putative" FT CDS 102..554 FT /note="ORF E6 from bp 30 to 554" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT misc_feature 141..142 FT /note="splice acceptor following E7" FT /note="putative" FT intron 450..503 FT /note="contained in the E6 ORF" FT /note="putative" FT CDS 530..826 FT /note="ORF E7 from 440 to 826" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT misc_feature 820..821 FT /note="splice acceptor following E1" FT /note="putative" FT CDS 832..2781 FT /note="ORF E1 from bp 715 to 2781" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT misc_feature 1279..1280 FT /note="splice donor in E1" FT /note="putative" FT misc_feature 2678..2679 FT /note="splice acceptor following E2" FT /note="putative" FT CDS 2723..3829 FT /note="ORF E2 from bp 2696 to 3829" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT misc_feature 3242..3243 FT /note="splice acceptor following E4" FT /note="putative" FT CDS 3255..3584 FT /note="ORF E4 from bp 3240 to 3584" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT misc_feature 3596..3597 FT /note="splice donor behind E4" FT /note="putative" FT CDS 3887..4162 FT /note="ORF E5a from bp 3872 to 4162" FT /gene="E5a" FT /note="putative" FT /codon_start=1 FT CDS 4159..4377 FT /note="ORF E5b from bp 4003 to 4377" FT /gene="E5b" FT /note="putative" FT /codon_start=1 FT misc_feature 4405..4406 FT /note="splice acceptor following L2" FT /note="putative" FT CDS 4423..5802 FT /note="ORF L2 from bp 4378 to 5802" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT polyA_signal 4554..4560 FT /note="putative" FT misc_feature 5788..5789 FT /note="splice acceptor following L1" FT /note="putative" FT CDS 5789..7291 FT /note="ORF L1 from bp 5678 to 7291" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT repeat_region 7292..7339 FT /rpt_unit=7292..7315,7316..7339 FT /standard_name="24 bp tandem repeat" FT /note="putative" FT misc_feature 7368..7369 FT /note="splice donor noncoding region" FT /note="putative" FT polyA_signal 7407..7412 FT /note="putative" FT repeat_region 7450..7474 FT /rpt_unit=7450..7458, 7466..7474 FT /standard_name="nonanucleotide direct repeat" FT /note="putative" FT source 1..7902 FT /organism="Human papillomavirus type 6" FT /sequenced_mol="DNA" XX SQ SEQUENCE 7902 bp; 2438 a; 1530 c; 1699 g; 2235 t; gttaataaca atcttggttt aaaaaatagg agggaccgaa aacggttcaa ccgaaaacgg 60 ttgtatataa accagcccta aaatttagca aacgaggcat tatggaaagt gcaaatgcct 120 ccacgtctgc aacgaccata gaccagttgt gcaagacgtt taatctatct atgcatacgt 180 tgcaaattaa ttgtgtgttt tgcaagaatg cactgaccac agcagagatt tattcatatg 240 catataaaca cctaaaggtc ctgtttcgag gcggctatcc atatgcagcc tgcgcgtgct 300 gcctagaatt tcatggaaaa ataaaccaat atagacactt tgattatgct ggatatgcaa 360 caacagttga agaagaaact aaacaagaca tcttagacgt gctaattcgg tgctacctgt 420 gtcacaaacc gctgtgtgaa gtagaaaagg taaaacatat actaaccaag gcgcggttca 480 taaagctaaa ttgtacgtgg aagggtcgct gcctacactg ctggacaaca tgcatggaag 540 acatgttacc ctaaaggata ttgtattaga cctgcaacct ccagaccctg tagggttaca 600 ttgctatgag caattagtag acagctcaga agatgaggtg gacgaagtgg acggacaaga 660 ttcacaacct ttaaaacaac atttccaaat agtgacctgt tgctgtggat gtgacagcaa 720 cgttcgactg gttgtgcagt gtacagaaac agacatcaga gaagtgcaac agcttctgtt 780 gggaacacta aacatagtgt gtcccatctg cgcaccgaag acctaacaac gatggcggac 840 gattcaggta cagaaaatga ggggtctggg tgtacaggat ggtttatggt agaagctata 900 gtgcaacacc caacaggtac acaaatatca gacgatgagg atgaggaggt ggaggacagt 960 gggtatgaca tggtggactt tattgatgac agcaatatta cacacaattc actggaagca 1020 caggcattgt ttaacaggca ggaggcggac acccattatg cgactgtgca ggacctaaaa 1080 cgaaagtatt taggtagtcc atatgttagt cctataaaca ctatagccga ggcagtggaa 1140 agtgaaataa gtccacgatt ggacgccatt aaacttacaa gacagccaaa aaaggtaaag 1200 cgacggctgt ttcaaaccag ggaactaacg gacagtggat atggctattc tgaagtggaa 1260 gctggaacgg gaacgcaggt agagaaacat ggcgtaccgg aaaatggggg agatggtcag 1320 gaaaaggaca caggaaggga catagagggg gaggaacata cagaggcgga agcgcccaca 1380 aacagtgtac gggagcatgc aggcacagca ggaatattgg aattgttaaa atgtaaagat 1440 ttacgggcag cattacttgg taagtttaaa gaatgctttg ggctgtcttt tatagattta 1500 attaggccat ttaaaagtga taaaacaaca tgtttagatt gggtggtagc agggtttggt 1560 atacatcata gcatatcaga ggcatttcaa aaattaattg agccattaag tttatatgca 1620 catatacaat ggctaacaaa tgcatgggga atggtattgt tagtattatt aagatttaaa 1680 gtaaataaaa gtagaagtac cgttgcacgt acacttgcaa cgctattaaa tatacctgaa 1740 aaccaaatgt taatagagcc accaaaaata caaagtggtg ttgcagccct gtattggttt 1800 cgtacaggta tatcaaatgc cagtacagtt ataggggaag caccagaatg gataacacgc 1860 caaacagtta ttgaacacgg gttggcagac agtcagttta aattaacaga aatggtgcag 1920 tgggcgtatg ataatgacat atgcgaggag agtgaaattg catttgaata tgcacaaagg 1980 ggagattttg attctaatgc acgagcattt ttaaatagca atatgcaggc aaaatatgtg 2040 aaagattgtg caactatgtg tagacattat aaacatgcag aaatgaggaa gatgtctata 2100 aaacaatgga taaaacatag gggttctaaa atagaaggca caggaaattg gaaaccaatt 2160 gtacaattcc tacgacatca aaatatagaa ttcattcctt ttttaactaa atttaaatta 2220 tggctgcacg gtacgccaaa aaaaaactgc atagccatag taggccctcc agatactggg 2280 aaatcgtact tttgtatgag tttaataagc tttctaggag gtacagttat tagtcatgta 2340 aattccagca gccatttttg gttgcaaccg ttagtagatg ctaaggtagc attgttagat 2400 gatgcaacac agccatgttg gatatatatg gatacatata tgagaaattt gttagatggt 2460 aatcctatga gtattgacag aaagcataaa gcattgacat taattaaatg tccacctctg 2520 ctagtaacgt ccaacataga tattactaaa gaagataaat ataagtattt acatactaga 2580 gtaacaacat ttacatttcc aaatccattc ccttttgaca gaaatgggaa tgcagtgtat 2640 gaactgtcaa atacaaactg gaaatgtttt tttgaaagac tgtcgtcaag cctagacatt 2700 caggattctg aggacgagga agatggaagc aatagccaag cgtttagatg cgtgccagga 2760 acagttgtta gaactttatg aagaaaacag tactgaccta cacaaacatg tattgcattg 2820 gaaatgcatg agacatgaaa gtgtattatt atataaagca aaacaaatgg gcctaagcca 2880 cataggaatg caagtagtgc caccattaaa ggtgtccgaa gcaaaaggac ataatgccat 2940 tgaaatgcaa atgcatttag aatcattatt aaggactgag tatagtatgg aaccgtggac 3000 attacaagaa acaagttatg aaatgtggca aacaccacct aaacgctgtt ttaaaaaacg 3060 gggcaaaact gtagaagtta aatttgatgg ctgtgcaaac aatacaatgg attatgtggt 3120 atggacagat gtgtatgtgc aggacaatga cacctgggta aaggtgcata gtatggtaga 3180 tgctaagggt atatattaca catgtggaca atttaaaaca tattatgtaa actttgtaaa 3240 agaggcagaa aagtatggga gcaccaaaca ttgggaagta tgttatggca gcacagttat 3300 atgttctcct gcatctgtat ctagcactac acaagaagta tccattcctg aatctactac 3360 atacaccccc gcacagacct ccacccttgt gtcctcaagc accaaggaag acgcagtgca 3420 aacgccgcct aggaaacgag cacgaggagt ccaacagtcc ccttgcaacg ccttgtgtgt 3480 ggcccacatt ggacccgtgg acagtggaaa ccacaacctc atcactaaca atcacgacca 3540 gcaccaaaga cggaacaaca gtaacagttc agctacgcct atagtgcaat ttcaaggtga 3600 atccaattgt ttaaagtgtt ttagatatag gctaaatgac agacacagac atttatttga 3660 tttaatatca tcaacgtggc actgggcctc ctcaaaggca ccacataaac atgccattgt 3720 aactgtaaca tatgatagtg aggaacaaag gcaacagttt ttagatgttg taaaaatacc 3780 ccctaccatt agccacaaac tgggatttat gtcactgcac ctattgtaat ttgtatatat 3840 gtaaatgtgt aaatatatgg tattggtgta atacaactgt acatgtatgg aagtggtgcc 3900 tgtacaaata gctgcaggaa caaccagcac attcatactg cctgttataa ttgcatttgt 3960 tgtatgtttt gttagcatca tacttattgt atggatatct gagtttattg tgtacacatc 4020 tgtgctagta ctaacactgc ttttatattt actattgtgg ctgctattaa caaccccctt 4080 gcaatttttc ctactaactc tacttgtgtg ttactgtccc gcattgtata tacactacta 4140 tattgttacc acacagcaat gatgctaaca tgtcaattta atgatggaga tacctggctg 4200 ggtttgtggt tgttatgtgc ctttattgta gggatgttgg ggttattatt gatgcactat 4260 agagctgtac aaggggataa acacaccaaa tgtaagaagt gtaacaaaca caactgtaat 4320 gatgattatg taactatgca ttatactact gatggtgatt atatatatat gaattagagt 4380 aaaccgtttt ttatatttgt aacagtgtat gctttgtata ccatggcaca tagtagggcc 4440 cgacgacgca agcgtgcgtc agctacacag ctatatcaaa catgtaaact cactggaaca 4500 tgccccccag atgtaattcc taaggtggag cacaacacca ttgcagatca aatattaaaa 4560 tggggaagtt tgggggtgtt ttttggaggg ttgggtatag gcacgggttc cggcactggg 4620 ggtcgtactg gctatgttcc cttacaaact tctgcaaaac cttctattac tagtgggcct 4680 atggctcgtc ctcctgtggt ggtggagcct gtggcccctt cggatccatc tattgtgtct 4740 ttaattgaag aatcggcaat cattaacgca ggggcgcctg aaattgtgcc ccctgcacac 4800 ggtgggttta caattacatc ctctgaaaca actacccctg caatattgga tgtatcagtt 4860 actagtcaca ctactactag tatatttaga aatcctgtct ttacagaacc ttctgtaaca 4920 caaccccaac cacccgtgga ggctaatgga catatattaa tttctgcacc cactgtaacg 4980 tcacacccta tagaggaaat tcctttagat acttttgtgg tatcatctag tgatagcggt 5040 cctacatcca gtacccctgt tcctggtact gcacctcggc ctcgtgtggg cctatatagt 5100 cgtgcattgc accaggtgca ggttacagac cctgcatttc tttccactcc tcaacgctta 5160 attacatatg ataaccctgt atatgaaggg gaggatgtta gtgtacaatt tagtcatgat 5220 tctatacaca atgcacctga tgaggctttt atggacataa ttcgtttgca cagacctgcc 5280 attgcgtccc gacgtggcct tgtgcggtac agtcgcattg gacaacgggg gtctatgcac 5340 actcgcagcg gaaagcacat aggggcccgc attcattatt tttatgatat ttcacctatt 5400 gcacaggctg cagaagaaat agaaatgcac cctcttgtgg ctgcacagga tgatacattt 5460 gatatttatg ctgaatcttt tgaacctggc attaacccta cccaacaccc tgttacaaat 5520 atatcagata catatttaac ttccacacct aatacagtta cacaaccgtg gggtaacacc 5580 acagttccat tgtcacttcc taatgacctg tttttacaat ctggccctga tataactttt 5640 cctactgcac ctatgggaac accctttagt cctgtaactc ctgctttacc tacaggccct 5700 gttttcatta caggttctgg attttatttg catcctgcat ggtattttgc acgtaaacgc 5760 cgtaaacgta ttcccttatt tttttcagat gtggcggcct agcgacagca cagtatatgt 5820 gcctcctcct aaccctgtat ccaaagttgt tgccacggat gcttatgtta ctcgcaccaa 5880 catattttat catgccagca gttctagact tcttgcagtg ggacatcctt atttttccat 5940 aaaacgggct aacaaaactg ttgtgccaaa ggtgtcagga tatcaataca gggtatttaa 6000 ggtggtgtta ccagatccta acaaatttgc attgcctgac tcgtctcttt tcgatcccac 6060 aacacaacgt ttagtatggg catgcacagg cctagaggtg ggcaggggac agccattagg 6120 tgtgggtgta agtggacatc ctttcctaaa taaatatgat gatgttgaaa attcagggag 6180 tggtggtaac cctggacagg ataacagggt taatgtaggt atggattata aacaaacaca 6240 attatgcatg gttggatgtg cccccccttt gggcgagcat tggggtaaag gtaaacagtg 6300 tactaataca cctgtacagg ctggtgactg cccgccctta gaacttatta ccagtgttat 6360 acaggatggc gatatggttg acacaggctt tggtgctatg aattttgctg atttgcagac 6420 caataaatca gatgttccta ttgacatatg tggcactaca tgtaaatatc cagattattt 6480 acaaatggct gcagacccat atggtgatag attatttttt tttctacgga aggaacaaat 6540 gtttgccaga cattttttta acagggctgg cgaggtgggg gaacctgtgc ctgatacact 6600 tataattaag ggtagtggaa atcgcacgtc tgtagggagt agtatatatg ttaacacccc 6660 gagcggctct ttggtgtcct ctgaggcaca attgtttaat aagccatatt ggctacaaaa 6720 agcccaggga cataacaatg gtatttgttg gggtaatcaa ctgtttgtta ctgtggtaga 6780 taccacacgc agtaccaaca tgacattatg tgcatccgta actacatctt ccacatacac 6840 caattctgat tataaagagt acatgcgtca tgtggaagag tatgatttac aatttatttt 6900 tcaattatgt agcattacat tgtctgctga agtaatggcc tatattcaca caatgaatcc 6960 ctctgttttg gaagactgga actttgggtt atcgcctccc ccaaatggta cattagaaga 7020 tacctatagg tatgtgcagt cacaggccat tacctgtcaa aagcccactc ctgaaaagga 7080 aaagccagat ccctataaga accttagttt ttgggaggtt aatttaaaag aaaagttttc 7140 tagtgaattg gatcagtatc ctttgggacg caagtttttg ttacaaagtg gatatagggg 7200 acggtcctct attcgtacag gtgttaagcg ccctgctgtt tccaaagcct ctgctgcccc 7260 taaacgtaag cgcgccaaaa ctaaaaggta atatatgtgt atatgtactg ttatatatat 7320 gtgtgtatgt actgttatgt atatgtgtgt gtgtgttctg tgtgtaatgt aagttatttg 7380 tgtaatgtgt atgtgtgttt atgtgcaata aacaattacc tcttgttaca ccctgtgact 7440 cagtggctgt tgcacgcgtt ttggtttgca cgcgccttac acacataagt aatatacatg 7500 cacaatatat atatttttgt ttaaaatact atacttttat atttgcaacc gttttcggtt 7560 gcccttagca tacactttcc accaatttgt tacaacgtgt ttcctcttaa tcctatatat 7620 tttgtgccag gtacacattg ccctgccaag ttgcttgcca agtgcatcat atcctgccaa 7680 ccacacacct ggcgccaggg tgcggtattg ccttactcat aaacctgtct ttgtgttata 7740 cttttatgca ctgtagccaa ctcttaaaag catttttggc ttgtagcagc acattttttt 7800 gctcttactg tttggtatac aataacataa aaatgagtaa cctaaggtca cacacctgcg 7860 accggtttcg gttatccaca ccctacatat ttccttctta ta 7902 //