ID HPV35 STANDARD; ds-DNA; VRL; 7851 bp. XX DE Human papillomavirus type 35 (HPV35), complete genome. XX AC M74117 XX DT 30-JAN-1992 XX OS Human papillomavirus type 35 DNA recovered from a cervical OS adenocarcinoma. OC Human papillomavirus type 35 OC Viridae; ds-DNA nonenveloped viruses; Papovaviridae; OC Papillomavirus. XX RN [1] RP 1-7851 XX RA Lorincz,A.T., Quinn,A.P., Lancaster,W.D. and Temple,G.F.; RT "A New Type of Papillomavirus Associated with Cancer of the Uterine RT Cervix;" RL Virology 159, 187-190 (1991). XX XX In developing countries, cancer of the cervix is responsible for XX 24% of all cancers in women. In these areas, it is the most XX frequent female malignancy. In developed countries, it ranks XX behind cancers of the breast, lung, uterus, and ovaries and XX accounts for 7% of all female cancers. HPV-35 is most often found XX in lesions of the genital mucosa which may have a risk for XX malignant progression. Prevalence studies indicate that HPV-35 was XX present in 2/158 (1%) of anogenital intraepithelial neoplasia and XX in 3/69 (4%) of anogenital cancers. Thus HPV-35 is a XX low-prevalence HPV associated with anogenital intraepithelial XX neoplasia and cancer. XX XX The 7851 base pair genome of HPV-35 was first recovered from a XX cervical adenocarcinoma. The numbering of the sequence was XX determined by homology to HPV-31. Both the E6 and E7 ORFs of XX HPV-35 exhibit the conserved zinc finger protein motifs seen in XX other HPV E6 and E7 proteins. The E6 ORF also exhibits putative XX splice donor and acceptor sites similar to those seen in other XX oncogenic types (16, 18, 31, and 33). The E7 protein shows XX sequence homology to the adenovirus 5 E1A protein binding site for XX the retinoblastoma tumor suppressor gene product seen in type 16. XX XX The long control region (LCR) of HPV-35 contains features which are XX either conserved among all or many of the papillomaviruses or are XX conserved among just those associated with anogenital lesions. A XX feature which appears to be common among the mucosal types is the XX glucocorticoid response element. These elements have been shown to XX mediate the hormonal response in the presence of glucocorticoids. XX Besides the presence in HPV-35, potential GREs have been identified XX in types 6, 11, 16, 18, 31, 33, and 39. The author notes that XX their irregular distribution within the LCR raises questions XX concerning the premise that they play a role in the life cycle of XX the mucosal HPVs. In the LCR of many types, sequences homologous XX to binding sites for transcription factors, activator protein 1 XX (AP-1), and nuclear factor 1 (NF-1) have been identified. In XX HPV-35, two putative AP-1 regions, and five putative NF-1 regions XX are present. Also found in the LCR of many HPV types are tandem XX direct repeats, direct repeats, and inverted repeats. Like those XX other HPVs, HPV-35 contains tandem direct repeats of 8 bp, direct XX repeats of 11 bp, and inverted repeats of 7 and 8 bp. The XX CK-octamer motif has been identified in HPV type 6, 11, 16, 18, and XX 33. This element is present in HPV-35 and may explain the tissue XX specificity of HPV infection. Three putative E2 binding sites XX with the consensus ACC(Nx6)GGT have been identified in HPV-35. The XX author contends that these sites may not be involved with E2 XX binding as only 7/10 mucosal types have sequences with this XX consensus and that extracts of E2-transfected COS-1 cells did not XX protect this site in HPV 18. XX XX Of particular interest is the author's identification of a 20 bp XX sequence which is conserved between the oncogenic mucosal types XX (16, 18, 31, 33, 35, 39, and 51) and has not been found in other XX nononcogenic types (1a, 2a, 5, 6, 8, 9, 11, 17, 19, 20, 25, 36, 47 XX or 57). This 20 bp region is located approximately 30 bp 5' to the XX keratinocyte-specific octamer. XX XX FT KEY Location/Qualifiers FT 5'UTR join(7092..7851,1..109) FT /function="regulatory region" FT /standard_name="LCR" FT protein_bind 17..22 FT /bound_moiety="Sp-1" FT CAAT_signal complement(18..26) FT protein_bind 24..35 FT /function="gene transcription" FT /bound_moiety="E2" FT /note="putative" FT protein_bind 39..50 FT /function="gene transcription" FT /bound_moiety="E2" FT /note="putative" FT TATA_signal 54..59 FT /note="putative" FT CDS 110..559 FT /note="putative" FT /note="E6 ORF from bp 59 to 559" FT /product="transforming protein" FT /gene="E6" FT /note="putative" FT /codon_start=1 FT misc_feature 230..238 FT /standard_name="Splice donor" FT misc_feature 405..416 FT /standard_name="Splice acceptor" FT CDS 562..861 FT /note="putative" FT /product="transforming protein" FT /gene="E7" FT /note="putative" FT /codon_start=1 FT CDS 868..2760 FT /note="putative" FT /product="replication protein" FT /gene="E1" FT /note="putative" FT /codon_start=1 FT CDS 2693..3796 FT /function="regulation of gene expression" FT /note="putative" FT /product="regulatory protein" FT /gene="E2" FT /note="putative" FT /codon_start=1 FT CDS <3273..3563 FT /note="putative" FT /gene="E4" FT /note="putative" FT /codon_start=1 FT /partial FT CDS 3793..4038 FT /note="putative" FT /gene="E5" FT /note="putative" FT /codon_start=1 FT polyA_signal 4159..4164 FT /note="putative" FT CDS 4184..5593 FT /note="putative" FT /product="minor capsid protein" FT /gene="L2" FT /note="putative" FT /codon_start=1 FT CDS 5574..7091 FT /note="putative" FT /product="major capsid protein" FT /gene="L1" FT /note="putative" FT /codon_start=1 FT repeat_region 7090..7105 FT /rpt_unit=7090..7097, 7098..7105 FT repeat_region 7123..7160 FT /rpt_unit=7123..7133, 7150..7160 FT protein_bind 7415..7426 FT /function="gene transcription" FT /bound_moiety="E2" FT /note="putative" FT protein_bind 7477..7491 FT /bound_moiety="hormone receptor" FT /standard_name="glucocorticoid responsive element" FT protein_bind 7514..7519 FT /bound_moiety="NF-1" FT protein_bind 7527..7532 FT /bound_moiety="NF-1" FT protein_bind 7534..7539 FT /bound_moiety="NF-1" FT enhancer complement(7670..7677) FT /standard_name="keratinocyte specific enhancer FT (CK-octomer)" FT /note="putative" FT protein_bind 7671..7676 FT /bound_moiety="NF-1" FT protein_bind 7695..7700 FT /bound_moiety="NF-1" FT source 1..7851 FT /organism="Human papillomavirus type 35" FT /proviral FT /sequenced_mol="DNA" FT /tissue_type="cervical carcinoma" XX SQ SEQUENCE 7851 bp; 2553 a; 1343 c; 1568 g; 2387 t; ccctataaaa aaaacaggga gtgaccgaaa acggtcgtac cgaaaacggt tgccataaaa 60 gcagaagtgc acaaaaaagc agaagtggac agacattgta aggtgcggta tgtttcagga 120 cccagctgaa cgaccttaca aactgcatga tttgtgcaac gaggtagaag aaagcatcca 180 tgaaatttgt ttgaattgtg tatactgcaa acaagaatta cagcggagtg aggtatatga 240 ctttgcatgc tatgatttgt gtatagtata tagagaaggc cagccatatg gagtatgcat 300 gaaatgttta aaattttatt caaaaataag tgaatataga tggtatagat atagtgtgta 360 tggagaaacg ttagaaaaac aatgcaacaa acagttatgt catttattaa ttaggtgtat 420 tacatgtcaa aaaccgctgt gtccagttga aaagcaaaga catttagaag aaaaaaaacg 480 attccataac atcggtggac ggtggacagg tcggtgtatg tcctgttgga aaccaacacg 540 tagagaaacc gaggtgtaat catgcatgga gaaataacta cattgcaaga ctatgtttta 600 gatttggaac ccgaggcaac tgacctatac tgttatgagc aattgtgtga cagctcagag 660 gaggaggaag atactattga cggtccagct ggacaagcaa aaccagacac ctccaattat 720 aatattgtaa cgtcctgttg taaatgtgag gcgacactac gtctgtgtgt acagagcaca 780 cacattgaca tacgtaaatt ggaagattta ttaatgggca catttggaat agtgtgcccc 840 ggctgttcac agagagcata atctacaatg gctgatcctg caggtacaga tgaaggggag 900 gggacgggat gtaatggatg gttttttgta gaagcagtag ttagtagacg tacgggatcc 960 agtgtagagg acgaaaatga agatgactgt gacagggggg aggatatggt ggactttata 1020 aatgatacag atatattaaa catacaggca gaaacagaga cagcacaagc attatttcat 1080 gcacaggagg agcaaacaca caaagaggct gtacaggtcc taaaacgaaa gtatgctagt 1140 agtccactta gcagcgtgag cttatgtgtt aataataaca taagtccacg tttaaaagct 1200 atttgcattg aaaataaaaa tacagcagca aagcgacgat tatttgaact accagacagc 1260 ggttatggca attctgaagt ggaaatacac gagatacaac aggtagaggg gcatgataca 1320 gttgaacaat gtagtatggg cagtggggat agtataacct ctagtagcga tgaaagacat 1380 gatgagactc caacgcgaga cataatacaa atactaaaat gtagtaatgc aaacgcagct 1440 atgttggcta aatttaaaga actatttggt attagtttta cagaacttat tagaccattt 1500 aagagtgata aatccacatg tacagattgg tgtgtggccg catttggaat agccccaagt 1560 gtggcgaact ttaaacatat aacatatgta tacatataca atgtttatcg tgttcatggg 1620 gctatggtaa ttctagcatt attacgattt aaagtcgaaa aacgagaaca acaattgaaa 1680 actattgatg ctaaattgct atgtatttca gctgcaagta tgctaataca accaccaaaa 1740 ttacgtagta ccccagctgc gttatattgg tttaaaacag caatgtcaaa tattagtgag 1800 gttgatggag aaacaccaga atggattcaa agacaaacag tattacagca tagttttaat 1860 gatgcaatat ttgacctatc tgaaatggta caatgggcat atgacaatga ttttatagat 1920 gatagtgata tagcatataa atatgcacaa ttggcagaaa ctaatagtaa tgcatgtgct 1980 tttttaaaaa gtaattcgca agctaaaatt gtaaaagatt gtgcaacaat gtgtagacat 2040 tataaacgag ctgaaaaaag agaaatgaca atgtcacagt ggattaaaag gcgatgtgca 2100 caggtggacg atgacggtga ctggagggac atagtacgat ttttaagata tcaacaagta 2160 gattttgtgg catttttatc tgcactaaaa aattttttac atggtgtgcc taaaaaaaat 2220 tgcatactaa tatatggagc accaaacaca ggtaaatcat tatttggaat gagtctaatg 2280 catttcttac aaggagctat tatatcctat gtaaattcta aaagccattt ttggttgcag 2340 ccattatatg atgccaaaat agctatgtta gatgatgcta catcgccatg tggcatatat 2400 agaccaatat ttaagaaatg cactagatgg aaatcctata tttcatttag atgtaaagca 2460 ttaagcatag tgcatataat gcccaccttt acttattaca tcaatataaa tgcaggcaaa 2520 gatgacaggt ggccatactt acatagcagg gtagtggtct ttacatttca caatgaattc 2580 ccatttgata aaaatggaaa cccagagtat gggcttaatg ataaaaactg gaaatccttt 2640 ttctcaagga cgtggtgcag attaaatttg cacgaggaag aggtcaaaga aaatgatgga 2700 gacgctttcc cagcgtttaa gtgtgtgtca ggacaaaata ctagaacatt acgagactga 2760 tagcacatgt ttgtctgatc acatacagta ttggaaactg attcgtcttg aatgtgcagt 2820 attttataaa gcaagagaaa tgggaattaa aactcttaac caccaagtgg ttccaacgca 2880 ggccatttca aaagccaaag caatgcaagc aattgaactg caattaatgt tagagacatt 2940 aaatacaact gagtatagca cagaggactg gacactgcaa gaaacaagta ttgaactata 3000 tacaacagtt cctacaagat gtttaaaaaa agatgtttat actgtggaag cacaatttga 3060 tggtgataaa caaaatacta tgcattatac taattggaca catatatata tattagagga 3120 cagtatatgt actgttgtaa agggactggt aaattataaa ggtatttatt atgtgcatca 3180 gggtgtagaa acatattatg ttacttttag ggaagaggct aaaaagtatg gaaaaaaaaa 3240 tatatgggaa gtgcatgtgg gtggtcaggt aattgtttgt cctgaatctg tatttagcag 3300 cacagaacta tccactgctg aaattgctac acagctacac gcctacaaca ccaccgagac 3360 ccataccaaa gcctgctccg tgggcaccac agaaacccag aagacaaatc acaaacgact 3420 tcgagggggt accgagctcc cctacaaccc caccaagcga gtgcgactca gtgccgtgga 3480 cagtgttgac agaggggtct actctacatc tgactgcaca aacaaagacc ggtgtggtag 3540 ttgtagtaca actacaccta tagtacattt aaaaggtgat gcaaatacat taaagtgttc 3600 aagatataga ttgggtaaat ataaagcatt gtatcaagat gcttcatcta catggagatg 3660 gacatgtaca aacgataaaa aacaaatagc aattgtaaca ttaacttaca caacagaata 3720 tcaaagggat aaatttttaa ctacagtaaa aatacctaac acagttacag tgtctaaagg 3780 atatatgtct atatgataga ccttacagct tccagtactg tgttgctgtg ctttttgttg 3840 tgcttttgtg tgcttttgtg cttgtgtctg cttgtacgtt cgctattgct atctgtgtca 3900 ttatactcag cattaatatt actggtttta atactgtggg ttactgtagc aacaccacta 3960 cttgcttttg ttgtttcttg cttttgtata tacctatgga tgattaacgc tcatgcacaa 4020 tatttggcag tacagtaatt gtatacaaac attgtgtttg gtactgtgta acatgtgtgt 4080 atggtggttt tattttttgt tgttcattgt atattttgtt tttttactgt ttttaaacat 4140 ttttatttct gtgtttttaa taaattgatc acatggtata accatgcgac acaaaaggtc 4200 tacaaaacgt gttaaacgtg catctgcaac acaactatat cgtacttgca aagctgcagg 4260 aacttgtcca ccagatgtta tacctaaggt tgagggtaat actgttgctg atcaaatttt 4320 aaaatatggc agcatggctg tgttttttgg ggggttagga attggttctg gatctggcac 4380 aggtggaaga tctggatatg ttccactggg tacaacacct ccaacggctg ccacaaacat 4440 tcctatacga ccccctgtaa ctgtggaaag tataccatta gacacaattg gccctttaga 4500 ttcttctata gtgtcattag tagaggaaac tagttttatt gagtctggtg cccctgttgt 4560 tacaccaagg gtcccaccta caacaggttt tacaataacc acatctacag ataccacacc 4620 tgctatttta gatgtgacat ccataagtac acatgataat cctactttca ctgatccttc 4680 tgttttacac ccacccacgc ctgcagaaac ttcaggtcat tttgtacttt catcatcttc 4740 tattagtaca cataattatg aagaaatccc tatggatact tttattgttt ccacagacag 4800 caataatata actaatagca cgcctattcc agggtctcgc cctacgacac gcctaggatt 4860 atatagtaaa ggtacccagc aggttaaggt tgttgaccct gcctttatga cttctcctgc 4920 aaaacttatt acatatgata atcctgcata tgaaggcctt aaccctgata caaccttaca 4980 atttgagcat gaggatatta gcttagctcc ggatcctgac tttatggaca ttatagcttt 5040 acataggcct gcactaacat ctaggaaagg cactattaga tatagtagag taggtaataa 5100 acgtactatg catacacgaa gtggaaaagc tataggggca cgggtacatt attatcagga 5160 tttaagtagt attactgaag atatagaatt acaaccctta caacatgtac catcctcttt 5220 accacatacc actgtttcaa catcattaaa tgatggtatg tttgatattt atgctcctat 5280 agatactgag gaagatatta tattttcagc atcttctaac aatactttat atactacatc 5340 taacactgca tatgttccta gcaatactac tataccatta agtagtggct atgatattcc 5400 tataacagca gggccagaca ttgtatttaa ctctaatact attactaact ctgtactacc 5460 ggtacccaca ggtcctatat attctattat tgcagatggg ggtgactttt atttacaccc 5520 tagttattat ttattaaaac gacgtcgtaa agctatccca tatttttttg cagatgtctc 5580 tgtggcggtc taacgaagcc actgtctacc tgcctccagt gtcagtgtct aaggttgtta 5640 gcactgatga atatgtaaca cgcacaaaca tctactatca tgcaggcagt tctaggctat 5700 tagctgtggg tcacccatac tatgctatta aaaaacaaga ttctaataaa atagcagtac 5760 ccaaggtatc tggtttgcaa tacagagtat ttagagtaaa attaccagat cctaataagt 5820 ttggatttcc agacacatca ttttatgatc cctgcctcca gcgtttggtt tgggcctgta 5880 caggagttga agtaggtcgt ggtcagccat taggagtagg tattagtggt catcctttat 5940 taaataaatt ggatgatact gaaaatctta ataaatatgt tggtaactct ggtaactctg 6000 gtacagataa cagggaatgc atttctatgg attataaaca aacacaattg tgtttaatag 6060 gttgtaggcc tcctataggt gaacattggg gaaaaggcac accttgtaat gctaaccagg 6120 taaaagcagg agaatgtcct cctttggagt tactaaacac tgtactacaa gacggggaca 6180 tggtagacac aggatttggt gcaatggatt ttactacatt acaagctaat aaaagtgatg 6240 ttcccctaga tatatgcagt tccatttgca aatatcctga ttatctaaaa atggtttctg 6300 agccatatgg agatatgtta tttttttatt tacgtaggga gcaaatgttt gttagacatt 6360 tatttaatag ggctggaact gtaggtgaaa cagtacctgc agacctatat attaagggta 6420 ccactggcac attgcctagt actagttatt ttcctactcc tagtggctct atggtaacct 6480 ccgatgcaca aatatttaat aaaccatatt ggttgcaacg tgcacaaggc cataataatg 6540 gtatttgttg gagtaaccaa ttgtttgtta ctgtagttga tacaacccgt agtacaaata 6600 tgtctgtgtg ttctgctgtg tcttctagtg acagtacata taaaaatgac aattttaagg 6660 aatatttaag gcatggtgaa gaatatgatt tacagtttat ttttcagtta tgtaaaataa 6720 cactaacagc agatgttatg acatatattc atagtatgaa cccgtccatt ttagaggatt 6780 ggaattttgg ccttacacca ccgccttctg gtaccttaga ggacacatat cgctatgtaa 6840 catcacaggc tgtaacttgt caaaaaccca gtgcaccaaa acctaaagat gatccattaa 6900 aaaattatac tttttgggag gttgatttaa aggaaaagtt ttctgcagac ttagatcagt 6960 ttccgttggg ccgtaaattt ttgttacaag caggactaaa ggccaggcct aattttagat 7020 taggcaggcg tgcagctcca gcatctacat ctaaaaaatc ttctactaaa cgtagaaaag 7080 taaaaagtta atgtgtaaat gtgtatgcat gtatactgtg tgttatgtgt tgtagtgctt 7140 gtatatatat tatgtgttgt ggtgcctgtt tgtgttgtac atggcgtgta aatgtgtgta 7200 taatattgtg caatgtgttg tacgtgggtg tttttgtact tagtgtgtag tagttcagta 7260 gccataaagt gatgtgtgtg tttataatta acactgtatt gttgtatgac tatggtgcac 7320 cgatatgagc ttacataatt acatgacagc tatattgtgt atataaataa tctacctcca 7380 ttttgtgtgt tagtgtcctt tacattacct ttcaaccgat ttcggttgct gttggtaagc 7440 tttatatgtt ttttacaaaa acattcctac ctcagcagaa cacttaatcc ttgtgttcct 7500 gatatatatt gtttgccaac tttatattgg cttttgccaa tctttaaact tgattcatct 7560 tgcagtatta gtcatttttc atacttgtgg tccacccaca cttgtaacac ttgtaacagt 7620 gcttttaggc acatattttt tgcatttcta aagggcttta attgcacacc ttggctttac 7680 atattatgtg tgtttgccaa caccacccta cacatcctgc caactttaag ttaaaacatg 7740 catgtaaaac attactcact gtattacaca ttgttatatg cacacaggtg tgtccaaccg 7800 atttggatta cagttttata agcatttctt tttattatag ttagtaacaa t 7851 //