LOCUS HPV14d 7439 bp ds-DNA VRL 04-OCT-1993 DEFINITION Human papillomavirus type 14D (HPV-14), complete genome. ACCESSION X74467 SOURCE Human papillomavirus type 14D DNA. ORGANISM Human papillomavirus type 14D Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7439) AUTHORS Egawa,K., Delius,H., Matsukura,T., Kawashima,M. and De Villiers,E.M. TITLE Two novel types of human papillomavirus, HPV63 and HPV65 comparisons of their distinct clinical and histological features and their DNA sequences to other HPV types JOURNAL Virology 194, 789-799 (1993) STANDARD full staff_review COMMENT Submitted (27-JAN-1993) on tape to the EMBL Data Library by: H. Delius, Deutsches Krebsforschungszentrum, Abt ATV, Im Neuenheimer Feld 506, W -6900 Heidelberg, FRG. Clone = insert in HindIII site of pBR322. HPV-14 is most often isolated from the flat warts of Epidermodysplasia Verruciformis (EV). It has been isolated from both benign lesions and squamous cell carcinomas. EV is a rare genetic disease which is thought to be autosomal recessive, although a few single cases of X-linked inheritance have been reported. This genetic defect results in a patient who is immunotolerant. These patients are then infected by HPV types which are rarely seen in the general population and which often persist their entire life. This infection is characterized by either flat warts or pigmented (red or reddish-brown) macular plaques. Most often these warts are benign but occasionally they progress to malignancy. Ultraviolet light may be a cofactor in this progression. Benign lesions are associated with diverse EV HPVs, while in about 90% of cases cancers harbor HPV-5 or HPV-8. Studies indicate that carcinoma develops in approximately one-third of all EV patients. It is also interesting to note that other immunocompromised patients, such as renal allograft recipients and HIV positive patients, suffer from infection of EV HPV types and these infections have been known to progress to malignancy. Due to an apparent deletion of approximately 300 bp, the E6 and E7 ORFs of HPV-14d seem to be disrupted. Based on its homology to other HPV types, the ORF appearing in the sequence from bp 82 to 744 is similar at its 5' end to the beginning of E6, and at its 3' end to the end of E7. On this basis, we have chosen to assign bp 196-627 to E6 and bp 628-744 to E7. FEATURES Location/Qualifiers CDS 196..627 /note="putative" /note="ORF E6 from bp 82 to 627" /product="transforming protein" /gene="E6" /note="putative" /codon_start=1 /translation="MATTDSSTDSADEGPSPKSNYCDSTETKSSFIEPPLPATIFGLA NLLEIPLDDCLVPCNFCGNFLTHLEVCEFDEKKLSLIWKGHCVFACCRVCCTATATYE FNEFYESTVEGREIESVTGKSIFDVDVRCYTCMKFLDSIEKL" CDS 628..744 /note="putative" /product="transforming protein" /gene="E7" /note="putative" /codon_start=1 /translation="RIFITATEFALRTFQNLLFEQLQLLCPECRGNCKHGGS" CDS 731..2548 /note="putative" /note="ORF E1 from bp 641 to 2548" /product="replication protein" /gene="E1" /note="putative" /codon_start=1 /translation="MADPKGSTSKDGLDDWCIVEAECSDIENDLEELFDRDTDSDISE LLDDNDDLDQGNSRELFHQQESKESEEHLQKLKRKYLSPQAIAQLSPRLESITLSPQQ KSKRRLFAEQDSGLELTLTNEAEDVSSEVEVPALDSQPVAEAQIGTVDIHYTELLRAS NNKAILMAKFKEAFGVGFNDLTRQFKSYKTCCNHWVLSVYAVHDDLLESSKKLLQQHC DYVWIRGIAAMSLFLLCFKVGKNRGTVHKLMTSMLNVHEKQILSEPPKLRNVAAALFW YKGAMGSGTFTYGPYPDWMAHQTIVGHQSTEANAFDMSVMVQWAFDNNYLDEADIAYQ YAKLAPEDSNAVAWLAHNNQARFVRECASMVRFYKKGQMKEMSMSEWIHTRITEVEGE GHWSTIAKFLRYQQVNFIMFLAALKDMLHSVPKRNCILIYGPPNTGKSAFTMSLIRVL RGRVLSFVNSKSQFWLQPMSECKIALIDDVTDPCWLYMDTYLRNGLDGHYVSLDCKHK APIQTKFPALLLTSNINVHNEITYRYLHSRIKGFEFPNPFPMKADNTPEFELTDQSWK SFFTRLWNQLELSDQEDEGDNGESQRPFQCSARSANEHL" CDS 2490..3941 /note="putative" /note="ORF E2 from bp 2466 to 3941" /product="regulatory protein" /gene="E2" /note="putative" /codon_start=1 /translation="MENLSDRFNALQDQLMNIYETAANTLESQIEHWQTLRKEAVLLY FARQNGVTRLGYQVVPTLAISEAKAKQAIGMVLQLQSLQKSQFGSEPWSLVDTSGETF RSAPENHFKKGPVSVEVIYDNDKDNANAYTMWKHIYYQDDDEQWHKSASGVNHTGIYY MQGTFRNYYVLFADDATRYSKTGHWEVKVNKETVFTPVTSSTPPESPGGQADSNTSSK TPTTATDSTSRLSPADSRKQSQQANTKGRRYGRRPSSRTRRTTETRQRRRSRSKSRSR SRSRSRLRSRSRSQSSERRSRYRSRSRSRQKEVSRITTTTRGRGRGSSSTSSKRSQRA RGRGRGGSRGRRSSSTSPTSSKRSRRESESSRQRGISPSDVGKSLQSVSSRNTGRLGR LLDEALDPPVILVRGDPNTLRCFRNRAKQKFTGLYRAFSTAWSWVAGDGTERLGRSRM LISFFSFNQRRDFDQTVKYPKGVDRSFGSFDSL" CDS 2851..3696 /note="putative" /note="ORF E4 from bp 2848 to 3696" /gene="E4" /note="putative" /codon_start=1 /translation="MITIKTMQMLILCGSTYITRMMTNSGIKVQAGSTTQAYIICKEP LETTMFCLLMMQLDIVKLDIGKLKLIRKLCLLLSPAPPLPSHQEDKQTQTPPPRPPPP PLTPRPDSRPQIPENSHNKPTPKEEGTDADRPVGPGERPKRGRGGDRGPSPGRGRGRG LGSDLDPGRNRLSGGLGTDQDPDPDKKKCPESQPPPEGEVEGHPPPPPNGHNGHEEGA VGGAGGDGHPPPPPPPPNGHDESQSLLGSVASLLVTWESHFNQLVQEIQEDLEGYWTK LSIPQ" CDS 4028..5587 /note="putative" /note="ORF L2 from bp 3959 to 5587" /product="minor capsid protein" /gene="L2" /note="putative" /codon_start=1 /translation="MARARRVKRDSATNIYRTCKQAGTCPPDVINKVESTTIADKILQ YGSAGVFFGGLGISTGKGTGGTTGYVPLGEGPAVRVGGAPTIIRPALVPDTIGPSDII PVDTLDPVEPTTSSIVPLTDSTGPDLLPGEVETIAEVHPGPSRPPTDTPVTTSTGGSS AILEVAPEPTPPSRVRVTRTQYHNPSFQVITESTPTTGESSLADNILVTSGSGGQTIG GATPELIELQELPSRYSFEIEEPTPPRRTSTPLQRIQTAIRRRGGLTNRRLVQQVSVE NPLFLTRPSRLVQFQFDNPAFEEEVTQIFEQDIEDFNEPPDRDFLDVQRLGRPQYSET PAGYLRVSRLGQRRTIRTRSGAQIGSQVHFYRDLSSINTEDPIELQLLGQHSGDATIV QGPVESTFVDINVDENPLSEDFSAHSDDLLLDEANEDFSGSQLVVGNRRSTSSYTVPR FETTRSGSYYAQDTKGYYVAYPEDRDISMDIIYPTPELPVVIIHTYDTSGDFYLHPSL HKRLKRKRKYL" CDS 5603..7159 /note="putative" /note="ORF L1 from bp 5588 to 7159" /product="major capsid protein" /gene="L1" /note="putative" /codon_start=1 /translation="MAVWQAASGKVYLPPSTPVARVQSTDEYVQRTNIYYHAYSDRLL TVGHPYFNIYDVQSAKIKVPKVSGNQHRVFRLKLPDPNRFALADMSVYNPDKERLVWA CRGIEIGRGQPLGVGSVGHPLFNKVGDTENPNSYRQQANSTDDRQNVSFDPKQLQMFI IGCAPCMGEHWDRALPCVEDKPPPGSCPPIELKNTVIEDGDMADIGYGNLNFKALQEN RSDVSLDIVNEICKYPDFLKMQNDVYGDSCFFYARREQCYARHFFVRGGKTGDDIPAA QVDEGSLKNVYYIPPMTNQPQNNIGNAMYFPTVSGSLVSSDAQLFNRPFWLQRAQGHN NGICWFNQLFVTVVDNTRNTNFSISVSSENTEVSKIDNYTSQKFQEYLRHVEEYEMSL ILQLCKIPLTAEVLAQINAMNSNILEEWQLGFVPAPDNPIHDTYRYIESAATRCPDKN PPKEREDPYKNFNFWNVDLTERLSLDLDQYSLGRKFLFQAGLQQSTVNGTKTVSTRGS IKGIKRKRKN" source 1..7439 /organism="Human papillomavirus type 14d" /sequenced_mol="DNA" BASE COUNT 2337 a 1432 c 1612 g 2058 t ORIGIN 195 bp upstream from beginning of E6/E7 fused cds 1 aacggtaagt tattctgcac cgggtgcggt cactgtatta ctcactatgt ggttgttgtt 61 gccaactacc attgctgata gcatgttttt gcctgtaacg ttatcgacac atacatatct 121 atgtatatat atatatatat atatatatat atatatatat atatatacta cagaaaaaac 181 agagaatgca gactcatggc gacaactgac tcttcaacag acagtgcaga tgaaggtcct 241 tctcctaaga gtaactattg tgatagcaca gaaaccaaat cttcttttat agagccacca 301 ttacctgcaa ctatatttgg cttagcaaac ctattggaaa taccactaga tgattgttta 361 gtaccttgta acttttgtgg taattttttg actcatttag aagtctgtga atttgatgag 421 aaaaaactaa gtctaatttg gaaaggtcat tgtgtatttg cttgttgccg tgtatgctgc 481 acagcaacag caacgtatga gtttaatgaa ttttatgaga gtactgttga aggcagagaa 541 atagagagtg taacaggcaa atctattttt gatgttgatg tcaggtgcta tacctgcatg 601 aaatttttag attcaattga aaagcttcgc atctttataa ctgctacaga atttgctctt 661 agaaccttcc agaacctgtt atttgaacaa ctgcagctgt tgtgtcctga gtgccgtggg 721 aactgcaaac atggcggatc ctaaaggtag tacatctaaa gacgggttgg atgattggtg 781 tattgtggaa gctgaatgta gcgatataga aaatgatttg gaagaattat ttgacagaga 841 tacagactca gatatttcag aattattaga tgataatgat gacttggacc agggaaattc 901 tcgggaacta tttcatcaac aagagagtaa ggaaagcgag gagcacttgc aaaaactaaa 961 acgaaagtac ttgagtcctc aagctatcgc acagcttagt ccgcgacttg aaagtataac 1021 attgtcacct cagcagaagt ctaaacgaag gctctttgca gagcaggaca gcgggttgga 1081 gttaactctt acaaatgaag ctgaagatgt ttcttctgag gtggaggtac cggctctaga 1141 ctctcagccg gttgctgagg cacaaatagg aacagtagac attcattata cagaattatt 1201 acgtgccagc aacaataagg caattcttat ggcaaaattt aaggaggctt ttggggtagg 1261 ctttaatgat ttgacacgtc agtttaaaag ttacaaaacc tgctgtaatc attgggttct 1321 gtctgtatat gcagtgcatg atgatcttct tgaaagctca aagaagttat tgcaacagca 1381 ttgtgattat gtatggatac gtgggatagc tgctatgtca ttatttttat tgtgtttcaa 1441 agtgggaaaa aatcgtggga cagtacataa attaatgacc tcaatgttaa atgtgcatga 1501 aaagcaaata ttgtctgagc ctccaaagct acgaaatgtt gctgctgcat tgttctggta 1561 taaaggtgca atggggtcag ggacatttac ttatggtccc taccctgatt ggatggcaca 1621 tcaaactatt gttggccatc aaagtacaga agcaaatgca tttgatatgt ctgttatggt 1681 gcagtgggca tttgataaca attatttaga tgaagctgat atagcctatc aatatgctaa 1741 gttagcacca gaagatagta atgctgtggc ctggcttgcc cataataatc aggccaggtt 1801 tgttagagaa tgtgcatcta tggttagatt ttataaaaaa ggtcaaatga aagaaatgtc 1861 tatgtcagaa tggatacata ctagaataac tgaagtagaa ggagaaggtc attggtcaac 1921 aatagcaaaa tttcttagat atcaacaagt aaactttata atgtttttag ctgctttgaa 1981 agatatgcta cattcagttc ccaaacgtaa ttgtatatta atatatggtc ctccaaatac 2041 tgggaagtca gcatttacca tgtctttaat tcgtgtgtta agaggaaggg tgctttcatt 2101 tgttaattct aaaagccaat tttggctgca accaatgtca gagtgtaaaa tagctttaat 2161 tgatgatgtc acagatccat gttggttgta tatggacact tatttgagga atggccttga 2221 tggtcattat gtttctttag attgcaaaca taaagcaccg atacaaacta aatttcctgc 2281 actattactt acatctaata ttaatgtaca caatgaaata acgtatagat atttgcatag 2341 tagaattaag ggatttgaat ttccaaatcc atttccaatg aaagcagaca atacacctga 2401 atttgaactc actgaccaaa gctggaaatc tttctttaca aggctttgga atcaattaga 2461 gctgagtgac caagaagacg agggagacaa tggagaatct cagcgaccgt ttcaatgctc 2521 tgcaagatca gctaatgaac atttatgaga ctgcagcaaa cacacttgag tcgcaaattg 2581 agcattggca aactcttcga aaagaagctg tgctactata ttttgctagg caaaatggtg 2641 tgacacgact tggataccaa gttgtgccta cattagccat ttcagaagca aaagccaagc 2701 aggccatagg gatggtgctg cagttgcaat cactgcaaaa gtctcagttt ggcagtgaac 2761 catggtcact ggttgatacc agtggagaaa catttagaag tgctccagaa aatcatttca 2821 aaaagggtcc agtatcagta gaggtgattt atgataacga taaagacaat gcaaatgctt 2881 atactatgtg gaagcacata tattaccagg atgatgacga acagtggcat aaaagtgcaa 2941 gcggggtcaa ccacacaggc atatattata tgcaaggaac ctttagaaac tactatgttt 3001 tgtttgctga tgatgcaact agatatagta aaactggaca ttgggaagtt aaagttaata 3061 aggaaactgt gtttactcct gtcaccagct ccacccctcc cgagtcacca ggaggacaag 3121 cagactcaaa cacctcctcc aagaccccca ccaccgccac tgactccacg tccagactct 3181 cgcccgcaga ttccagaaaa cagtcacaac aagccaacac caaaggaaga aggtacggac 3241 gcagaccgtc cagtaggacc cggcgaacga ccgaaacgcg gcagaggcgg agatcgaggt 3301 ccaagtccag gtcgcggtcg aggtcgcggt ctcggctccg atctagatcc cggtcgcaat 3361 cgtctgagcg gcggtctcgg taccgatcaa gatccagatc cagacaaaaa gaagtgtcca 3421 gaatcacaac caccaccaga gggagaggtc gagggtcatc ctccacctcc tccaaacggt 3481 cacaacgggc acgaggaagg ggccgtgggg ggagcagggg gagacggtca tcctccacct 3541 cccccacctc ctccaaacgg tcacgacgag agtcagagtc ttctaggcag cgtggcatct 3601 ctcctagtga cgtgggaaag tcacttcaat cagttagttc aagaaataca ggaagacttg 3661 gaaggttact ggacgaagct ctcgatcccc cagtaatctt agtcaggggg gaccctaaca 3721 cgctacgatg ctttcgcaat agagctaagc aaaagtttac agggctttac agggccttta 3781 gcacggcttg gtcgtgggtg gctggagatg gcactgagcg tctaggcagg tccagaatgc 3841 tcattagctt tttctccttt aaccagagaa gagattttga tcagactgtt aagtacccga 3901 aaggagtgga ccggtcgttt ggctcatttg atagcctata acacccctaa catactaaca 3961 taatagcttg ctactaacat ctaacattta ttgcattttt gctttttgtt tgcattattt 4021 taatgctatg gcgcgtgcta ggcgagtcaa gcgtgactct gctactaaca tttacagaac 4081 ctgcaagcaa gcaggcacgt gtcctcctga tgtcattaat aaagttgaaa gcacaactat 4141 tgctgataaa attttgcagt atggtagtgc tggtgttttt tttgggggtt tgggcataag 4201 cactggaaaa ggtacaggag gtaccacagg ctatgtgcct ttgggagagg gcccagcagt 4261 acgtgttggt ggtgcgccaa caattatcag acctgctctg gtcccagaca ccattggtcc 4321 atcagatatt atacctgtgg acaccttaga tccagtggag cctacgacct cttctattgt 4381 tccactcacg gattccacag gaccagacct tttgcctggc gaggtggaaa ctattgcaga 4441 ggtgcatcct ggcccgtcta ggcctcctac tgacactcct gtcacaacta gtacaggagg 4501 ctccagtgct atattagaag tagcaccgga acctactccg ccctcacgtg ttagggtgac 4561 ccggacacaa tatcataatc cctcctttca agtaattacc gaatccaccc ctaccacagg 4621 tgaaagttca ttagcagaca atatattggt tacctctggt tctgggggac aaactattgg 4681 aggcgctaca cctgaactta tagaacttca agagttacca tctagatatt catttgaaat 4741 cgaagaacca acacccccta gaagaactag taccccatta caaaggatac agacagctat 4801 aagaaggagg ggtgggctta caaataggcg cttagtccaa caagtttctg tagaaaaccc 4861 cttattttta acaagaccat ctagactagt gcaatttcag tttgataatc cagcatttga 4921 ggaggaagta acacaaatat ttgaacaaga tattgaagat tttaatgagc ctccagacag 4981 agattttcta gatgttcaaa ggctgggtag gccccaatat tcagaaactc cagcagggta 5041 tctccgagtt agtcgtcttg ggcaaaggcg gactatacgc actcgttctg gagcacaaat 5101 tgggtctcaa gttcattttt atagagatct aagtagtata aacacagaag atcctattga 5161 gcttcaatta ttaggtcagc attctgggga tgctactatt gtccaaggtc cagttgaaag 5221 cacatttgta gacataaatg tagatgaaaa tccactttct gaggatttta gtgcacattc 5281 tgatgacttg cttttagatg aagctaatga agactttagt ggctctcaat tagttgtggg 5341 taatcgacgc tcaacatctt catataccgt ccctcgtttt gaaacaacca gatctgggtc 5401 atattatgca caggatacaa aaggttatta tgtagcttat cctgaggata gggacattag 5461 catggatatt atttatccta ccccagagtt gcctgttgtc attattcaca catatgatac 5521 aagtggtgat ttttacctgc atcctagtct tcacaaaaga ctcaaaagaa aacgaaaata 5581 tttgtaactt tttcttttac agatggcagt ttggcaagca gctagtggta aggtttacct 5641 tccaccatct acaccagttg ccagggtcca aagtacggac gaatatgtgc aaaggactaa 5701 catctattat catgcataca gtgacagatt attaactgtt ggtcatccat atttcaatat 5761 atatgacgtg caaagtgcta agataaaagt accaaaagta tctggaaatc aacatagggt 5821 tttcagacta aagttgccag accctaatcg atttgcatta gctgacatgt ctgtttataa 5881 tccagataaa gaaagactgg tttgggcatg cagaggtata gaaataggca gaggacaacc 5941 tttaggtgta ggtagtgtag gacatccatt atttaataag gttggtgata cagaaaatcc 6001 caactcatac aggcaacaag ctaactccac tgatgacaga caaaatgtgt catttgatcc 6061 taagcaactg caaatgttta taataggctg tgcaccttgc atgggggaac attgggatag 6121 ggccttgcca tgtgtagaag ataaaccacc ccctggttct tgccctccaa ttgaattaaa 6181 aaatacagtg attgaagatg gtgacatggc agatataggc tatggaaatt taaattttaa 6241 ggcattacaa gaaaatagat ctgatgtaag tttggatata gttaatgaaa tttgcaaata 6301 tccagacttt ctgaaaatgc aaaatgatgt atatggagat tcctgctttt tttatgcacg 6361 cagggaacaa tgttatgcca gacacttttt tgttagaggg ggtaagacag gagatgacat 6421 accagcagca caagttgatg agggtagcct aaagaatgtt tattacattc caccaatgac 6481 aaatcaacca caaaacaata ttggcaatgc catgtatttc ccaactgtca gtggctcatt 6541 ggtatccagt gatgctcaac tgttcaatag accattttgg ttacagcgcg cacaaggcca 6601 caataatggt atttgttggt ttaatcagtt atttgttact gttgtggaca acacacgtaa 6661 cacaaatttt agtatatcag ttagttcaga aaacactgag gtatccaaaa ttgacaatta 6721 tacctctcag aaatttcaag aatatttaag acatgtagaa gaatatgaaa tgtctctaat 6781 tttacaacta tgtaaaatac ctttaacagc tgaagtctta gctcaaatta atgcaatgaa 6841 ttctaatatt ttagaggagt ggcaattagg atttgtacct gcaccagaca atcctattca 6901 tgatacatac agatatattg agtctgcagc gactaggtgt cctgataaaa atcctcctaa 6961 agaaagagaa gatccttata aaaactttaa cttttggaat gtagatttaa cagagagact 7021 atctttagac ctagatcaat attctcttgg gagaaaattt ttatttcagg caggtttgca 7081 gcaatcgacc gttaacggta caaaaacagt ttcgactagg ggatccatca agggtattaa 7141 acgaaaacgc aagaattaga cattatcgat ttcggtgcaa taaagtcaac ttttacacag 7201 tattcaagga atgtttattc actctgacta agcaaatatg agccgcgccc gatacataaa 7261 ggtgccaaat gaggtgagtt gtttgccaga agaggtcaga gccaactcag gtttgcgcca 7321 gatcagatac agcgcgagcc gcgttggatc aagctacatc gtctgaacac gcaaaagact 7381 caaggaaatg taagtgtgcc agtctattgt gttcgaattt ggcaaagttg aagaccgtt