ID AII80173; SV 1; linear; genomic DNA; STD; VRL; 2127 BP. XX PA KM192302.1 XX DT 13-AUG-2014 (Rel. 121, Created) DT 13-AUG-2014 (Rel. 121, Last updated, Version 1) XX DE Human betaherpesvirus 5 (Human cytomegalovirus) capsid maturation protease XX KW . XX OS Human betaherpesvirus 5 (Human cytomegalovirus) OC Viruses; Herpesvirales; Herpesviridae; Betaherpesvirinae; Cytomegalovirus. XX RN [1] RA Hsu J.-L., van Boomen D.J.H., Tomasec P., Weekes M.P., Antrobus R., RA Stanton R.J., Ruckova E., Sugrue D., Wilkie G.S., Davison A.J., RA Wilkinson G.W.G., Lehner P.J.; RT "Plasma membrane profiling defines an expanded class of cell surface RT proteins selectively targeted for degradation by HCMV US2 in cooperation RT with UL141"; RL Unpublished. XX RN [2] RA Davison A.J.; RT ; RL Submitted (14-JUL-2014) to the INSDC. RL MRC-University of Glasgow Centre for Virus Research, 8 Church Street, RL Glasgow G11 5JR, UK XX DR MD5; 7e56c14ea434c347674af04823a9d3f2. XX FH Key Location/Qualifiers FH FT source 1..2127 FT /organism="Human betaherpesvirus 5" FT /host="Homo sapiens" FT /strain="Merlin" FT /isolate="RCMV1678" FT /mol_type="genomic DNA" FT /db_xref="taxon:10359" FT CDS KM192302.1:116450..118576 FT /codon_start=1 FT /gene="UL80" FT /product="capsid maturation protease" FT /EC_number="3.4.21.97" FT /note="serine protease (N-terminal region); minor scaffold FT protein (remainder of protein, clipped near C terminus); FT involved in capsid morphogenesis" FT /db_xref="GOA:F5HEA7" FT /db_xref="InterPro:IPR001847" FT /db_xref="InterPro:IPR035443" FT /db_xref="UniProtKB/TrEMBL:F5HEA7" FT /protein_id="AII80173.1" FT /translation="MTMDEQQSQAVAPVYVGGFLARYDQSPDEAELLLPRDVVEHWLHA FT QGQGQPSLSVALPLNINHDDTAVVGHVAAMQSVRDGLFCLGCVTSPRFLEIVRRASEKS FT ELVSRGPVSPLQPDKVVEFLSGSYAGLSLSSRRCDDVEAATSLSGSETTPFKHVALCSV FT GRRRGTLAVYGRDPEWVTQRFPDLTAADRDGLRAQWQRCGSTAVDASGDPFRSDSYGLL FT GNSVDALYIRERLPKLRYDKQLVGVTERESYVKASVSPEAACVIKAASAERSGDSRSQA FT ATPAAGARVPSSSPSPPVEPPSPVQPPALPASPSVLPAESPPSLSPSEPAEAASMSHPL FT SAAVPAATAPPGATVAGASPAVSSLAWPHDGVYLPKDAFFSLLGASRSAAPVMYPGAVA FT APPSASPAPLPLPSYPASYGAPVVGYDQLAARHFADYVDPHYPGWGRRYEPAPSLHPSY FT PVPPPPSPAYYRRRDSPGGMDEPPSGWERYDGGHRGQSQKQHRHGGSGGHNKRRKETAA FT ASSSSSDEDLSFPGEAEHGRARKRLKSHVNSDGGSGGHAGSNQQQQQRYDELRDAIHEL FT KRDLFAARQSSTLLSAALPSAASSSPTTTTVCTPTSELTSGGGETPTALLSGGAKVAER FT AQAGVVNASCRLATASGSEAATAGPSTAGSSSCPASVVLAAAAAQAAAASQSPPKDMVD FT LNRRIFVAALNKLE" XX SQ Sequence 2127 BP; 334 A; 707 C; 676 G; 410 T; 0 other; atgacgatgg acgagcagca gtcgcaggct gtggcgccgg tctacgtggg cggctttctc 60 gcccgctacg accagtctcc ggacgaggcc gaattgctgt tgccgcggga cgtagtggag 120 cactggttgc acgcgcaggg ccagggacag ccttcgttgt cggtcgcgct cccgctcaac 180 atcaaccacg acgacacggc cgttgtagga cacgttgcgg cgatgcagag cgtccgcgac 240 ggtctttttt gcctgggctg cgtcacttcg cccaggtttc tggagattgt acgccgcgct 300 tcggaaaagt ccgagctggt ttcgcgcggg cccgtcagtc cgctgcagcc agacaaggtg 360 gtggagtttc tcagcggcag ctacgccggc ctctcgctct ccagccggcg ctgcgacgac 420 gtggaggccg cgacgtcgct ttcgggctcg gaaaccacgc cgttcaaaca cgtggctttg 480 tgcagcgtgg gtcggcgtcg cggtacgttg gccgtgtacg ggcgcgatcc cgagtgggtc 540 acacagcggt ttccagacct cacggcggcc gaccgtgacg ggctacgtgc acagtggcag 600 cgctgcggca gcactgctgt cgacgcgtcg ggcgatccct ttcgctcaga cagctacggc 660 ctgttgggca acagcgtgga cgcgctctac atccgtgagc gactgcccaa gctgcgctac 720 gacaagcaac tagtcggcgt gacggagcgc gagtcgtacg tcaaggcgag cgtttcgcct 780 gaggcggcgt gcgttattaa agcggcgtcc gccgagcgtt cgggcgacag ccgcagtcag 840 gccgccacgc cggcggctgg ggcgcgcgtt ccctcttcgt ccccgtcgcc tccagtcgaa 900 ccgccatctc ctgtacagcc gcctgcgctt ccagcgtcgc cgtccgttct tcccgcggaa 960 tcaccgccgt cgctttctcc ctcggagccg gcagaggcgg cgtccatgtc gcaccctctg 1020 agtgctgcgg ttcccgccgc tacggctcct ccaggtgcta ccgtggcagg tgcgtcgccg 1080 gctgtgtcgt ctctagcgtg gcctcacgac ggagtttatt tacccaaaga cgcttttttc 1140 tcgctacttg gggccagtcg ctcggcagcg cccgtcatgt atcccggcgc cgtagcggcc 1200 cctccttctg cttcgccagc accgctgcct ttgccgtctt atcccgcgtc ctacggcgcc 1260 cccgtcgtgg gttacgacca gttggcggca cgtcactttg cggactacgt ggatccccat 1320 tatcccgggt ggggtcggcg ttacgagccc gcgccgtctt tgcatccgtc ttatcccgtg 1380 ccgccgccac catcaccggc ctattaccgt cggcgcgact ctccgggcgg tatggatgaa 1440 ccaccgtccg gatgggagcg ttacgacggt ggtcaccgtg gtcagtcgca gaagcagcac 1500 cgtcacgggg gcagcggcgg acacaacaaa cgccgtaagg aaaccgcggc ggcgtcgtcg 1560 tcgtcctcgg acgaagactt gagtttccca ggcgaggccg agcacggccg ggcacgaaag 1620 cgtctaaaaa gtcacgtcaa tagcgacggt ggaagtggcg ggcacgcggg ttccaatcag 1680 cagcagcaac aacgttacga tgaactgcgg gatgccattc acgagctgaa acgcgatctg 1740 tttgccgcgc ggcagagttc tacgttactt tcggcggctc tcccctctgc ggcctcttcc 1800 tccccaacta ctactaccgt gtgtactccc accagcgagc tgacgagtgg cggaggagaa 1860 acacccacgg cacttctatc cggaggtgcc aaggtagctg agcgcgctca ggccggcgtg 1920 gtgaacgcca gttgccgcct cgctaccgcg tcgggttctg aggcggcaac ggccgggccc 1980 tcgacggcag gttcttcttc ctgcccggct agtgtcgtgt tagccgccgc tgctgcccaa 2040 gccgccgcag cttcccagag cccgcccaaa gacatggtag atctgaatcg gcggattttt 2100 gtggctgcgc tcaataagct cgagtaa 2127 //