ID JF909141; SV 1; circular; genomic DNA; STD; VRL; 2796 BP. XX AC JF909141; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Kenya virus isolate Comoros:Moheli:MO16BF1:2009 DE segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Kenya virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2796 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2796 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 56e99695aa85c6a09f29c80031d95cfc. XX FH Key Location/Qualifiers FH FT source 1..2796 FT /organism="East African cassava mosaic Kenya virus" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Moheli:MO16BF1:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Moheli" FT /lat_lon="12.3 S 43.64 E" FT /collection_date="2009" FT /db_xref="taxon:393599" FT gene 172..528 FT /gene="AV2" FT CDS 172..528 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXA7" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXA7" FT /protein_id="AEG90180.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCS" FT gene 332..1105 FT /gene="AV1" FT CDS 332..1105 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LYC2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LYC2" FT /protein_id="AEG90179.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGTVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1102..1506) FT /gene="AC3" FT CDS complement(1102..1506) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LYC6" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LYC6" FT /protein_id="AEG90183.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDRRPGNMNHDI FT ITFQIRFNHNIRKALGIHKCFLNFKVWTTLRPPTGLFLKVFRYQVLKYLDMIGVISINT FT VVQAVDHVLYNVLLNTLQVTEHHAIKFNLY" FT gene complement(1247..1654) FT /gene="AC2" FT CDS complement(1247..1654) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LYC5" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LYC5" FT /protein_id="AEG90182.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQATREHEPRHHHIPDTVQPQ FT HPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1578..2642) FT /gene="AC1" FT CDS complement(1578..2642) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LYC4" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LYC4" FT /protein_id="AEG90181.1" FT /translation="MPRAGRFSIKAKNYFLTYPKCYLSKEEALNQLRQLQTPTNKLFIK FT ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISPSRSAHFHPNIQGAKSSSDVKSYL FT DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH FT NLNSNLERIFQEPLTPYISPFISSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSED" FT gene complement(2195..2491) FT /gene="AC4" FT CDS complement(2195..2491) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LYC7" FT /protein_id="AEG90184.1" FT /translation="MKMGNLICMPSFSSKASTIVPTNDSSTSFPLPGPPISTQIFRELN FT QAPTSSPIWIRTETPSNGASFRSTDALLAADNNPPMTLTPRLLTQQISQRLLM" XX SQ Sequence 2796 BP; 726 A; 557 C; 727 G; 786 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacaa tggccgcgcc cgttaaagaa 60 agtggtcccc gcgcacttgt gttggtcggc cagtcatatt cacgcgtgaa agtctagata 120 tttggtgttt gtctttatag acttcgtcgc gaagtagatg agcgcgtcaa catgtgggat 180 ccattgttga acgattttcc cgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atctggaaca ggaatacgat cgcggtactg tcggggcgga gtatatacgt 300 gatttaatag gggttctacg gtgtaagagt tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacagcccat acacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaagccc aagatgtaca gaatgtatcg aagcccagat gttcctaagg gctgtgaagg 540 cccatgtaag gttcagtcct atgaacagag ggatgatgtg aagcacacag gtacggtccg 600 atgtgtcagt gatgttactc gtggatcagg cattacccat agagtcggga agaggttttg 660 tgtgaagtcc atatatatat tgggcaagat ttggatggat gagaacatca agaagcaaaa 720 tcatacgaat catgttatgt tcttccttgt tcgagataga aggccttatg gtcagagtcc 780 tcaagatttt ggacaagtgt tcaacatgtt tgataatgaa cctactacgg caactgtgaa 840 gaatgatctt agggaccgat atcaggtgtt acgtaaattc tatgcgactg ttgttggtgg 900 accctcaggg atgaaggaac aagctctggt taagaggttt tttaggatca ataatcatgt 960 agtgtataat catcaggaac aggccaagta tgagaatcat actgagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcgaatcc tgtgtacgct acgctgaaaa tacgcattta 1080 tttctatgat gcagtgacaa attaataaag gttgaatttt attgcatggt gctccgtaac 1140 ttggagtgtg tttagtaata cattgtacag aacatgatca acagcttgaa ctacagtgtt 1200 aatggaaata acgcctatca tatctaaata cttgagcact tgatatctaa atacttttaa 1260 gaaaagacca gtcggaggcc gtaaggtcgt ccagaccttg aagttgagaa aacacttgtg 1320 aatccccaat gccttccgga tgttgtggtt gaaccgtatc tggaatgtga tgatgtcgtg 1380 gttcatgttc cctggtcgcc tgtcgtggtt ggtgatgtcg aaatagaggg gatttgttat 1440 ttcccaggta aaaacgccat tctttgcttg aggcgcagtg atgagttccc ctgtgcgaga 1500 atccatgatt gatgcagtcg atatggagat agaacgagca gccgcattcg aggtctaccc 1560 gcctacgtcg gacggcccta gtcttcgctg tgcggtgttg gactttgatg ggcacttgag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaagggac 1680 tggttctttt cctcgtccag aaactcttta tatgatgatg ttggtcctgg attgcatagg 1740 aagatagtgg gaatgccgcc tttaatttga attggcttcc cgtattttgt attgctttgc 1800 cagtcccttt gggcccccat gaattctttg aaatgcttga ggtagtgggg gtcgacgtca 1860 tcaatgacgt tgtaccatgc gtcgttgctg tatacctttg gactgagatc caggtgtcca 1920 cacaagtagt tatgtggtcc caaagagcga gcccacattg tcttccctgt cctactatct 1980 ccctcgatta cgatactact aggtctccat ggccgcgcag cggaacccat cacgttctcg 2040 gaaacccagg cttcaagttc ctcaggaacg ttagtgaaag aagaagaaat aaagggagaa 2100 atataaggag tgagaggctc ttgaaaaatc ctctctaaat tgctatttaa attatgaaac 2160 tgtaaaacaa aatcttttgg ggctagttcc cgtattacat taagagcctc tgacttattt 2220 gctgagttaa gagccttggc gtaagcgtca ttggcggatt gttgtccgcc gcgagcagag 2280 cgtccgtcga tctgaaactc gccccattgg atggtgtctc cgtccttatc cagataggac 2340 ttgacgtcgg agcttgattt agctccctga atatttgggt ggaaatgggc ggaccgggaa 2400 ggggaaatga ggtcgaagaa tcgttggttg gtacaattgt acttgccttc gaactgaatg 2460 agggcatgca gatgaggttc cccattttca tggagctctc tgcagatctt gatgaacaat 2520 ttattggttg gggtttggag ttgtcggagc tgattcaagg cctcttcttt cgatagataa 2580 catttgggat atgtgaggaa atagtttttg gctttgatgc taaaacgacc agcccttggc 2640 atttgcgctg tcgtatagca atcggggggc actcaaagtc tgtagcaatc gggggaatgg 2700 gggggcaatt tatatgatgc cccccaaatg gcatttatgt aatatcctca tgaaatttga 2760 atttcaaacg tggaaagcgg ccatccgtat aatatt 2796 //