ID JF909166; SV 1; circular; genomic DNA; STD; VRL; 2801 BP. XX AC JF909166; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Mayotte:YT04E67:2005 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2801 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2801 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; f2baa5e08ca9ff773e33744208e42925. XX FH Key Location/Qualifiers FH FT source 1..2801 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Mayotte:YT04E67:2005" FT /mol_type="genomic DNA" FT /country="Mayotte" FT /lat_lon="12.81 S 45.2 E" FT /collection_date="2005" FT /db_xref="taxon:1229189" FT gene 174..530 FT /gene="AV2" FT CDS 174..530 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LX59" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LX59" FT /protein_id="AEG90330.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCP" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LYS2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LYS2" FT /protein_id="AEG90329.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGPGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LYS0" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LYS0" FT /protein_id="AEG90333.1" FT /translation="MDFRTGELITAPQAKNGVFTWEITNPLYFDITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPQTGLFLRVFRYQVLKYLDMIGVISLNT FT VITAVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LYS5" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LYS5" FT /protein_id="AEG90332.1" FT /translation="MPPSSPSMSHCSQVPIKVQHRTAKTRALRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQEAREHEPRHNHTPDTFQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LYS4" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LYS4" FT /protein_id="AEG90331.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNQRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTHVPEELEVWASENICSPAARPWRPVSIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFVTLHEPL FT FSSAHQSPTPHSEDQGPPT" FT gene complement(2254..2487) FT /gene="AC4" FT CDS complement(2254..2487) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LYS7" FT /protein_id="AEG90334.1" FT /translation="MGCLISMFSSNSKASSNVQTKDSSISFPHPDQHISIRTFRELNRR FT PMSRLTLKREGNFLTMEFSKSMPEVHGGRASI" XX SQ Sequence 2801 BP; 729 A; 563 C; 721 G; 788 T; 0 other; accggatggc cgcgcccgaa aaagcagatg gaccccactg tatgaccgcg cccgtgaaag 60 aaagtggtcc ccgcgcactg gggttggtcg gccagtcata ttcacgcgtg aaggtctaga 120 tatttgttgt ttgtctttat agacttcgtc acgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt gaatgatttt cccgaaaccg ttcacggttt ccgttctatg cttgctgtta 240 aatacctctt acatctggaa caggaatacg atcgcggtac tgtcggggct gagtatatac 300 gggatctaat aggggttctc cggtgtaaga gttatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta cagaatgtat cgaagcccag atgtccctaa gggctgtgaa 540 ggcccatgta aggttcagtc gtacgaacag agggatgatg ttaagcacac tggtatggtc 600 cgatgtgtca gtgatgttac tcgtgggcca ggcatcaccc atagagtcgg gaagaggttt 660 tgtgtgaagt ccatatatat attgggcaag atctggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttcttcctc gttcgagata gaaggcctta tggtccgagc 780 ccgcaagatt ttggacaagt gttcaacatg tttgataatg aacctactac ggcaacggtg 840 aagaatgatc tgagggatcg gtatcaggtg ttacgaaaat tctatgcgac cgttgttggt 900 ggaccctccg ggatgaagga acaagcgctg gttaagaggt tttttaggat caataatcat 960 gttgtgtata atcatcagga acaggccaag tatgagaatc atacggagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcaaat cctgtgtacg ctactctgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa aggttgaatt ttattgcatg ttgctccgta 1140 acttggagcg tgtttagtaa tacatcgtac agaacatgat caacagctgt aattacagtg 1200 ttaagggaaa taacgcctat catatctaaa tacttgagca cttgatatct aaatactctt 1260 aagaaaagac cagtctgagg ccgtaaggtc gtccagacct tgaagttgag aaaacacttg 1320 tgaatcgcca atgccttccg gaggttgtgg ttgaaacgta tctggagtgt gattatgtcg 1380 tggttcatgt tccctggcct cttgtcgtgg ttggtgatgt cgaaatagag gggatttgtt 1440 atttcccagg taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga 1500 aaatccatgg ttgatgcagt cgatatggag atagaacgag cagccacatt cgaggtctac 1560 ccgcctacgt cggagggccc tggtcttcgc tgtgcggtgt tggactttga tgggcacttg 1620 agaacaatgg ctcatggagg gtgacgaagg tggcattctt taaagcccag gctttaaggg 1680 actgattctt ttcctcttcc agaaactctt tatatgatga tgttggtcct ggattgcaga 1740 ggaagatagt gggaatgccg cctttaattt gaattggctt cccgtacttg gtattgcttt 1800 gccagtctct ttgggccccc atgaattctt tgaagtgttt gagataatgc gggtctacgt 1860 cgtcaatgac gttgtaccat gcgtcgtttg aatatacctt tggagacaga tccaggtgtc 1920 cacatagata attatggggt cccagtgaac gagcccacat ggttttcccg gttcggctat 1980 caccttcgag aacaatactg accggtctcc atggccgcgc agcgggactg catatatttt 2040 ctgatgccca tacctcgagt tcttcgggaa cgtgtgtaaa tgatgatgat aagaacggac 2100 taacgtaagt ttgtggcgga gcctggaaga ttctatctgc gttagcagat atgttatgga 2160 actgtaaaaa aaaggacttt ggatcttttt ctttaataat ttgaagagct tctgatttag 2220 aagaagcatt caacgcgtct gcatatacct gagctaaatg ctggccctcc ccccgtgcac 2280 ttcgggcatc gacttggaaa attccatcgt caagaaattc ccctcccttt tcaatgtaag 2340 ccttgacatc ggacgacgat ttagctccct gaatgttcgg atggaaatgt gttgatctgg 2400 atggggaaat gagatcgaag aatctttggt ttgtacattg gaacttgcct tcgaattgga 2460 tgagaacatg gagatgaggc accccatctt gatgcagttc tctgcaaacc ctaatgaatt 2520 tgatattcgt cgggtaagaa agggctttta actgggaaag ggcctcttcc ttggttaatg 2580 agcatcgggg ataggttatg aaataatttt tggcatttat ttgaaaacga ccggctcttg 2640 gcatatttgc tgtcgttttg gatcggggga cactcaaaac tccaggggaa cggtggaatg 2700 gggggcatta tatatgatgt cccccaatgg catatgtgta aatatgtcga cctccattca 2760 aattttgaat tgcgaatatt ggcggccatc cgattaatat t 2801 //