ID JF909074; SV 1; circular; genomic DNA; STD; VRL; 2800 BP. XX AC JF909074; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Anjouan:AJ21AL3:2009 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2800 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2800 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; dd073a226e0f2d1466132c29e2859569. XX FH Key Location/Qualifiers FH FT source 1..2800 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Anjouan:AJ21AL3:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Anjouan" FT /lat_lon="12.28 S 44.4 E" FT /collection_date="2009" FT /db_xref="taxon:1229189" FT gene 173..529 FT /gene="AV2" FT CDS 173..529 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LX71" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LX71" FT /protein_id="AEG89778.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAEDVQNVSKPRCP" FT gene 333..1106 FT /gene="AV1" FT CDS 333..1106 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LX04" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LX04" FT /protein_id="AEG89777.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVIGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1103..1507) FT /gene="AC3" FT CDS complement(1103..1507) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LX74" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LX74" FT /protein_id="AEG89781.1" FT /translation="MDSRTGELITAPQAKNGVFTWELTNPLYFEITHHDKRPGNMNHDI FT ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPQTGLFLRVFRYQVLKYLDMIGVISINT FT VISAVDHVLYAVLLNTLQVTEQHAIKFNIY" FT gene complement(1248..1655) FT /gene="AC2" FT CDS complement(1248..1655) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LX73" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LX73" FT /protein_id="AEG89780.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKNRAIRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGTNKSPLFRNHPPRQEAREHEPRHHHTPDTFQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1564..2643) FT /gene="AC1" FT CDS complement(1564..2643) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LX72" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LX72" FT /protein_id="AEG89779.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKALSYPTTIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNQRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEELEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFVTLHEPL FT FSSAHQSPTPHSEEQGHQT" FT gene complement(2253..2486) FT /gene="AC4" FT CDS complement(2253..2486) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LYX5" FT /protein_id="AEG89782.1" FT /translation="MGCLISMFSSNSKASSNVPTRDSSISFPPPDQHISIRTFRELNHR FT PMSKLTLKREGNFLTMEFSKSMPEVQGARASI" XX SQ Sequence 2800 BP; 733 A; 555 C; 729 G; 783 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacca gatggccgcg cccgtgaaag 60 acagtggtcc ccgcgcacgt gttacggtcg gccagtcata ttgacgcgtg aaagtctaga 120 tatttgttgt tgtctttata gacttcgtcg cgaagtagtg aagcgcgtca acatgtggga 180 tccattattg aacgatttcc cagaaaccgt tcacggtttt cgttctatgc ttgctgttaa 240 atacctgtta catctggaac aggaatacga tcgcggtact gtcggggctg agtatatacg 300 ggatctaata ggggttctac ggtgtaagaa ttatgtcgaa gcgaccagga gatataataa 360 tctcaacacc cgtatccaag gtgcggagga ggctgaactt cgacagccca tacacgaacc 420 gtgttgttgc ccccactgtc cgcgtcacca gaagcaaaat atgggccaac aggcccatgt 480 atcggaagcc gaagatgtac agaatgtatc gaagcccaga tgtccctaag ggctgtgaag 540 gcccatgtaa ggttcagtcg tatgaacaga gggatgatgt taagcacacg ggtatggtcc 600 gatgtgtcag tgatgttacg cgtgggtcag gcattaccca tagagtcggg aagaggtttt 660 gtgtaaagtc catatatata ttgggcaaga tctggatgga tgagaatatc aagaagcaaa 720 atcatacgaa tcatgttatg ttcttcctcg ttcgagatag aaggccttat gggccgagcc 780 cgcaagattt tggacaagtg ttcaacatgt ttgataatga acctactact gcaactgtta 840 agaatgatct tagggaccgg tatcaggtgt tacgtaaatt ctatgcgact gttattggtg 900 gaccctccgg gatgaaggaa caagcgctgg ttaagaggtt ttttaggatc aataatcatg 960 tagtgtataa tcatcaggaa caggccaagt atgaaaatca tactgagaat gcgttgttat 1020 tgtatatggc atgtacacat gcctcaaatc ctgtgtacgc gactttgaaa atacgcatct 1080 atttctatga tgcagtgaca aattaataaa tgttgaattt tattgcatgt tgctccgtaa 1140 cttggagtgt gtttagtaat acagcgtaca gaacatgatc aacagcgcta attacagtgt 1200 taatggaaat aacgcctatc atatctaaat acttgagcac ttgatatcta aatactctta 1260 agaaaagacc agtctgaggc cgtaaggtcg tccagacctt gaagttgaga aaacacttgt 1320 gaatcgccaa tgccttccgg aggttgtggt tgaaacgtat ctggagtgtg atgatgtcgt 1380 ggttcatgtt cccgggcctc ttgtcgtggt gggtgatttc gaaatagagg ggatttgtta 1440 gttcccaggt aaaaacgcca ttctttgctt gaggcgcagt gatgagttcc cctgtgcgag 1500 aatccatggt tgatgcagtc gatatggaga tagaacgagc agccgcattc gaggtctacc 1560 cgcctacgtc tgatggccct gttcttcgct gtgcggtgtt ggactttgat gggcacttga 1620 gaacaatggc tcgtggaggg tgacgaaggt ggcattcttt aaagcccagg ctttaaggga 1680 ctgattcttt tcctcttcca gaaactcttt atatgatgat gttggtcctg gattgcagag 1740 gaagatagtg ggaatgccgc ctttaatttg aattggcttc ccgtactttg tattgctttg 1800 ccagtccctt tgggccccca tgaattcttt gaagtgcttg aggtagtggg ggtcgacgtc 1860 atcaatgacg ttgtaccatg cgtcgtttga atataccttt ggagacagat ccaggtgtcc 1920 acatagataa ttatggggtc ccagtgaacg agcccacatg gttttcccgg tccggctatc 1980 gccttcgaga acaatactga tcggtctcca tggccgcgca gcgggactgc atatattttc 2040 cgatacccat acctctagtt cttcggggac ttgtgtaaat gaggatgata agaacggact 2100 aacgtaagtt tgtggcggag cctggaagat tctatctgcg ttagcagata tgttatggaa 2160 ctgtaaaaaa aaggacttgg gatctttttc tttgataatt tgaagagctt ctgatttaga 2220 agaagcattc aacgcgtcgg catatacctg agctaaatgc tggccctcgc cccttgcact 2280 tcgggcatcg acttggaaaa ttccatcgtc aagaaattcc cctccctttt caatgtaagc 2340 tttgacatcg gacgatgatt tagctccctg aatgttcgga tggaaatgtg ttgatctgga 2400 gggggaaatg agatcgaaga atctctggtt ggtacattgg aacttgcctt cgaattggat 2460 gagaacatgg agatgaggca ccccatcttg atgtagttct ctgcaaaccc taatgaattt 2520 gatagtcgtc gggtaagaaa gggcttttaa ttgggaaagg gcctcttcct tggttaatga 2580 gcatcgggga taggttatga aataattttt ggcatttatt tgaaaacgac cggctcttgg 2640 catattggct gtcgtttagg atcgggggac actcaaaact ccaggggaat ggtggaacgg 2700 ggggcaatat atatgatgtc ccccaatggc atatgtgtaa ataggtcgac ctccattcaa 2760 aatttgaatt gcgaatattg gcggccatcc gattaatatt 2800 //