ID AY795988; SV 1; linear; genomic DNA; STD; VRL; 2799 BP. XX AC AY795988; XX DT 24-MAR-2005 (Rel. 83, Created) DT 22-APR-2005 (Rel. 83, Last updated, Version 3) XX DE East African cassava mosaic virus isolate TZ10 segment A, complete DE sequence. XX KW . XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2799 RX PUBMED; 15784145. RA Ndunguru J., Legg J., Aveling T., Thompson G., Fauquet C.; RT "Molecular biodiversity of cassava begomoviruses in Tanzania: evolution of RT cassava geminiviruses in Africa and evidence for East Africa being a center RT of diversity of cassava geminiviruses"; RL Virol J 2:21-21(2005). XX RN [2] RP 1-2799 RA Ndunguru J., Fauquet C.M.; RT ; RL Submitted (27-OCT-2004) to the INSDC. RL ILTAB, Donald Danforth Plant Science Center, 975 North Warson Road, St. RL Louis, MO 63132, USA XX DR MD5; 317558bda69e499d4e04bf7c9947f316. DR EuropePMC; PMC1079959; 15784145. DR EuropePMC; PMC3163225; 21812981. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic virus" FT /segment="A" FT /strain="Uganda 2" FT /isolate="TZ10" FT /mol_type="genomic DNA" FT /country="Tanzania" FT /db_xref="taxon:62079" FT gene 172..528 FT /gene="AV2" FT CDS 172..528 FT /codon_start=1 FT /gene="AV2" FT /product="AV2" FT /db_xref="GOA:A0A3S5ZP69" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZP69" FT /protein_id="AAX39370.1" FT /translation="MWDPLVNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT ETQDVQNVSKPRCP" FT gene 332..1105 FT /gene="AV1" FT CDS 332..1105 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q58WG8" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:Q58WG8" FT /protein_id="AAX39371.1" FT /translation="MSKRPGDIIISTPVSKVRKRLNFDSPYTNRVVVPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSFEQRDDVKHLGICKVISDVTRGPGLTHR FT VGKRFCIKSIYILGKIWMDENIKKQNHTNNVMFYLLRDRRPYGNAPQDFGQIFNMFDNE FT PSTATIKNDLRDRFQVLRKFHVTVVGGPSGMKEQALVKRFYKLNHHVTYNHQEAGKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1102..1506) FT /gene="AC3" FT CDS complement(1102..1506) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:Q58WG7" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q58WG7" FT /protein_id="AAX39375.1" FT /translation="MDSRTGELITAPQARNGVFTWDITNPLYFEITDHDKRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGRFLRVFRYQVLKYLDMIGVISINT FT VLQAVDHVMYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1247..1654) FT /gene="AC2" FT CDS complement(1247..1654) FT /codon_start=1 FT /gene="AC2" FT /product="transactivator protein" FT /db_xref="GOA:Q58WG6" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q58WG6" FT /protein_id="AAX39374.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAIRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGHNKSPLFRNHRPRQEAREHEPRHHHTPDTVQPQ FT PSEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1563..2642) FT /gene="AC1" FT CDS complement(1563..2642) FT /codon_start=1 FT /gene="AC1" FT /product="replication-associated protein" FT /db_xref="GOA:Q58WG5" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q58WG5" FT /protein_id="AAX39372.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLQALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDAGLFQVDARSARGEGQHLAQVYADALNASSKSEARQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEIWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQKDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGHQT" FT gene complement(2252..2485) FT /gene="AC4" FT CDS complement(2252..2485) FT /codon_start=1 FT /gene="AC4" FT /product="AC4" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q58WG4" FT /protein_id="AAX39373.1" FT /translation="MGYLISMFSSNSKASSNVPTRDFSISFPHLDQHISIRTFRELNHR FT PMSKLTLKREGNFLTLDFSKSMPEVQGGRANI" XX SQ Sequence 2799 BP; 736 A; 542 C; 726 G; 794 T; 1 other; accggatggc cgcgcccgaa aaagcatgtg gaccccataa tgaccgcgcc cgtgaaagaa 60 agtggtccct gcgcacttgt tttggtcggc cagtcatatt cacgcgtgaa agtctagata 120 tttgttgttt gtctttatag acttcgtcgc gaagtagtag aacgcgtcaa catgtgggat 180 ccattggtga atgattttcc cgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atttggaaca ggaatacgat cgcggtactg tcggggctga gtatatacgg 300 gatctaatag gggttctacg gtgtaagaat tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaagag gctgaacttc gacagcccat acacgaaccg 420 tgttgttgtc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaaaccc aagatgtaca gaatgtatcg aagcccagat gtccctaagg gctgtgaagg 540 cccatgtaag gtccagtcgt ttgagcagag ggatgatgtg aagcaccttg gtatctgtaa 600 ggtgattagt gatgtgacgc gtgggcctgg gctgacacac agggtcggaa agaggttttg 660 tatcaagtcc atttatattc ttggtaagat ctggatggat gaaaatatta agaagcagaa 720 tcacactaat aatgtgatgt tttacctgct gagggataga aggccgtatg gcaatgcgcc 780 ccaagacttt gggcagatat ttaacatgtt tgataatgag cccagtactg caacaattaa 840 gaacgatttg agggataggt tccaggtgtt gaggaaattt catgtcactg ttgttggtgg 900 tccatctggc atgaaggagc aggcgttggt gaaaaggttt tacaagctga atcatcacgt 960 gacatataat catcaggagg cagggaagta tgagaatcac acagagaatg cgttgttatt 1020 gtatatggca tgtacacatg cntcgaatcc tgtgtatgct acgctgaaaa tacgcatcta 1080 tttttatgat gcagtgacaa attaataaag gttgaatttt attgcatgtt gctccgtaac 1140 ttggagtgtg ttaagtaata catcgtacat aacatgatca acagcttgaa ggacagtgtt 1200 aatggaaata acgcctatca tatctaaata cttgagcact tgatatctaa atactcttaa 1260 gaaacgacca gtctgaggcc gtaaggtcgt ccagaccttg aagttgagaa aacacttgtg 1320 aatccccaat gccttccgaa ggttgtggtt gaaccgtatc tggagtgtga tgatgtcgtg 1380 gttcatgttc cctggcctct tgtcgtggtc ggtgatttcg aaatagaggg gatttgttat 1440 gtcccaggta aaaacgccat tccttgcttg aggcgcagtg atgagttccc ctgtgcgaga 1500 atccatggtt gatgcagtcg atatggagat agaacgagca tccgcattcg aggtctaccc 1560 gcctacgtct gatggccctg gtcttcgctg tgcggtgttg gactttgatg ggcactagag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaagggat 1680 tggttctttt cctcgtccag aaactcttta tatgatgatg ttggtcctgg attgcagagg 1740 aagatagtgg gaatgccgcc tttaatttga attggcttcc cgtattttgt attgctttgc 1800 cagtcctttt gggcccccat gaattctttg aagtgtttga ggtagtgggg gtcgacgtca 1860 tcaatgacgt tgtaccatgc gtcgtttgaa tataccttgg gagacagatc caggtgtcca 1920 caaagataat tatggggtcc cagtgaacga gcccacattg ttttgccggt acggctatca 1980 ccttctagaa caatactgat cggtctccat ggccgcgcag cgggactgca tatattttcg 2040 gatacccata tctctatgtc ttctgggact tgtgtaaaag atgatgataa gaacggacta 2100 acgtaagttt gtggcggagc ctggaagatt ctatctgcgt tagcagatat gttatggaac 2160 tgtaaaaaaa aggactttgg atctttttct ttaataattt gacgagcctc ggatttagac 2220 gaagcattca acgcgtctgc atatacctga gctaaatgtt ggccctcccc ccttgcactt 2280 cgggcatcga cttggaaaag tccagcgtca agaaattccc ctcccttttc aatgtaagct 2340 ttgacatcgg acgatgattt agctccctga atgttcggat ggaaatgtgt tgatcgagat 2400 ggggaaatga gatcgaaaaa tctcgggttg gtacattgga acttgccttc gaattggatg 2460 agaacatgga gatgaggtac cccatcctga tgtagttctc tgcaaaccct aatgaatttg 2520 atattcgtcg ggtacgaaag ggcttgtaat tgggaaaggg cctcttcttt tgttaatgag 2580 catcggggat aggttatgaa ataatttttg gcattaattt gaaaacgacc ggctcttggc 2640 atattggctg tcgttttgga tcgggggaca ctcaaaactc caggggaacg gtggaatggg 2700 gggcattata tatgatgtcc cccaatggca tatgtgtaaa taggtagact tccattcaaa 2760 atttgaattc cgaatattgg cggccatccg attaatatt 2799 //