ID JQ975068; SV 1; linear; genomic RNA; STD; VRL; 2280 BP. XX AC JQ975068; XX DT 08-OCT-2012 (Rel. 114, Created) DT 08-OCT-2012 (Rel. 114, Last updated, Version 1) XX DE Sugarcane streak mosaic virus isolate M113 polyprotein gene, partial cds. XX KW . XX OS Sugarcane streak mosaic virus OC Viruses; Riboviria; Potyviridae; Poacevirus. XX RN [1] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT "Genetic variability and population structure of Sugarcane streak mosaic RT virus (SCSMV)"; RL Unpublished. XX RN [2] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT ; RL Submitted (20-APR-2012) to the INSDC. RL State Key Laboratory for Biology of Plant Diseases and Insect Pests, RL Institute of Plant Protection, Chinese Academy of Agricultural Sciences, RL West 2 Yuanmingyuan Road, Haidian, Beijing 100193, China XX DR MD5; 5ad83855e1b70c38e52e47ccec0bc0aa. XX FH Key Location/Qualifiers FH FT source 1..2280 FT /organism="Sugarcane streak mosaic virus" FT /host="sugarcane" FT /isolate="M113" FT /mol_type="genomic RNA" FT /country="China" FT /collected_by="Wenfeng Li and Zhen He" FT /collection_date="03-Jun-2011" FT /db_xref="taxon:53954" FT 5'UTR 1..199 FT CDS 200..>2280 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:K0I285" FT /protein_id="AFU50439.1" FT /translation="MATITKKQVWKPKERVVSEPPKAEIQESRTTLLFNDYAEVEDFIQ FT RFPAGSVFWTVKGKPKTIVNNLFKATQYGLAYDIAAEVYVCPICMTCARNKVYFTTNHQ FT NCGELFRNKQAYISTSLRLEVVDTFDVFPRYATVEQEKLVGDWMADMEAYAHAEDDSID FT IPYQIFNSDTGEVEERIKQVDLSVHGEIEEVERTYKVKIARSNATMLPHQRRANRVIMR FT TNEIKELIDSTLEICHNRNIKVSFVDHERKRKLFPQIPLKHTIEPQTLCDPHHDIIPAT FT EKFISQWKDVGEPTMHINEQWVQKGWSGVVLHKDDLEAHPSLQEKCVDNLFVVLGRCKH FT GDLQNALRPDCCEDLVFYTDAHKARSHILWDAMMKCHPDDHKPIVNVWTDEAYENMGYW FT LMATYPFKAICKECVNVKSIRDWVQNMRASKAYQFLRGGTSKHSRDLFRWLAVIQSELM FT TFNIRDAQNTQEDLNRNFLGTIPIGPLFEIANQMNQAVVDIRRGLQQMHKLVTDVEITH FT QARDEQILNEIARLRGLEFMQTEKLMTNMKHVAMTYRNLINTASQPLSIHTMRQLLLDA FT RSDEAYEFDIMRGKGAIAIVAPGVFRKFDKIYSEPGVYNVEWTHLTPGGELRTDFDYLR FT TDLKISQLHDKIHKWPENPLIDETCIVSEGEMSYHLCERVYECFVPIPHIMRVGNPQNP FT " FT mat_peptide 200..1273 FT /product="P1 protein" FT mat_peptide 1274..>2280 FT /product="Hc-Pro protein" XX SQ Sequence 2280 BP; 733 A; 467 C; 518 G; 562 T; 0 other; aaatgtaatt tcaaattgac tacaatcaac tctcttccaa tcgctcaagc tctcacaagc 60 cttcaaaagc gaccaaaaga gcccagtagc cgaactcggg tggagacacg ccgggtgcta 120 ctgtttcaag cgatcaggag agaatttagc tttggccaga gacagtttga cgataagttc 180 acgagtcgtc tgggaagcta tggccactat cactaagaag caagtgtgga agccgaagga 240 gcgggtggtt agcgaaccac ctaaggctga aattcaagag tcgcgcacga ctcttctttt 300 caacgactat gcggaagttg aggatttcat tcaacgcttc ccagctggaa gcgtcttttg 360 gacggttaaa ggtaagccaa aaacgattgt aaacaatttg tttaaggcta cacaatacgg 420 attagcatac gacatcgcag cagaagtata tgtgtgccct atctgtatga cctgcgcacg 480 caacaaagtt tacttcacca ccaaccatca aaactgtggt gaactcttca ggaacaagca 540 ggcatacatt tcaacttctc tgagactcga agtggttgac acttttgacg ttttcccacg 600 ctacgcaacc gttgagcaag agaagttagt tggagattgg atggctgata tggaagctta 660 tgcccacgcc gaggatgatt caattgatat cccatatcaa atcttcaata gtgacactgg 720 cgaagttgaa gaacgcatca agcaagttga tttatcagta catggcgaga ttgaagaagt 780 tgagcgcacg tacaaggtaa agattgcgcg ttctaatgcc acaatgttac cacatcagcg 840 tcgggcaaac cgtgtcatca tgcgcaccaa tgaaattaag gagctgatcg actcgacact 900 cgaaatatgc cacaacagaa acatcaaagt aagcttcgtg gatcatgaac gaaagcgcaa 960 attatttcca caaatcccgt tgaaacacac catagaacct caaaccctat gtgacccaca 1020 ccacgacatt atcccagcaa ctgagaaatt tattagtcag tggaaggatg tgggagaacc 1080 aacaatgcac attaatgagc aatgggttca gaaaggatgg agcggtgtag ttctacacaa 1140 agacgatctt gaagcgcacc cgagtctcca agagaaatgt gttgacaact tgtttgtggt 1200 acttggaagg tgcaaacatg gggatttgca aaatgcactg aggccagatt gctgtgaaga 1260 tttagtgttt tatactgatg ctcataaggc aaggtcacac attttgtggg acgcgatgat 1320 gaagtgtcat ccagatgatc acaaacccat tgttaacgtt tggacagacg aagcttacga 1380 gaacatgggt tattggctaa tggctacata tccctttaaa gcaatatgca aagagtgtgt 1440 aaatgtaaaa tcaattagag actgggttca aaatatgaga gcatctaaag cataccaatt 1500 tttaagagga ggaacatcaa aacactcgcg ggatctcttc aggtggctcg cggttataca 1560 atctgagctg atgactttca acatacgaga tgcccagaac acgcaagaag atctcaatag 1620 aaatttcctt ggaacaatac caattggacc tctgtttgaa attgctaatc aaatgaatca 1680 ggcagttgtt gatattcgac gcgggttgca gcaaatgcat aaactggtaa cagacgttga 1740 gattacacat caagcccgtg atgagcagat cctgaacgaa attgcacggc ttcgtggttt 1800 agaattcatg caaactgaaa aacttatgac caacatgaag cacgttgcaa tgacatatag 1860 gaatttaatc aacacggcaa gccaaccatt gtcgattcac acaatgaggc aactcttatt 1920 agatgcaaga agtgatgagg catatgagtt tgacataatg cgcggtaaag gagcaattgc 1980 tattgtagca cctggagtgt tccggaaatt tgataagatt tattcagagc caggtgtgta 2040 caatgtggag tggacgcatc taacaccagg tggagaacta agaactgatt ttgattactt 2100 gagaactgat ctcaaaatct cccagttaca tgataaaatt cataaatggc cagaaaatcc 2160 actaattgac gaaacgtgta ttgtctctga gggcgaaatg tcgtatcact tgtgtgaacg 2220 agtttatgaa tgctttgtgc ctattccaca tatcatgcga gttggcaatc cacagaatcc 2280 //