ID D85597; SV 1; linear; genomic DNA; STD; PLN; 8322 BP. XX AC D85597; XX DT 21-SEP-1997 (Rel. 52, Created) DT 23-DEC-2010 (Rel. 107, Last updated, Version 6) XX DE Oryza australiensis retrotransposon RIRE1 DNA. XX KW . XX OS Oryza australiensis OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade; OC Oryzoideae; Oryzeae; Oryzinae; Oryza. XX RN [1] RP 1-8322 RA Ohtsubo E.; RT ; RL Submitted (27-MAY-1996) to the INSDC. RL Contact:Eiichi Ohtsubo Univ.Tokyo, Institute of Molecular and Cellular RL Biosciences; Yayoi 1-1-1, Bunkyo-ku, Tokyo 113, Japan XX RN [2] RX DOI; 10.1266/ggs.72.131. RX PUBMED; 9339541. RA Noma K., Nakajima R., Ohtsubo H., Ohtsubo E.; RT "RIRE1, a retrotransposon from wild rice Oryza australiensis"; RL Genes Genet. Syst. 72(3):131-140(1997). XX DR MD5; e6428594e08eb718454dad009286c5c8. DR EuropePMC; PMC1940013; 17617907. DR EuropePMC; PMC2576258; 18842156. DR EuropePMC; PMC5258746; 28174588. XX FH Key Location/Qualifiers FH FT source 1..8322 FT /organism="Oryza australiensis" FT /strain="W1538" FT /mol_type="genomic DNA" FT /clone_lib="The pCRII vector was used for the genomic FT library." FT /db_xref="taxon:4532" FT mobile_element 1..8322 FT /mobile_element_type="retrotransposon: RIRE1" FT repeat_region 1..1523 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="5' LTR" FT regulatory 1053..1057 FT /regulatory_class="CAAT_signal" FT regulatory 1079..1083 FT /regulatory_class="TATA_box" FT regulatory 1334..1337 FT /note="termination signal" FT /regulatory_class="other" FT primer_bind 1525..1542 FT /note="complementary to 3' end of initiator methionyl tRNA" FT /note="putative (-)-strand primer binding site" FT CDS 2826..6779 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:O23864" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR013103" FT /db_xref="InterPro:IPR025724" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036875" FT /db_xref="InterPro:IPR039537" FT /db_xref="UniProtKB/TrEMBL:O23864" FT /protein_id="BAA22288.1" FT /translation="MAANTTPSTFNLRSILEKEKLNGTNFMDWYRNLRIVLKQERKEYV FT LEVPYPEELPNNATATARRGFEKHTNDALDISCLMLATMSPELQKQYESSDAHTTIQGL FT RGMFENQARDERFNTSKSLFACRLVEGNPVSPHVIKMIGYIESLEKLGFPLSQELATDV FT ILQSLPPSFEPFILNYHMNNMDRTLAELHGMLKTVEESIQKNGHHVMMMQNAKRKPPVK FT KLCTKRKLTPDEIASASNAKKGKKGSAASDAVCFYCKETGHWKRNCKKYMEDLKKKQST FT TSASGINVIDINLATSPTDSWVFDTGSVAHSCKSLQGMRRSRGLRRGEVNLRVGNGASV FT ATVAVGTVPLHLPSGLVLELNNCYCVPTLCQNVISASCLQAEGYDFRSMNNGCSIYLRD FT MFYFHAPLVNGLYVLNLEASPIYNINTERQLSNDINPTFIWHCRLGHINKKRMEKLHKD FT GLLHSFDFESFETCESCLLGKMTKAPFTGHSERASDLLALVHTDVCGPMSSTARGGYQY FT FITFTDDFSRYGYIYLMRHKSESFEKFKEFQNEVQNHLGKTIKFLRSDRGGEYVSQEFG FT NHLKDCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMSQSDLPLSFWGYALETAALT FT LNRVPSKSVEKTPYEIWTGQPPSLSFLKIWGCEAYVKRLQSDKLTPKSDKCFVVGYPKE FT TKGYYFYNREQAKVFVARHGVFLEKEFLSRRVSGIRVHLEEVQETPETVSATTEPQQED FT QSVAPPVVDTPAPRRSERSRRAPDRYTGAEQRDILLLDNDEPKTYEEAMVGHDSNKWLG FT AMKSEIESMYDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQIQ FT GVDYDETFSPVAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSEDVYMIQPQGFVD FT PESPGKICKLQKSIYGLKQASRSWNIRFDEVIKGFGFIKNEEEACVYKKVSGSAIVFLI FT LYVDDILLIGNDIPMLESVKSSLKNSFSMKDLGEAAYILGIRIYRDRSKRLIGLSQSTY FT IDKVLKRFNMHDSKKGFLPMSHGINLSKNQCPQTHDERNKMGMVPYASAIGSIMYAMLC FT TRPDVSYALSATSRYQSDPGEGHWTAVKNILKYLRRTKDMFLVYGGEEDLVVSGYTDAS FT FQTDKDDYRSQSGFVFCLNGGAVSWKSSKQDTVADSTTEAEYIAASEAAKEAVWIKKFV FT SELGVMTSTTGPMSLYCDNSGAIAQAKEPRSHQKSKHILRRYHLIREIVDRGDVKICKV FT HTDLNIADPLTKPLPQPKHEAHTRAMGIRYLHD" FT mat_peptide 2826..3674 FT /gene="gag" FT /product="gag protein" FT /note="putative" FT misc_feature 3588..3629 FT /note="RNA binding motif" FT mat_peptide 3675..4121 FT /gene="pro" FT /product="aspartic proteinase" FT /note="putative" FT misc_feature 3738..3746 FT /note="protease motif" FT mat_peptide 4122..5021 FT /gene="int" FT /product="integrase" FT /note="putative" FT misc_feature 4146..4251 FT /note="zinc finger motif" FT misc_feature 4329..4628 FT /note="integrase motif" FT mat_peptide 5022..6287 FT /gene="rt" FT /product="reverse transcriptase" FT /note="putative" FT misc_feature 5796..5807 FT /note="reverse transcriptase motif" FT mat_peptide 6288..6776 FT /gene="rh" FT /product="RNase H" FT /note="putative" FT misc_feature 6315..6701 FT /note="RNase H motif" FT primer_bind 6789..6798 FT /note="polypurine tract as putative (+)-strand priming FT site" FT /note="putatie (+)-strand primer binding site" FT repeat_region 6800..8322 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="3' LTR" XX SQ Sequence 8322 BP; 2323 A; 1638 C; 2072 G; 2289 T; 0 other; tgttagtgat atgccctaga ggcaacatgg agatgttgga tatcacgtca atgtttatgt 60 ataaatgaat aaagtgttat tcagttccat gaaatagaca tcttgtattg atcaataagt 120 acgtgactta tttatgagac tctatatgta tgaatctatt tctaaacgat ccctgatcat 180 atgtcatatt gttggaacaa atatgattct aagatcggca tgtttattga ttgatgatca 240 tgtctcatag atcatgggta tggagatacc aaatcaataa acatggacat atgtgttaga 300 gaacacattg ttggatagac ccaccatgag acactacagg aattaatgtg ccattagttg 360 gtctcaggta gtgttggtac aaagtcctta gacctgagat caccatggat tccaacatgt 420 gtagtagcct actttgggac taccaaacgc tattccgtaa ctgggtagtt ataaaggtag 480 ttttcgggtt tgctatgaaa catggagtgg gatgtgagtg atcaagatgg aatttgcccc 540 tcctttggag agatatctct gggcccctcg aggtaatgga ttatggaaat gcatggccat 600 gctaagttga ttgaggagtc aatcaacaga cagataatcc aatacggatc gagtgaatgg 660 ttaagctatt gaagggatgg cacacatctt gcctatagct taactggtat cgtgaggcaa 720 agggatttgt gtacacatta caggttcaga gccgatatta tattcttgta tactatggtg 780 tcaatatgtg ctgctaggcg ccgctgttga caggtcggct gagttggact cgagccgacg 840 atcggctaag ttagactata gccgaaaccg agtatacctg aacctacagg gtcgcacgct 900 taaggggata agaacaggca ttcgagttgg actcggatac ggctcgatcg gatagggatc 960 cgatcggata gagtcctatg ggcttacgac gtatgggcgg ccaactctat gagatacgga 1020 tgagatccga ttcaaaatag agtcctgatc ggccactagt cccagtcggc caactctata 1080 taaaggaggg aggggtggag agctgcggta cgtcaattgc atcgcgtgca acagaagtcg 1140 ccgagatcaa tctccacgga aaccctcccc actttcgagc caaaccctag ccttgcatcg 1200 cgtgctagca cacgggtgtt cggcgttctg tccccggacg tgtggatacc ggtagaggcg 1260 ctgctacgtt gcacgctgtt gatcggctgc ggatcggcta cgacatccgc gaatcggctg 1320 attggtgaat cggttgtccc cgggtatcgg ctgtcggctt gcgggaatcg gctacactgt 1380 tccggatcgg atagccctga ttggctattt ccgctgcata ctccaatcgg taacgatcta 1440 tcggccctta cttgcaagtg ttcctggttt gcgcggtaaa aagtttttgt tttcggctag 1500 cgtagcttcc gcgtaaccct tcagtggtat cagagctaat cttgcttagt tcgggtttta 1560 gattggatct ggaacaagtt ttgcgcacgg gttgattaga tcaatcggtt ttacgggatt 1620 tgttgaggag agttgatggt aattgcgttc cttgatagtg tatcggaacg acgtaatgtg 1680 attacgtcga agcaatatcc atataactcg gtttgagttg ggttcgagat agcgttcctt 1740 gatagtgtat cggaacggta ttatgtgatt ataccgaagc tatctcgtag tccgactttg 1800 tttgctgcct cgcgcctacg caaaccatct cccttgatag tgtatcggga cggcataata 1860 tgattatgcc gatatggttt ccgtaagcga tttcatagat tggatctatg aattcgtgtg 1920 gtgttacggg cgccatgtgc gtaccccctt ataggtcgtc tcggtgcccg atctctgcga 1980 taggtagaaa acgtttctac gcctaacttg tttttgcaga caatgcctgc agatggtatg 2040 gccctattgt atgtatgaga tgcatgtgat gcatgtaata ttctttcata tgtgcaatcc 2100 ctgtaatatt ctttcatacg tgcaatgctt gtaatattcg ttcacgacct gcgagtcgat 2160 gtaatcttca attagagctt ctatagtagt agcttagttt gaagaaatca agcattcggc 2220 actcggaaga acggtgatga agatggagat cctctacacc ggcatggaga tggagatcac 2280 catgtgaagg gggccatact atttcactat atgctattct acttgctttc atatatgtga 2340 tatgtttgtt tgataggata gcatccctcg caaaattaag tagtaatgat gcccttccaa 2400 atgttgcacc cgtcaccgtt atgctcgtca atggtgggtc tgccaaagca gagtgccatt 2460 ctctatcata acacgagggg gtatatgtca gacatataca tgcaggagca tttggtttac 2520 ttaacaaggc tatcaaaaca gttttgggcc ttggggcata ggttggggcc ggggcatgga 2580 gatgccacac caacaacaag agtcacatag agatgtgatt agcaaggtgt tgcttaccga 2640 tgttattaat cttctagcag tgatgccaaa gctcactaga aaattagtta acatgtggat 2700 cttgacccag tacgtaacct agagggaagg tgcaaaatac gtaatgggag tcttatgtta 2760 aattgtttaa gaatagacat agcatgaacg ttgtgctaaa ctttgctttt tctgttgtag 2820 aataaatggc ggctaatact acacccagta cctttaattt gcgatctatt ctcgaaaagg 2880 aaaaattgaa tggaacaaac tttatggatt ggtatcgcaa cttgagaatt gttctcaagc 2940 aagagcgtaa ggagtacgtt cttgaagtac cctatcctga ggagttgcct aataatgcca 3000 ctgctactgc acggaggggt ttcgagaagc acactaatga tgccctagat ataagctgtc 3060 tcatgctagc tacaatgtcc cctgagcttc agaagcaata tgagagcagc gatgctcaca 3120 ctactattca gggactgcgt ggtatgtttg agaaccaagc tcgggacgag aggttcaaca 3180 cctcaaagtc cttgtttgcg tgcaggcttg ttgaggggaa tcccgtcagt ccgcatgtga 3240 taaagatgat tggctatatt gagagtctgg aaaaactagg ttttcccctt agccaagagt 3300 tggctactga tgtgattctc cagtcgctcc ctccgagctt cgagccattc atattaaact 3360 atcacatgaa caatatggat agaaccttgg ctgaattaca tgggatgcta aagacagttg 3420 aggagagtat ccagaaaaat ggtcatcatg tgatgatgat gcagaatgct aagcgcaaac 3480 cacctgtcaa gaaactttgc accaagagga agttaactcc cgatgagatc gcgagtgcct 3540 ctaatgcaaa gaagggcaag aaggggtcgg cggcatcaga tgccgtttgc ttctattgca 3600 aggagacagg ccactggaag aggaactgca agaagtacat ggaggatctc aaaaagaagc 3660 aaagtacgac ttctgcttca ggtattaatg ttatagacat taatcttgct acttcaccta 3720 ctgactcttg ggtatttgat accggatcag tagctcatag ttgcaaatcg ttgcagggaa 3780 tgagaagaag tagaggattg agaaggggcg aggtgaacct gcgcgtcggc aatggagcaa 3840 gcgttgctac agttgctgtc ggcacagtac cacttcatct accttcagga ttagttttgg 3900 aattgaataa ttgttattgt gttccaacac tatgtcaaaa cgttatttcc gcttcatgtt 3960 tgcaagcgga aggatatgat tttagatcaa tgaacaatgg ttgttcaata tatctcagag 4020 atatgttcta ttttcatgct ccattggtga atggattata cgttttgaat cttgaagcgt 4080 ctcccatcta taacattaat acagaaaggc aactctctaa tgatataaac cccacattta 4140 tctggcattg tcgcttaggc catataaata agaaacgcat ggagaagctc cataaggatg 4200 gattgcttca ctcttttgat tttgaatcat ttgagacatg tgagtcttgt ttacttggta 4260 agatgacaaa ggcacctttc acgggacata gtgagagagc aagtgactta ttggcactcg 4320 tacatactga tgtatgtgga ccaatgagct caacggccag aggtggttat caatacttca 4380 ttacctttac cgatgacttt agtagatatg gctatatcta cttaatgagg cataagtccg 4440 aatcctttga aaagttcaaa gaattccaga atgaagtaca gaatcattta gggaaaacaa 4500 tcaagtttct acgatcagat cgtggagggg aatacgtgag ccaagagttt ggtaatcatc 4560 tgaaagattg tggaattgtt ccacagttga ctccgccagg aactccacaa tggaacggag 4620 tgtccgaacg gagaaatcgc accttgttgg acatggtgcg gtcgatgatg agccaaagtg 4680 atcttccgtt atccttctgg ggatatgctc ttgaaacagc tgcgctcaca ctaaatagag 4740 ttccatctaa gtcagttgaa aagacaccat atgagatatg gacagggcaa ccccctagtt 4800 tgtcttttct caaaatttgg ggatgtgagg cttatgtaaa acgtttacaa tctgacaagc 4860 tcacacccaa atctgacaag tgcttcgttg tgggatatcc taaggaaact aagggatatt 4920 acttttataa tcgggaacaa gccaaagtgt ttgtcgcccg acatggtgtc ttcttggaga 4980 aagagtttct ttcaagaagg gttagtggga tcagggtgca tcttgaagaa gttcaagaaa 5040 caccagaaac cgtttcagcg accacagaac cacaacagga ggaccaaagt gttgcgccac 5100 cagttgtaga tacaccagcc ccacgaaggt ctgaaagatc acgtcgtgcg cctgacaggt 5160 acacaggtgc ggaacaacgt gatatattgt tgttggacaa cgatgaacct aagacctatg 5220 aggaagcgat ggtgggacac gattccaaca agtggcttgg agccatgaaa tccgaaatag 5280 aatccatgta cgacaatcaa gtttggaact tggttgatcc acctgatggt gtcaaaacca 5340 tcgagtgtaa atggcttttt aagaaaaagg ccgatatgga tggaaatgtt cacatctata 5400 aggcgcgatt ggtggcgaaa ggttttaaac agattcaagg agttgattat gatgaaacct 5460 tctcgcccgt cgcaatgctt aaatctattc ggattatcct agcgattgct gcatatttcg 5520 attatgagat atggcagatg gatgtcaaaa cggctttcct aaatggaaac ctaagcgagg 5580 atgtatacat gatacaacct cagggttttg tcgatccaga atcgcctgga aagatatgca 5640 agctacagaa atccatttat ggattgaagc aagcatctcg gagttggaat attcgttttg 5700 atgaagtaat caaagggttt ggtttcatca aaaacgaaga agaggcctgt gtttacaaaa 5760 aggtcagtgg gagcgcaatt gtatttctaa tcttatatgt ggatgacata ttactgattg 5820 gaaatgatat ccctatgcta gaatccgtca agtcttcatt gaaaaatagt ttttccatga 5880 aagacttagg ggaggcagca tacatattgg gcattcggat ctatagagat agatccaaga 5940 ggctaattgg attaagccaa agtacataca ttgacaaggt gttgaaaagg ttcaacatgc 6000 atgattccaa gaaaggtttc ttgcccatgt cacatggcat taatcttagc aagaatcagt 6060 gccctcagac acatgatgag cggaataaga tgggtatggt tccatatgct tcggcaattg 6120 gatccatcat gtatgccatg ctttgtacac gcccagatgt ctcgtacgct ttgagtgcta 6180 cgagcagata ccagtcagat ccaggtgaag gtcactggac tgccgtaaag aatatcctta 6240 agtacttgag aagaactaag gatatgttcc tagtctatgg aggtgaagaa gatctcgttg 6300 taagtggtta caccgatgct agcttccaaa cagacaagga tgattataga tcgcaatctg 6360 ggttcgtgtt ctgcctgaac ggaggcgcag tcagctggaa gagttccaag caggatactg 6420 ttgctgattc tacaacggag gccgagtaca ttgctgcttc ggaagctgca aaggaggctg 6480 tttggatcaa gaaattcgtt tctgagcttg gtgtgatgac tagtacgact ggtccaatgt 6540 ctctctattg tgataatagt ggagccattg cgcaagccaa ggagccgagg tcacatcaga 6600 agtccaaaca catacttcgg cgatatcatc tcatccgcga gatagtggac agaggtgatg 6660 tcaagatatg caaagtgcac acggatctca acatagccga tccgctgaca aaacctctcc 6720 ctcagccgaa gcatgaggcg cacacaagag caatgggtat tagatactta catgattgac 6780 tctagtgcaa gtgggagatt gttagtgata tgccctagag gcaacatgga gatgttggat 6840 atcacgtcaa tgtttatgta taaatgaata aagtgttatt cagttccatg aaatagacat 6900 cttgtattga tcaataagta cgtgacttat ttatgagact ctatatgtat gaatctattt 6960 ctaaacgatc cctgatcata tgtcatattg ttggaacaaa tatgattcta agatcggcat 7020 gtttattgat tgatgatcat gtctcataga tcatgggtat ggagatacca aatcaataaa 7080 catggacata tgtgttagag aacacattgt tggatagacc caccatgaga cactacagga 7140 attaatgtgc cattagttgg tctcaggtag tgttggtaca aagtccttag acctgagatc 7200 accatggatt ccaacatgtg tagtagccta ctttgggact accaaacgct attccgtaac 7260 tgggtagtta taaaggtagt tttcgggttt gctatgaaac atggagtggg atgtgagtga 7320 tcaagatgga atttgcccct cctttggaga gatatctctg ggcccctcga ggtaatggat 7380 tatggaaatg catggccatg ctaagttgat tgaggagtca atcaacagac agataatcca 7440 atacggatcg agtgaatggt taagctattg aagggatggc acacatcttg cctatagctt 7500 aactggtatc gtgaggcaaa gggatttgtg tacacattac aggttcagag ccgatattat 7560 attcttgtat actatggtgt caatatgtgc tgctaggcgc cgctgttgac aggtcggctg 7620 agttggactc gagccgacga tcggctaagt tagactatag ccgaaaccga gtatacctga 7680 acctacaggg tcgcacgctt aaggggataa gaacaggcat tcgagttgga ctcggatacg 7740 gctcgatcgg atagggatcc gatcggatag agtcctatgg gcttacgacg tatgggcggc 7800 caactctatg agatacggat gagatccgat tcaaaataga gtcctgatcg gccactagtc 7860 ccagtcggcc aactctatat aaaggaggga ggggtggaga gctgcggtac gtcaattgca 7920 tcgcgtgcaa cagaagtcgc cgagatcaat ctccacggaa accctcccca ctttcgagcc 7980 aaaccctagc cttgcatcgc gtgctagcac acgggtgttc ggcgttctgt ccccggacgt 8040 gtggataccg gtagaggcgc tgctacgttg cacgctgttg atcggctgcg gatcggctac 8100 gacatccgcg aatcggctga ttggtgaatc ggttgtcccc gggtatcggc tgtcggcttg 8160 cgggaatcgg ctacactgtt ccggatcgga tagccctgat tggctatttc cgctgcatac 8220 tccaatcggt aacgatctat cggcccttac ttgcaagtgt tcctggtttg cgcggtaaaa 8280 agtttttgtt ttcggctagc gtagcttccg cgtaaccctt ca 8322 //