ID   BAE28059; SV 1; linear; mRNA; HTC; MUS; 1101 BP.
XX
PA   AK147667.1
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus (house mouse) hypothetical protein
XX
KW   CAP trapper; HTC; HTC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; e347667973b1cf732bc472f399fd38f0.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1101
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="M5C1105H05"
FT                   /cell_line="UNDEFINED_CELL_LINE"
FT                   /tissue_type="brain"
FT                   /db_xref="taxon:10090"
FT   CDS             AK147667.1:302..1402
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="SET domain-containing protein 7 (MGD|MGI:1920501
FT                   GB|AK129428, evidence: BLASTN, 99%, match=4993)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8VHL1"
FT                   /db_xref="InterPro:IPR001214"
FT                   /db_xref="InterPro:IPR003409"
FT                   /db_xref="InterPro:IPR017155"
FT                   /db_xref="MGI:MGI:1920501"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8VHL1"
FT                   /protein_id="BAE28059.1"
FT                   /translation="MDSDDEVVEEAVEGHLDDDGLPHGFCTVTYSSTDRFEGNFVHGEK
FT                   NGRGKFFFFDGSTLEGYYVDDALQGQGVYTYEDGGVLQGTYVDGELNGPAQEYDSDGRL
FT                   IFKGQYKDNNRHGVCWIHYPDGGSLVGEVNEDGEMTGEKIAYVYPDQRTALYGKFIDGE
FT                   MLEGKLATLMATEEGRPHFEVTSGSSVYHFDKSTSSCISSDALLPDPYESERVYVADSL
FT                   ISSAGEGLFSKVAVGPNTVMSFYNGVRITHQEVDSRDWALNGNTLSLDEETVIDVPEPY
FT                   NHVSKYCASLGHKANHSFTPNCVYDLFVHPRFGPIKCIRTLRAVEAEEELTVAYGYDHS
FT                   PPGKSGPEAPEWYQVELKAFQATQQK"
XX
SQ   Sequence 1101 BP; 265 A; 278 C; 326 G; 232 T; 0 other;
     atggatagcg acgatgaggt ggtggaggaa gccgtggaag ggcacctgga cgatgacgga        60
     ttaccacacg ggttctgcac tgtcacctac tcctccacgg acagatttga gggcaacttt       120
     gtgcatggag aaaagaatgg gcgtgggaaa ttcttcttct ttgacggcag caccctggag       180
     ggatattacg tggacgatgc cctgcagggc cagggcgttt acacctatga ggatggaggt       240
     gttctccaag gcacctatgt ggatggagag ctgaatggac cggcccagga gtacgactct       300
     gacgggaggc tgatcttcaa gggacagtac aaagacaaca atcgacacgg agtgtgttgg       360
     atccattacc cggatggagg cagcctcgtc ggggaagtca atgaagatgg agagatgacg       420
     ggagaaaaga tcgcctatgt gtaccctgac cagaggactg cgctctatgg gaagttcatt       480
     gacggagaga tgctcgaagg gaaactggcc accctcatgg ccaccgaaga agggaggcca       540
     cactttgaag tgacgtcggg aagttccgtg taccactttg ataaatcgac ttcctcctgc       600
     atctccagcg atgctctcct tccagaccct tatgagtctg agagggttta tgttgccgac       660
     tctctcatct ccagtgctgg agaagggctg ttttccaagg tagcagttgg acctaatact       720
     gttatgtcgt tttataatgg agtccgaatt acacaccaag aggttgacag cagggattgg       780
     gccctaaatg ggaacactct ctccctggac gaggagacag tcattgatgt gcccgagccc       840
     tacaaccacg tgtccaagta ctgtgcctcc ttgggacaca aggccaatca ctccttcact       900
     ccgaactgcg tctatgacct gtttgtccac ccccgtttcg ggccgatcaa gtgcatccgc       960
     accctgcgag ctgtggaggc cgaggaagag ctgaccgttg cctacggcta tgaccacagc      1020
     cccccgggga agagtgggcc tgaagccccc gagtggtatc aagtggagct gaaggccttc      1080
     caggccaccc agcagaagtg a                                                1101
//