The non-redundant patent sequence databases have been created at two levels from the patent class (PAT) in  EMBL-Bank and the patent proteins databases. Level-1 non-redundant patent sequences are 100% identical over the same length; Level-2 non-redundant patent sequences are identical and belong to a same patent family (the same invention). Level-2 sequence clusters have been enriched with biological information and additional data from the patent documents. 

Non-redundant Patent Sequence Coverage 

NR databases Abbreviation Coverage
Non-redundant patent proteins level-1 NRPL1 EPO, JPO, KIPO and USPTO patent proteins
Non-redundant patent proteins level-2 NRPL2 EPO, JPO, KIPO and USPTO patent proteins
Non-redundant patent nucleotides level-1 NRNL1 EMBL-Bank patents
Non-redundant patent nucleotides level-2 NRNL2 EMBL-Bank patents

Data Search and Query

The user can run the similarity & homology tool, e.g.  FASTA, to search sequences against the non-redundant databases, and use EBI Search ( NRNL1 ,  NRNL2NRPL1 and  NRPL2) for text querying. 

Downloads 

Databases FASTA format data EMB-like format data
NRNL1 nrnl1 nrnl1.annot
NRNL2 nrnl2 nrnl2.annot
NRPL1 nrpl1 nrpl1.annot
NRPL2 nrpl2 nrpl2.annot

Documentation

Publications 

Original paper: Li W., McWilliam H., Richart de la Torre A., Grodowski A., Benediktovich I., Goujon M., Nauche, S. and Lopez, R. (2010) Non-redundant patent sequence databases with value-added annotations at two levels. Nucleic Acids Research. 38(Database issue):D52-D56. DOI:  10.1093/nar/gkp960  full-text PDF 

Database update: Li W., Kondratowicz, B., McWilliam H., Nauche, S. and Lopez, R. (2013) The Annotation-enriched non-redundant patent sequence databases. Database: The Journal of Biological Databases and Curation. 2013:bat005. DOI: 10.1093/database/bat005 full-text PDF

People and collaborators

The project is a collaboration between the EPO life science team and the External Services team at the EBI.

Funding

FELICS is funded by the European Commission under FELICS, contract number 021902 (RII3) within the Research Infrastructure Action of the FP6 "Structuring the European Research Area" Programme. Additional funding has been provided by EMBL-EBI and the European Patent Office. 

Contact

Enquiries regarding EBI patent data should be sent to EBI Support.