The UniProt databases
UniProt comprises three different databases (Figure 1) which are optimised for different uses:
- The UniProt Knowledgebase (UniProtKB) is used to access functional information on proteins. A subset of UniProtKB entries forms the Proteomes dataset. This consists of the set of proteins thought to be expressed by an organism whose genome has been completely sequenced
- The UniProt Reference Clusters (UniRefs) provide clustered sets of sequences at several resolutions
- The UniProt Archive (UniParc) is a sequence archive which contains all protein sequences from the main publicly available protein sequence databases