0%

The UniProt databases

UniProt comprises three different databases (Figure 1) which are optimised for different uses:

  1. The UniProt Knowledgebase (UniProtKB) is used to access functional information on proteins. A subset of UniProtKB entries forms the Proteomes dataset. This consists of the set of proteins thought to be expressed by an organism whose genome has been completely sequenced
  2. The UniProt Reference Clusters (UniRefs) provide clustered sets of sequences at several resolutions
  3. The UniProt Archive (UniParc) is a sequence archive which contains all protein sequences from the main publicly available protein sequence databases
Figure 1 Sources and flow of data for UniProt databases.