Dataset updates for September 2024
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Overview of the current biological databases
The current dataset composition as of 1st October 2024 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
---|---|---|---|---|
AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
ChEMBL | protein | 1 | 14,321 | 05/09/2024 14:34:46 |
EMVec | nucleotide | 1 | 7,561 | 17/07/2024 04:59:53 |
ENA | nucleotide | 115 | 78,584,200 | 18/07/2024 19:50:37 |
ENA cds | nucleotide | 88 | 394,866,750 | 19/07/2024 17:34:29 |
ENA expcon | nucleotide | 1 | 43,043 | 17/07/2024 04:49:43 |
ENA ncr | nucleotide | 64 | 49,145,362 | 28/09/2024 02:11:28 |
ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
ENA spacer | nucleotide | 45 | 212,565 | 01/07/2023 02:49:57 |
Ens | mixed | 10,870 | 297,469,886 | 11/09/2024 16:15:30 |
EnsCovid | mixed | 4 | 26 | 31/12/2023 00:24:02 |
EnsGenomes | mixed | 167,173 | 365,466,580 | 10/06/2024 15:34:31 |
EPO | protein | 1 | 4,441,107 | 23/05/2023 17:05:48 |
HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
IMGTHLAcds | nucleotide | 1 | 40,618 | 22/07/2024 11:21:13 |
IMGTHLAgen | nucleotide | 1 | 22,583 | 22/07/2024 11:21:11 |
IMGTHLApro | protein | 1 | 40,416 | 22/07/2024 11:21:13 |
IMGTLIGM | nucleotide | 1 | 246,842 | 22/07/2024 11:23:42 |
IntAct | protein | 1 | 124,556 | 19/09/2024 00:38:24 |
InterPro | protein | 1 | 45,899 | 30/07/2024 01:26:47 |
IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
IPRMC | protein | 1 | 245,963,890 | 30/07/2024 12:07:26 |
IPRMC_UNIPARC | protein | 1 | 1 | 04/08/2024 01:51:21 |
JPO | protein | 1 | 6,565,074 | 27/09/2024 00:52:22 |
KIPO | protein | 1 | 2,087,869 | 27/09/2024 00:30:16 |
MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
PDB | protein | 1 | 812,286 | 26/09/2024 02:10:52 |
PDBaa | protein | 1 | 812,286 | 26/09/2024 02:10:15 |
PDBna | nucleotide | 1 | 51,740 | 26/09/2024 02:13:00 |
Pfam | protein | 1 | 21,979 | 26/06/2024 01:22:33 |
Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
TAXONOMY | other | 1 | 1 | 01/10/2024 00:29:36 |
TESTDB | protein | 1 | 3 | |
TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
UniParc | protein | 401 | 1,614,380,330 | 29/07/2024 09:54:03 |
UniProtKB | protein | 3 | 10,695,431 | 29/07/2024 09:43:57 |
UniProtKB Divisions | protein | 18 | 149,459,298 | 29/07/2024 11:40:27 |
UniRef | protein | 3 | 44,000,000 | 29/07/2024 11:40:47 |
UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
USPTO | protein | 1 | 9,733,759 | 12/07/2024 01:39:53 |
WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |