TargetDB Statistics Summary Report |
Last updated: Jul 2 2008 |
Target Status Statistics
Total number of targets deposited by worldwide SG Centers in TargetDB:
188563
Table 1: TargetDB Status Statistics
| Status |
Total Number of Targets |
(%) Relative to "Cloned" Targets | (%) Relative to "Expressed" Targets | (%) Relative to "Purified" Targets |
(%) Relative to "Crystallized" Targets |
| Cloned | 122933 | 100.0 | - | - | - |
| Expressed | 81220 | 66.1 | 100.0 | - | - |
| Soluble | 32543 | 26.5 | 40.1 | - | - |
| Purified | 28760 | 23.4 | 35.4 | 100.0 | - |
| Crystallized | 10451 | 8.5 | 12.9 | 36.3 | 100.0 |
| Diffraction-quality Crystals | 5195 | 4.2 | 6.4 | 18.1 | 49.7 |
| Diffraction | 4750 | 3.9 | 5.8 | 16.5 | 45.5 |
| NMR Assigned | 1766 | 1.4 | 2.2 | 6.1 | - |
| HSQC | 3240 | 2.6 | 4.0 | 11.3 | - |
| Crystal Structure | 3881 | 3.2 | 4.8 | 13.5 | 37.1 |
| NMR Structure | 1684 | 1.4 | 2.1 | 5.9 | - |
| In PDB1 | 5710 | 4.6 | 7.0 | 19.9 | 39 |
| Work Stopped | 30855 | - | - |
- | - |
| Test Target | 1 | - | - |
- | - |
| Other | 6963 | - | - |
- | - |
Last updated: Jul 2 2008
Note 1:
Number of targets with status "in PDB". A target may reference several
PDB IDs (example: structure of the same polypeptides with different ligands).
Multiple targets in TargetDB may identify the same PDB structure when a
stucture is a result of collaboration between different centers and each
center includes the target on its target list.
Figure 1: Experimental Status in TargetDB

Last updated: Jul 2 2008
This graph is normalized
relative to number of cloned targets in TargetDB.
Targets that progressed to status "Cloned" constitute 65% of TargetDB.
Table 2: TargetDB Status Statistics by Organism
| Organism |
Total Number1 |
Work Stopped |
Cloned |
Expressed |
Purified |
Crystallized |
Crystal Structure |
NMR Structure |
In PDB2 |
| Total Viruses | 603 | 117 | 343 | 228 | 124 | 32 | 27 | 8 | 31 |
| Archaea | 13476 | 1758 | 9488 | 6470 | 3098 | 1235 | 600 | 45 | 679 |
| Bacteria | 109941 | 12416 | 73152 | 53578 | 19294 | 7775 | 2782 | 192 | 3103 |
| Total Prokaryotes | 123417 | 14174 | 82640 | 60048 | 22392 | 9010 | 3382 | 237 | 3782 |
| Yeast | 2604 | 664 | 1906 | 1335 | 774 | 112 | 54 | 14 | 50 |
| Plasmodium | 5200 | 336 | 2957 | 1263 | 200 | 67 | 19 | 0 | 19 |
| Trypanosoma | 6419 | 79 | 3974 | 1930 | 300 | 59 | 9 | 0 | 8 |
| Leishmania | 9597 | 288 | 4575 | 2208 | 404 | 146 | 21 | 0 | 17 |
| Arabidopsis | 8099 | 5390 | 4030 | 1273 | 319 | 82 | 36 | 52 | 87 |
| Rice | 134 | 101 | 125 | 63 | 12 | 4 | 1 | 0 | 1 |
| Nematode | 15068 | 3463 | 12645 | 5590 | 425 | 99 | 28 | 7 | 36 |
| Fly | 877 | 271 | 171 | 93 | 42 | 5 | 3 | 3 | 6 |
| Mouse | 2281 | 922 | 1773 | 1404 | 755 | 203 | 67 | 267 | 336 |
| Human | 12321 | 4404 | 6354 | 4671 | 2655 | 520 | 159 | 1080 | 1241 |
| Other Eukaryotes | 1940 | 646 | 1437 | 1111 | 356 | 112 | 75 | 14 | 94 |
| Total Eukaryotes | 64540 | 16564 | 39947 | 20941 | 6242 | 1409 | 472 | 1437 | 1895 |
| Synthetic | 3 | 0 | 3 | 3 | 3 | 1 | 1 | 2 | 3 |
| Unknown | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| Total | 188564 | 30855 | 122934 | 81221 | 28761 | 10452 | 3882 | 1684 | 5711 |
Last updated: Jul 2 2008
Note 1:
Total counts in this table may differ from total number of targets. If targtet
is a hybrid complex (for example:a complex of human and mouse polypeptides)
it is counted in different organism classifications.
Note 2:
Number of targets with status "in PDB". A target may reference several
PDB IDs (example: structure of the same polypeptides with different ligands).
Multiple targets in TargetDB may identify the same PDB structure when a stucture
is a result of collaboration between different centers and each center includes
the target on its target list.
Figure 2: Source Organisms in TargetDB

Last updated: Jul 2 2008
back to top
Deposited Structure StatisticsNumber of released X-Ray structures reported to TargetDB: 4128
Number of released NMR structures reported to TargetDB: 1679
Number of released Cryo-Electron Microscopy structures reported to TargetDB: 3
Total number of released structures from worldwide SG Centers reported to TargetDB:
5810
View list of all reported to TargetDB structures deposited by worldwide SG Centers to the PDB
Table 3: PDB Status Statistics for Structural Genomics Structures
| Status | All Centers | PSI Centers | Non-PSI SG Centers in North America |
SG Centers in Europe | SG Centers in Asia |
| Total Deposited | 5943 | 3040 | 111 | 134 | 2670 |
| Released | 5810 | 2943 | 108 | 130 | 2641 |
| Release on Publication | 27 | 0 | 1 | 0 | 26 |
| Release on Certain Date | 44 | 41 | 0 | 0 | 3 |
| In Process | 62 | 56 | 2 | 4 | 0 |
|
Last updated: Jul 2 2008 | | 1:
Some PDB IDs are cross referenced by different centers. Example: PDB_id 106Y is associated with SPINE and TB centers. Therefore difference between number of structures in "ALL Centers" column and direct sum of number of
structures from projects/geographical regions can be observed. |
| 2:
"Total Deposited" are all structures in the PDB including structures released to the public
and structures that are in the process to be released("Released on Publication"
, "Released on Certain Date", etc.). |
Figure 3: Structures Released by SG Centers by Year

Last updated: Jul 2 2008
back to top
Sequence Redundancy Statistics
Table 4: TargetDB Sequence Redundancy Statistics by Experimental Status
| Sequence Identity(%) | Novel Targets
Status: Selected |
Novel Targets Status: Cloned |
Novel Targets
Status: Expressed |
Novel Targets Status: Purified |
Novel Targets
Status: Crystallized |
Novel Targets Status: Crystal Structure | Novel Targets
Status: NMR Structure | Novel Targets Status: in PDB |
| <100 | 126672 | 90922 | 60928 | 23701 | 9052 | 3488 | 1566 | 4875 |
| <90 | 116074 | 85184 | 57312 | 22408 | 8643 | 3323 | 1546 | 4685 |
| <70 | 105202 | 78621 | 53545 | 21259 | 8429 | 3277 | 1422 | 4527 |
| <50 | 86255 | 66658 | 46023 | 18648 | 7826 | 3121 | 1269 | 4214 |
| <30 | 48127 | 40569 | 28867 | 12470 | 6022 | 2577 | 894 | 3295 |
|
Last updated: 08-04-08 | | Sequence redundancy is calculated
by clustering analysis
using BLASTClust program with similarity threshold set
to percent of sequence identity.
Please view
detailed explanation of sequence redundancy calculations and
BLASTClust threshold settings. Sequence redundancy calculations are based on
comparison to all protein sequences in TargetDB which are in the same
experimental status category and at least 20 amino acids long |
Table 5: Sequence Redundancy Statistics for Structures Released by SG Centers in the PDB by Year
| Year | Released Structures |
Number of Released Structures <30% Sequence Identity at Time of Release | Percent(%) of Released Structures <30% Sequence Identity at Time of Release | | <= 2000 | 84 | 31 | 37 |
| 2001 | 59 | 24 | 41 |
| 2002 | 152 | 58 | 38 |
| 2003 | 388 | 154 | 40 |
| 2004 | 911 | 384 | 42 |
| 2005 | 1023 | 367 | 36 |
| 2006 | 1087 | 455 | 42 |
| 2007 | 1548 | 583 | 38 |
| 2008 | 558 | 229 | 41 |
| Total | 5810 | 2285 | 39 |
| Last updated: 08-07-02 | | Sequence redundancy is calculated
by clustering analysis
using BLASTClust program with similarity threshold set
to percent of sequence identity.
Please view
detailed explanation of sequence redundancy calculations and
BLASTClust threshold settings. Sequence redundancy calculations are based on
comparison to all protein sequences in the PDB which are at least
20 amino acids long |
Figure 4: Comparison of Novel Structures with Number of Structures Released By SG Centers

| Last updated: 08-07-02 | | Sequence redundancy is calculated
by clustering analysis
using BLASTClust program with similarity threshold set
to percent of sequence identity.
Please view
detailed explanation of sequence redundancy calculations and
BLASTClust threshold settings. Sequence redundancy calculations are based on
comparison to all protein sequences in the PDB which are at least
20 amino acids long |
back to top
Summary Statistics Reports by Project or Geographical Region:
|