Protein Structure Initiative
Print Page | Close Window

Statistics Summary Report for Non-PSI Structural Genomics Centers in North America

                         Last updated: Jul 2 2008


|BSGI| |S2F| |SSGCID|

Target Status Statistics

Total number of targets deposited by Non-PSI Structural Genomics Centers in North America to TargetDB: 2387

Table 1: Status Statistics for Non-PSI Structural Genomics Centers in North America

Status Total Number of Targets(%) Relative to "Cloned" Targets(%) Relative to "Expressed" Targets(%) Relative to "Purified" Targets(%) Relative to "Crystallized" Targets
Cloned1761100.0---
Expressed122269.4100.0--
Soluble60034.149.1--
Purified65937.453.9100.0-
Crystallized21812.417.833.1100.0
Diffraction-quality Crystals955.47.814.443.6
Diffraction633.65.29.628.9
NMR Assigned10.10.10.2-
HSQC352.02.95.3-
Crystal Structure653.75.39.929.8
NMR Structure120.71.01.8-
In PDB11086.18.816.445
Work Stopped249----
Test Target0----
Other0----

Last updated: Jul 2 2008
Note 1:   Number of targets with status "in PDB". A target may reference several PDB IDs (example: structure of the same polypeptides with different ligands).
Multiple targets in TargetDB may identify the same PDB structure when a stucture is a result of collaboration between different centers and each center includes
the target on its target list.

Figure 1: Target Experimental Status for Non-PSI Structural Genomics Centers in North America

Last updated: Jul 2 2008

This graph is normalized relative to number of cloned targets in TargetDB.
Targets that progressed to status "Cloned" constitute 74% of TargetDB.

Table 2: Status Statistics for Non-PSI Structural Genomics Centers in North America by Organism

Organism Total Number1 Work Stopped Cloned Expressed Purified Crystallized Crystal Structure NMR Structure In PDB2
Archaea20103541101
Bacteria2013239153410415821815412100
Total Prokaryotes2033249153710465861825512101
Leishmania202000000
Total2035249153910465861825512101

Last updated: Jul 2 2008

Note 1:   Total counts in this table may differ from total number of targets. If targtet is a hybrid complex
(for example:a complex of human and mouse polypeptides) it is counted in different organism classifications.

Note 2:   Number of targets with status "in PDB". A target may reference several PDB IDs (example: structure of the same polypeptides with different ligands).
Multiple targets in TargetDB may identify the same PDB structure when a stucture is a result of collaboration between different centers and each center includes
the target on its target list.

Figure 2: Source Organisms in Non-PSI Structural Genomics Centers in North America

Last updated: Jul 2 2008 back to top


Deposited Structure Statistics for Non-PSI Structural Genomics Centers in North America

Number of Released X-Ray Structures: 93

Number of Released NMR Structures: 15

Total number of released structures from Non-PSI Structural Genomics Centers in North America in the PDB: 108

Table 3: PDB Status Statistics for Structures from Non-PSI Structural Genomics Centers in North America

PDB StatusBSGIS2FSSGCIDTotal
Total Deposited53517111
Released51516108
Release on Publication1001
Release on Certain Date0000
In Process1012
Last updated: Jul 2 2008
Note 1:   "Total Deposited" are all structures in the PDB including structures released to the public and structures that are in the process to be released("Released on Publication" , "Released on Certain Date", etc.).
Note 2:  Some PDB IDs are cross referenced by different centers. Therefore difference between "Total" number of structures and direct sum of number of structures from individual centers can be observed

Figure 3: Structures Released by Non-PSI Structural Genomics Centers in North America by Year

Last updated: Jul 2 2008

Table 4: List of Structures Deposited in the PDB by Non-PSI Structural Genomics Centers in North America

Total number of structures: 112

Structures of distinct targets: 1051

1   A target may reference several PDB IDs (example: structure of the same polypeptides with different ligands).
In this case only one structure is counted to compute number of structures of distinct targets

Related PDB_ID(s): PDB_ID(s) associated with the same target in targetDB

PDB_ID CenterTitleTarget_id Deposition DateReleased Date PDB StatusRelated PDB_ID in TargetDB
3CQJ BSGIcrystal structure of l-xylulose-5-phosphate 3-epimerase ulae (form b) complex with zn2+ULAE_ECO572008-04-030000-00-00AUTHnone
3BE6 BSGIcrystal structure of fite (crystal form 2), a group iii periplasmic siderophore binding proteinFITE_ECO572007-11-160000-00-00HPUBnone
1SBZ BSGIcrystal structure of dodecameric fmn-dependent ubix-like decarboxylase from escherichia coli o157:h7PAD1_ECO572004-02-112004-10-26RELnone
2AMJ BSGIcrystal structure of modulator of drug activity b from escherichia coli o157:h7MDAB_ECO572005-08-092006-07-18RELnone
1LKZ BSGIcrystal structure of d-ribose-5-phosphate isomerase (rpia) from escherichia coli.RPIA_ECOLI2002-04-262002-05-08RELnone
1NT4 BSGIcrystal structure of escherichia coli periplasmic glucose-1-phosphatase h18a mutant complexed with glucose-1-phosphateAGP_ECOLI2003-01-282004-01-13RELnone
2FLO BSGIcrystal structure of exopolyphosphatase (ppx) from e. coli o157:h7PPX_ECO572006-01-062006-06-06RELnone
2FSU BSGIcrystal structure of the phnh protein from escherichia coliPHNH_ECOLI2006-01-232007-03-27RELnone
1FC4 BSGI2-amino-3-ketobutyrate coa ligaseKBL_ECOLI2000-07-172001-05-02RELnone
2D40 BSGIcrystal structure of z3393 from escherichia coli o157:h7Z3393_ECO572005-10-052006-09-26RELnone
1TQ5 BSGIcrystal structure of yhhw from escherichia coliYHHW_ECOLI2004-06-162005-06-14RELnone
3B8N BSGIstructure of fepe- bacterial polysaccharide co-polymeraseFEPE_ECO572007-11-012008-01-22RELnone
1Y00 BSGIsolution structure of the carbon storage regulator protein csraCSRA_ECOLI2004-11-132005-06-21RELnone
2B59 BSGIthe type ii cohesin dockerin complexCIPA_CLOTM NON-BSGI:P71143_CLOTM2005-09-272005-10-11RELnone
1GZ0 BSGI23s ribosomal rna g2251 2'o-methyltransferase rlmbRLMB_ECOLI2002-05-032002-10-17RELnone
3B8O BSGIstructure of wzze- bacterial polysaccharide co-polymeraseWZZE_ECO572007-11-012008-01-22RELnone
2FT0 BSGIcrystal structure of tdp-fucosamine acetyltransferase (wecd)- complex with acetyl-coaWECD_ECOL62006-01-232006-08-01RELnone
1XK7 BSGIcrystal structure- c2 form- of escherichia coli crotonobetainyl-coa: carnitine coa transferase (caib)CAIB_ECOLI2004-09-272005-03-15RELnone
1ZPS BSGIcrystal structure of methanobacterium thermoautotrophicum phosphoribosyl-amp cyclohydrolase hisiHIS3_METTH2005-05-172005-08-30RELnone
1K75 BSGIthe l-histidinol dehydrogenase (hisd) structure implicates domain swapping and gene duplication.HISX_ECOLI2001-10-182002-02-27RELnone
1RRZ BSGIsolution structure of glgs protein from e. coliGLGS_ECOLI2003-12-092004-06-01RELnone
2OVI BSGIstructure of the heme binding protein chuxCHUX_ECO572007-02-132008-02-12RELnone
1KON BSGIcrystal structure of e.coli yebcYEBC_ECOLI2001-12-212002-07-17RELnone
1FC5 BSGIcrystal structure of molybdopterin biosynthesis moea proteinMOEA_ECOLI2000-07-172001-07-25RELnone
1R6Y BSGIcrystal structure of ygin from escherichia coliYGIN_ECOLI2003-10-172004-11-02RELnone
1U9T BSGIcrystal structure analysis of chus, an e. coli heme oxygenaseCHUS_ECO572004-08-102005-10-25RELnone
1U5W BSGIcrystal structure of hypothetical protein yjjx from escherichia coliYJJX_ECOLI2004-07-282005-08-23RELnone
2OZN BSGIthe cohesin-dockerin complex of nagj and nagh from clostridium perfringensCohesin NON-BSGI:FivarDoc2007-02-262008-05-06RELnone
1ZYL BSGIcrystal structure of hypothetical protein yihe from escherichia coliRDOA_ECOLI2005-06-102006-09-19RELnone
2R1A BSGIcrystal structure of the periplasmic lipopolysaccharide transport protein lpta (yhbn), trigonal formYHBN_ECOLI2007-08-222008-04-29RELnone
1PTM BSGIcrystal structure of e.coli pdxaPDXA_ECOLI2003-06-232003-11-04RELnone
1N3B BSGIcrystal structure of dephosphocoenzyme a kinase from escherichia coliCOAE_ECOLI2002-10-252003-01-28RELnone
1FG7 BSGIcrystal structure of l-histidinol phosphate aminotransferase with pyridoxal-5'-phosphateHIS8_ECOLI2000-07-282001-08-22RELnone
2H7A BSGInmr structure of the conserved protein ycgl from escherichia coli representing the duf709 family reveals a novel a/b/a sandwich foldYCGL_ECOL62006-06-012007-04-17RELnone
1TVM BSGInmr structure of enzyme gatb of the galactitol-specific phosphoenolpyruvate-dependent phosphotransferase systemPTKB_ECO572004-06-292005-09-06RELnone
1MC3 BSGIcrystal structure of rffhRMLA2_ECOLI2002-08-052002-11-20RELnone
1SG7 BSGInmr solution structure of the putative cation transport regulator chabCHAB_ECO572004-02-232005-03-08RELnone
1O9B BSGIquinate/shikimate dehydrogenase ydib complexed with nadhYDIB_ECOLI2002-12-122003-02-01RELnone
3B8P BSGIfragment of wzzb, polysaccharide co-polymerase from salmonella typhimuriumWZZB_SALTY NON-BSGI:WZZB_ECO572007-11-012008-01-22RELnone
1YQC BSGIcrystal structure of ureidoglycolate hydrolase (alla) from escherichia coli o157:h7ALLA_ECO572005-02-012005-10-18RELnone
1Q18 BSGIcrystal structure of e.coli glucokinase (glk)GLK_ECO572003-07-182004-07-27RELnone
1L1P BSGIsolution structure of the ppiase domain from e. coli trigger factorTIG_ECOLI2002-02-192003-06-24RELnone
1KK9 BSGIcrystal structure of e. coli ycioYCIO_ECOLI2001-12-062002-12-11RELnone
1XZO BSGIidentification of a disulfide switch in bssco, a member of the sco family of cytochrome c oxidase assembly proteinsYPMQ_BACSU2004-11-122005-03-01RELnone
1YNI BSGIcrystal structure of n-succinylarginine dihydrolase, astb, bound to substrate and product, an enzyme from the arginine catabolic pathway of escherichia coliASTB_ECOLI2005-01-242005-02-15RELnone
1KSL BSGIstructure of rsuaRSUA_ECOLI2002-01-132002-04-24RELnone
1PRZ BSGIcrystal structure of pseudouridine synthase rlud catalytic moduleRLUD_ECO572003-06-202003-11-04RELnone
2FPU BSGIcrystal structure of the n-terminal domain of e.coli hisb- complex with histidinolHIS7_ECO572006-01-172006-09-05RELnone
2AHW BSGIcrystal structure of acyl-coa transferase from e. coli o157:h7 (ydif)-thioester complex with coa- 2YDIF_ECO572005-07-282005-11-01RELnone
1P9K BSGIthe solution structure of ybcj from e. coli reveals a recently discovered alfal motif involved in rna-bindingYBCJ_ECOLI2003-02-202003-03-11RELnone
1SG5 BSGIsolution structure of yaeo, a rho-specific inhibitor of transcription terminationROF_ECOLI2004-02-232005-07-12RELnone
1P9N BSGIcrystal structure of escherichia coli mobb.MOBB_ECOLI2003-05-122003-05-20RELnone
2I6R BSGIcrystal structure of e. coli hype, a hydrogenase maturation proteinHYPE_ECO572006-08-292007-10-23RELnone
2F79 BSGIsolution structure of napd a pathway specific chaperone in napa maturationNAPD_ECOLI2005-11-300000-00-00WDRNnone
1M65 S2Fycdx proteinycdX2002-07-122003-04-22RELnone
1NNV S2Fthe solution structure of hi1450HI14502003-01-142004-01-27RELnone
1JJV S2Fdephospho-coa kinase in complex with atpHI08902001-07-092002-05-01RELnone
1NMP S2Fstructural genomics, ybgi protein, unknown functionybgI2003-01-102004-01-20RELnone
1JOV S2Fcrystal structure analysis of hi1317HI13172001-07-312003-06-24RELnone
1J9A S2FoligoribonucleaseHI17152001-05-242003-06-24RELnone
1NO5 S2Fstructure of hi0073 from haemophilus influenzae, the nucleotide binding domain of the hi0073/hi0074 two protein nucleotidyl transferase.HI00732003-01-152004-03-16RELnone
1IMU S2Fsolution structure of hi0257, a ribosome binding proteinHI02572001-05-112001-10-03RELnone
1NMO S2Fstructural genomics, protein ybgi, unknown functionybgI2003-01-102004-01-20REL1NMP
1JOE S2Fcrystal structure of autoinducer-2 production protein (luxs) from heamophilus influenzaeHI04912001-07-272001-08-08RELnone
1HTW S2Fcomplex of hi0065 with adp and magnesiumHI00652001-01-012002-08-07RELnone
1NRK S2Fygfz proteinygfZ2003-01-242004-03-09RELnone
1MXI S2Fstructure of yibk from haemophilus influenzae (hi0766): a methyltransferase with a cofactor bound at a site formed by a knotHI07662002-10-022003-02-25RELnone
1RW1 S2Fyffb (pa3664) proteinPA36642003-12-152004-11-02RELnone
1R8G S2Fstructure and function of ybdkybdK2003-10-242004-08-17RELnone
1IN0 S2Fyajq protein (hi1034)HI10342001-05-112003-06-10RELnone
2OUT S2Fsolution structure of hi1506, a novel two domain protein from haemophilus influenzaeHI15062007-02-122007-05-01RELnone
1JOP S2Fyhch protein (hi0227)HI02272001-07-302003-06-17RELnone
1NU0 S2Fstructure of the double mutant (l6m; f134m, semet form) of yqgf from escherichia coli, a hypothetical proteinyqgF2003-01-302004-03-02RELnone
1J8B S2Fstructure of ybab from haemophilus influenzae (hi0442), a protein of unknown functionHI04422001-05-212003-01-14RELnone
1MWW S2Fthe structure of the hypothetical protein hi1388.1 from haemophilus influenzae reveals a tautomerase/mif foldHI1388.12002-10-012003-11-18RELnone
1DBX S2Fcrystal structure of cysteinyl-trna(pro) deacylase from h. influenzae (hi1434)HI14341999-11-032000-06-14RELnone
1FL9 S2Fthe yjee proteinHI00652000-08-132002-08-07REL1HTW
1K1E S2Fstructure of the cobalt-bound form of the deoxy-d-mannose-octulosonate 8-phosphate phosphatase (yrbi) from haemophilus influenzae (hi1679)HI16792001-09-252002-02-27RELnone
1J7G S2Fstructure of yihz from haemophilus influenzae (hi0670), a d-tyr-trna(tyr) deacylaseHI06702001-05-162003-04-22RELnone
1JOG S2Fstructure of hi0074 from heamophilus influenzae reveals the fold of a substrate binding domain of a nucleotidyltransferaseHI00742001-07-292002-12-18RELnone
2AIZ S2Fsolution structure of peptidoglycan associated lipoprotein from haemophilus influenza bound to udp-n-acetylmuramoyl-l-alanyl-d-glutamyl-meso-2,6-diaminopimeloyl-d-alanyl-d-alanineHI03812005-08-012006-03-14RELnone
1X6I S2Fcrystal structure of ygfy from escherichia coliygfY2004-08-112005-02-08REL1X6J
1J85 S2Fstructure of yibk from haemophilus influenzae (hi0766), a truncated sequence homolog of trna (guanosine-2'-o-) methyltransferase (spou)HI07662001-05-202003-02-25REL1MXI
1LQA S2Ftas protein from escherichia coli in complex with nadphb28342002-05-092003-07-08RELnone
1RXX S2Fstructure of arginine deiminasePA51712003-12-182004-01-13RELnone
3CA8 S2Fcrystal structure of escherichia coli ydcf, an s-adenosyl-l-methionine utilizing enzymeydcF2008-02-192008-05-06RELnone
1OQF S2Fcrystal structure of the 2-methylisocitrate lyaseprpB2003-03-082004-04-27RELnone
1IM8 S2Fcrystal structure of yeco from haemophilus influenzae (hi0319), a methyltransferase with a bound s-adenosylhomocysteineHI03192001-05-102001-11-07RELnone
2APN S2Fhi1723 solution structureHI17232005-08-162006-10-03RELnone
1J8D S2Fstructure of the metal-free form of the deoxy-d-mannose-octulosonate 8-phosphate phosphatase (yrbi) from haemophilus influenzae (hi1679)HI16792001-05-212002-02-27REL1K1E
1MW5 S2Fstructure of hi1480 from haemophilus influenzaeHI14802002-09-272003-11-18RELnone
1JOS S2Fribosome binding factor a(rbfa)HI12882001-07-302003-06-24RELnone
1IZM S2Fstructure of ygfb from haemophilus influenzae (hi0817), a conserved hypothetical proteinHI08172002-10-092003-12-02RELnone
1JO0 S2Fstructure of hi1333, a hypothetical protein from haemophilus influenzae with structural similarity to rna-binding proteinsHI13332001-07-262002-11-20RELnone
1X6J S2Fcrystal structure of ygfy from escherichia coliygfY2004-08-112005-02-08RELnone
1MWQ S2Fstructure of hi0828, a hypothetical protein from haemophilus influenzae with a putative active-site phosphohistidineHI08282002-09-302003-11-25RELnone
1DBU S2Fcrystal structure of cysteinyl-trna(pro) deacylase protein from h. influenzae (hi1434)HI14341999-11-032000-06-14REL1DBX
1XAX S2Fnmr structure of hi0004, a putative essential gene product from haemophilus influenzaeHI00042004-08-262005-01-18RELnone
1NNX S2Fstructure of the hypothetical protein ygiw from e. coli.ygiW2003-01-142004-03-09RELnone
1NMN S2Fstructure of yqgf from escherichia coli, a hypothetical proteinyqgF2003-01-102004-03-02REL1NU0
1JN1 S2Fstructure of 2c-methyl-d-erythritol 2,4-cyclodiphosphate synthase from haemophilus influenzae (hi0671)HI06712001-07-212002-08-21RELnone
1J7H S2Fsolution structure of hi0719, a hypothetical protein from haemophilus influenzaeHI07192001-05-162003-02-11RELnone
1JAL S2Fychf protein (hi0393)HI03932001-05-302003-03-25RELnone
1NIJ S2Fyjia proteinyjiA2002-12-242003-06-24RELnone
1NNG S2Fstructure of hi0827, a thioesterase acting on short-chain acyl-coa compounds.HI08272003-01-132004-03-09Replaced by:1YLInone
3D63 SSGCIDcrystal structure of inorganic pyrophosphatase from burkholderia pseudomalleiBupsA.00023.a2008-05-180000-00-00AUTHnone
3D6B SSGCID2.2 a crystal structure of glutaryl-coa dehydrogenase from burkholderia pseudomalleiBupsA.00027.a2008-05-192008-06-03RELnone
3D53 SSGCID2.2 a crystal structure of inorganic pyrophosphatase from rickettsia prowazekiiRiprA.00023.a2008-05-152008-05-27RELnone
3D64 SSGCIDcrystal structure of s-adenosyl-l-homocysteine hydrolase from burkholderia pseudomalleiBupsA.00032.a2008-05-182008-06-03RELnone
3CEZ SSGCIDcrystal structure of methionine-r-sulfoxide reductase from burkholderia pseudomalleiBupsA.00033.a2008-02-292008-03-18RELnone
3DAH SSGCID2.3 a crystal structure of ribose-phosphate pyrophosphokinase from burkholderia pseudomalleiBupsA.00035.a2008-05-292008-06-10RELnone
3D5T SSGCIDcrystal structure of malate dehydrogenase from burkholderia pseudomalleiBupsA.00005.a2008-05-162008-06-03RELnone

Note 1:  Last updated: Jul 2 2008

back to top

Sequence Redundancy Statistics

Table 5: Sequence Redundancy Statistics for Non-PSI Structural Genomics Centers in North America by Experimental Status

Sequence Identity(%)Novel Targets
Status:
Selected
Novel Targets
Status:
Cloned
Novel Targets
Status:
Expressed
Novel Targets
Status:
Purified
Novel Targets
Status:
Crystallized
Novel Targets
Status:
Crystal Structure
Novel Targets
Status:
NMR Structure
Novel Targets
Status:
in PDB
<100191914831014568180501293
<9017231385961550173501193
<7016861359940541172501193
<5016101305909530171501193
<3012401112816500165501192
Last updated: 08-04-08
Sequence redundancy is calculated by clustering analysis using BLASTClust program with similarity threshold set to percent of sequence identity.   Please view detailed explanation of sequence redundancy calculations and BLASTClust threshold settings.  Sequence redundancy calculations are based on comparison to all protein sequences in TargetDB which are in the same experimental status category and at least 20 amino acids long

Table 6: Sequence Redundancy Statistics for Structures Released by Non-PSI Structural Genomics Centers in North America by Year

Year Released Structures Number of Released Structures <30% Identity at Time of Release Percent(%) of Released Structures <30% Identity(%) at Time of Release
<= 20002150
20016350
200215640
2003251352
2004181161
200517741
20068338
20074250
200813431
Total1085046
Last updated:08-07-02
Sequence redundancy is calculated by clustering analysis using BLASTClust program with similarity threshold set to percent of sequence identity.   Please view detailed explanation of sequence redundancy calculations and BLASTClust threshold settings.  Sequence redundancy calculations are based on comparison to all protein sequences in the PDB which are at least 20 amino acids long

Figure 4:   Comparison of Novel Structures with Number of Structures Released by Non-PSI Structural Genomics Centers in North America by Year

Note 1:  Last updated:  08-07-02
Sequence redundancy is calculated by clustering analysis using BLASTClust program with similarity threshold set to percent of sequence identity.   Please view detailed explanation of sequence redundancy calculations and BLASTClust threshold settings.  Sequence redundancy calculations are based on comparison to all protein sequences in the PDB which are at least 20 amino acids long
back to top

© RCSB PDB