Dataset Open Access
This page provides the single mutation data extracted with MicroMiner from the PDB. The data contains amino acid pairs in protein structures from the PDB, exemplifying single mutations’ local structural changes for single chains and pairs for protein–protein interfaces. Mutations to non-standard residues are also provided.
See the MicroMiner publication for details:
Sieg, J.; Rarey, M. Searching similar local 3D micro-environments in protein structure databases with MicroMiner, 2023 (accepted in Briefings in Bioinformatics)
Data content:
A row in the TSV files describes the residue position of the single mutation in the wild-type (query) and mutant (hit). Multiple local structural and sequential similarity measures are provided, computed from the residue 3D micro-environments. The column fullSeqId contains the global sequence similarity. The first two rows of a TSV file look this:
queryName queryChain queryAA queryPos hitName hitChain hitAA hitPos siteIdentity siteBackBoneRMSD siteAllAtomRMSD nofSiteResidues alignmentLDDT fullSeqId
10GS A CYS 47 2J9H A ALA 48 0.938 0.223 0.431 16.0 0.996 0.976 0.976
queryName: query PDB-ID
queryChain: query chain ID
queryAA: query amino acid type (three letter code)
queryPos: query sequence position of the amino acid residue
hitName: hit PDB-ID
hitChain: hit chain ID
hitAA: hit amino acid type (three letter code)
hitPos: hit sequence position of the amino acid residue
siteIdentity: sequence identity of the aligned micro-environments
siteBackBoneRMSD: Calpha-RMSD of the aligned micro-environments
siteAllAtomRMSD: all-atom-RMSD of the aligned micro-environments
nofSiteResidues: number of residues in the micro-environments
alignmentLDDT: mean LDDT score of all residues in the aligned micro-environments
fullSeqId: global sequence identity of the query chain and hit chain (as specified by the chain IDs)
This work was supported by the German Federal Ministry of Education and Research as part of de.NBI [grant number 031L0105] and protP.S.I. [grant number 031B0405B].
| Name | Size | |
|---|---|---|
|
filtered_single_mutations_pdb_monomer.tsv.tar.gz
md5:1992f83a3356c0dbf846d242688ee4cb |
78.6 MB | Download |
|
filtered_single_mutations_pdb_ppi.tsv.tar.gz
md5:7e96f16aa5cf24438dae392a6f137ea0 |
13.3 MB | Download |
|
pdb_all_monomer.tsv.tar.gz
md5:7f71061b76e5ad8583e87996cffb0bb4 |
2.9 GB | Download |
|
pdb_all_ppi.tsv.tar.gz
md5:ebd06bcc67a2785ba0f623366b3628ed |
517.8 MB | Download |
|
single_mutations_pdb_monomer_non_standard_aa.tsv.tar.gz
md5:0c93df9fa680d0ff7bab7f423e9aa675 |
3.6 MB | Download |
|
single_mutations_pdb_ppi_non_standard_aa.tsv.tar.gz
md5:1f3f15dc28fd661f772b09718c3c2816 |
984.0 kB | Download |