Segmentation data model prototype
Segmentation is the decomposition of 3D volumes into regions that can be associated with defined objects. Following several consultations with the EM community (Patwardhan et al., 2012; Patwardhan et al., 2014; Patwardhan et al., 2017), the EMDB developed a prototype to explore supporting the deposition of volume segmentations with structured biological annotation which is here defined as the association of data with identifiers (e.g., accession codes from UniProt) and ontologies taken from well established bioinformatics resources. To our knowledge, none of the segmentation formats widely used in electron microscopy and related fields currently support structured biological annotation. Third party use of segmentations is further impeded by the prevalence of segmentation file formats and their lack of interoperability. EMDB therefore proposed an open segmentation file format called EMDB-SFF to capture basic segmentation data from application-specific segmentation file formats and provide the means for structured biological annotation. In this way, file formats like EMDB-SFF could not only enable depositions of segmentations but also act as a file interchange format between different applications and facilitate analysis of 3D reconstructions. Furthermore EMDB-SFF prototypes the description of multiple transforms for a segment, thus allowing a segment to be used to describe the placement of a sub-tomogram average onto a tomographic reconstruction.
Model
EMDB-SFF files have the follow features:
- Segmentation metadata:
- name
- version (of schema)
- details (free-form text)
- global external references, e.g. specimen scientific identifier
- bounding box
- primary descriptor contained i.e. one of ‘three_d_volume’, ‘mesh_list’, or ‘shape_primitive_list’ (see schema documentation)
- list of software used to create the segmentation (name, version, processing details)
- list of transforms referenced by segments e.g. transform to place the sub-tomogram average in the tomogram
- Hierarchical ordering of segments through the use of segment IDs and parent IDs;
- Four geometrical representations of segments (volumes, contours, meshes, shapes);
- Can store subtomogram averages and how they map into the parent tomogram through the use of transforms;
- List of associated external references per segment;
- List of associated complexes and macromolecules in a related EMDB entry
Each segment in a segmentation can consist of two types of descriptors:
- textual descriptors;
- geometric descriptors.
Textual descriptors consist of either free-form text or standardised terms. Standard terms should be provided from a [published] ontology or list of identifiers.
Geometric descriptors can take one or more of the following representations:
- ‘three_d_volume’ for 3D volumes;
- ‘mesh_list’ for lists of meshes each of which consists of a set of vertices and polygons;
- lists of shape primitives (ellipsoid, cuboid, cone, cylinder).
Documentation
Download
The current schema (version 0.8.0.dev1) is available here.
Documentation
Complete documentation of the schema is available here.
Auxiliary Tools
sfftk-rw
sfftk-rw is a Python toolkit for reading and writing EMDB-SFF files only. It is part of a family of tools designed to work with EMDB-SFF files.
sfftk-rw has the following utilities:
- convert - interconvert between XML, HDF5 and JSON file formats of the EMDB-SFF data model;
- view - view a file summary
The full documentation is available at readthedocs.
Download
The latest version runs only on Python 3 (version 0.7.1) and may be installed using pip install sfftk-rw. Alternatively, feel free to obtain the source code from Github.
sfftk
sfftk provides a shell command and a Python API to process EMDB-SFF files.
The following utilities are available using sfftk:
- convert - Conversion of application-specific segmentation file formats to EMDB-SFF. Currently, sfftk supports the following formats:
- AmiraMesh (.am)
- Amira HyperSurface (.surf)
- Segger (.seg)
- EMDB Map masks (.map)
- Stereolithography (.stl)
- IMOD (.mod)
- notes - Annotation of EMDB-SFF files.
- view - Brief summaries of segmentation files.
Read the full documentation here.
Download
The latest development version (version 0.5.5.dev1) of sfftk may be downloaded/installed from PyPI or the source may be obtained from GitHub.
Quick links
Recent Entries
(Show all)CryoEM structure of human NSUN2 with tRNA Lys(CTT) (Conformation 2)
In situ subtomogram average of the M. maripaludis Hdr-Vhu-Fwd supercomplex
Map of the 50S ribosomal subunit with E-tRNA and RsfS from the alphaproteobacteria Asaia platycodi
CryoEM structure of human NSUN2(C271A) with SAH cross-linked to tRNA Lys(CTT) (D-arm conformation)
CryoEM structure of human NSUN2 with tRNA Lys(CTT) (Conformation 1)
CryoEM structure of human NSUN2(C271A) with SAH cross-linked to tRNA Lys(TTT) (No D-arm conformation)
CryoEM structure of human NSUN2 with tRNA Lys(CTT) and SFG (D-arm conformation)
CryoEM structure of human NSUN2 with tRNA Lys(CTT) and SFG (No D-arm conformation)
CryoEM structure of human NSUN2(C271A) with SAH cross-linked to tRNA Lys(TTT) (D-arm conformation)
CryoEM structure of human NSUN2(C271A) with SAH cross-linked to tRNA Lys(CTT) (No D-arm conformation)
CryoEM structure of human NSUN2(C271A) with SAH cross-linked to a pre-tRNA Leu(CAA) intron
Zebrafish Abcb4 in IF-Narrow conformation in the presence of Elacridar (ELA-IF-Narrow)
Zebrafish Abcb4 in IF-narrow conformation in the presence of Tariquidar (TQR-IF-Narrow)
Zebrafish Abcb4 in IF-Wide conformation in the presence of Tariquidar (TQR-IF-Wide)
Zebrafish Abcb4 in IF-medium conformation in the presence of Tariquidar (TQR-IF-Medium)
Zebrafish Abcb4 in IF-Medium conformation in the presence of Elacridar (ELA-IF-Medium)
Zebrafish Abcb4 in IF-Wide conformation in the presence of Elacridar (ELA-IF-Wide)
Zebrafish Abcb4 in IF-narrow conformation in the presence of Vincristine (VCR-IF-Narrow)
Cryo-EM structure of PPAT-NUDT5 complex bound to 6-benzylthioinosine-5'-monophosphate (6-benzylTIMP)
Cryo-EM structure of PPAT-NUDT5 complex bound to 6-methylthioinosine-5'-monophosphate (6-meTIMP)
Zebrafish Abcb4 in IF-Wide conformation in the presence of Vincristine (VCR-IF-Wide)
Cryo-EM structure of PPAT-NUDT5 complex bound to adenosine-5'-monophosphate (AMP)
Zebrafish Abcb4 in IF-medium conformation in the presence of Vincristine (VCR-IF-Medium)
State 1 MLL4FC bound to a nucleosome premodified with H2BK120ub and H4K16ac
State 3 MLL4FC bound to a nucleosome premodified with H2BK120ub and H4K16ac
State 2 focused on PHD FYR of MLL4FC bound to a nucleosome premodified with H2BK120ub and H4K16ac
State 2 focused on H3 N terminal tail of MLL4FC bound to a nucleosome premodified with H2BK120ub and H4K16ac
In situ subtomogram average of a ribosome bound to ribosome associated vesicle in primary neurons expressing KDEL tagged with mNeonGreen (mNeon-KDEL)
RPS26dC HEK mutant 80S ribosome bound to Kozak mRNA (RPS26dC-Kozak)
electron tomograms of epon embedded P. falciparum stage V gametocytes (NF54)
electron tomograms of epon embedded P. falciparum stage V gametocytes with PfMIC60 knockout
electron tomograms of epon embedded P. falciparum stage V gametocytes with PfMIC19/PfMIC60 double knockout
electron tomograms of epon embedded P. falciparum stage V gametocytes with PfMIC19 knockout
Neuraminidase NA isolated from the H1N1 strain A/Victoria/2570/2019 propagated in eggs
Cryo-EM structure of P. urativorans 70S ribosome after 72h acute stress - State 3: E-tRNA/P-tRNA/Balon
Cryo-EM structure of P. urativorans 70S ribosome after 72h acute stress - State 4: E-tRNA/RaiA/Balon
Structure of Geobacillus stearothermophilus RNase P ribozyme in complex with precursor tRNA in 10 mM Ca2+
Cryo-EM map of vaccine elicited antibody 22F5 bound to post-fusion conformation of Langya virus F protein
Cryo-EM structure of antibody 22F5 in complex with pre-fusion stabilized LayV-F
Human AGO2 bound to a miR-20a guide and a position 10-11 mismatched target
A complex of PTH1R/Gs bound to a PTHrP analogue with three beta-amino acids
A complex of PTH1R/Gs bound to a PTHrP analogue with five beta-amino acids
MprF from Pseudomonas aeruginosa mutant, H566C in nanodisc, C2 symmetry
MprF from Pseudomonas aeruginosa mutant- H386C/F389C in nanodisc, C2 symmetry
Cryo-EM structure of ATP-bound Arabidopsis thaliana fatty acid transporter CTS
Cryo-EM structure of Arabidopsis thaliana fatty acid transporter CTS
Cryo-EM structure of IBA-CoA-bound Arabidopsis thaliana fatty acid transporter CTS
Cryo-EM structure of C12:0-CoA-bound Arabidopsis thaliana fatty acid transporter CTS
Cryo-EM structure of 2,4-DB-CoA-bound Arabidopsis thaliana fatty acid transporter CTS
Structure of SARS-CoV-2 Spike in complex with antibodies S309 and CT1-1
Cryo-EM structure of the family 2A encapsulin from Mycolicibacterium smegmatis
Structure of SARS-CoV-2 Spike in complex with antibodies S309 and CT1-5.
Cryo-EM structure of reduced form of formatedehydrogenase from Rhodobacter aestuarii (RaFDH) with NADH
Local refinement region of SARS-CoV-2 spike RBD in complex with antibodies CT1-5.
Local refinement region of SARS-CoV-2 spike RBD in complex with antibodies S309 and CT1-1.
Consensus cryo-EM map for the structure of human PI3KC3-C1 complex
Designed one-component T=3 quasisymmetric protein nanocage pentamer sub-particle region
Subtomogram average structure of flagellar export apparatus and MS-ring of deleted flhA in Borrelia burgdorferi
CryoEM structure of the soluble-WRAPed membranous portion of MspA (Mycobacterium smegmatis porin), dimerized along the native interface.
Designed one-component T=13 quasisymmetric protein nanocage hexamer sub-particle region
Designed one-component T=13 quasisymmetric protein nanocage pentamer sub-particle region
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Go complex in nucleotide-free NC state 1
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Go complex in nucleotide-free C state
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GDP-bound, AHD-closed C state 2, plunge-frozen 15 seconds after GDP addition
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GDP-bound, AHD-closed C state 2, plunge-frozen 8 seconds after GDP addition
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Go complex in nucleotide-free NC state 2
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GDP-bound, AHD-closed C state 1, plunge-frozen 15 seconds after GDP addition
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 complex in nucleotide-free C state 2
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GDP-bound, AHD-closed C state 1, plunge-frozen 8 seconds after GDP addition
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 (delipidated) complex in nucleotide-free C state
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open C state 2, plunge-frozen 0-5 seconds after GTP addition
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 complex in nucleotide-free NC state 3
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 complex in nucleotide-free C state 1
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-closed C state 2, plunge-frozen 8 seconds after GTP addition
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 complex in nucleotide-free NC state 2
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 complex in nucleotide-free NC state 1
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gq (delipidated) complex in nucleotide-free C state
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-closed C state 1, plunge-frozen 8 seconds after GTP addition
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open C state 1, plunge-frozen 0-5 seconds after GTP addition
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open C state, plunge-frozen 8 seconds after GTP addition
Cryo-EM structure of human Neurotensin Receptor 1 (hNTSR1)-Gi1 (delipidated) complex in nucleotide-free NC state
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open NC state 4, plunge-frozen 0-5 seconds after GTP addition
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open NC state 2, plunge-frozen 0-5 seconds after GTP addition
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open NC state 3, plunge-frozen 0-5 seconds after GTP addition
Cryo-EM structure of the human neurotensin receptor 1 (hNTSR1)-Gi1 complex in the GTP-bound, AHD-open NC state 1, plunge-frozen 0-5 seconds after GTP addition
Structure of Klebsiella pneumoniae trypsin-HamAB bound with DNA, trimer
Structure of Klebsiella pneumoniae trypsin-HamAB bound with DNA, dimer
Structure of Klebsiella pneumoniae trypsin-HamAB bound with DNA, monomer
Subtomogram average of D. discoideum ribosome (from EMPIAR-11899) using particles selected at least 43 nm away from lamella surface
Subtomogram average of the D. discoideum vault complex using D39 symmetry (from EMPIAR-11845)
Subtomogram average of gamma tubulin ring complex capping microtubule minus ends from purified S. uvarum spindle pole bodies
Subtomogram average of D. discoideum ribosome (from EMPIAR-11899) using particles selected within 43 nm of lamella surface
In situ subtomogram average of filamentous IMPDH in mycophenolic acid-treated HeLa cells
Subtomogram average of D. discoideum nuclear pore complex (from EMPIAR-11943)
Subtomogram average of H. sapiens T-complex protein Ring Complex (TRiC) (from EMPIAR-11538)
Subtomogram average of C. reinhardtii ribosome (from EMPIAR-11830)
Structure of a stalled E. coli 70S RNC-NuoK-86 in complex with the membrane protein insertase SecYEG-YidC
Structure of a stalled E. coli 70S RNC-NuoK-86 in complex with the membrane protein insertase SecYEG-YidC (Focused Refinement)
Structure of a stalled E. coli 70S RNC-NuoK-86 in complex with the membrane protein insertase SecYEG-YidC (Composite map)
cryoEM structure of S1P3 in complex with heterotrimeric G protein
Cryo-EM structure of S1P2 in complex with heterotrimeric G protein
Cryo-EM structure of SHIV-elicited CN81-2029 Fab in complex with HIV Env trimer Q23-SCT27
Cryo-EM structure of SHIV-elicited CI93-1365 Fab in complex with HIV Env trimer Q23-SCT27
Cryo-EM structure of SHIV-elicited CE79-1571 Fab in complex with HIV Env trimer Q23-SCT27
Cryo-EM structure of Protein involved in colonization (Pic) from Enteroaggregative Escherichia coli complexed with the fragment antigen binding domain of monoclonal antibody 40
Cryo-EM structure of Enterotoxigenic Escherichia coli autotransporter A (EatA) complexed with the fragment antigen binding domain of monoclonal antibody G12
Pneumococcal choline binding CbpE surface protein bounds to 5992-2 Fab
Cryo-EM structure of Secreted extracellular protein A (SepA) from Shigella flexneri complexed with the fragment antigen binding domain of monoclonal antibody 40
Cryo-EM structure of Enterotoxigenic Escherichia coli autotransporter A (EatA) complexed with the fragment antigen binding domain of monoclonal antibody 40
Cryo-EM structure of Enterotoxigenic Escherichia coli autotransporter A (EatA) complexed with the fragment antigen binding domain of monoclonal antibody 15
Pneumococcal choline binding CbpE surface protein bounds to multiple 5992-2 Fabs
Cryo-EM structure of Enterotoxigenic Escherichia coli autotransporter A (EatA) complexed with the fragment antigen binding domain of monoclonal antibody 25
Cryo-EM structure of nucleotide-free Streptococcus thermophilus FoeAB 2
Cryo-EM structure of Streptococcus thermophilus FoeAB in complex with AMPPNP
ICA-1103811 bound KCNQ2/3 heteromer with 3:1 stoichiometry, state 2
Cryo-EM structure of Streptococcus thermophilus FoeAB E504Q mutant in complex with ATP
Cryo-EM structure of nucleotide-free Streptococcus thermophilus FoeAB 1
Cryo-EM structure of Streptococcus thermophilus FoeAB in complex with ADP
ICA-1103811 bound KCNQ2/3 heteromer with 3:1 stoichiometry, state 3
ICA-1103811 bound KCNQ2/3 heteromer with 3:1 stoichiometry, state 1
Cryo-EM structure of Streptococcus thermophilus FoeAB in complex with AMPPNP in peptidisc
Cryo-EM structure of nucleotide-free Streptococcus thermophilus FoeAB 3
Cryo-EM structure of Streptococcus thermophilus FoeAB in complex with ATP and ADP
SARS-CoV-2 polymerase with incorporated and pre-incorporated AT-9052-Sp
XEN1101 bound KCNQ2/3 heteromer co-expressed with CaM, 2:2 stoichiometry
XEN1101 bound KCNQ2/3 heteromer co-expressed with CaM, 3:1 stoichiometry, state 2
XEN1101 bound KCNQ2/3 heteromer co-expressed with CaM, 3:1 stoichiometry, state 1
XEN1101 bound KCNQ2/3 heteromer co-expressed with CaM, 3:1 stoichiometry, state 3
XEN1101 bound KCNQ2/3 heteromer co-expressed with CaM, 3:1 stoichiometry, state 4
VPS34-CII (VPS34 199-REIE-202 > 199-AAAA-202 mutant) bound to RAB5A-GTP (Q79L); unsharpened composite map
VPS34-CII (VPS34 199-REIE-202 to 199-AAAA-202 mutant) bound to RAB5A (Q79L), focused refinement on the base of the complex
VPS34-CII (VPS34 199-REIE-202 to 199-ERIR-202 mutant) bound to RAB5A (Q79L) on the VPS15 subunit, focused refinement on the adaptor arm
VPS34-CII (VPS34/VPS15/BECLIN1/UVRAG) bound to RAB5A (Q79L) on the VPS34 and VPS15 subunits, primary map processed with CryoTEN
VPS34-CII (VPS34 199-REIE-202 to 199-AAAA-202 mutant) bound to RAB5A (Q79L, focused refinement on the interface between RAB5A and VPS34)
Cryo-EM structure of the PseTnsAB paired-end complex (right end) in the presence of Mn
VPS34-CII (VPS34 199-REIE-202 to 199-AAAA-202 mutant) bound to RAB5A (Q79L), primary composite map processed with CryoTEN
VPS34-CII (VPS34 199-REIE-202 to 199-AAAA-202 mutant) bound to RAB5A (Q79L, focused refinement on the adaptor arm)
Apo VPS34-CII (VPS34/VPS15/BECLIN1/UVRAG), consensus refinement map processed with CryoTEN
VPS34-CII (VPS34 199-REIE-202 to 199-AAAA-202 mutant) bound to RAB5A (Q79L, focused refinement on the kinase arm)
VPS34-CII (VPS34 199-REIE-202 > 199-AAAA-202 mutant) bound to RAB5A-GTP (Q79L), consensus refinement
VPS34-CII (VPS34 199-REIE-202 to 199-ERIR-202 mutant) bound to RAB5A (Q79L) on the VPS15 subunit, focused refinement of the RAB5A interface with VPS15
VPS34-CII (VPS34 199-REIE-202 to 199-ERIR-202 mutant) bound to RAB5A (Q79L) on the VPS15 subunit, focused refinement on the kinase arm
VPS34-CII (VPS34 199-REIE-202 to 199-ERIR-202 mutant) bound to RAB5A (Q79L) on the VPS15 subunit; unsharpened composite map
VPS34-CII (VPS34 199-REIE-202 to 199-ERIR-202 mutant) bound to RAB5A (Q79L) on the VPS15 subunit, composite map processed with CryoTEN.
VPS34-CII bound to RAB5A-GTP 1-212 (C19S, C63S, Q79L) on the VPS34 subunit, primary map processed with CryoTEN
VPS34-CII bound to RAB5A-GTP 1-212 (C19S, C63S, Q79L) on the VPS15 subunit, primary map processed with CryoTEN
Cryo-EM structure of the PseTnsAB paired-end complex (right end) in the presence of Mg
VPS34-CII (VPS34 199-REIE-202 to 199-ERIR-202 mutant) bound to RAB5A (Q79L) on the VPS15 subunit, consensus refinement
Cryo-EM structure of the PseTnsAB paired-end complex (left end) in the presence of Mg
Cryo-EM Structure of the N600A Quinol-Dependent Nitric Oxide Reductase
ALECT2 type Ia filament from renal biopsy tissue of an individual with ALECT2 amyloidosis
ALECT2 type IIa filament from renal biopsy tissue of an individual with ALECT2 amyloidosis
ALECT2 type Ib filament from renal biopsy tissue of an individual with ALECT2 amyloidosis
ALECT2 type IIb filament from renal biopsy tissue of an individual with ALECT2 amyloidosis
ALECT2 type III filament from renal biopsy tissue of an individual with ALECT2 amyloidosis
