Dimensionality
Notebook link where you can find all the graphs.
Data created on 17.10.2022 at 19:32:45
Data updated on 17.10.2022 at 19:32:45
Support by Ontologies
Despite the difficulty of finding definitions of NMR Dimensionality in ontologies, the term itself exists ( in nuclear magnetic resonance CV, and in Physico-chemical methods and properties). However, its subclasses are not rich with either definitions or values, with the values mostly being missing in the case of 1D NMR, or provided (as in CHMO), but not classified as 1D. Still, the available ontologies are sufficient for providing the piece of metadata describing the dimensionality as the subclasses can be added in another field in the repository.
Data Sanitisation and Missing Values
Field Type | Field Name | Values Readability | Unit | Missing | Comment | |
---|---|---|---|---|---|---|
MTBLS | semi-dedicated | Pulse sequence name | free text | none | The field is not provided; or the expressions "1D" or "2D" are not found in the field, and also the method mentioned is not available in the mapping between the method and dimensionality made with hard-coding; or the value is provided as N/A or other similar expressions; or the study "assays" value is "null" | Due to providing methods names in free text, much hard coding was needed |
MW | semi-dedicated | NMR Eexperiment Type | free text | none | The field is not provided; or the expressions "1D" and "2D" are not found in the field, and also the method mentioned is not available in the mapping between the method and dimensionality made with hard-coding; or the value is provided as N/A or other similar expressions; or decoding the JSON file that contains the study details has failed due to syntax error there | Due to providing methods names in free text, much hard coding was needed |
CENAPT | semi-dedicated | Description | machine-readable | none | Description field is always available, so missing means only that the expressions "1D" and "2D" are not found there | It was machine-readable as looking for "1D" and "2D" expressions was enough in all the texts |
NMRShiftDB | semi-dedicated | [nmr:assignmentMethod, nmr:OBSERVENUCLEUS] | free text | none | The field is not provided; or the expressions "1D" and "2D" are not found in the field, and also the method mentioned is not available in the mapping between the method and dimensionality made with hard-coding. The value can also be given as "Unreported" or other similar expressions. | Due to providing methods names in free text, much hard coding was needed |
Input Examples | Output | |
---|---|---|
MTBLS | ["1D NOESY with presaturation (tnnoesy)", "2D JRES with homonuclear J-resolved 2D correlation, presaturation during relaxation delay with gradients (jresgpprqf)", "1H_PROTON"] | "1d", "2d", "missing" |
MW | ["2D 1H-13C HSQC-TOCSY", "NOESY", "NOESY PR1D"] | "1d", "2d", "missing" |
CENAPT | ["NMR data of gossypol in DMSOd6 and CDCl3. The dataset contains 1D 1H 13C as well as 2D COSY, HSQC, HMBC, all acquired at 600MHz (Bruker 600MHz spectrometer with CryoProbe (CP DCH 600S3 C/H-D-05 Z)"] | "1d", "2d", "missing" |
NMRShiftDB | ["1D shift positions", "2D INADEQUATE/NMRanalyst", "1H, H,H-COSY, H,H-NOESY, H,C-HMQC, H,C-HMBC, DEPTQ, 1H, 13C, 1H, H,H-COSY"] | "1d", "2d", "missing" |
Results
The Dimensionality was possible to obtain in most studies and repositories. It can be either mentioned explicitly as "1D" or "2D", or it can be implied by the technique name (e.g., HSQC). None of the repositories used ontologies to describe the value.
Most of the studies were only one-dimensional. Here you can see the percentage of studies based on their dimensionalities. If a study has both 1D and 2D spectra, it will appear twice.
CENAPT was the only exception to provide 2D NMR for almost all the studies as one can see below (NMRShiftDB data were not added due to the huge number of studies there).
However, NMRShiftDB similarly has mostly 1D NMR data.
And here you can find the whole view of the four repositories with a logarithmic scale.