Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software

Hiroshi Tsugawa, Tobias Kind, Ryo Nakabayashi, Daichi Yukihira, Wataru Tanaka, Tomas Cajka, Kazuki Saito, Oliver Fiehn, Masanori Arita

Research output: Contribution to journalArticle

94 Citations (Scopus)

Abstract

Compound identification from accurate mass MS/MS spectra is a bottleneck for untargeted metabolomics. In this study, we propose nine rules of hydrogen rearrangement (HR) during bond cleavages in low-energy collision-induced dissociation (CID). These rules are based on the classic even-electron rule and cover heteroatoms and multistage fragmentation. We evaluated our HR rules by the statistics of MassBank MS/MS spectra in addition to enthalpy calculations, yielding three levels of computational MS/MS annotation: "resolved" (regular HR behavior following HR rules), "semiresolved" (irregular HR behavior), and "formula-assigned" (lacking structure assignment). With this nomenclature, 78.4% of a total of 18506 MS/MS fragment ions in the MassBank database and 84.8% of a total of 36370 MS/MS fragment ions in the GNPS database were (semi-) resolved by predicted bond cleavages. We also introduce the MS-FINDER software for structure elucidation. Molecular formulas of precursor ions are determined from accurate mass, isotope ratio, and product ion information. All isomer structures of the predicted formula are retrieved from metabolome databases, and MS/MS fragmentations are predicted in silico. The structures are ranked by a combined weighting score considering bond dissociation energies, mass accuracies, fragment linkages, and, most importantly, nine HR rules. The program was validated by its ability to correctly calculate molecular formulas with 98.0% accuracy for 5063 MassBank MS/MS records and to yield the correct structural isomer with 82.1% accuracy within the top-3 candidates. In a test with 936 manually identified spectra from an untargeted HILIC-QTOF MS data set of human plasma, formulas were correctly predicted in 90.4% of the cases, and the correct isomer structure was retrieved at 80.4% probability within the top-3 candidates, including for compounds that were absent in mass spectral libraries. The MS-FINDER software is freely available at http://prime.psc.riken.jp/.

Original languageEnglish (US)
Pages (from-to)7946-7958
Number of pages13
JournalAnalytical Chemistry
Volume88
Issue number16
DOIs
StatePublished - Aug 16 2016

Fingerprint

Hydrogen
Isomers
Ions
Plasma (human)
Terminology
Isotopes
Enthalpy
Statistics
Electrons

ASJC Scopus subject areas

  • Analytical Chemistry

Cite this

Tsugawa, H., Kind, T., Nakabayashi, R., Yukihira, D., Tanaka, W., Cajka, T., ... Arita, M. (2016). Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software. Analytical Chemistry, 88(16), 7946-7958. https://doi.org/10.1021/acs.analchem.6b00770

Hydrogen Rearrangement Rules : Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software. / Tsugawa, Hiroshi; Kind, Tobias; Nakabayashi, Ryo; Yukihira, Daichi; Tanaka, Wataru; Cajka, Tomas; Saito, Kazuki; Fiehn, Oliver; Arita, Masanori.

In: Analytical Chemistry, Vol. 88, No. 16, 16.08.2016, p. 7946-7958.

Research output: Contribution to journalArticle

Tsugawa, H, Kind, T, Nakabayashi, R, Yukihira, D, Tanaka, W, Cajka, T, Saito, K, Fiehn, O & Arita, M 2016, 'Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software', Analytical Chemistry, vol. 88, no. 16, pp. 7946-7958. https://doi.org/10.1021/acs.analchem.6b00770
Tsugawa, Hiroshi ; Kind, Tobias ; Nakabayashi, Ryo ; Yukihira, Daichi ; Tanaka, Wataru ; Cajka, Tomas ; Saito, Kazuki ; Fiehn, Oliver ; Arita, Masanori. / Hydrogen Rearrangement Rules : Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software. In: Analytical Chemistry. 2016 ; Vol. 88, No. 16. pp. 7946-7958.
@article{96965222bf884b35b3b8ec2deccf2ff1,
title = "Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software",
abstract = "Compound identification from accurate mass MS/MS spectra is a bottleneck for untargeted metabolomics. In this study, we propose nine rules of hydrogen rearrangement (HR) during bond cleavages in low-energy collision-induced dissociation (CID). These rules are based on the classic even-electron rule and cover heteroatoms and multistage fragmentation. We evaluated our HR rules by the statistics of MassBank MS/MS spectra in addition to enthalpy calculations, yielding three levels of computational MS/MS annotation: {"}resolved{"} (regular HR behavior following HR rules), {"}semiresolved{"} (irregular HR behavior), and {"}formula-assigned{"} (lacking structure assignment). With this nomenclature, 78.4{\%} of a total of 18506 MS/MS fragment ions in the MassBank database and 84.8{\%} of a total of 36370 MS/MS fragment ions in the GNPS database were (semi-) resolved by predicted bond cleavages. We also introduce the MS-FINDER software for structure elucidation. Molecular formulas of precursor ions are determined from accurate mass, isotope ratio, and product ion information. All isomer structures of the predicted formula are retrieved from metabolome databases, and MS/MS fragmentations are predicted in silico. The structures are ranked by a combined weighting score considering bond dissociation energies, mass accuracies, fragment linkages, and, most importantly, nine HR rules. The program was validated by its ability to correctly calculate molecular formulas with 98.0{\%} accuracy for 5063 MassBank MS/MS records and to yield the correct structural isomer with 82.1{\%} accuracy within the top-3 candidates. In a test with 936 manually identified spectra from an untargeted HILIC-QTOF MS data set of human plasma, formulas were correctly predicted in 90.4{\%} of the cases, and the correct isomer structure was retrieved at 80.4{\%} probability within the top-3 candidates, including for compounds that were absent in mass spectral libraries. The MS-FINDER software is freely available at http://prime.psc.riken.jp/.",
author = "Hiroshi Tsugawa and Tobias Kind and Ryo Nakabayashi and Daichi Yukihira and Wataru Tanaka and Tomas Cajka and Kazuki Saito and Oliver Fiehn and Masanori Arita",
year = "2016",
month = "8",
day = "16",
doi = "10.1021/acs.analchem.6b00770",
language = "English (US)",
volume = "88",
pages = "7946--7958",
journal = "Analytical Chemistry",
issn = "0003-2700",
publisher = "American Chemical Society",
number = "16",

}

TY - JOUR

T1 - Hydrogen Rearrangement Rules

T2 - Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software

AU - Tsugawa, Hiroshi

AU - Kind, Tobias

AU - Nakabayashi, Ryo

AU - Yukihira, Daichi

AU - Tanaka, Wataru

AU - Cajka, Tomas

AU - Saito, Kazuki

AU - Fiehn, Oliver

AU - Arita, Masanori

PY - 2016/8/16

Y1 - 2016/8/16

N2 - Compound identification from accurate mass MS/MS spectra is a bottleneck for untargeted metabolomics. In this study, we propose nine rules of hydrogen rearrangement (HR) during bond cleavages in low-energy collision-induced dissociation (CID). These rules are based on the classic even-electron rule and cover heteroatoms and multistage fragmentation. We evaluated our HR rules by the statistics of MassBank MS/MS spectra in addition to enthalpy calculations, yielding three levels of computational MS/MS annotation: "resolved" (regular HR behavior following HR rules), "semiresolved" (irregular HR behavior), and "formula-assigned" (lacking structure assignment). With this nomenclature, 78.4% of a total of 18506 MS/MS fragment ions in the MassBank database and 84.8% of a total of 36370 MS/MS fragment ions in the GNPS database were (semi-) resolved by predicted bond cleavages. We also introduce the MS-FINDER software for structure elucidation. Molecular formulas of precursor ions are determined from accurate mass, isotope ratio, and product ion information. All isomer structures of the predicted formula are retrieved from metabolome databases, and MS/MS fragmentations are predicted in silico. The structures are ranked by a combined weighting score considering bond dissociation energies, mass accuracies, fragment linkages, and, most importantly, nine HR rules. The program was validated by its ability to correctly calculate molecular formulas with 98.0% accuracy for 5063 MassBank MS/MS records and to yield the correct structural isomer with 82.1% accuracy within the top-3 candidates. In a test with 936 manually identified spectra from an untargeted HILIC-QTOF MS data set of human plasma, formulas were correctly predicted in 90.4% of the cases, and the correct isomer structure was retrieved at 80.4% probability within the top-3 candidates, including for compounds that were absent in mass spectral libraries. The MS-FINDER software is freely available at http://prime.psc.riken.jp/.

AB - Compound identification from accurate mass MS/MS spectra is a bottleneck for untargeted metabolomics. In this study, we propose nine rules of hydrogen rearrangement (HR) during bond cleavages in low-energy collision-induced dissociation (CID). These rules are based on the classic even-electron rule and cover heteroatoms and multistage fragmentation. We evaluated our HR rules by the statistics of MassBank MS/MS spectra in addition to enthalpy calculations, yielding three levels of computational MS/MS annotation: "resolved" (regular HR behavior following HR rules), "semiresolved" (irregular HR behavior), and "formula-assigned" (lacking structure assignment). With this nomenclature, 78.4% of a total of 18506 MS/MS fragment ions in the MassBank database and 84.8% of a total of 36370 MS/MS fragment ions in the GNPS database were (semi-) resolved by predicted bond cleavages. We also introduce the MS-FINDER software for structure elucidation. Molecular formulas of precursor ions are determined from accurate mass, isotope ratio, and product ion information. All isomer structures of the predicted formula are retrieved from metabolome databases, and MS/MS fragmentations are predicted in silico. The structures are ranked by a combined weighting score considering bond dissociation energies, mass accuracies, fragment linkages, and, most importantly, nine HR rules. The program was validated by its ability to correctly calculate molecular formulas with 98.0% accuracy for 5063 MassBank MS/MS records and to yield the correct structural isomer with 82.1% accuracy within the top-3 candidates. In a test with 936 manually identified spectra from an untargeted HILIC-QTOF MS data set of human plasma, formulas were correctly predicted in 90.4% of the cases, and the correct isomer structure was retrieved at 80.4% probability within the top-3 candidates, including for compounds that were absent in mass spectral libraries. The MS-FINDER software is freely available at http://prime.psc.riken.jp/.

UR - http://www.scopus.com/inward/record.url?scp=84983048041&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84983048041&partnerID=8YFLogxK

U2 - 10.1021/acs.analchem.6b00770

DO - 10.1021/acs.analchem.6b00770

M3 - Article

AN - SCOPUS:84983048041

VL - 88

SP - 7946

EP - 7958

JO - Analytical Chemistry

JF - Analytical Chemistry

SN - 0003-2700

IS - 16

ER -