A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry

Runmin Yang, Daming Zhu, Qiang Kou, Poornima Bhat-Nakshatri, Harikrishna Nakshatri, Si Wu, Xiaowen Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Database search is the main approach for identifying proteoforms using top-down tandem mass spectra. However, it is extremely slow to align a query spectrum against all protein sequences in a large database when the target proteoform that produced the spectrum contains post-translational modifications and/or mutations. As a result, efficient and sensitive protein sequence filtering algorithms are essential for speeding up database search. In this paper, we propose a novel filtering algorithm, which generates spectrum graphs from subspectra of the query spectrum and searches them against the protein database to find good candidates. Compared with the sequence tag and gaped tag approaches, the proposed method circumvents the step of tag extraction, thus simplifying data processing. Experimental results on real data showed that the proposed method achieved both high speed and high sensitivity in protein sequence filtration.

Original languageEnglish (US)
Title of host publicationProceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages222-229
Number of pages8
Volume2017-January
ISBN (Electronic)9781509030491
DOIs
StatePublished - Dec 15 2017
Event2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017 - Kansas City, United States
Duration: Nov 13 2017Nov 16 2017

Other

Other2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
CountryUnited States
CityKansas City
Period11/13/1711/16/17

Fingerprint

Mass spectrometry
Mass Spectrometry
Databases
Proteins
Protein Databases
Post Translational Protein Processing
Mutation

Keywords

  • filtering algorithm
  • Mass spectrometry
  • spectrum graph

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics

Cite this

Yang, R., Zhu, D., Kou, Q., Bhat-Nakshatri, P., Nakshatri, H., Wu, S., & Liu, X. (2017). A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry. In Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017 (Vol. 2017-January, pp. 222-229). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BIBM.2017.8217653

A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry. / Yang, Runmin; Zhu, Daming; Kou, Qiang; Bhat-Nakshatri, Poornima; Nakshatri, Harikrishna; Wu, Si; Liu, Xiaowen.

Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017. Vol. 2017-January Institute of Electrical and Electronics Engineers Inc., 2017. p. 222-229.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yang, R, Zhu, D, Kou, Q, Bhat-Nakshatri, P, Nakshatri, H, Wu, S & Liu, X 2017, A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry. in Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017. vol. 2017-January, Institute of Electrical and Electronics Engineers Inc., pp. 222-229, 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017, Kansas City, United States, 11/13/17. https://doi.org/10.1109/BIBM.2017.8217653
Yang R, Zhu D, Kou Q, Bhat-Nakshatri P, Nakshatri H, Wu S et al. A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry. In Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017. Vol. 2017-January. Institute of Electrical and Electronics Engineers Inc. 2017. p. 222-229 https://doi.org/10.1109/BIBM.2017.8217653
Yang, Runmin ; Zhu, Daming ; Kou, Qiang ; Bhat-Nakshatri, Poornima ; Nakshatri, Harikrishna ; Wu, Si ; Liu, Xiaowen. / A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry. Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017. Vol. 2017-January Institute of Electrical and Electronics Engineers Inc., 2017. pp. 222-229
@inproceedings{972e05122f4c4cd5a90b55b33f8eaad3,
title = "A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry",
abstract = "Database search is the main approach for identifying proteoforms using top-down tandem mass spectra. However, it is extremely slow to align a query spectrum against all protein sequences in a large database when the target proteoform that produced the spectrum contains post-translational modifications and/or mutations. As a result, efficient and sensitive protein sequence filtering algorithms are essential for speeding up database search. In this paper, we propose a novel filtering algorithm, which generates spectrum graphs from subspectra of the query spectrum and searches them against the protein database to find good candidates. Compared with the sequence tag and gaped tag approaches, the proposed method circumvents the step of tag extraction, thus simplifying data processing. Experimental results on real data showed that the proposed method achieved both high speed and high sensitivity in protein sequence filtration.",
keywords = "filtering algorithm, Mass spectrometry, spectrum graph",
author = "Runmin Yang and Daming Zhu and Qiang Kou and Poornima Bhat-Nakshatri and Harikrishna Nakshatri and Si Wu and Xiaowen Liu",
year = "2017",
month = "12",
day = "15",
doi = "10.1109/BIBM.2017.8217653",
language = "English (US)",
volume = "2017-January",
pages = "222--229",
booktitle = "Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry

AU - Yang, Runmin

AU - Zhu, Daming

AU - Kou, Qiang

AU - Bhat-Nakshatri, Poornima

AU - Nakshatri, Harikrishna

AU - Wu, Si

AU - Liu, Xiaowen

PY - 2017/12/15

Y1 - 2017/12/15

N2 - Database search is the main approach for identifying proteoforms using top-down tandem mass spectra. However, it is extremely slow to align a query spectrum against all protein sequences in a large database when the target proteoform that produced the spectrum contains post-translational modifications and/or mutations. As a result, efficient and sensitive protein sequence filtering algorithms are essential for speeding up database search. In this paper, we propose a novel filtering algorithm, which generates spectrum graphs from subspectra of the query spectrum and searches them against the protein database to find good candidates. Compared with the sequence tag and gaped tag approaches, the proposed method circumvents the step of tag extraction, thus simplifying data processing. Experimental results on real data showed that the proposed method achieved both high speed and high sensitivity in protein sequence filtration.

AB - Database search is the main approach for identifying proteoforms using top-down tandem mass spectra. However, it is extremely slow to align a query spectrum against all protein sequences in a large database when the target proteoform that produced the spectrum contains post-translational modifications and/or mutations. As a result, efficient and sensitive protein sequence filtering algorithms are essential for speeding up database search. In this paper, we propose a novel filtering algorithm, which generates spectrum graphs from subspectra of the query spectrum and searches them against the protein database to find good candidates. Compared with the sequence tag and gaped tag approaches, the proposed method circumvents the step of tag extraction, thus simplifying data processing. Experimental results on real data showed that the proposed method achieved both high speed and high sensitivity in protein sequence filtration.

KW - filtering algorithm

KW - Mass spectrometry

KW - spectrum graph

UR - http://www.scopus.com/inward/record.url?scp=85046277713&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85046277713&partnerID=8YFLogxK

U2 - 10.1109/BIBM.2017.8217653

DO - 10.1109/BIBM.2017.8217653

M3 - Conference contribution

AN - SCOPUS:85046277713

VL - 2017-January

SP - 222

EP - 229

BT - Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017

PB - Institute of Electrical and Electronics Engineers Inc.

ER -