Computational framework for identification of intact glycopeptides in complex samples

Anoop Mayampurath, Chuan Yih Yu, Ehwang Song, Jagadheshwar Balan, Yehia Mechref, Haixu Tang

Research output: Contribution to journalArticlepeer-review

82 Scopus citations


Glycosylation is an important protein modification that involves enzymatic attachment of sugars to amino acid residues. Understanding the structure of these sugars and the effects of glycosylation are vital for developing indicators of disease development and progression. Although computational methods based on mass spectrometric data have proven to be effective in monitoring changes in the glycome, developing such methods for the glycoproteome are challenging, largely due to the inherent complexity in simultaneously studying glycan structures with their corresponding glycosylation sites. This paper introduces a computational framework for identifying intact N-linked glycopeptides, i.e. glycopeptides with N-linked glycans attached to their glycosylation sites, in complex proteome samples. Scoring algorithms are presented for tandem mass spectra of glycopeptides resulting from collision-induced dissociation (CID), higher-energy C-trap dissociation (HCD), and electron transfer dissociation (ETD) fragmentation modes. An empirical false-discovery rate estimation method, based on a target-decoy search approach, is derived for assigning confidence. The power of our method is further enhanced when multiple data sets are pooled together to increase identification confidence. Using this framework, 103 highly confident N-linked glycopeptides from 53 sites across 33 glycoproteins were identified in complex human serum proteome samples using conventional proteomic platforms with standard depletion of the 7-most abundant proteins. These results indicate that our method is ready to be used for characterizing site-specific protein glycosylation in complex samples.

Original languageEnglish
Pages (from-to)453-463
Number of pages11
JournalAnalytical Chemistry
Issue number1
StatePublished - Jan 7 2014


Dive into the research topics of 'Computational framework for identification of intact glycopeptides in complex samples'. Together they form a unique fingerprint.

Cite this