Integrated Proteomic Pipeline Using Multiple Search Engines for a Proteogenomic Study with a Controlled Protein False Discovery Rate
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Gun Wook Park | - |
dc.contributor.author | Heeyoun Hwang | - |
dc.contributor.author | Kwang Hoe Kim | - |
dc.contributor.author | Ju Yeon Lee | - |
dc.contributor.author | Hyun Kyoung Lee | - |
dc.contributor.author | Eun Sun Ji | - |
dc.contributor.author | Sung-Kyu Robin Park | - |
dc.contributor.author | John R. Yates | - |
dc.contributor.author | Kyung-Hoon Kwon | - |
dc.contributor.author | Young Mok Park | - |
dc.contributor.author | Hyoung-Joo Lee | - |
dc.contributor.author | Young-Ki Paik | - |
dc.contributor.author | Jin Young Kim | - |
dc.contributor.author | Jong Shin Yoo | - |
dc.date.available | 2017-05-30T05:56:32Z | - |
dc.date.created | 2017-05-22 | - |
dc.date.issued | 2016-08 | - |
dc.identifier.issn | 1535-3893 | - |
dc.identifier.uri | https://pr.ibs.re.kr/handle/8788114/3575 | - |
dc.description.abstract | In the Chromosome-Centric Human Proteome Project (C-HPP), false-positive identification by peptide spectrum matches (PSMs) after database searches is a major issue for proteogenomic studies using liquid-chromatography and mass-spectrometry-based large proteomic profiling. Here we developed a simple strategy for protein identification, with a controlled false discovery rate (FDR) at the protein level, using an integrated proteomic pipeline (IPP) that consists of four engrailed steps as follows. First, using three different search engines, SEQUEST, MASCOT, and MS-GF+, individual proteomic searches were performed against the neXtProt database. Second, the search results from the PSMs were combined using statistical evaluation tools including DTASelect and Percolator. Third, the peptide search scores were converted into E-scores normalized using an in-house program. Last, ProteinInferencer was used to filter the proteins containing two or more peptides with a controlled FDR of 1.0% at the protein level. Finally, we compared the performance of the IPP to a conventional proteomic pipeline (CPP) for protein identification using a controlled FDR of <1% at the protein level. Using the IPP, a total of 5756 proteins (vs 4453 using the CPP) including 477 alternative splicing variants (vs 182 using the CPP) were identified from human hippocampal tissue. In addition, a total of 10 missing proteins (vs 7 using the CPP) were identified with two or more unique peptides, and their tryptic peptides were validated using MS/MS spectral pattern from a repository database or their corresponding synthetic peptides. This study shows that the IPP effectively improved the identification of proteins, including alternative splicing variants and missing proteins, in human hippocampal tissues for the C-HPP. All RAW files used in this study were deposited in ProteomeXchange (PXD000395). © 2016 American Chemical Society. | - |
dc.description.uri | 1 | - |
dc.language | 영어 | - |
dc.publisher | AMER CHEMICAL SOC | - |
dc.subject | null | - |
dc.subject | null | - |
dc.subject | null | - |
dc.subject | false discovery rate | - |
dc.subject | proteogenomics | - |
dc.subject | integrated proteomic pipeline | - |
dc.subject | E-value | - |
dc.subject | E-score | - |
dc.subject | ProteinInferencer | - |
dc.subject | missing protein | - |
dc.subject | alternative splicing variant | - |
dc.title | Integrated Proteomic Pipeline Using Multiple Search Engines for a Proteogenomic Study with a Controlled Protein False Discovery Rate | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.identifier.wosid | 000387303100014 | - |
dc.identifier.scopusid | 2-s2.0-84994591691 | - |
dc.identifier.rimsid | 59455 | ko |
dc.date.tcdate | 2018-10-01 | - |
dc.contributor.affiliatedAuthor | Young Mok Park | - |
dc.identifier.doi | 10.1021/acs.jproteome.6b00376 | - |
dc.identifier.bibliographicCitation | JOURNAL OF PROTEOME RESEARCH, v.15, no.11, pp.4082 - 4090 | - |
dc.citation.title | JOURNAL OF PROTEOME RESEARCH | - |
dc.citation.volume | 15 | - |
dc.citation.number | 11 | - |
dc.citation.startPage | 4082 | - |
dc.citation.endPage | 4090 | - |
dc.date.scptcdate | 2018-10-01 | - |
dc.description.wostc | 7 | - |
dc.description.scptc | 7 | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |