IBS Publications Repository: Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses

BROWSE

Related Scientist

chae,sehyun's photo.

chae,sehyun: 식물노화·수명연구단

ITEM VIEW & DOWNLOAD

IBS Publications RepositoryCenter for Plant Aging Research (식물 노화·수명 연구단)1. Journal Papers (저널논문)

Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses

DC Field	Value	Language
dc.contributor.author	Park, H	-
dc.contributor.author	Bae, J	-
dc.contributor.author	Kim, H	-
dc.contributor.author	Kim, S	-
dc.contributor.author	Kim, H	-
dc.contributor.author	Mun, DG	-
dc.contributor.author	Joh, Y	-
dc.contributor.author	Lee, W	-
dc.contributor.author	Sehyun Chae	-
dc.contributor.author	Lee, S	-
dc.contributor.author	Kim, HK	-
dc.contributor.author	Daehee Hwang	-
dc.contributor.author	Lee, SW	-
dc.contributor.author	Paek, E	-
dc.date.available	2015-04-21T08:55:01Z	-
dc.date.created	2015-01-20	-
dc.date.issued	2014-12	-
dc.identifier.issn	1615-9853	-
dc.identifier.uri	https://pr.ibs.re.kr/handle/8788114/1443	-
dc.description.abstract	In proteogenomic analysis, construction of a compact, customized database from mRNA-seq data and a sensitive search of both reference and customized databases are essential to accurately determine protein abundances and structural variations at the protein level. However, these tasks have not been systematically explored, but rather performed in an ad-hoc fashion. Here, we present an effectivemethod for constructing a compact database containing comprehensive sequences of sample-specific variants—single nucleotide variants, insertions/deletions, and stop-codon mutations derived from Exome-seq and RNA-seq data. It, however, occupies less space by storing variant peptides, not variant proteins. We also present an efficient search method for both customized and reference databases. The separate searches of the two databases increase the search time, and a unified search is less sensitive to identify variant peptides due to the smaller size of the customized database, compared to the reference database, in the target-decoy setting. Our method searches the unified database once, but performs targetdecoy validations separately. Experimental results show that our approach is as fast as the unified search and as sensitive as the separate searches. Our customized database includes mutation information in the headers of variant peptides, thereby facilitating the inspection of peptide-spectrum matches.	-
dc.description.uri	1	-
dc.language	영어	-
dc.publisher	WILEY-BLACKWELL	-
dc.subject	Bioinformatics / Early onset gastric cancer / Peptide identification / Proteogenomics/ / Sequence database	-
dc.title	Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses	-
dc.type	Article	-
dc.type.rims	ART	-
dc.identifier.wosid	000345915200012	-
dc.identifier.scopusid	2-s2.0-84913526299	-
dc.identifier.rimsid	16781	ko
dc.date.tcdate	2018-10-01	-
dc.contributor.affiliatedAuthor	Sehyun Chae	-
dc.contributor.affiliatedAuthor	Daehee Hwang	-
dc.identifier.doi	10.1002/pmic.201400225	-
dc.identifier.bibliographicCitation	PROTEOMICS, v.14, no.23-24, pp.2742 - 2749	-
dc.citation.title	PROTEOMICS	-
dc.citation.volume	14	-
dc.citation.number	23-24	-
dc.citation.startPage	2742	-
dc.citation.endPage	2749	-
dc.date.scptcdate	2018-10-01	-
dc.description.wostc	9	-
dc.description.scptc	9	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	RNA-SEQ DATA	-
dc.subject.keywordPlus	PEPTIDE IDENTIFICATION	-
dc.subject.keywordPlus	PROTEIN IDENTIFICATION	-
dc.subject.keywordPlus	CONSTRUCTION	-
dc.subject.keywordPlus	PROTEOMICS	-
dc.subject.keywordPlus	FRAMEWORK	-
dc.subject.keywordPlus	STRATEGY	-
dc.subject.keywordPlus	CANCER	-
dc.subject.keywordPlus	PAIRS	-
dc.subject.keywordAuthor	Bioinformatics	-
dc.subject.keywordAuthor	Early onset gastric cancer	-
dc.subject.keywordAuthor	Peptide identification	-
dc.subject.keywordAuthor	Proteogenomics	-
dc.subject.keywordAuthor	Sequence database	-