日本質量分析学会 第66回質量分析総合討論会

Program

Poster Presentations

Day 4, May 18(Fri.)  Poster

Metadata Curation for fully utilizing raw MS data in jPOST repository

(1Kumamoto Univ., 2Niigata Univ., 3DBCLS, 4Niigata Univ., 5Kyushu Univ., 6Kyoto Univ., 7Kyoto Univ., 8Trans-IT)
oDaiki Kobayashi1, Norie Araki1, Shujiro Okuda2, Yu Watanabe2, Yuki Moriya3, Shin Kawano3, Tadashi Yamamoto4, Masaki Matsumoto5, Tomoyo Takami5, Akiyasu Yoshizawa6, Tsuyoshi Tabata6, Mio Iwasaki7, Naoyuki Sugiyama6, Satoshi Tanaka8, Susumu Goto3, Yasushi Ishihama6

A large amount of mass spectrometry (MS)-based proteomics data has been generated owing to the recent striking advance in MS instruments and methodology. The construction of management system for these MS files, such as a raw data repository, a standardized re-analysis protocol, and integrated database, is much helpful to publicly utilize these data sets. Here we have initiated the jPOST (Japan ProteOme STandard Repository/Database) project1 to construct such kind of data management system. To fully utilize these MS datasets, the metadata annotation of the MS raw file is important to precisely identify the sample sources, the sample preparation methods, and the MS acquisition settings. However, these metadata are sometimes not in detail, or have inappropriate annotations. First, these files have been divided into each reanalysis unit according to these metadata annotation because raw MS files are re-analyzed to identify peptides/proteins with high quality. Then, these metadata has been converted to the controlled vocabularies used in jPOST database. Especially, we have established the classification method of disease-related samples based on sample types, organs, and disease types. In this presentation, we show some cases of the curation workflow for the metadata annotating jPOST MS files.