Oral Sessions
(Day1, Day2, Day3, Day4)
Poster Presentations
(Day1, Day2, Day3, Day4)
Luncheon Seminars
(Day1, Day2, Day3, Day4)
Poster Presentations
- Day 4, May 18(Fri.) Poster
-
4P-36 PDF
Metadata Curation for fully utilizing raw MS data in jPOST repository
A large amount of mass spectrometry (MS)-based proteomics data has been generated owing to the recent striking advance in MS instruments and methodology. The construction of management system for these MS files, such as a raw data repository, a standardized re-analysis protocol, and integrated database, is much helpful to publicly utilize these data sets. Here we have initiated the jPOST (Japan ProteOme STandard Repository/Database) project1 to construct such kind of data management system. To fully utilize these MS datasets, the metadata annotation of the MS raw file is important to precisely identify the sample sources, the sample preparation methods, and the MS acquisition settings. However, these metadata are sometimes not in detail, or have inappropriate annotations. First, these files have been divided into each reanalysis unit according to these metadata annotation because raw MS files are re-analyzed to identify peptides/proteins with high quality. Then, these metadata has been converted to the controlled vocabularies used in jPOST database. Especially, we have established the classification method of disease-related samples based on sample types, organs, and disease types. In this presentation, we show some cases of the curation workflow for the metadata annotating jPOST MS files.