プログラム

招待講演

第3日　5月17日（木）　14:25～15:15　A会場（オービットホール）

3A-IL-P-1425　　PDF

The “big data" age has arrived to proteomics: a world full of possibilities

(EMBL-EBI)
^oVizcaino, Juan Antonio

During the last years we have developed an infrastructure to enable data sharing of mass spectrometry proteomics data in the public domain, including the world-leading PRIDE database and related tools, open data standards and the establishment of the worldwide ProteomeXchange Consortium. Thanks, among other efforts, to the success of PRIDE and ProteomeXchange, the proteomics community is now widely embracing open data policies.
This plethora of data is being increasingly reused by the research community, e.g. in proteogenomics approaches, to build spectral libraries, for tool benchmarking or in innovative meta-analysis studies, among other applications. In that context, I will explain in higher detail an in-house project aiming to build a functional and comprehensive version of the human phospho-proteome, by reusing public datasets.
We also aim to facilitate data reuse by third parties by building open, reproducible, and scalable proteomics data analysis pipelines. As a proof of context, these pipelines are deployed first in the EMBL-EBI “Embassy Cloud", with the idea that in the future they can be made available in other cloud infrastructures, and that can be freely reused by any interested researcher in the community. These pipelines are connected to PRIDE, bringing the analysis tools closer to the data.