Decoding the Proteins Inside Every Cell

BioE Professor Nikolai Slavov, in collaboration with EMBL-EBI, Cambridge, UK, was awarded a £2,000,000 NSF grant for “Making Single-Cell Proteomics data FAIR.”
Abstract Source: NSF
An award is made to Northeastern University to enable the development of advanced cyberinfrastructure that will support the FAIR (Findable, Accessible, Interoperable, Reusable) use of single-cell proteomics (SCP) data. SCP is a rapidly emerging field that allows scientists to directly measure proteins in individual cells, offering insights into biology that cannot be obtained by measuring RNA alone. The project will develop open data standards, bioinformatics tools, and data-sharing platforms that ensure SCP data can be easily reused by researchers worldwide. This work will also provide educational opportunities for students through hands-on training in data science and computational biology, promote open science practices, and deliver outreach programs aimed at improving public scientific literacy. By fostering data sharing and reproducibility, the project contributes to a robust scientific workforce and broader societal goals such as improving human health, protecting the environment, and advancing biological understanding through collaborative research.
The research addresses urgent needs in the scientific community by enabling rigorous and transparent analysis of single-cell proteomics data. As this technology matures, large volumes of complex data are being generated, yet the current infrastructure limits researchers’ ability to standardize, share, and interpret these datasets effectively. The project will formalize open metadata standards, extend open-source pipelines for quality control and analysis, and integrate reanalyzed public datasets into widely used resources such as the PRIDE database, the Single Cell Expression Atlas, and the Chan Zuckerberg Initiative?s CellxGene portal. By supporting reproducible reanalysis and downstream meta-analysis, the project will unlock new biological insights, such as identifying protein covariation patterns that may reveal mechanisms of cellular regulation. This work will empower the scientific community to advance discovery through more transparent, interoperable, and accessible use of proteomics data.