Accurate label-free quantification by directLFQ to compare unlimited numbers of proteomes

Recent advances in mass spectrometry (MS)-based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Whereas peptide identification is already scalable, most label-free quantification (LFQ) algorithm...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Ammar, Constantin, Schessner, Julia Patricia, Willems, Sander, Andre Clemens Michaelis, Mann, Matthias
Format Paper
LanguageEnglish
Published Cold Spring Harbor Cold Spring Harbor Laboratory Press 19.02.2023
Cold Spring Harbor Laboratory
Edition1.1
Subjects
Online AccessGet full text
ISSN2692-8205
2692-8205
DOI10.1101/2023.02.17.528962

Cover

More Information
Summary:Recent advances in mass spectrometry (MS)-based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Whereas peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratio-based approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 minutes and 100,000 proteomes in less than two hours - thousand-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition (DDA) and data-independent acquisition (DIA). Additionally, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is available as an open-source Python package and as a GUI with a one-click installer and can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines.Competing Interest StatementThe authors have declared no competing interest.
Bibliography:SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
Competing Interest Statement: The authors have declared no competing interest.
ISSN:2692-8205
2692-8205
DOI:10.1101/2023.02.17.528962