PriLive: privacy-preserving real-time filtering for next-generation sequencing

Abstract Motivation In next-generation sequencing, re-identification of individuals and other privacy-breaching strategies can be applied even for anonymized data. This also holds true for applications in which human DNA is acquired as a by-product, e.g. for viral or metagenomic samples from a human...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 34; no. 14; pp. 2376 - 2383
Main Authors Loka, Tobias P, Tausch, Simon H, Dabrowski, Piotr W, Radonić, Aleksandar, Nitsche, Andreas, Renard, Bernhard Y
Format Journal Article
LanguageEnglish
Published England Oxford University Press 15.07.2018
Online AccessGet full text
ISSN1367-4803
1367-4811
1460-2059
1367-4811
DOI10.1093/bioinformatics/bty128

Cover

More Information
Summary:Abstract Motivation In next-generation sequencing, re-identification of individuals and other privacy-breaching strategies can be applied even for anonymized data. This also holds true for applications in which human DNA is acquired as a by-product, e.g. for viral or metagenomic samples from a human host. Conventional data protection strategies including cryptography and post-hoc filtering are only appropriate for the final and processed sequencing data. This can result in an insufficient level of data protection and a considerable time delay in the further analysis workflow. Results We present PriLive, a novel tool for the automated removal of sensitive data while the sequencing machine is running. Thereby, human sequence information can be detected and removed before being completely produced. This facilitates the compliance with strict data protection regulations. The unique characteristic to cause almost no time delay for further analyses is also a clear benefit for applications other than data protection. Especially if the sequencing data are dominated by known background signals, PriLive considerably accelerates consequent analyses by having only fractions of input data. Besides these conceptual advantages, PriLive achieves filtering results at least as accurate as conventional post-hoc filtering tools. Availability and implementation PriLive is open-source software available at https://gitlab.com/rki_bioinformatics/PriLive. Supplementary information Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1367-4811
1460-2059
1367-4811
DOI:10.1093/bioinformatics/bty128