PriLive: privacy-preserving real-time filtering for next-generation sequencing
Abstract Motivation In next-generation sequencing, re-identification of individuals and other privacy-breaching strategies can be applied even for anonymized data. This also holds true for applications in which human DNA is acquired as a by-product, e.g. for viral or metagenomic samples from a human...
Saved in:
Published in | Bioinformatics Vol. 34; no. 14; pp. 2376 - 2383 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
15.07.2018
|
Online Access | Get full text |
ISSN | 1367-4803 1367-4811 1460-2059 1367-4811 |
DOI | 10.1093/bioinformatics/bty128 |
Cover
Summary: | Abstract
Motivation
In next-generation sequencing, re-identification of individuals and other privacy-breaching strategies can be applied even for anonymized data. This also holds true for applications in which human DNA is acquired as a by-product, e.g. for viral or metagenomic samples from a human host. Conventional data protection strategies including cryptography and post-hoc filtering are only appropriate for the final and processed sequencing data. This can result in an insufficient level of data protection and a considerable time delay in the further analysis workflow.
Results
We present PriLive, a novel tool for the automated removal of sensitive data while the sequencing machine is running. Thereby, human sequence information can be detected and removed before being completely produced. This facilitates the compliance with strict data protection regulations. The unique characteristic to cause almost no time delay for further analyses is also a clear benefit for applications other than data protection. Especially if the sequencing data are dominated by known background signals, PriLive considerably accelerates consequent analyses by having only fractions of input data. Besides these conceptual advantages, PriLive achieves filtering results at least as accurate as conventional post-hoc filtering tools.
Availability and implementation
PriLive is open-source software available at https://gitlab.com/rki_bioinformatics/PriLive.
Supplementary information
Supplementary data are available at Bioinformatics online. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1367-4803 1367-4811 1460-2059 1367-4811 |
DOI: | 10.1093/bioinformatics/bty128 |