CSV-Filter: a deep learning-based comprehensive structural variant filtering method for both short and long reads

Abstract Motivation Structural variants (SVs) play an important role in genetic research and precision medicine. As existing SV detection methods usually contain a substantial number of false positive calls, approaches to filter the detection results are needed. Results We developed a novel deep lea...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics (Oxford, England) Vol. 40; no. 9
Main Authors Xia, Zeyu, Xiang, Weiming, Wang, Qingzhe, Li, Xingze, Li, Yilin, Gao, Junyu, Tang, Tao, Yang, Canqun, Cui, Yingbo
Format Journal Article
LanguageEnglish
Published England Oxford University Press 02.09.2024
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text
ISSN1367-4811
1367-4803
1367-4811
DOI10.1093/bioinformatics/btae539

Cover

More Information
Summary:Abstract Motivation Structural variants (SVs) play an important role in genetic research and precision medicine. As existing SV detection methods usually contain a substantial number of false positive calls, approaches to filter the detection results are needed. Results We developed a novel deep learning-based SV filtering tool, CSV-Filter, for both short and long reads. CSV-Filter uses a novel multi-level grayscale image encoding method based on CIGAR strings of the alignment results and employs image augmentation techniques to improve SV feature extraction. CSV-Filter also utilizes self-supervised learning networks for transfer as classification models, and employs mixed-precision operations to accelerate training. The experiments showed that the integration of CSV-Filter with popular SV detection tools could considerably reduce false positive SVs for short and long reads, while maintaining true positive SVs almost unchanged. Compared with DeepSVFilter, a SV filtering tool for short reads, CSV-Filter could recognize more false positive calls and support long reads as an additional feature. Availability and implementation https://github.com/xzyschumacher/CSV-Filter
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1367-4811
1367-4803
1367-4811
DOI:10.1093/bioinformatics/btae539