EPMA: Efficient pattern matching algorithm for DNA sequences

•We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present multithreading in pattern matching.•We use Turing machine for pattern matching.•We present comparative results with significance improvements. To solv...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 80; pp. 162 - 170
Main Authors Tahir, Muhammad, Sardaraz, Muhammad, Ikram, Ataul Aziz
Format Journal Article
LanguageEnglish
Published New York Elsevier Ltd 01.09.2017
Elsevier BV
Subjects
Online AccessGet full text
ISSN0957-4174
1873-6793
DOI10.1016/j.eswa.2017.03.026

Cover

Abstract •We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present multithreading in pattern matching.•We use Turing machine for pattern matching.•We present comparative results with significance improvements. To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume of biological data has increased significantly. These large amounts of data have increased the need to analyze it in reasonable space and time. DNA sequences contain basic information of species, and pattern matching between different species is an important and challenging issue to cope with. There exist generalized string matching and some specialized DNA pattern matching algorithms in the literature. There is still need to develop fast and space efficient pattern matching algorithms that consider new hardware development. In this paper, we present a novel DNA sequences pattern matching algorithm called EPMA. The proposed algorithm utilizes fixed length 2-bits binary encoding, segmentation and multi-threading. The idea is to find the pattern with multiple searcher agents concurrently. The proposed algorithm is validated with comparative experimental results. The results show that the new algorithm is a good candidate for DNA sequence pattern matching applications. The algorithm effectively utilizes modern hardware and will help researchers in the sequence alignment, short read error correction, phylogenetic inference etc. Furthermore, the proposed method can be extended to generalized string matching and their applications.
AbstractList To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume of biological data has increased significantly. These large amounts of data have increased the need to analyze it in reasonable space and time. DNA sequences contain basic information of species, and pattern matching between different species is an important and challenging issue to cope with. There exist generalized string matching and some specialized DNA pattern matching algorithms in the literature. There is still need to develop fast and space efficient pattern matching algorithms that consider new hardware development. In this paper, we present a novel DNA sequences pattern matching algorithm called EPMA. The proposed algorithm utilizes fixed length 2-bits binary encoding, segmentation and multi-threading. The idea is to find the pattern with multiple searcher agents concurrently. The proposed algorithm is validated with comparative experimental results. The results show that the new algorithm is a good candidate for DNA sequence pattern matching applications. The algorithm effectively utilizes modern hardware and will help researchers in the sequence alignment, short read error correction, phylogenetic inference etc. Furthermore, the proposed method can be extended to generalized string matching and their applications.
•We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present multithreading in pattern matching.•We use Turing machine for pattern matching.•We present comparative results with significance improvements. To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume of biological data has increased significantly. These large amounts of data have increased the need to analyze it in reasonable space and time. DNA sequences contain basic information of species, and pattern matching between different species is an important and challenging issue to cope with. There exist generalized string matching and some specialized DNA pattern matching algorithms in the literature. There is still need to develop fast and space efficient pattern matching algorithms that consider new hardware development. In this paper, we present a novel DNA sequences pattern matching algorithm called EPMA. The proposed algorithm utilizes fixed length 2-bits binary encoding, segmentation and multi-threading. The idea is to find the pattern with multiple searcher agents concurrently. The proposed algorithm is validated with comparative experimental results. The results show that the new algorithm is a good candidate for DNA sequence pattern matching applications. The algorithm effectively utilizes modern hardware and will help researchers in the sequence alignment, short read error correction, phylogenetic inference etc. Furthermore, the proposed method can be extended to generalized string matching and their applications.
Author Ikram, Ataul Aziz
Tahir, Muhammad
Sardaraz, Muhammad
Author_xml – sequence: 1
  givenname: Muhammad
  surname: Tahir
  fullname: Tahir, Muhammad
  email: m_tahir@ciit-attock.edu.pk
  organization: Department of Computer Science, COMSATS Institute of Information Technology, Attock Campus, Pakistan
– sequence: 2
  givenname: Muhammad
  surname: Sardaraz
  fullname: Sardaraz, Muhammad
  email: sardaraz@ciit-attock.edu.pk
  organization: Department of Computer Science, COMSATS Institute of Information Technology, Attock Campus, Pakistan
– sequence: 3
  givenname: Ataul Aziz
  surname: Ikram
  fullname: Ikram, Ataul Aziz
  email: ata.ikram@nu.edu.pk
  organization: Department of Electrical Engineering, National University, Islamabad, Pakistan
BookMark eNp9kDtPwzAUhS0EEm3hDzBFYk6w4yR2UJeqlIdUHgPM1o1z0zpqk2K7IP49jsrE0Oku57tH5xuT067vkJArRhNGWXHTJui-IUkpEwnlCU2LEzJiUvC4ECU_JSNa5iLOmMjOydi5loYgpWJEpou359lttGgaow12PtqB92i7aAter023imCz6q3x623U9Da6e5lFDj_32Gl0F-SsgY3Dy787IR_3i_f5Y7x8fXiaz5ax5qn0ccEzSDPdVJIjFFnKgQkJWtS5Bi0rSqu65HWGeQWAAqjIsWlKWnOJwGpZ8gm5Pvzd2T5UO6_afm-7UKlYydMwhbE8pOQhpW3vnMVGaePBm77zFsxGMaoGV6pVgys1uFKUq-AqoOk_dGfNFuzPcWh6gDBM_zJolRsUaqyNRe1V3Ztj-C9GxYSo
CitedBy_id crossref_primary_10_1016_j_vlsi_2024_102310
crossref_primary_10_14778_3447689_3447695
crossref_primary_10_3233_THC_218012
crossref_primary_10_1093_bib_bbab070
crossref_primary_10_3390_info11050248
crossref_primary_10_1016_j_eswa_2018_09_012
crossref_primary_10_7717_peerj_cs_957
crossref_primary_10_1002_cpe_7292
crossref_primary_10_1109_ACCESS_2020_2969038
crossref_primary_10_1007_s11227_022_04673_3
crossref_primary_10_2139_ssrn_4170153
Cites_doi 10.1145/359842.359859
10.1145/79173.79184
10.1145/135239.135243
10.1147/rd.312.0249
10.2174/157489361004150922151409
10.1016/j.ipl.2007.01.002
10.1016/j.jda.2003.12.002
10.1142/S0219720016300021
10.3844/jcssp.2011.644.650
10.1080/00207160212111
10.1016/j.ipl.2009.11.010
10.1145/135239.135244
10.1016/j.ygeno.2014.08.007
10.1002/(SICI)1097-024X(199707)27:7<851::AID-SPE108>3.0.CO;2-D
10.1145/248625.248639
10.3906/elk-1304-165
10.1016/S0020-0190(03)00296-5
10.1145/2431211.2431212
10.1145/351827.384246
10.1137/0206024
ContentType Journal Article
Copyright 2017 Elsevier Ltd
Copyright Elsevier BV Sep 1, 2017
Copyright_xml – notice: 2017 Elsevier Ltd
– notice: Copyright Elsevier BV Sep 1, 2017
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1016/j.eswa.2017.03.026
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList Computer and Information Systems Abstracts

DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1873-6793
EndPage 170
ExternalDocumentID 10_1016_j_eswa_2017_03_026
S0957417417301811
GroupedDBID --K
--M
.DC
.~1
0R~
13V
1B1
1RT
1~.
1~5
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
9JO
AAAKF
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARIN
AAXUO
AAYFN
ABBOA
ABFNM
ABMAC
ABMVD
ABUCO
ABYKQ
ACDAQ
ACGFS
ACHRH
ACNTT
ACRLP
ACZNC
ADBBV
ADEZE
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGJBL
AGUBO
AGUMN
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALEQD
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
APLSM
AXJTR
BJAXD
BKOJK
BLXMC
BNSAS
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
IHE
J1W
JJJVA
KOM
LG9
LY1
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
PQQKQ
Q38
RIG
ROL
RPZ
SDF
SDG
SDP
SDS
SES
SPC
SPCBC
SSB
SSD
SSL
SST
SSV
SSZ
T5K
TN5
~G-
29G
AAAKG
AAQXK
AATTM
AAXKI
AAYWO
AAYXX
ABJNI
ABKBG
ABWVN
ABXDB
ACLOT
ACNNM
ACRPL
ACVFH
ADCNI
ADJOM
ADMUD
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
ASPBG
AVWKF
AZFZN
CITATION
EFKBS
FEDTE
FGOYB
G-2
HLZ
HVGLF
HZ~
R2-
SBC
SET
SEW
WUQ
XPP
ZMT
~HD
7SC
8FD
AFXIZ
AGCQF
AGRNS
JQ2
L7M
L~C
L~D
SSH
ID FETCH-LOGICAL-c328t-634a24cfb83ea6423a178ac7d5cac8b00bd93d4e5baae7a075eff90d38ea1d893
IEDL.DBID .~1
ISSN 0957-4174
IngestDate Mon Jul 14 08:44:37 EDT 2025
Wed Oct 01 03:51:50 EDT 2025
Thu Apr 24 23:03:49 EDT 2025
Fri Feb 23 02:29:04 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords Strings
Algorithm
Multi-threading
DNA;Binary
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c328t-634a24cfb83ea6423a178ac7d5cac8b00bd93d4e5baae7a075eff90d38ea1d893
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
PQID 1932170115
PQPubID 2045477
PageCount 9
ParticipantIDs proquest_journals_1932170115
crossref_citationtrail_10_1016_j_eswa_2017_03_026
crossref_primary_10_1016_j_eswa_2017_03_026
elsevier_sciencedirect_doi_10_1016_j_eswa_2017_03_026
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2017-09-01
2017-09-00
20170901
PublicationDateYYYYMMDD 2017-09-01
PublicationDate_xml – month: 09
  year: 2017
  text: 2017-09-01
  day: 01
PublicationDecade 2010
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle Expert systems with applications
PublicationYear 2017
Publisher Elsevier Ltd
Elsevier BV
Publisher_xml – name: Elsevier Ltd
– name: Elsevier BV
References Chen, Lu, Ram (bib0009) 2004
Navarro, Raffinot (bib0033) 2004; 2
Fredriksson (bib0016) 2003; 87
Ďurian, Holub, Peltola, Tarhio (bib0010) 2010; 110
Baeza-Yates, Gonnet (bib0005) 1992; 35
Shibata, Matsumoto, Takeda, Shinohara, Arikawa (bib0041) 2000
Zubair, Wahab, Hussain, Zaffar (bib0048) 2010
Klaib, Osborne (bib0022) 2009
Sardaraz, Tahir, Ikram, Bajwa (bib0039) 2014; 104
Boyer, Moore (bib0006) 1977; 20
Peltola, Tarhio (bib0037) 2003
Manber (bib0028) 1997; 15
Faro, S., & Lecroq, T. (2010). The exact string matching problem: A comprehensive experimental evaluation.
Bhukya, Somayajulu (bib0042) 2011; 4
.
Tahir, Sardaraz, Aziz Ikram, Bajwa (bib0044) 2015; 10
Özcan, Ünsal (bib0035) 2015; 23
Kofahi, Abusalama (bib0024) 2012; 9
Navarro, Raffinot (bib0032) 2002
Tarhio, Peltola (bib0045) 1997; 27
Fredriksson, Grabowski (bib0017) 2005
Lecroq (bib0026) 2007; 102
Faro, Lecroq (bib0014) 2013; 45
Michailidis, Margaritis (bib0029) 2002; 79
Wu, Manber (bib0047) 1992; 35
Wu, Manber (bib0046) 1992
Shibata, Kida, Fukamachi, Takeda, Shinohara, Shinohara (bib0040) 2000
Allauzen, Crochemore, Raffinot (bib0001) 2001
Pehlivan, Orhan (bib0036) 2011; 19
Charras, Lecrog, Pehoushek (bib0008) 1998
Navarro (bib0030) 2001; 31
Almazroi (bib0002) 2011; 7
Fan, Yao, Ma (bib0012) 2009
Cao (bib0011) 2004
Külekci (bib0025) 2009
Bucak, Uslan (bib0007) 2011; 19
Sardaraz, Tahir, Ikram (bib0038) 2016; 14
Knuth, Morris, James, Pratt (bib0023) 1977; 6
Li, Lu, Lu (bib0027) 2008
Amir, Benson (bib0004) 1994; Vol. 5
Kim, Kim, Park (bib0021) 2007
Navarro, Raffinot (bib0031) 2000; 5
Navarro, Tarhio (bib0034) 2005; 35
Hassan (bib0018) 2005
Sunday (bib0043) 1990; 33
Horspool (bib0019) 1980; 10
Franek, Jennings, Smyth (bib0015) 2005
Karp, Rabin (bib0020) 1987; 31
Amir, Benson (bib0003) 1992
Hassan (10.1016/j.eswa.2017.03.026_bib0018) 2005
Wu (10.1016/j.eswa.2017.03.026_bib0046) 1992
Navarro (10.1016/j.eswa.2017.03.026_bib0032) 2002
Li (10.1016/j.eswa.2017.03.026_bib0027) 2008
Allauzen (10.1016/j.eswa.2017.03.026_bib0001) 2001
Ďurian (10.1016/j.eswa.2017.03.026_bib0010) 2010; 110
Horspool (10.1016/j.eswa.2017.03.026_bib0019) 1980; 10
Knuth (10.1016/j.eswa.2017.03.026_bib0023) 1977; 6
Shibata (10.1016/j.eswa.2017.03.026_bib0040) 2000
Külekci (10.1016/j.eswa.2017.03.026_bib0025) 2009
Kim (10.1016/j.eswa.2017.03.026_bib0021) 2007
Kofahi (10.1016/j.eswa.2017.03.026_bib0024) 2012; 9
Fredriksson (10.1016/j.eswa.2017.03.026_bib0016) 2003; 87
Navarro (10.1016/j.eswa.2017.03.026_bib0034) 2005; 35
Pehlivan (10.1016/j.eswa.2017.03.026_bib0036) 2011; 19
Tarhio (10.1016/j.eswa.2017.03.026_bib0045) 1997; 27
Bucak (10.1016/j.eswa.2017.03.026_bib0007) 2011; 19
Cao (10.1016/j.eswa.2017.03.026_bib0011) 2004
Navarro (10.1016/j.eswa.2017.03.026_bib0031) 2000; 5
Sardaraz (10.1016/j.eswa.2017.03.026_bib0039) 2014; 104
Sunday (10.1016/j.eswa.2017.03.026_bib0043) 1990; 33
Sardaraz (10.1016/j.eswa.2017.03.026_bib0038) 2016; 14
Bhukya (10.1016/j.eswa.2017.03.026_bib0042) 2011; 4
Chen (10.1016/j.eswa.2017.03.026_bib0009) 2004
Baeza-Yates (10.1016/j.eswa.2017.03.026_bib0005) 1992; 35
Charras (10.1016/j.eswa.2017.03.026_bib0008) 1998
Amir (10.1016/j.eswa.2017.03.026_bib0003) 1992
Klaib (10.1016/j.eswa.2017.03.026_bib0022) 2009
Amir (10.1016/j.eswa.2017.03.026_bib0004) 1994; Vol. 5
Franek (10.1016/j.eswa.2017.03.026_bib0015) 2005
Faro (10.1016/j.eswa.2017.03.026_bib0014) 2013; 45
Manber (10.1016/j.eswa.2017.03.026_bib0028) 1997; 15
Wu (10.1016/j.eswa.2017.03.026_bib0047) 1992; 35
Peltola (10.1016/j.eswa.2017.03.026_bib0037) 2003
Almazroi (10.1016/j.eswa.2017.03.026_bib0002) 2011; 7
Özcan (10.1016/j.eswa.2017.03.026_bib0035) 2015; 23
Tahir (10.1016/j.eswa.2017.03.026_bib0044) 2015; 10
10.1016/j.eswa.2017.03.026_bib0013
Fredriksson (10.1016/j.eswa.2017.03.026_bib0017) 2005
Shibata (10.1016/j.eswa.2017.03.026_bib0041) 2000
Navarro (10.1016/j.eswa.2017.03.026_bib0030) 2001; 31
Lecroq (10.1016/j.eswa.2017.03.026_bib0026) 2007; 102
Fan (10.1016/j.eswa.2017.03.026_bib0012) 2009
Navarro (10.1016/j.eswa.2017.03.026_bib0033) 2004; 2
Boyer (10.1016/j.eswa.2017.03.026_bib0006) 1977; 20
Michailidis (10.1016/j.eswa.2017.03.026_bib0029) 2002; 79
Karp (10.1016/j.eswa.2017.03.026_bib0020) 1987; 31
Zubair (10.1016/j.eswa.2017.03.026_bib0048) 2010
References_xml – year: 2004
  ident: bib0011
  article-title: Fast string matching algorithm and its application in DNA sequence search
– reference: Faro, S., & Lecroq, T. (2010). The exact string matching problem: A comprehensive experimental evaluation.
– volume: 4
  start-page: 431
  year: 2011
  end-page: 441
  ident: bib0042
  article-title: Index based multiple pattern matching algorithm using DNA sequence and pattern count
  publication-title: International Journal of Information Technology
– volume: 79
  start-page: 867
  year: 2002
  end-page: 888
  ident: bib0029
  article-title: On-line approximate string searching algorithms: Survey and experimental results
  publication-title: International Journal of Computer Mathematics
– volume: 102
  start-page: 229
  year: 2007
  end-page: 235
  ident: bib0026
  article-title: Fast exact string matching algorithms
  publication-title: Information Processing Letters
– volume: 7
  start-page: 644
  year: 2011
  ident: bib0002
  article-title: A fast hybrid algorithm approach for the exact string matching problem via berry ravindran and alpha skip search algorithms
  publication-title: Journal of Computer Science
– volume: 31
  start-page: 249
  year: 1987
  end-page: 260
  ident: bib0020
  article-title: Efficient randomized pattern-matching algorithms
  publication-title: IBM Journal of Research and Development
– volume: 35
  start-page: 74
  year: 1992
  end-page: 82
  ident: bib0005
  article-title: A new approach to text searching
  publication-title: Communications of the ACM
– start-page: 288
  year: 2005
  end-page: 297
  ident: bib0015
  article-title: A simple fast hybrid pattern-matching algorithm
  publication-title: Combinatorial pattern matching
– volume: 10
  start-page: 501
  year: 1980
  end-page: 506
  ident: bib0019
  article-title: Practical fast searching in strings
  publication-title: Software: Practice and Experience
– start-page: 118
  year: 2009
  end-page: 128
  ident: bib0025
  article-title: Filter Based fast matching of long patterns by using SIMD instructions
  publication-title: Stringology
– start-page: 257
  year: 2008
  end-page: 266
  ident: bib0027
  article-title: Efficient merging and filtering algorithms for approximate string searches
  publication-title: Data engineering, 2008. ICDE 2008. IEEE 24th international conference on
– volume: 15
  start-page: 124
  year: 1997
  end-page: 136
  ident: bib0028
  article-title: A text compression scheme that allows fast searching directly in the compressed file
  publication-title: ACM Transactions on Information Systems (TOIS)
– start-page: 80
  year: 2003
  end-page: 93
  ident: bib0037
  article-title: Alternative algorithms for bit-parallel string matching
  publication-title: String processing and information retrieval
– start-page: 376
  year: 2005
  end-page: 387
  ident: bib0017
  article-title: Practical and optimal string matching
  publication-title: String processing and information retrieval
– volume: 14
  year: 2016
  ident: bib0038
  article-title: Advances in high throughput DNA sequence data compression
  publication-title: Journal of Bioinformatics and Computational Biology
– volume: 6
  start-page: 323
  year: 1977
  end-page: 350
  ident: bib0023
  article-title: Fast pattern matching in strings
  publication-title: SIAM Journal on Computing
– start-page: 11
  year: 2005
  end-page: 16
  ident: bib0018
  article-title: Mixed heuristic algorithm for intelligent string matching for information retrieval
  publication-title: Computational intelligence and multimedia applications, 2005. Sixth international conference on
– start-page: 51
  year: 2001
  end-page: 72
  ident: bib0001
  article-title: Efficient experimental string matching by weak factor recognition*
  publication-title: Combinatorial pattern matching
– start-page: 75
  year: 2009
  end-page: 80
  ident: bib0022
  article-title: A new string matching algorithm for searching biological sequences
  publication-title: International conference on information and communication systems, ICICS
– start-page: 153
  year: 1992
  end-page: 162
  ident: bib0046
  article-title: Agrep–a fast approximate pattern-matching tool
  publication-title: Usenix
– volume: 31
  start-page: 1265
  year: 2001
  end-page: 1312
  ident: bib0030
  article-title: NR‐grep: A fast and flexible pattern‐matching tool
  publication-title: Software: Practice and Experience
– volume: 5
  start-page: 4
  year: 2000
  ident: bib0031
  article-title: Fast and flexible string matching by combining bit-parallelism and suffix automata
  publication-title: Journal of Experimental Algorithmics (JEA)
– start-page: 56
  year: 2009
  end-page: 59
  ident: bib0012
  article-title: Fast variants of the backward-oracle-marching algorithm
  publication-title: Internet computing for science and engineering (ICICSE), 2009 fourth international conference on
– volume: 19
  start-page: 59
  year: 2011
  end-page: 71
  ident: bib0036
  article-title: Automatic knowledge extraction for filling in biography forms from Turkish texts
  publication-title: Turkish Journal of Electrical Engineering & Computer Sciences
– volume: 45
  start-page: 13
  year: 2013
  ident: bib0014
  article-title: The exact online string matching problem: A review of the most recent results
  publication-title: ACM Computing Surveys (CSUR)
– volume: 20
  start-page: 762
  year: 1977
  end-page: 772
  ident: bib0006
  article-title: A fast string searching algorithm
  publication-title: Communications of the ACM
– start-page: 55
  year: 1998
  end-page: 64
  ident: bib0008
  article-title: A very fast string matching algorithm for small alphabets and long patterns
  publication-title: Combinatorial pattern matching
– volume: 104
  start-page: 225
  year: 2014
  end-page: 228
  ident: bib0039
  article-title: SeqCompress: An algorithm for biological sequence compression
  publication-title: Genomics
– volume: 19
  start-page: 157
  year: 2011
  end-page: 173
  ident: bib0007
  article-title: Sequence alignment from the perspective of stochastic optimization: A survey
  publication-title: Turkish Journal of Electrical Engineering & Computer Sciences
– volume: 35
  start-page: 1107
  year: 2005
  end-page: 1130
  ident: bib0034
  article-title: LZgrep: A Boyer–Moore string matching tool for Ziv–Lempel compressed text
  publication-title: Software: Practice and Experience
– start-page: 181
  year: 2000
  end-page: 194
  ident: bib0041
  article-title: A Boyer–Moore type algorithm for compressed pattern matching
  publication-title: Combinatorial pattern matching
– year: 2002
  ident: bib0032
  article-title: Flexible pattern matching in strings: Practical on-line search algorithms for texts and biological sequences
– volume: 2
  start-page: 347
  year: 2004
  end-page: 371
  ident: bib0033
  article-title: Practical and flexible pattern matching over Ziv–Lempel compressed text
  publication-title: Journal of Discrete Algorithms
– volume: 110
  start-page: 148
  year: 2010
  end-page: 152
  ident: bib0010
  article-title: Improving practical exact string matching
  publication-title: Information Processing Letters
– volume: 23
  start-page: 1405
  year: 2015
  end-page: 1417
  ident: bib0035
  article-title: Fast bitwise pattern-matching algorithm for DNA sequences on modern hardware
  publication-title: Turkish Journal of Electrical Engineering & Computer Sciences
– volume: 27
  start-page: 851
  year: 1997
  end-page: 861
  ident: bib0045
  article-title: String matching in the DNA alphabet
  publication-title: Software-Practice and Experience
– start-page: 62
  year: 2004
  end-page: 68
  ident: bib0009
  article-title: Compressed pattern matching in DNA sequences
  publication-title: Computational systems bioinformatics conference, 2004. CSB 2004. Proceedings. 2004 IEEE
– volume: 87
  start-page: 201
  year: 2003
  end-page: 204
  ident: bib0016
  article-title: Shift-or string matching with super-alphabets
  publication-title: Information Processing Letters
– volume: 10
  start-page: 469
  year: 2015
  end-page: 475
  ident: bib0044
  article-title: HaShRECA: Hadoop based short read error correction algorithm for genome assembly
  publication-title: Current Bioinformatics
– volume: 35
  start-page: 83
  year: 1992
  end-page: 91
  ident: bib0047
  article-title: Fast text searching: Allowing errors
  publication-title: Communications of the ACM
– start-page: 279
  year: 1992
  end-page: 288
  ident: bib0003
  article-title: Efficient two-dimensional compressed matching
  publication-title: Data compression conference, 1992. DCC'92
– volume: 33
  start-page: 132
  year: 1990
  end-page: 142
  ident: bib0043
  article-title: A very fast substring search algorithm
  publication-title: Communications of the ACM
– start-page: 306
  year: 2000
  end-page: 315
  ident: bib0040
  article-title: Speeding up pattern matching by text compression
  publication-title: Algorithms and complexity
– reference: .
– start-page: 271
  year: 2007
  end-page: 281
  ident: bib0021
  article-title: Fast matching method for DNA sequences
  publication-title: Combinatorics, algorithms, probabilistic and experimental methodologies
– volume: Vol. 5
  start-page: 705
  year: 1994
  end-page: 714
  ident: bib0004
  article-title: Let sleeping files lie: Pattern matching in Z-compressed files
  publication-title: 5th Annual ACM-SIAM symposium on discrete algorithms
– volume: 9
  start-page: 30
  year: 2012
  end-page: 38
  ident: bib0024
  article-title: A framework for distributed pattern matching based on multi-threading
  publication-title: The International Arab Journal of Information Technology
– start-page: 1
  year: 2010
  end-page: 5
  ident: bib0048
  article-title: Improved text scanning approach for exact String matching
  publication-title: Information and emerging technologies (ICIET), 2010 international conference on
– volume: 20
  start-page: 762
  year: 1977
  ident: 10.1016/j.eswa.2017.03.026_bib0006
  article-title: A fast string searching algorithm
  publication-title: Communications of the ACM
  doi: 10.1145/359842.359859
– ident: 10.1016/j.eswa.2017.03.026_bib0013
– start-page: 75
  year: 2009
  ident: 10.1016/j.eswa.2017.03.026_bib0022
  article-title: A new string matching algorithm for searching biological sequences
– volume: 33
  start-page: 132
  year: 1990
  ident: 10.1016/j.eswa.2017.03.026_bib0043
  article-title: A very fast substring search algorithm
  publication-title: Communications of the ACM
  doi: 10.1145/79173.79184
– start-page: 1
  year: 2010
  ident: 10.1016/j.eswa.2017.03.026_bib0048
  article-title: Improved text scanning approach for exact String matching
– volume: 35
  start-page: 74
  year: 1992
  ident: 10.1016/j.eswa.2017.03.026_bib0005
  article-title: A new approach to text searching
  publication-title: Communications of the ACM
  doi: 10.1145/135239.135243
– start-page: 55
  year: 1998
  ident: 10.1016/j.eswa.2017.03.026_bib0008
  article-title: A very fast string matching algorithm for small alphabets and long patterns
– volume: 31
  start-page: 249
  year: 1987
  ident: 10.1016/j.eswa.2017.03.026_bib0020
  article-title: Efficient randomized pattern-matching algorithms
  publication-title: IBM Journal of Research and Development
  doi: 10.1147/rd.312.0249
– start-page: 56
  year: 2009
  ident: 10.1016/j.eswa.2017.03.026_bib0012
  article-title: Fast variants of the backward-oracle-marching algorithm
– volume: 10
  start-page: 469
  year: 2015
  ident: 10.1016/j.eswa.2017.03.026_bib0044
  article-title: HaShRECA: Hadoop based short read error correction algorithm for genome assembly
  publication-title: Current Bioinformatics
  doi: 10.2174/157489361004150922151409
– volume: 4
  start-page: 431
  year: 2011
  ident: 10.1016/j.eswa.2017.03.026_bib0042
  article-title: Index based multiple pattern matching algorithm using DNA sequence and pattern count
  publication-title: International Journal of Information Technology
– volume: 102
  start-page: 229
  year: 2007
  ident: 10.1016/j.eswa.2017.03.026_bib0026
  article-title: Fast exact string matching algorithms
  publication-title: Information Processing Letters
  doi: 10.1016/j.ipl.2007.01.002
– volume: 35
  start-page: 1107
  year: 2005
  ident: 10.1016/j.eswa.2017.03.026_bib0034
  article-title: LZgrep: A Boyer–Moore string matching tool for Ziv–Lempel compressed text
  publication-title: Software: Practice and Experience
– start-page: 257
  year: 2008
  ident: 10.1016/j.eswa.2017.03.026_bib0027
  article-title: Efficient merging and filtering algorithms for approximate string searches
– start-page: 271
  year: 2007
  ident: 10.1016/j.eswa.2017.03.026_bib0021
  article-title: Fast matching method for DNA sequences
– volume: 2
  start-page: 347
  year: 2004
  ident: 10.1016/j.eswa.2017.03.026_bib0033
  article-title: Practical and flexible pattern matching over Ziv–Lempel compressed text
  publication-title: Journal of Discrete Algorithms
  doi: 10.1016/j.jda.2003.12.002
– volume: 10
  start-page: 501
  year: 1980
  ident: 10.1016/j.eswa.2017.03.026_bib0019
  article-title: Practical fast searching in strings
  publication-title: Software: Practice and Experience
– start-page: 11
  year: 2005
  ident: 10.1016/j.eswa.2017.03.026_bib0018
  article-title: Mixed heuristic algorithm for intelligent string matching for information retrieval
– volume: 14
  year: 2016
  ident: 10.1016/j.eswa.2017.03.026_bib0038
  article-title: Advances in high throughput DNA sequence data compression
  publication-title: Journal of Bioinformatics and Computational Biology
  doi: 10.1142/S0219720016300021
– start-page: 62
  year: 2004
  ident: 10.1016/j.eswa.2017.03.026_bib0009
  article-title: Compressed pattern matching in DNA sequences
– start-page: 306
  year: 2000
  ident: 10.1016/j.eswa.2017.03.026_bib0040
  article-title: Speeding up pattern matching by text compression
– start-page: 51
  year: 2001
  ident: 10.1016/j.eswa.2017.03.026_bib0001
  article-title: Efficient experimental string matching by weak factor recognition*
– start-page: 279
  year: 1992
  ident: 10.1016/j.eswa.2017.03.026_bib0003
  article-title: Efficient two-dimensional compressed matching
– year: 2002
  ident: 10.1016/j.eswa.2017.03.026_bib0032
– volume: 19
  start-page: 157
  year: 2011
  ident: 10.1016/j.eswa.2017.03.026_bib0007
  article-title: Sequence alignment from the perspective of stochastic optimization: A survey
  publication-title: Turkish Journal of Electrical Engineering & Computer Sciences
– start-page: 376
  year: 2005
  ident: 10.1016/j.eswa.2017.03.026_bib0017
  article-title: Practical and optimal string matching
– start-page: 118
  year: 2009
  ident: 10.1016/j.eswa.2017.03.026_bib0025
  article-title: Filter Based fast matching of long patterns by using SIMD instructions
– volume: 7
  start-page: 644
  year: 2011
  ident: 10.1016/j.eswa.2017.03.026_bib0002
  article-title: A fast hybrid algorithm approach for the exact string matching problem via berry ravindran and alpha skip search algorithms
  publication-title: Journal of Computer Science
  doi: 10.3844/jcssp.2011.644.650
– volume: Vol. 5
  start-page: 705
  year: 1994
  ident: 10.1016/j.eswa.2017.03.026_bib0004
  article-title: Let sleeping files lie: Pattern matching in Z-compressed files
– volume: 79
  start-page: 867
  year: 2002
  ident: 10.1016/j.eswa.2017.03.026_bib0029
  article-title: On-line approximate string searching algorithms: Survey and experimental results
  publication-title: International Journal of Computer Mathematics
  doi: 10.1080/00207160212111
– volume: 110
  start-page: 148
  year: 2010
  ident: 10.1016/j.eswa.2017.03.026_bib0010
  article-title: Improving practical exact string matching
  publication-title: Information Processing Letters
  doi: 10.1016/j.ipl.2009.11.010
– volume: 19
  start-page: 59
  year: 2011
  ident: 10.1016/j.eswa.2017.03.026_bib0036
  article-title: Automatic knowledge extraction for filling in biography forms from Turkish texts
  publication-title: Turkish Journal of Electrical Engineering & Computer Sciences
– volume: 35
  start-page: 83
  year: 1992
  ident: 10.1016/j.eswa.2017.03.026_bib0047
  article-title: Fast text searching: Allowing errors
  publication-title: Communications of the ACM
  doi: 10.1145/135239.135244
– start-page: 181
  year: 2000
  ident: 10.1016/j.eswa.2017.03.026_bib0041
  article-title: A Boyer–Moore type algorithm for compressed pattern matching
– volume: 104
  start-page: 225
  year: 2014
  ident: 10.1016/j.eswa.2017.03.026_bib0039
  article-title: SeqCompress: An algorithm for biological sequence compression
  publication-title: Genomics
  doi: 10.1016/j.ygeno.2014.08.007
– volume: 27
  start-page: 851
  year: 1997
  ident: 10.1016/j.eswa.2017.03.026_bib0045
  article-title: String matching in the DNA alphabet
  publication-title: Software-Practice and Experience
  doi: 10.1002/(SICI)1097-024X(199707)27:7<851::AID-SPE108>3.0.CO;2-D
– volume: 15
  start-page: 124
  year: 1997
  ident: 10.1016/j.eswa.2017.03.026_bib0028
  article-title: A text compression scheme that allows fast searching directly in the compressed file
  publication-title: ACM Transactions on Information Systems (TOIS)
  doi: 10.1145/248625.248639
– volume: 31
  start-page: 1265
  year: 2001
  ident: 10.1016/j.eswa.2017.03.026_bib0030
  article-title: NR‐grep: A fast and flexible pattern‐matching tool
  publication-title: Software: Practice and Experience
– start-page: 153
  year: 1992
  ident: 10.1016/j.eswa.2017.03.026_bib0046
  article-title: Agrep–a fast approximate pattern-matching tool
– volume: 23
  start-page: 1405
  year: 2015
  ident: 10.1016/j.eswa.2017.03.026_bib0035
  article-title: Fast bitwise pattern-matching algorithm for DNA sequences on modern hardware
  publication-title: Turkish Journal of Electrical Engineering & Computer Sciences
  doi: 10.3906/elk-1304-165
– year: 2004
  ident: 10.1016/j.eswa.2017.03.026_bib0011
– volume: 9
  start-page: 30
  year: 2012
  ident: 10.1016/j.eswa.2017.03.026_bib0024
  article-title: A framework for distributed pattern matching based on multi-threading
  publication-title: The International Arab Journal of Information Technology
– volume: 87
  start-page: 201
  year: 2003
  ident: 10.1016/j.eswa.2017.03.026_bib0016
  article-title: Shift-or string matching with super-alphabets
  publication-title: Information Processing Letters
  doi: 10.1016/S0020-0190(03)00296-5
– volume: 45
  start-page: 13
  year: 2013
  ident: 10.1016/j.eswa.2017.03.026_bib0014
  article-title: The exact online string matching problem: A review of the most recent results
  publication-title: ACM Computing Surveys (CSUR)
  doi: 10.1145/2431211.2431212
– start-page: 80
  year: 2003
  ident: 10.1016/j.eswa.2017.03.026_bib0037
  article-title: Alternative algorithms for bit-parallel string matching
– start-page: 288
  year: 2005
  ident: 10.1016/j.eswa.2017.03.026_bib0015
  article-title: A simple fast hybrid pattern-matching algorithm
– volume: 5
  start-page: 4
  year: 2000
  ident: 10.1016/j.eswa.2017.03.026_bib0031
  article-title: Fast and flexible string matching by combining bit-parallelism and suffix automata
  publication-title: Journal of Experimental Algorithmics (JEA)
  doi: 10.1145/351827.384246
– volume: 6
  start-page: 323
  year: 1977
  ident: 10.1016/j.eswa.2017.03.026_bib0023
  article-title: Fast pattern matching in strings
  publication-title: SIAM Journal on Computing
  doi: 10.1137/0206024
SSID ssj0017007
Score 2.3192031
Snippet •We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present...
To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume...
SourceID proquest
crossref
elsevier
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 162
SubjectTerms Algorithm
Algorithms
Bioinformatics
Biological evolution
Deoxyribonucleic acid
DNA
DNA;Binary
Error correction
Gene sequencing
Hardware
Multi-threading
Pattern matching
String matching
Strings
Title EPMA: Efficient pattern matching algorithm for DNA sequences
URI https://dx.doi.org/10.1016/j.eswa.2017.03.026
https://www.proquest.com/docview/1932170115
Volume 80
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier)
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: GBLVA
  dateStart: 20110101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier SD Complete Freedom Collection [SCCMFC]
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: ACRLP
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals [SCFCJ]
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: AIKHN
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: ScienceDirect (Elsevier)
  customDbUrl:
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: .~1
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVLSH
  databaseName: Elsevier Journals
  customDbUrl:
  mediaType: online
  eissn: 1873-6793
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017007
  issn: 0957-4174
  databaseCode: AKRWK
  dateStart: 19900101
  isFulltext: true
  providerName: Library Specific Holdings
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PS8MwFA5jXrz4W5zOkYM3qWubdEnFy5gbU9kQdLBbSJtEJ3MbW8Wbf7svbTpQZAehFFKSUr40732H730PoYtWbAhPtfESbiKPJpp5EmIl3KivEsh_NLTVyINhqz-i9-NoXEGdshbGyipd7C9ieh6t3ZOmQ7O5mEyaT0AOIB3ai1jXqbyCnTLbxeDqay3zsPZzrPDbY56d7QpnCo2XXn1a76GA5Uan1mDh7-T0K0znuae3h3YcacTt4rv2UUXPDtBu2ZABu_N5iG66j4P2Ne7mthCQTfAid8-cYaCluWYSy-nLfDnJXt8xkFV8O2zjtZb6CI163edO33PtEbyUhDzzWoTKkKYm4URLwJTIgHGZMhWlMuVwnBIVE0V1lEipmQRuoI2JfUW4loECnnKMqrP5TJ8grCLOEsJVrIihyvixz2HYAm4X69iEYQ0FJS4idd7htoXFVJQisTdhsRQWS-ETAVjW0OV6zaJwztg4OyrhFj_2X0Bo37iuXu6NcKdvJSwptT7zQXT6z9eeoW07KsRkdVTNlh_6HNhHljTy36uBttp3D_3hN3eB2Gg
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PS8MwGA1jHvTib3H-zMGb1LVNuqTiZejG1G0IbrBbSJtEJzrHVvHm3-6XNh0osoNQCm2TUl6T73uBl_chdNaIDeGpNl7CTeTRRDNPQqyEE_VVAvmPhnY3cq_f6Azp3SgaVdB1uRfGyipd7C9ieh6t3Z26Q7M-HY_rj0AOIB3ag1jXKVgCrdAoZHYFdvG10HlY_zlWGO4xzzZ3O2cKkZeef1rzoYDlTqfWYeHv7PQrTufJp72J1h1rxM3iw7ZQRU-20UZZkQG7CbqDrloPveYlbuW-EJBO8DS3z5xg4KW5aBLL16f32Th7fsPAVvFNv4kXYupdNGy3Btcdz9VH8FIS8sxrECpDmpqEEy0BVCIDxmXKVJTKlMN8SlRMFNVRIqVmEsiBNib2FeFaBgqIyh6qTt4neh9hFXGWEK5iRQxVxo99DpcNIHexjk0Y1lBQ4iJSZx5ua1i8ilIl9iIslsJiKXwiAMsaOl_0mRbWGUtbRyXc4scAEBDbl_Y7Kv-NcNNvLiwrtUbzQXTwz9eeotXOoNcV3dv-_SFas08KZdkRqmazD30MVCRLTvKh9g2eIdn9
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=EPMA%3A+Efficient+pattern+matching+algorithm+for+DNA+sequences&rft.jtitle=Expert+systems+with+applications&rft.au=Tahir%2C+Muhammad&rft.au=Sardaraz%2C+Muhammad&rft.au=Ikram%2C+Ataul+Aziz&rft.date=2017-09-01&rft.pub=Elsevier+Ltd&rft.issn=0957-4174&rft.eissn=1873-6793&rft.volume=80&rft.spage=162&rft.epage=170&rft_id=info:doi/10.1016%2Fj.eswa.2017.03.026&rft.externalDocID=S0957417417301811
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0957-4174&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0957-4174&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0957-4174&client=summon