EPMA: Efficient pattern matching algorithm for DNA sequences
•We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present multithreading in pattern matching.•We use Turing machine for pattern matching.•We present comparative results with significance improvements. To solv...
Saved in:
| Published in | Expert systems with applications Vol. 80; pp. 162 - 170 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Elsevier Ltd
01.09.2017
Elsevier BV |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0957-4174 1873-6793 |
| DOI | 10.1016/j.eswa.2017.03.026 |
Cover
| Abstract | •We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present multithreading in pattern matching.•We use Turing machine for pattern matching.•We present comparative results with significance improvements.
To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume of biological data has increased significantly. These large amounts of data have increased the need to analyze it in reasonable space and time. DNA sequences contain basic information of species, and pattern matching between different species is an important and challenging issue to cope with. There exist generalized string matching and some specialized DNA pattern matching algorithms in the literature. There is still need to develop fast and space efficient pattern matching algorithms that consider new hardware development. In this paper, we present a novel DNA sequences pattern matching algorithm called EPMA. The proposed algorithm utilizes fixed length 2-bits binary encoding, segmentation and multi-threading. The idea is to find the pattern with multiple searcher agents concurrently. The proposed algorithm is validated with comparative experimental results. The results show that the new algorithm is a good candidate for DNA sequence pattern matching applications. The algorithm effectively utilizes modern hardware and will help researchers in the sequence alignment, short read error correction, phylogenetic inference etc. Furthermore, the proposed method can be extended to generalized string matching and their applications. |
|---|---|
| AbstractList | To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume of biological data has increased significantly. These large amounts of data have increased the need to analyze it in reasonable space and time. DNA sequences contain basic information of species, and pattern matching between different species is an important and challenging issue to cope with. There exist generalized string matching and some specialized DNA pattern matching algorithms in the literature. There is still need to develop fast and space efficient pattern matching algorithms that consider new hardware development. In this paper, we present a novel DNA sequences pattern matching algorithm called EPMA. The proposed algorithm utilizes fixed length 2-bits binary encoding, segmentation and multi-threading. The idea is to find the pattern with multiple searcher agents concurrently. The proposed algorithm is validated with comparative experimental results. The results show that the new algorithm is a good candidate for DNA sequence pattern matching applications. The algorithm effectively utilizes modern hardware and will help researchers in the sequence alignment, short read error correction, phylogenetic inference etc. Furthermore, the proposed method can be extended to generalized string matching and their applications. •We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present multithreading in pattern matching.•We use Turing machine for pattern matching.•We present comparative results with significance improvements. To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume of biological data has increased significantly. These large amounts of data have increased the need to analyze it in reasonable space and time. DNA sequences contain basic information of species, and pattern matching between different species is an important and challenging issue to cope with. There exist generalized string matching and some specialized DNA pattern matching algorithms in the literature. There is still need to develop fast and space efficient pattern matching algorithms that consider new hardware development. In this paper, we present a novel DNA sequences pattern matching algorithm called EPMA. The proposed algorithm utilizes fixed length 2-bits binary encoding, segmentation and multi-threading. The idea is to find the pattern with multiple searcher agents concurrently. The proposed algorithm is validated with comparative experimental results. The results show that the new algorithm is a good candidate for DNA sequence pattern matching applications. The algorithm effectively utilizes modern hardware and will help researchers in the sequence alignment, short read error correction, phylogenetic inference etc. Furthermore, the proposed method can be extended to generalized string matching and their applications. |
| Author | Ikram, Ataul Aziz Tahir, Muhammad Sardaraz, Muhammad |
| Author_xml | – sequence: 1 givenname: Muhammad surname: Tahir fullname: Tahir, Muhammad email: m_tahir@ciit-attock.edu.pk organization: Department of Computer Science, COMSATS Institute of Information Technology, Attock Campus, Pakistan – sequence: 2 givenname: Muhammad surname: Sardaraz fullname: Sardaraz, Muhammad email: sardaraz@ciit-attock.edu.pk organization: Department of Computer Science, COMSATS Institute of Information Technology, Attock Campus, Pakistan – sequence: 3 givenname: Ataul Aziz surname: Ikram fullname: Ikram, Ataul Aziz email: ata.ikram@nu.edu.pk organization: Department of Electrical Engineering, National University, Islamabad, Pakistan |
| BookMark | eNp9kDtPwzAUhS0EEm3hDzBFYk6w4yR2UJeqlIdUHgPM1o1z0zpqk2K7IP49jsrE0Oku57tH5xuT067vkJArRhNGWXHTJui-IUkpEwnlCU2LEzJiUvC4ECU_JSNa5iLOmMjOydi5loYgpWJEpou359lttGgaow12PtqB92i7aAter023imCz6q3x623U9Da6e5lFDj_32Gl0F-SsgY3Dy787IR_3i_f5Y7x8fXiaz5ax5qn0ccEzSDPdVJIjFFnKgQkJWtS5Bi0rSqu65HWGeQWAAqjIsWlKWnOJwGpZ8gm5Pvzd2T5UO6_afm-7UKlYydMwhbE8pOQhpW3vnMVGaePBm77zFsxGMaoGV6pVgys1uFKUq-AqoOk_dGfNFuzPcWh6gDBM_zJolRsUaqyNRe1V3Ztj-C9GxYSo |
| CitedBy_id | crossref_primary_10_1016_j_vlsi_2024_102310 crossref_primary_10_14778_3447689_3447695 crossref_primary_10_3233_THC_218012 crossref_primary_10_1093_bib_bbab070 crossref_primary_10_3390_info11050248 crossref_primary_10_1016_j_eswa_2018_09_012 crossref_primary_10_7717_peerj_cs_957 crossref_primary_10_1002_cpe_7292 crossref_primary_10_1109_ACCESS_2020_2969038 crossref_primary_10_1007_s11227_022_04673_3 crossref_primary_10_2139_ssrn_4170153 |
| Cites_doi | 10.1145/359842.359859 10.1145/79173.79184 10.1145/135239.135243 10.1147/rd.312.0249 10.2174/157489361004150922151409 10.1016/j.ipl.2007.01.002 10.1016/j.jda.2003.12.002 10.1142/S0219720016300021 10.3844/jcssp.2011.644.650 10.1080/00207160212111 10.1016/j.ipl.2009.11.010 10.1145/135239.135244 10.1016/j.ygeno.2014.08.007 10.1002/(SICI)1097-024X(199707)27:7<851::AID-SPE108>3.0.CO;2-D 10.1145/248625.248639 10.3906/elk-1304-165 10.1016/S0020-0190(03)00296-5 10.1145/2431211.2431212 10.1145/351827.384246 10.1137/0206024 |
| ContentType | Journal Article |
| Copyright | 2017 Elsevier Ltd Copyright Elsevier BV Sep 1, 2017 |
| Copyright_xml | – notice: 2017 Elsevier Ltd – notice: Copyright Elsevier BV Sep 1, 2017 |
| DBID | AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1016/j.eswa.2017.03.026 |
| DatabaseName | CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1873-6793 |
| EndPage | 170 |
| ExternalDocumentID | 10_1016_j_eswa_2017_03_026 S0957417417301811 |
| GroupedDBID | --K --M .DC .~1 0R~ 13V 1B1 1RT 1~. 1~5 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN 9JO AAAKF AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AARIN AAXUO AAYFN ABBOA ABFNM ABMAC ABMVD ABUCO ABYKQ ACDAQ ACGFS ACHRH ACNTT ACRLP ACZNC ADBBV ADEZE ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGJBL AGUBO AGUMN AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALEQD ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD APLSM AXJTR BJAXD BKOJK BLXMC BNSAS CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FIRID FNPLU FYGXN G-Q GBLVA GBOLZ HAMUX IHE J1W JJJVA KOM LG9 LY1 LY7 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. PQQKQ Q38 RIG ROL RPZ SDF SDG SDP SDS SES SPC SPCBC SSB SSD SSL SST SSV SSZ T5K TN5 ~G- 29G AAAKG AAQXK AATTM AAXKI AAYWO AAYXX ABJNI ABKBG ABWVN ABXDB ACLOT ACNNM ACRPL ACVFH ADCNI ADJOM ADMUD ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP ASPBG AVWKF AZFZN CITATION EFKBS FEDTE FGOYB G-2 HLZ HVGLF HZ~ R2- SBC SET SEW WUQ XPP ZMT ~HD 7SC 8FD AFXIZ AGCQF AGRNS JQ2 L7M L~C L~D SSH |
| ID | FETCH-LOGICAL-c328t-634a24cfb83ea6423a178ac7d5cac8b00bd93d4e5baae7a075eff90d38ea1d893 |
| IEDL.DBID | .~1 |
| ISSN | 0957-4174 |
| IngestDate | Mon Jul 14 08:44:37 EDT 2025 Wed Oct 01 03:51:50 EDT 2025 Thu Apr 24 23:03:49 EDT 2025 Fri Feb 23 02:29:04 EST 2024 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Keywords | Strings Algorithm Multi-threading DNA;Binary |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c328t-634a24cfb83ea6423a178ac7d5cac8b00bd93d4e5baae7a075eff90d38ea1d893 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| PQID | 1932170115 |
| PQPubID | 2045477 |
| PageCount | 9 |
| ParticipantIDs | proquest_journals_1932170115 crossref_citationtrail_10_1016_j_eswa_2017_03_026 crossref_primary_10_1016_j_eswa_2017_03_026 elsevier_sciencedirect_doi_10_1016_j_eswa_2017_03_026 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2017-09-01 2017-09-00 20170901 |
| PublicationDateYYYYMMDD | 2017-09-01 |
| PublicationDate_xml | – month: 09 year: 2017 text: 2017-09-01 day: 01 |
| PublicationDecade | 2010 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | Expert systems with applications |
| PublicationYear | 2017 |
| Publisher | Elsevier Ltd Elsevier BV |
| Publisher_xml | – name: Elsevier Ltd – name: Elsevier BV |
| References | Chen, Lu, Ram (bib0009) 2004 Navarro, Raffinot (bib0033) 2004; 2 Fredriksson (bib0016) 2003; 87 Ďurian, Holub, Peltola, Tarhio (bib0010) 2010; 110 Baeza-Yates, Gonnet (bib0005) 1992; 35 Shibata, Matsumoto, Takeda, Shinohara, Arikawa (bib0041) 2000 Zubair, Wahab, Hussain, Zaffar (bib0048) 2010 Klaib, Osborne (bib0022) 2009 Sardaraz, Tahir, Ikram, Bajwa (bib0039) 2014; 104 Boyer, Moore (bib0006) 1977; 20 Peltola, Tarhio (bib0037) 2003 Manber (bib0028) 1997; 15 Faro, S., & Lecroq, T. (2010). The exact string matching problem: A comprehensive experimental evaluation. Bhukya, Somayajulu (bib0042) 2011; 4 . Tahir, Sardaraz, Aziz Ikram, Bajwa (bib0044) 2015; 10 Özcan, Ünsal (bib0035) 2015; 23 Kofahi, Abusalama (bib0024) 2012; 9 Navarro, Raffinot (bib0032) 2002 Tarhio, Peltola (bib0045) 1997; 27 Fredriksson, Grabowski (bib0017) 2005 Lecroq (bib0026) 2007; 102 Faro, Lecroq (bib0014) 2013; 45 Michailidis, Margaritis (bib0029) 2002; 79 Wu, Manber (bib0047) 1992; 35 Wu, Manber (bib0046) 1992 Shibata, Kida, Fukamachi, Takeda, Shinohara, Shinohara (bib0040) 2000 Allauzen, Crochemore, Raffinot (bib0001) 2001 Pehlivan, Orhan (bib0036) 2011; 19 Charras, Lecrog, Pehoushek (bib0008) 1998 Navarro (bib0030) 2001; 31 Almazroi (bib0002) 2011; 7 Fan, Yao, Ma (bib0012) 2009 Cao (bib0011) 2004 Külekci (bib0025) 2009 Bucak, Uslan (bib0007) 2011; 19 Sardaraz, Tahir, Ikram (bib0038) 2016; 14 Knuth, Morris, James, Pratt (bib0023) 1977; 6 Li, Lu, Lu (bib0027) 2008 Amir, Benson (bib0004) 1994; Vol. 5 Kim, Kim, Park (bib0021) 2007 Navarro, Raffinot (bib0031) 2000; 5 Navarro, Tarhio (bib0034) 2005; 35 Hassan (bib0018) 2005 Sunday (bib0043) 1990; 33 Horspool (bib0019) 1980; 10 Franek, Jennings, Smyth (bib0015) 2005 Karp, Rabin (bib0020) 1987; 31 Amir, Benson (bib0003) 1992 Hassan (10.1016/j.eswa.2017.03.026_bib0018) 2005 Wu (10.1016/j.eswa.2017.03.026_bib0046) 1992 Navarro (10.1016/j.eswa.2017.03.026_bib0032) 2002 Li (10.1016/j.eswa.2017.03.026_bib0027) 2008 Allauzen (10.1016/j.eswa.2017.03.026_bib0001) 2001 Ďurian (10.1016/j.eswa.2017.03.026_bib0010) 2010; 110 Horspool (10.1016/j.eswa.2017.03.026_bib0019) 1980; 10 Knuth (10.1016/j.eswa.2017.03.026_bib0023) 1977; 6 Shibata (10.1016/j.eswa.2017.03.026_bib0040) 2000 Külekci (10.1016/j.eswa.2017.03.026_bib0025) 2009 Kim (10.1016/j.eswa.2017.03.026_bib0021) 2007 Kofahi (10.1016/j.eswa.2017.03.026_bib0024) 2012; 9 Fredriksson (10.1016/j.eswa.2017.03.026_bib0016) 2003; 87 Navarro (10.1016/j.eswa.2017.03.026_bib0034) 2005; 35 Pehlivan (10.1016/j.eswa.2017.03.026_bib0036) 2011; 19 Tarhio (10.1016/j.eswa.2017.03.026_bib0045) 1997; 27 Bucak (10.1016/j.eswa.2017.03.026_bib0007) 2011; 19 Cao (10.1016/j.eswa.2017.03.026_bib0011) 2004 Navarro (10.1016/j.eswa.2017.03.026_bib0031) 2000; 5 Sardaraz (10.1016/j.eswa.2017.03.026_bib0039) 2014; 104 Sunday (10.1016/j.eswa.2017.03.026_bib0043) 1990; 33 Sardaraz (10.1016/j.eswa.2017.03.026_bib0038) 2016; 14 Bhukya (10.1016/j.eswa.2017.03.026_bib0042) 2011; 4 Chen (10.1016/j.eswa.2017.03.026_bib0009) 2004 Baeza-Yates (10.1016/j.eswa.2017.03.026_bib0005) 1992; 35 Charras (10.1016/j.eswa.2017.03.026_bib0008) 1998 Amir (10.1016/j.eswa.2017.03.026_bib0003) 1992 Klaib (10.1016/j.eswa.2017.03.026_bib0022) 2009 Amir (10.1016/j.eswa.2017.03.026_bib0004) 1994; Vol. 5 Franek (10.1016/j.eswa.2017.03.026_bib0015) 2005 Faro (10.1016/j.eswa.2017.03.026_bib0014) 2013; 45 Manber (10.1016/j.eswa.2017.03.026_bib0028) 1997; 15 Wu (10.1016/j.eswa.2017.03.026_bib0047) 1992; 35 Peltola (10.1016/j.eswa.2017.03.026_bib0037) 2003 Almazroi (10.1016/j.eswa.2017.03.026_bib0002) 2011; 7 Özcan (10.1016/j.eswa.2017.03.026_bib0035) 2015; 23 Tahir (10.1016/j.eswa.2017.03.026_bib0044) 2015; 10 10.1016/j.eswa.2017.03.026_bib0013 Fredriksson (10.1016/j.eswa.2017.03.026_bib0017) 2005 Shibata (10.1016/j.eswa.2017.03.026_bib0041) 2000 Navarro (10.1016/j.eswa.2017.03.026_bib0030) 2001; 31 Lecroq (10.1016/j.eswa.2017.03.026_bib0026) 2007; 102 Fan (10.1016/j.eswa.2017.03.026_bib0012) 2009 Navarro (10.1016/j.eswa.2017.03.026_bib0033) 2004; 2 Boyer (10.1016/j.eswa.2017.03.026_bib0006) 1977; 20 Michailidis (10.1016/j.eswa.2017.03.026_bib0029) 2002; 79 Karp (10.1016/j.eswa.2017.03.026_bib0020) 1987; 31 Zubair (10.1016/j.eswa.2017.03.026_bib0048) 2010 |
| References_xml | – year: 2004 ident: bib0011 article-title: Fast string matching algorithm and its application in DNA sequence search – reference: Faro, S., & Lecroq, T. (2010). The exact string matching problem: A comprehensive experimental evaluation. – volume: 4 start-page: 431 year: 2011 end-page: 441 ident: bib0042 article-title: Index based multiple pattern matching algorithm using DNA sequence and pattern count publication-title: International Journal of Information Technology – volume: 79 start-page: 867 year: 2002 end-page: 888 ident: bib0029 article-title: On-line approximate string searching algorithms: Survey and experimental results publication-title: International Journal of Computer Mathematics – volume: 102 start-page: 229 year: 2007 end-page: 235 ident: bib0026 article-title: Fast exact string matching algorithms publication-title: Information Processing Letters – volume: 7 start-page: 644 year: 2011 ident: bib0002 article-title: A fast hybrid algorithm approach for the exact string matching problem via berry ravindran and alpha skip search algorithms publication-title: Journal of Computer Science – volume: 31 start-page: 249 year: 1987 end-page: 260 ident: bib0020 article-title: Efficient randomized pattern-matching algorithms publication-title: IBM Journal of Research and Development – volume: 35 start-page: 74 year: 1992 end-page: 82 ident: bib0005 article-title: A new approach to text searching publication-title: Communications of the ACM – start-page: 288 year: 2005 end-page: 297 ident: bib0015 article-title: A simple fast hybrid pattern-matching algorithm publication-title: Combinatorial pattern matching – volume: 10 start-page: 501 year: 1980 end-page: 506 ident: bib0019 article-title: Practical fast searching in strings publication-title: Software: Practice and Experience – start-page: 118 year: 2009 end-page: 128 ident: bib0025 article-title: Filter Based fast matching of long patterns by using SIMD instructions publication-title: Stringology – start-page: 257 year: 2008 end-page: 266 ident: bib0027 article-title: Efficient merging and filtering algorithms for approximate string searches publication-title: Data engineering, 2008. ICDE 2008. IEEE 24th international conference on – volume: 15 start-page: 124 year: 1997 end-page: 136 ident: bib0028 article-title: A text compression scheme that allows fast searching directly in the compressed file publication-title: ACM Transactions on Information Systems (TOIS) – start-page: 80 year: 2003 end-page: 93 ident: bib0037 article-title: Alternative algorithms for bit-parallel string matching publication-title: String processing and information retrieval – start-page: 376 year: 2005 end-page: 387 ident: bib0017 article-title: Practical and optimal string matching publication-title: String processing and information retrieval – volume: 14 year: 2016 ident: bib0038 article-title: Advances in high throughput DNA sequence data compression publication-title: Journal of Bioinformatics and Computational Biology – volume: 6 start-page: 323 year: 1977 end-page: 350 ident: bib0023 article-title: Fast pattern matching in strings publication-title: SIAM Journal on Computing – start-page: 11 year: 2005 end-page: 16 ident: bib0018 article-title: Mixed heuristic algorithm for intelligent string matching for information retrieval publication-title: Computational intelligence and multimedia applications, 2005. Sixth international conference on – start-page: 51 year: 2001 end-page: 72 ident: bib0001 article-title: Efficient experimental string matching by weak factor recognition* publication-title: Combinatorial pattern matching – start-page: 75 year: 2009 end-page: 80 ident: bib0022 article-title: A new string matching algorithm for searching biological sequences publication-title: International conference on information and communication systems, ICICS – start-page: 153 year: 1992 end-page: 162 ident: bib0046 article-title: Agrep–a fast approximate pattern-matching tool publication-title: Usenix – volume: 31 start-page: 1265 year: 2001 end-page: 1312 ident: bib0030 article-title: NR‐grep: A fast and flexible pattern‐matching tool publication-title: Software: Practice and Experience – volume: 5 start-page: 4 year: 2000 ident: bib0031 article-title: Fast and flexible string matching by combining bit-parallelism and suffix automata publication-title: Journal of Experimental Algorithmics (JEA) – start-page: 56 year: 2009 end-page: 59 ident: bib0012 article-title: Fast variants of the backward-oracle-marching algorithm publication-title: Internet computing for science and engineering (ICICSE), 2009 fourth international conference on – volume: 19 start-page: 59 year: 2011 end-page: 71 ident: bib0036 article-title: Automatic knowledge extraction for filling in biography forms from Turkish texts publication-title: Turkish Journal of Electrical Engineering & Computer Sciences – volume: 45 start-page: 13 year: 2013 ident: bib0014 article-title: The exact online string matching problem: A review of the most recent results publication-title: ACM Computing Surveys (CSUR) – volume: 20 start-page: 762 year: 1977 end-page: 772 ident: bib0006 article-title: A fast string searching algorithm publication-title: Communications of the ACM – start-page: 55 year: 1998 end-page: 64 ident: bib0008 article-title: A very fast string matching algorithm for small alphabets and long patterns publication-title: Combinatorial pattern matching – volume: 104 start-page: 225 year: 2014 end-page: 228 ident: bib0039 article-title: SeqCompress: An algorithm for biological sequence compression publication-title: Genomics – volume: 19 start-page: 157 year: 2011 end-page: 173 ident: bib0007 article-title: Sequence alignment from the perspective of stochastic optimization: A survey publication-title: Turkish Journal of Electrical Engineering & Computer Sciences – volume: 35 start-page: 1107 year: 2005 end-page: 1130 ident: bib0034 article-title: LZgrep: A Boyer–Moore string matching tool for Ziv–Lempel compressed text publication-title: Software: Practice and Experience – start-page: 181 year: 2000 end-page: 194 ident: bib0041 article-title: A Boyer–Moore type algorithm for compressed pattern matching publication-title: Combinatorial pattern matching – year: 2002 ident: bib0032 article-title: Flexible pattern matching in strings: Practical on-line search algorithms for texts and biological sequences – volume: 2 start-page: 347 year: 2004 end-page: 371 ident: bib0033 article-title: Practical and flexible pattern matching over Ziv–Lempel compressed text publication-title: Journal of Discrete Algorithms – volume: 110 start-page: 148 year: 2010 end-page: 152 ident: bib0010 article-title: Improving practical exact string matching publication-title: Information Processing Letters – volume: 23 start-page: 1405 year: 2015 end-page: 1417 ident: bib0035 article-title: Fast bitwise pattern-matching algorithm for DNA sequences on modern hardware publication-title: Turkish Journal of Electrical Engineering & Computer Sciences – volume: 27 start-page: 851 year: 1997 end-page: 861 ident: bib0045 article-title: String matching in the DNA alphabet publication-title: Software-Practice and Experience – start-page: 62 year: 2004 end-page: 68 ident: bib0009 article-title: Compressed pattern matching in DNA sequences publication-title: Computational systems bioinformatics conference, 2004. CSB 2004. Proceedings. 2004 IEEE – volume: 87 start-page: 201 year: 2003 end-page: 204 ident: bib0016 article-title: Shift-or string matching with super-alphabets publication-title: Information Processing Letters – volume: 10 start-page: 469 year: 2015 end-page: 475 ident: bib0044 article-title: HaShRECA: Hadoop based short read error correction algorithm for genome assembly publication-title: Current Bioinformatics – volume: 35 start-page: 83 year: 1992 end-page: 91 ident: bib0047 article-title: Fast text searching: Allowing errors publication-title: Communications of the ACM – start-page: 279 year: 1992 end-page: 288 ident: bib0003 article-title: Efficient two-dimensional compressed matching publication-title: Data compression conference, 1992. DCC'92 – volume: 33 start-page: 132 year: 1990 end-page: 142 ident: bib0043 article-title: A very fast substring search algorithm publication-title: Communications of the ACM – start-page: 306 year: 2000 end-page: 315 ident: bib0040 article-title: Speeding up pattern matching by text compression publication-title: Algorithms and complexity – reference: . – start-page: 271 year: 2007 end-page: 281 ident: bib0021 article-title: Fast matching method for DNA sequences publication-title: Combinatorics, algorithms, probabilistic and experimental methodologies – volume: Vol. 5 start-page: 705 year: 1994 end-page: 714 ident: bib0004 article-title: Let sleeping files lie: Pattern matching in Z-compressed files publication-title: 5th Annual ACM-SIAM symposium on discrete algorithms – volume: 9 start-page: 30 year: 2012 end-page: 38 ident: bib0024 article-title: A framework for distributed pattern matching based on multi-threading publication-title: The International Arab Journal of Information Technology – start-page: 1 year: 2010 end-page: 5 ident: bib0048 article-title: Improved text scanning approach for exact String matching publication-title: Information and emerging technologies (ICIET), 2010 international conference on – volume: 20 start-page: 762 year: 1977 ident: 10.1016/j.eswa.2017.03.026_bib0006 article-title: A fast string searching algorithm publication-title: Communications of the ACM doi: 10.1145/359842.359859 – ident: 10.1016/j.eswa.2017.03.026_bib0013 – start-page: 75 year: 2009 ident: 10.1016/j.eswa.2017.03.026_bib0022 article-title: A new string matching algorithm for searching biological sequences – volume: 33 start-page: 132 year: 1990 ident: 10.1016/j.eswa.2017.03.026_bib0043 article-title: A very fast substring search algorithm publication-title: Communications of the ACM doi: 10.1145/79173.79184 – start-page: 1 year: 2010 ident: 10.1016/j.eswa.2017.03.026_bib0048 article-title: Improved text scanning approach for exact String matching – volume: 35 start-page: 74 year: 1992 ident: 10.1016/j.eswa.2017.03.026_bib0005 article-title: A new approach to text searching publication-title: Communications of the ACM doi: 10.1145/135239.135243 – start-page: 55 year: 1998 ident: 10.1016/j.eswa.2017.03.026_bib0008 article-title: A very fast string matching algorithm for small alphabets and long patterns – volume: 31 start-page: 249 year: 1987 ident: 10.1016/j.eswa.2017.03.026_bib0020 article-title: Efficient randomized pattern-matching algorithms publication-title: IBM Journal of Research and Development doi: 10.1147/rd.312.0249 – start-page: 56 year: 2009 ident: 10.1016/j.eswa.2017.03.026_bib0012 article-title: Fast variants of the backward-oracle-marching algorithm – volume: 10 start-page: 469 year: 2015 ident: 10.1016/j.eswa.2017.03.026_bib0044 article-title: HaShRECA: Hadoop based short read error correction algorithm for genome assembly publication-title: Current Bioinformatics doi: 10.2174/157489361004150922151409 – volume: 4 start-page: 431 year: 2011 ident: 10.1016/j.eswa.2017.03.026_bib0042 article-title: Index based multiple pattern matching algorithm using DNA sequence and pattern count publication-title: International Journal of Information Technology – volume: 102 start-page: 229 year: 2007 ident: 10.1016/j.eswa.2017.03.026_bib0026 article-title: Fast exact string matching algorithms publication-title: Information Processing Letters doi: 10.1016/j.ipl.2007.01.002 – volume: 35 start-page: 1107 year: 2005 ident: 10.1016/j.eswa.2017.03.026_bib0034 article-title: LZgrep: A Boyer–Moore string matching tool for Ziv–Lempel compressed text publication-title: Software: Practice and Experience – start-page: 257 year: 2008 ident: 10.1016/j.eswa.2017.03.026_bib0027 article-title: Efficient merging and filtering algorithms for approximate string searches – start-page: 271 year: 2007 ident: 10.1016/j.eswa.2017.03.026_bib0021 article-title: Fast matching method for DNA sequences – volume: 2 start-page: 347 year: 2004 ident: 10.1016/j.eswa.2017.03.026_bib0033 article-title: Practical and flexible pattern matching over Ziv–Lempel compressed text publication-title: Journal of Discrete Algorithms doi: 10.1016/j.jda.2003.12.002 – volume: 10 start-page: 501 year: 1980 ident: 10.1016/j.eswa.2017.03.026_bib0019 article-title: Practical fast searching in strings publication-title: Software: Practice and Experience – start-page: 11 year: 2005 ident: 10.1016/j.eswa.2017.03.026_bib0018 article-title: Mixed heuristic algorithm for intelligent string matching for information retrieval – volume: 14 year: 2016 ident: 10.1016/j.eswa.2017.03.026_bib0038 article-title: Advances in high throughput DNA sequence data compression publication-title: Journal of Bioinformatics and Computational Biology doi: 10.1142/S0219720016300021 – start-page: 62 year: 2004 ident: 10.1016/j.eswa.2017.03.026_bib0009 article-title: Compressed pattern matching in DNA sequences – start-page: 306 year: 2000 ident: 10.1016/j.eswa.2017.03.026_bib0040 article-title: Speeding up pattern matching by text compression – start-page: 51 year: 2001 ident: 10.1016/j.eswa.2017.03.026_bib0001 article-title: Efficient experimental string matching by weak factor recognition* – start-page: 279 year: 1992 ident: 10.1016/j.eswa.2017.03.026_bib0003 article-title: Efficient two-dimensional compressed matching – year: 2002 ident: 10.1016/j.eswa.2017.03.026_bib0032 – volume: 19 start-page: 157 year: 2011 ident: 10.1016/j.eswa.2017.03.026_bib0007 article-title: Sequence alignment from the perspective of stochastic optimization: A survey publication-title: Turkish Journal of Electrical Engineering & Computer Sciences – start-page: 376 year: 2005 ident: 10.1016/j.eswa.2017.03.026_bib0017 article-title: Practical and optimal string matching – start-page: 118 year: 2009 ident: 10.1016/j.eswa.2017.03.026_bib0025 article-title: Filter Based fast matching of long patterns by using SIMD instructions – volume: 7 start-page: 644 year: 2011 ident: 10.1016/j.eswa.2017.03.026_bib0002 article-title: A fast hybrid algorithm approach for the exact string matching problem via berry ravindran and alpha skip search algorithms publication-title: Journal of Computer Science doi: 10.3844/jcssp.2011.644.650 – volume: Vol. 5 start-page: 705 year: 1994 ident: 10.1016/j.eswa.2017.03.026_bib0004 article-title: Let sleeping files lie: Pattern matching in Z-compressed files – volume: 79 start-page: 867 year: 2002 ident: 10.1016/j.eswa.2017.03.026_bib0029 article-title: On-line approximate string searching algorithms: Survey and experimental results publication-title: International Journal of Computer Mathematics doi: 10.1080/00207160212111 – volume: 110 start-page: 148 year: 2010 ident: 10.1016/j.eswa.2017.03.026_bib0010 article-title: Improving practical exact string matching publication-title: Information Processing Letters doi: 10.1016/j.ipl.2009.11.010 – volume: 19 start-page: 59 year: 2011 ident: 10.1016/j.eswa.2017.03.026_bib0036 article-title: Automatic knowledge extraction for filling in biography forms from Turkish texts publication-title: Turkish Journal of Electrical Engineering & Computer Sciences – volume: 35 start-page: 83 year: 1992 ident: 10.1016/j.eswa.2017.03.026_bib0047 article-title: Fast text searching: Allowing errors publication-title: Communications of the ACM doi: 10.1145/135239.135244 – start-page: 181 year: 2000 ident: 10.1016/j.eswa.2017.03.026_bib0041 article-title: A Boyer–Moore type algorithm for compressed pattern matching – volume: 104 start-page: 225 year: 2014 ident: 10.1016/j.eswa.2017.03.026_bib0039 article-title: SeqCompress: An algorithm for biological sequence compression publication-title: Genomics doi: 10.1016/j.ygeno.2014.08.007 – volume: 27 start-page: 851 year: 1997 ident: 10.1016/j.eswa.2017.03.026_bib0045 article-title: String matching in the DNA alphabet publication-title: Software-Practice and Experience doi: 10.1002/(SICI)1097-024X(199707)27:7<851::AID-SPE108>3.0.CO;2-D – volume: 15 start-page: 124 year: 1997 ident: 10.1016/j.eswa.2017.03.026_bib0028 article-title: A text compression scheme that allows fast searching directly in the compressed file publication-title: ACM Transactions on Information Systems (TOIS) doi: 10.1145/248625.248639 – volume: 31 start-page: 1265 year: 2001 ident: 10.1016/j.eswa.2017.03.026_bib0030 article-title: NR‐grep: A fast and flexible pattern‐matching tool publication-title: Software: Practice and Experience – start-page: 153 year: 1992 ident: 10.1016/j.eswa.2017.03.026_bib0046 article-title: Agrep–a fast approximate pattern-matching tool – volume: 23 start-page: 1405 year: 2015 ident: 10.1016/j.eswa.2017.03.026_bib0035 article-title: Fast bitwise pattern-matching algorithm for DNA sequences on modern hardware publication-title: Turkish Journal of Electrical Engineering & Computer Sciences doi: 10.3906/elk-1304-165 – year: 2004 ident: 10.1016/j.eswa.2017.03.026_bib0011 – volume: 9 start-page: 30 year: 2012 ident: 10.1016/j.eswa.2017.03.026_bib0024 article-title: A framework for distributed pattern matching based on multi-threading publication-title: The International Arab Journal of Information Technology – volume: 87 start-page: 201 year: 2003 ident: 10.1016/j.eswa.2017.03.026_bib0016 article-title: Shift-or string matching with super-alphabets publication-title: Information Processing Letters doi: 10.1016/S0020-0190(03)00296-5 – volume: 45 start-page: 13 year: 2013 ident: 10.1016/j.eswa.2017.03.026_bib0014 article-title: The exact online string matching problem: A review of the most recent results publication-title: ACM Computing Surveys (CSUR) doi: 10.1145/2431211.2431212 – start-page: 80 year: 2003 ident: 10.1016/j.eswa.2017.03.026_bib0037 article-title: Alternative algorithms for bit-parallel string matching – start-page: 288 year: 2005 ident: 10.1016/j.eswa.2017.03.026_bib0015 article-title: A simple fast hybrid pattern-matching algorithm – volume: 5 start-page: 4 year: 2000 ident: 10.1016/j.eswa.2017.03.026_bib0031 article-title: Fast and flexible string matching by combining bit-parallelism and suffix automata publication-title: Journal of Experimental Algorithmics (JEA) doi: 10.1145/351827.384246 – volume: 6 start-page: 323 year: 1977 ident: 10.1016/j.eswa.2017.03.026_bib0023 article-title: Fast pattern matching in strings publication-title: SIAM Journal on Computing doi: 10.1137/0206024 |
| SSID | ssj0017007 |
| Score | 2.3192031 |
| Snippet | •We present a brief introduction to the applications of pattern matching.•We present a novel pattern matching algorithm for DNA sequences.•We present... To solve, manage and analyze biological problems using computer technology is called bioinformatics. With the emergent evolution in computing era, the volume... |
| SourceID | proquest crossref elsevier |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 162 |
| SubjectTerms | Algorithm Algorithms Bioinformatics Biological evolution Deoxyribonucleic acid DNA DNA;Binary Error correction Gene sequencing Hardware Multi-threading Pattern matching String matching Strings |
| Title | EPMA: Efficient pattern matching algorithm for DNA sequences |
| URI | https://dx.doi.org/10.1016/j.eswa.2017.03.026 https://www.proquest.com/docview/1932170115 |
| Volume | 80 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier) customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: GBLVA dateStart: 20110101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier SD Complete Freedom Collection [SCCMFC] customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: ACRLP dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals [SCFCJ] customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: AIKHN dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVESC databaseName: ScienceDirect (Elsevier) customDbUrl: eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: .~1 dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier – providerCode: PRVLSH databaseName: Elsevier Journals customDbUrl: mediaType: online eissn: 1873-6793 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0017007 issn: 0957-4174 databaseCode: AKRWK dateStart: 19900101 isFulltext: true providerName: Library Specific Holdings |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PS8MwFA5jXrz4W5zOkYM3qWubdEnFy5gbU9kQdLBbSJtEJ3MbW8Wbf7svbTpQZAehFFKSUr40732H730PoYtWbAhPtfESbiKPJpp5EmIl3KivEsh_NLTVyINhqz-i9-NoXEGdshbGyipd7C9ieh6t3ZOmQ7O5mEyaT0AOIB3ai1jXqbyCnTLbxeDqay3zsPZzrPDbY56d7QpnCo2XXn1a76GA5Uan1mDh7-T0K0znuae3h3YcacTt4rv2UUXPDtBu2ZABu_N5iG66j4P2Ne7mthCQTfAid8-cYaCluWYSy-nLfDnJXt8xkFV8O2zjtZb6CI163edO33PtEbyUhDzzWoTKkKYm4URLwJTIgHGZMhWlMuVwnBIVE0V1lEipmQRuoI2JfUW4loECnnKMqrP5TJ8grCLOEsJVrIihyvixz2HYAm4X69iEYQ0FJS4idd7htoXFVJQisTdhsRQWS-ETAVjW0OV6zaJwztg4OyrhFj_2X0Bo37iuXu6NcKdvJSwptT7zQXT6z9eeoW07KsRkdVTNlh_6HNhHljTy36uBttp3D_3hN3eB2Gg |
| linkProvider | Elsevier |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PS8MwGA1jHvTib3H-zMGb1LVNuqTiZejG1G0IbrBbSJtEJzrHVvHm3-6XNh0osoNQCm2TUl6T73uBl_chdNaIDeGpNl7CTeTRRDNPQqyEE_VVAvmPhnY3cq_f6Azp3SgaVdB1uRfGyipd7C9ieh6t3Z26Q7M-HY_rj0AOIB3ag1jXKVgCrdAoZHYFdvG10HlY_zlWGO4xzzZ3O2cKkZeef1rzoYDlTqfWYeHv7PQrTufJp72J1h1rxM3iw7ZQRU-20UZZkQG7CbqDrloPveYlbuW-EJBO8DS3z5xg4KW5aBLL16f32Th7fsPAVvFNv4kXYupdNGy3Btcdz9VH8FIS8sxrECpDmpqEEy0BVCIDxmXKVJTKlMN8SlRMFNVRIqVmEsiBNib2FeFaBgqIyh6qTt4neh9hFXGWEK5iRQxVxo99DpcNIHexjk0Y1lBQ4iJSZx5ua1i8ilIl9iIslsJiKXwiAMsaOl_0mRbWGUtbRyXc4scAEBDbl_Y7Kv-NcNNvLiwrtUbzQXTwz9eeotXOoNcV3dv-_SFas08KZdkRqmazD30MVCRLTvKh9g2eIdn9 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=EPMA%3A+Efficient+pattern+matching+algorithm+for+DNA+sequences&rft.jtitle=Expert+systems+with+applications&rft.au=Tahir%2C+Muhammad&rft.au=Sardaraz%2C+Muhammad&rft.au=Ikram%2C+Ataul+Aziz&rft.date=2017-09-01&rft.pub=Elsevier+Ltd&rft.issn=0957-4174&rft.eissn=1873-6793&rft.volume=80&rft.spage=162&rft.epage=170&rft_id=info:doi/10.1016%2Fj.eswa.2017.03.026&rft.externalDocID=S0957417417301811 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0957-4174&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0957-4174&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0957-4174&client=summon |