Advancing DNA Signal Processing: Integrating Digital and Biological Nuances for Enhanced Identification of Coding Regions

DOI:

https://doi.org/10.36371/port.2023.4.8

Authors

  • Ammar A. Sakran Informatics Institute for Postgraduate Studies, Iraqi Commission for Computer and Informatics. Iraq
  • Suha. M. Hadi Informatics Institute for Postgraduate Studies, Iraqi Commission for Computer and Informatics. Iraq
  • Waleed A. Mahmoud Al-Jawher College of Engineering, Uruk University, Baghdad, Iraq.

Within the complex realm of DNA sequencing, discerning protein coding areas from non-coding segments proves challenging due to the pervasive 1/f background disturbance. Traditional digital signal processing (DSP) methodologies, while widely adopted, may inadvertently overlook the inherent nuances and intricacies of DNA sequences. This paper critically examines these established DSP-centric methodologies, underscoring their potential inadequacies in capturing the salient characteristics intrinsic to DNA. Notably, nucleotides within the DNA exhibit distinct attributes, such as their triadic configurations, specific structural significance, and particularized density distributions in codons, among other characteristics. By harnessing these inherent features of nucleotides, computational approaches can effectively counteract signal disruptions, enhancing the precision in identifying protein coding regions.

Keywords:

DNA Signal Processing, Coding Region Identification, Digital-Biological Integration, 1/f Noise Mitigation, 3-Base Periodicity Analysis, Advanced DSP Techniques

[1] L. Galleani and R. Garello, “The minimum entropy mapping spectrum of a DNA sequence,” IEEE Trans. Inf. Theory,

[2] vol. 56, no. 2, pp. 771–783, 2010.

[3] J. Mena-Chalco, H. Carrer, Y. Zana, and R. M. Cesar Jr, “Identification of protein-coding regions using the modified

[4] Gabor-wavelet transform,” IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 5, no. 2, pp. 198–207, 2008.

[5] M. Akhtar, J. Epps, and E. Ambikairajah, “Signal processing in sequence analysis: advances in eukaryotic gene

[6] prediction,” IEEE J. Sel. Top. Signal Process., vol. 2, no. 3, pp. 310–321, 2008.

[7] S. S. Sahu and G. Panda, “Identification of protein-coding regions in DNA sequences using a time-frequency filtering

[8] approach,” Genomics. Proteomics Bioinformatics, vol. 9, no. 1, pp. 45–55, 2011.

[9] S. Barman, M. Roy, S. Biswas, and S. Saha, “Prediction of cancer cell using digital signal processing,” Ann. Fac. Eng.

[10] Hunedoara, vol. 9, no. 3, p. 91, 2011.

[11] W.-F. Zhang and H. Yan, “Coding sequence prediction using empirical mode decomposition and Fourier transform of

[12] structural profiles of DNA sequences,” Pattern Recognit., vol. 45, no. 3, pp. 947–955, 2012.

[13] M. Ahmad, L. T. Jung, and A.-A. Bhuiyan, “A biological inspired fuzzy adaptive window median filter (FAWMF) for

[14] enhancing DNA signal processing,” Comput. Methods Programs Biomed., vol. 149, pp. 11–17, 2017.

[15] P. Ramachandran, W.-S. Lu, and A. Antoniou, “Filter-based methodology for the location of hot spots in proteins and

[16] coding sequences in DNA,” IEEE Trans. Biomed. Eng., vol. 59, no. 6, pp. 1598–1609, 2012.

[17] T. Meng et al., “Wavelet analysis in current cancer genome research: a survey,” IEEE/ACM Trans. Comput. Biol.

[18] Bioinforma., vol. 10, no. 6, pp. 1442–14359, 2013.

[19] T. M. Inbamalar and R. Sivakumar, “Study of DNA sequence analysis using DSP techniques,” J. Autom. Control Eng.

[20] Vol, vol. 1, no. 4, 2013.

[21] D. K. Shakya, R. Saxena, and S. N. Sharma, “An adaptive window length strategy for eukaryotic CDS prediction,”

[22] IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 10, no. 5, pp. 1241–1252, 2013.

[23] L. Das, J. K. Das, and S. Nanda, “Identification of coding sequence location applying Kaiser window and DFT

[24] techniques,” in Convergence in Technology (I2CT), 2017 2nd International Conference for, 2017, pp. 211–216.

[25] S. A. Marhon and S. C. Kremer, “Prediction of protein coding regions using a wide-range wavelet window method,”

[26] IEEE/ACM Trans. Comput. Biol. Bioinforma., vol. 13, no. 4, pp. 742–753, 2016.

[27] M. Roy and S. Barman, “Improved gene prediction by principal component analysis based autoregressive Yule- Walker

[28] method,” Gene, vol. 575, no. 2, pp. 488–497, 2016.

[29] M. Ahmad, L. T. Jung, and A.-A. Bhuiyan, “From DNA to protein: Why genetic code context of nucleotides for DNA

[30] signal processing? A review,” Biomed. Signal Process. Control, vol. 34, pp. 44–63, 2017.

[31] B. Das and I. Turkoglu, “A novel numerical mapping method based on entropy for digitizing DNA sequences,” Neural

[32] Comput. Appl., pp. 1–9, 2017.

[33] M. S. Mabrouk, S. M. Naeem, and M. A. Eldosoky, “DIFFERENT GENOMIC SIGNAL PROCESSING METHODS

[34] FOR EUKARYOTIC GENE PREDICTION: A SYSTEMATIC REVIEW,” Biomed. Eng. Appl. Basis Commun., vol. 29, no. 01,

[35] p. 1730001, 2017.

[36] L. Das, J. K. Das, and S. Nanda, “Advanced protein coding region prediction applying robust SVD algorithm,” in Man

[37] and Machine Interfacing (MAMI), 2017 2nd International Conference on,

[38] D. Anastassiou, “Genomic signal processing,” IEEE Signal Process. Mag., vol. 18, no. 4, pp. 8–20, 2001.

[39] M. Akhtar, E. Ambikairajah, and J. Epps, “Detection of period-3 behavior in genomic sequences using singular value

[40] decomposition,” in Emerging Technologies, 2005. Proceedings of the IEEE Symposium on, 2005, pp. 13–17.

[41] A.K. Singh, V.K. Srivastava, Improved filtering approach for identification of protein-coding regions in eukaryotes by

[42] background noise reduction using S- G filter, Netw. Model. Anal. Health Inform. Bioinform. 10 (19) (2021) 1–16

[43] L. Das, S. Nanda, and J. K. Das, “A novel DNA mapping scheme for improved coding sequence prediction using digital

[44] filters,” in Man and 2017Machine Interfacing (MAMI), 2nd International Conference on, 2017, pp. 1–6.

[45] P.K. Kumari, J.B. Seventline, A Novel Approach For Identification Of Exon Locations In DNA Sequences Using GLC

[46] Window, Int. J. Biol. Biomed. Eng. 15 (2021) 47–60.

[47] A. Piovesan, F. Antonaros, L. Vitale, P. Strippoli, M.C. Pelleri, M. Caracausi, Human protein-coding genes and gene

[48] feature statistics, BMC Res. Notes 12 (2019).

[49] S.A. Marhon, S.C. Kremer, Prediction of protein-coding regions using a wide-range wavelet window method, IEEE/ACM

[50] Trans. Comput. Biol. Bioinform. 13 (4) (2016) 742–753.

[51] S.D. Sharma, S.N. Sharma, R. Saxena, Identification of short exons disunited by a short intron in eukaryotic DNA

[52] regions, IEEE/ACM Trans. Comput. Biol. Bio in-form. 17 (5) (2020) 1660–1670.

[53] Q. Zheng, T. Chen, W. Zhou, L. Xie, H. Su, Gene prediction by the noise-assisted MEMD and wavelet transform for

[54] identifying the protein-coding regions, Bio- cybern. Biomed. Eng. 41 (1) (2021) 196–210.

[55] M. Raman Kumar, N.K. Vaegae, Walsh code based numerical mapping method for the identification of protein-coding

[56] regions in eukaryotes, Biomed. Signal Process. Control 58 (2020) 1–11.

[57] M. Raman Kumar, N.K. Vaegae, A new numerical approach for DNA representation using modified Gabor wavelet

[58] transform for the identification of protein-coding regions, Biocybern. Biomed. Eng. 40 (2) (2020) 836–848.

[59] M.K. Hota, Empirical mode decomposition based adaptive noise canceller for improved identification of exons in

[60] eukaryotes, Netw. Model. Anal. Health Inform. Bioinform. 10 (2021) 60.

[61] P. Garg, S.D. Sharma, S.N. Sharma, Tandem repeats detection in DNA sequences using p-spectrum based algorithm, in

[62] The Proceedings of Conference on Information and Communication Technology (CICT 2017), 2017, pp. 1–5.

[63] N. Yu, Z. Li, Z. Yu, Survey on encoding schemes for genomic data representation and feature learning-from signal

[64] processing to machine learning, Big Data Min.Anal. 1(3)(2018) 191–210.

[65] P. Garg, S.D. Sharma, Identification of CpG Islands in DNA sequences using short-time Fourier transform, Interdiscip.

[66] Sci. Comput. Life Sci. 12 (3) (2020) 355–367.

[67] N. Yu, Z. Li, Z. Yu, Survey on encoding schemes for genomic data representation and feature learning—from signal

[68] processing to machine learning, Big Data Min. Anal. 1 (2018) 191–210.

[69] H. Wassfy, M. Elnaby, M. Salem, M. Mabrouk, A.-A. Zidan, Advanced DNA Mapping Schemes for Exon Prediction

[70] Using Digital Filters, Am. J. Biomed. Eng. (6) 1 (2016) 25–31

[71] A.K. Singh, V.K. Srivastava, Improved filtering approach for identification of protein-coding regions in eukaryotes by

[72] background noise reduction using S-G filter, Netw. Model. Anal. Health Inform. Bioinforma. (10) 1 (2021)

[73] J. Baranowski, P. Piątek, Fractional Band-Pass Filters: Design, Implementation, and Application to EEG Signal

[74] Processing, J. Circuits Syst. Comput. 26 (11) (2017) 1750170

[75] Y. Ferdi, A Fractional Digital High-Pass Notch Filter, in 2020 10th International Symposium on Signal, Image, Video and

[76] Communications (ISIVC), 2021, pp. 1–4.

[77] A. Gupta, S. Kumar, Design of Atangana–Baleanu–Caputo fractional-order digital filter, ISA Trans. 112 (2021) 74–88

[78] A.K. Shukla, R.K. Pandey, R.B. Pachori, A fractional filter based efficient algorithm for retinal blood vessel

[79] segmentation, Biomed. Signal Process. Control 59 (2020)

[80] Ammar AbdRaba Sakran, Waleed A. Mahmoud Al-Jawher and Suha M Hadi “ A new approach for DNA sequence

[81] analysis using multiwavelet transform (MWT)” , 2023 J. Phys.: Conf. Ser. 2432 012022.

[82] Q. Zheng, T. Chen, W. Zhou, S.A. Marhon, L. Xie, H. Su, SAVMD: An adaptive signal processing method for

[83] identifying protein-coding regions, Biomed. Signal Process, Control 70 (2021)

[84] Q. Zheng, T. Chen, W. Zhou, L. Xie, H. Su, Gene prediction by the noise-assisted MEMD and wavelet transform for

[85] identifying the protein-coding regions, Biocybern. Biomed. 41 (1) (2021) 196–210

[86] V. Pathak, S.J. Nanda, A.M. Joshi, S.S. Sahu, VLSI implementation of the anti-notch lattice structure for identification of

[87] exon regions in Eukaryotic genes, IET Comput. Digit. Tech. 14 (5) (2020) 217–229

[88] S. Kar, M. Ganguly, S. Das, Using DIT-FFT algorithm for identification of the protein-coding region in the eukaryotic

[89] gene, Biomed. Eng. Appl. Basis Commun. 31 (01) (2019)

[90] A. Singh, V. Srivastava, Bidirectional filtering approach for the improved protein coding region identification in

[91] eukaryotes, Netw. Model. Anal. Health Inform. Bioinforma. 11 (2022)

[92] S. Kar, M. Ganguly, S. Das, Using DIT-FFT algorithm for identification of the protein-coding region in the eukaryotic

[93] gene, Biomed. Eng.: Appl., Basis Commun. 31 (01) (2019).

[94] S. Putluri, M.Z.U. Rahman, C.S. Amara, N. Putluri, New Exon Prediction Techniques Using Adaptive Signal Processing

[95] Algorithms for Genomic Analysis, IEEE Access 7 (2019) 80800–80812.

[96] P.K. Kumari, J.B. Seventline, A Novel Approach For Identification Of Exon Locations In DNA Sequences Using GLC

[97] Window, Int. J. Biol. Biomed. Eng. 15 (2021) 47–60.

[98] R.K. M, N.K. Vaegae, Walsh code based numerical mapping method for the identification of protein-coding regions in

[99] eukaryotes, Biomed. Signal Process. Control. 58 (2020) 101859,

[100] S. Kar, M. Ganguly, S. Ghosal, Prediction of coding region and mutations in Human DNA by effective numerical coding

[101] and DSP technique, in 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), 2021.

[102] National center for biotechnology information. https://www.ncbi.nlm.nih.gov

[103] Sanja Rogic, HMR195. https://srogic.wordpress.com/datasets/hmr195-dataset.

[104] SMR Taha, WA Mahmood “New techniques for Daubechies wavelets and multiwavelets implementation using quantum

[105] computing” Facta universitatis-series: Electronics and Energetics 26 (2), 145-156, 2013.

[106] WAM Al-Jawher, SH Awad “A proposed brain tumor detection algorithm using Multi wavelet Transform (MWT)”

[107] Materials Today: Proceedings 65, 2731-2737, 2022.

[108] WA Mahmoud “Computation of Wavelet and Multiwavelet Transforms Using Fast Fourier Transform” Journal Port

[109] Science Research 4 (2), 111-117, 2021.

[110] WAM Al-Jawher, SAA SHABAN “CLUSTERING OF MEDICAL IMAGES USING MULTIWAVELET TRANSFORM

[111] AND K-MEANS ALGORITHMS” Journal Port Science Research 5 (1), 35-42, 2022.

[112] SM Saadi, WAM Al-Jawher “ Proposed DeepFake Detection Method Using Multiwavelet Transform” International Journal

[113] of Innovative Computing 13 (1-2), 61-66, 2022.

[114] AH Salman, WAM Al-Jawher “A Hybrid Multiwavelet Transform with Grey Wolf Optimization Used for an Efficient

[115] Classification of Documents” International Journal of Innovative Computing 13 (1-2), 55-60, 2022.

[116] WA Mahmoud, AI Abbas, NAS Alwan “Face Identification Using Back-Propagation Adaptive Multiwavelet” Journal of

[117] Engineering 18 (3), 2012.

[118] Waleed A Mahmoud, MR Shaker “3D Ear Print Authentication using 3D Radon Transform” proceeding of 2nd

[119] International Conference on Information & Communication Technologies, Pages 1052-1056, 2006.

[120] Waleed A Mahmoud, Afrah Loay Mohammed Rasheed “3D Image Denoising by Using 3D Multiwavelet” ALMustansiriya J. Sci, vol 21, issue 7, pp. 108-136, 2010.

[121] W. A. Mahmoud & I.K. Ibraheem “Image Denoising Using Stationary Wavelet Transform” Signals, Inf. Patt. Proc. &

[122] Class., vol 46, issue 4, pp. 1-18, 2003.

[123] Waleed A Mahmoud, Ali A Ali, & Saad N Abdul Majed “Wavelet Based Multi Carrier Code Division Multiple Access”

[124] proceeding of 5th International Multi-Conference on Systems, Signals and Devices, pages 1-6, 2008.

[125] Abbas Hasan Kattoush, Waleed Ameen Mahmoud Al-Jawher, Sulaiman M Abbas, Ali Tweij Shaheen “A N-Radon Based

[126] OFDM Trasceivers Design and Performance Simulation Over Different Channel Models” Journal of Wireless Personal

[127] Communications, Volume 58, Pages 695-711, 2011.

[128] AHM Al-Heladi, W. A. Mahmoud, HA Hali, AF Fadhel “Multispectral Image Fusion using Walidlet Transform”

[129] Advances in Modelling and Analysis B, vol 52, issue 1-2, pp. 1-20, 2009.

[130] Hamid M Hasan, Waleed A. Mahmoud Al- Jawher, Majid A Alwan “3-d face recognition using improved 3d mixed

[131] transform” Journal International Journal of Biometrics and Bioinformatics (IJBB), Volume 6, Issue 1, Pages 278-290, 2012.

[132] Waleed Ameen Mahmoud “A Smart Single Matrix Realization of Fast Walidlet Transform” Journal International Journal

[133] of Research and Reviews in Computer Science, Volume 2, Issue, 1, Pages 144-151, 2011.

[134] Waleed A. Mahmud Al-Jawhar “Feature Combination & Mapping Using Multiwavelet Transform” Al-Rafidain Univ.

[135] College journal, Issue 19, Pages: 13-34, 2006.

[136] Abbas Hasan Kattoush, Waleed Ameen Mahmoud Al-Jawher, Sulaiman M Abbas, Ali Tweij Shaheen “A N-Radon Based

[137] OFDM Trasceivers Design and Performance Simulation Over Different Channel Models” Journal of Wireless Personal

[138] Communications, Volume 58, Pages 695-711, 2011.

[139] Walid A Mahmoud, Majed E Alneby, Wael H Zayer “2D-multiwavelet transform 2D-two activation function wavelet

[140] network-based face recognition” J. Appl. Sci. Res, vol. 6, issue 8, 1019-1028, 2010.

[141] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, “Image Encryption Algorithm Based on Arnold

[142] Transform and Chaos Theory in the Multi-wavelet Domain”, International Journal of Computers and Applications, Vol. 45, Issue 4,

[143] pp. 306-322, 2023.

[144] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher ”Hybrid image encryption algorithm based on

[145] compressive sensing, gray wolf optimization, and chaos” , Journal of Electronic Imaging, Volume 32, Issue 4, Pages 043038-

[146] , 2023.

[147] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, ”URUK 4D DISCRETE CHAOTIC MAP FOR

[148] SECURE COMMUNICATION APPLICATIONS” Journal Port Science Research, Vol. 5, Issue 3, PP. 131-141, 2023.

[149] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, “Image Encryption Algorithm Based on Arnold

[150] Transform and Chaos Theory in the Multi-wavelet Domain” International Journal of Computers and Applications, Volume 45, Issue

[151] , Pages 306-322, 2023.

[152] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, “WAM 3D discrete chaotic map for secure

[153] communication applications” International Journal of Innovative Computing, Volume 13, Issue 1-2, Pages 45-54, 2022.

[154] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher” Hybrid image encryption algorithm based on

[155] compressive sensing, gray wolf optimization, and chaos” Journal of Electronic Imaging, Volume 32, Issue 4, Pages 043038-043038,

[156] Zahraa A Hasan, Suha M Hadi, Waleed A Mahmoud, “Speech scrambler with multiwavelet, Arnold Transform and particle

[157] swarm optimization” Journal Pollack Periodica, Volume 18, Issue 3, Pages 125-131, 2023.

[158] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher “A Hybrid Domain Medical Image Encryption Scheme

[159] Using URUK and WAM Chaotic Maps with Wavelet–Fourier Transforms” Journal of Cyber Security and Mobility, Pages 435–464-

[160] –464, 2023.

[161] W. A. Mahmoud Al-Jawher, Zahraa A Hasan, Suha M. Hadi,” Time Domain Speech Scrambler Based on Particle Swarm

[162] Optimization” International Journal for Engineering and Information Sciences, Vol. 18, Issue 1, PP. 161-166, 2023.

Sakran, A. A. . ., Hadi, S. M. ., & Al-Jawher, W. A. M. . (2023). Advancing DNA Signal Processing: Integrating Digital and Biological Nuances for Enhanced Identification of Coding Regions. Journal Port Science Research, 6(4), 374–387. https://doi.org/10.36371/port.2023.4.8

Downloads

Download data is not yet available.

Most read articles by the same author(s)

1 2 > >>