Periodic power spectrum with applications in detection of latent periodicities in DNA sequences
From MaRDI portal
Publication:728537
DOI10.1007/S00285-016-0982-8zbMATH Open1354.42009arXiv1504.02367OpenAlexW3101164530WikidataQ40357196 ScholiaQ40357196MaRDI QIDQ728537FDOQ728537
Publication date: 20 December 2016
Published in: Journal of Mathematical Biology (Search for Journal in Brave)
Abstract: Latent periodic elements in genomes play important roles in genomic functions. Many complex periodic elements in genomes are difficult to be detected by commonly used digital signal processing (DSP). We present a novel method to compute the periodic power spectrum of a DNA sequence based on the nucleotide distributions on periodic positions of the sequence. The method directly calculates full periodic spectrum of a DNA sequence rather than frequency spectrum by Fourier transform. The magnitude of the periodic power spectrum reflects the strength of the periodicity signals, thus, the algorithm can capture all the latent periodicities in DNA sequences. We apply this method on detection of latent periodicities in different genome elements, including exons and microsatellite DNA sequences. The results show that the method minimizes the impact of spectral leakage, captures a much broader latent periodicities in genomes, and outperforms the conventional Fourier transform.
Full work available at URL: https://arxiv.org/abs/1504.02367
Recommendations
- Spectral sum rules and search for periodicities in DNA sequences
- An efficient algorithm for prediction of genes of genomic sequences based on Fourier analysis
- Revisiting the relationship between compositional sequence complexity and periodicity
- The problem of power spectrum and the signal-to-noise ratio and its fast algorithm implementation in gene identification
- Identification of repeats in DNA sequences using nucleotide distribution uniformity
Protein sequences, DNA sequences (92D20) Fourier and Fourier-Stieltjes transforms and other transforms of Fourier type (42A38)
Cites Work
- Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence
- A new method to cluster DNA sequences using Fourier power spectrum
- An improved model for whole genome phylogenetic analysis by Fourier transform
- Information decomposition method to analyze symbolical sequences
- A measure of DNA sequence similarity by Fourier transform with applications on hierarchical clustering
- Some features of Fourier spectrum for symbolic sequences
- A converse coding theorem for mismatched decoding at the output of binary-input memoryless channels
- Detection and visualization of tandem repeats in dna sequences
Cited In (5)
- Investigating some attributes of periodicity in DNA sequences via semi-Markov modelling
- Chromosome-specific spatial periodicities in gene expression revealed by spectral analysis
- Latent periodicity-2 in coronavirus SARS-CoV-2 genome: evolutionary implications
- Model of perfect tandem repeat with random pattern and empirical homogeneity testing poly-criteria for latent periodicity revelation in biological sequences
- Identification of repeats in DNA sequences using nucleotide distribution uniformity
Uses Software
This page was built for publication: Periodic power spectrum with applications in detection of latent periodicities in DNA sequences
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q728537)