On distribution of runs and patterns in four state trials

From MaRDI portal
Publication:6506941

arXiv2211.00774MaRDI QIDQ6506941FDOQ6506941


Authors: Jung Taek Oh Edit this on Wikidata



Abstract: From a mathematical and statistical point of view, a segment of a DNA strand can be viewed as a sequence of four-state (A, C, G, T) trials. We consider distributions of runs and patterns related to run lengths of multi-state sequences, especially for four states (A, B, C, D). Let X1,X2,ldots be a sequence of four state i.i.d. trials taking values in the set mathscrS=A,B,C,D of four symbols with probability P(A)=Pa, P(B)=Pb, P(C)=Pc and P(D)=Pd, respectively. In this paper, we obtain exact formulae for the probability distribution function for runs of B's the discrete distribution of order k, longest run statistics, shortest run statistics, waiting time distribution and the distribution of run lengths.













This page was built for publication: On distribution of runs and patterns in four state trials

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6506941)