Bioinformatics and phylogenetics. Seminal contributions of Bernard Moret (Q2312623)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Bioinformatics and phylogenetics. Seminal contributions of Bernard Moret
scientific article

    Statements

    Bioinformatics and phylogenetics. Seminal contributions of Bernard Moret (English)
    0 references
    17 July 2019
    0 references
    This book is a collection of fifteen chapters, each focusing on one aspect of phylogenetic analyses. The style of the book is comprehensive, balancing the thorough explanation of fundamental concepts with suggestions and references that support further research. The first four chapters focus on likelihood calculations; in Chapter 1, the author reviews the approaches for optimising phylogenetic likelihood calculations focusing on sequential PLF via algorithmic means and technical means. Also discussed are partial and full terraces in the tree space, as well as details related to the parallelisation of computational aspects. In the second chapter, the author presents matrix-based calculations of the likelihood function and its applications on the pruning algorithm using only vector operations, on inferring edge lengths, and on one-length and all-length optimisations. In the third chapter, the author discussess high-performance, faster, likelihood-based calculations and the resilience of the approaches to multi-mode parallelisation. In the next chapter, the author presents yet another angle on the maximum likelihood-based estimation, one that takes into account sequence-length requirements. The problem of optimising the branch lengths and balancing depth versus branching are discussed in detail. In the fifth chapter, the author analyses the gene family evolution, from an algorithmic perspective. Following an overview on binary and non-binary tree reconciliation, the inference of a gene tree from a set of trees, using amalgamation and the super-tree method, are also presented. In the next chapter, the author presents a series of approaches based on the divide-and-conquer estimations of super-trees: the bipartition-based, quartet-based and distance-based methods. The accuracy and scalability of super-tree methods and approaches to improve these (e.g. the SuperFine implementation) are also discussed. In the seventh chapter, the author presents methods for constructing taxonomic super-trees focusing the examples on Incertae sedis. The focus is on rooted splits and split-based semantics coupled with examples that highlight a pipeline based on subproblem decomposition. In the eighth chapter, the aim is to present the changes in interpretation of the evolutionary rate change led by the transformation from additive to ultra-metric (Farris transform, non-parametric rate smoothing and penalised likelihood). The examples include fish, Solanacae and Malvaceae. Continuing the trend, in the ninth chapter, the author presents approaches for the reconstruction of ancestral genomes focusing on the difficulties in solving the median problem when multiple genomes are involved. With the tenth chapter, the author approaches yet another direction, the genome rearrangement problem for genes with single and multiple copies; for the former, that standard distances such as breakpoint, reversal and translocation, DCJ and ScoJ are presented, for the latter, the concept of polyploidy and the models with indels, duplications and deletions, are discussed in detail. All examples focus on cancers. The study of cancer phylogenetics continues in Chapter 11 where the author presents approaches for estimating evolutionary distances based on single nucleotide variants (SNVs) and copy number aberrations (CNAs) and incorporating these into the Steiner tree problems; applications of deconvolution methods on SNVs and CNAs conclude the chapter. The twelfth chapter also focuses on integrating phylogenies into networks, namely the identification of clusters in rooted phylogenetic networks. Particular examples on the node visibility property, on reticulation visible networks and on nearly stable networks are also included. In the next chapter, the author overviews the computational difficulties in relation to the identification and description of phylogenetic networks in the presence of hybridisation. Following a discussion on approaches for displaying and summarising networks, the generative models based on sequences and approaches based on maximum likelihood approaches and Bayesian Inference are discussed in detail. In the fourteenth chapter, the author overviews, side-by-side, comparative and functional genomic approaches with examples on seven Amniote genomes. In the last chapter, the author puts forward yet another angle based on integer linear programming. Standard problems such as the handling of high-density subgraphs, the identification of a maximum clique and of the maximum independent set are discussed from the ILP perspective. The author also embeds the traveling salesman problem into biological problems e.g. the DNA assembly problem and the marker-ordering problem. Although written in an accessible language, the book is mainly aimed at researchers with pre-existing knowledge of phylogenetics and computational approaches related to it (e.g. graph theory). The numerous examples make this collection of chapters comprehensible for undergraduate, graduate and postgraduate researchers from computer science, mathematics and biology; the extensive set of references that accompany every chapter recommend the book as a reliable starting point for further studies.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    phylogenetics
    0 references
    likelihood calculation
    0 references
    tree space
    0 references
    maximum likelihood tree inference
    0 references
    pruning inference
    0 references
    high-performance phylogenetic inference
    0 references
    sequence-length requirements
    0 references
    phylogenetic signal
    0 references
    optimisation of branch lengths
    0 references
    gene family evolution
    0 references
    binary gene tree
    0 references
    binary species tree
    0 references
    non-binary species tree
    0 references
    non-binary gene tree
    0 references
    set of trees
    0 references
    super-tree
    0 references
    divide-and-conquer tree estimation
    0 references
    scalability of super-tree methods
    0 references
    tree estimation
    0 references
    taxonomic super-tree construction
    0 references
    rooted splits
    0 references
    evolutionary rate change
    0 references
    modal similarity of orthologs
    0 references
    phylo-genomics
    0 references
    Farris transform
    0 references
    non-parametric rate smoothing (NPRS)
    0 references
    penalised likelihood
    0 references
    ancestral genome reconstruction
    0 references
    multiple genomes
    0 references
    genome rearrangement
    0 references
    single gene copies
    0 references
    multiple gene copies
    0 references
    breakpoint distance
    0 references
    multi-copy model
    0 references
    cancer phylogenetics
    0 references
    Steiner tree problem
    0 references
    deconvolutional phylogenies
    0 references
    cluster networks
    0 references
    regular networks
    0 references
    rooted phylogenetic networks
    0 references
    reticulation visible phylogenetic networks
    0 references
    parametrisation of network topology
    0 references
    Bayesian inference of phylogenetic networks
    0 references
    population genetics
    0 references
    phylogenetic invariant models
    0 references
    functional genomics
    0 references
    comparative genomics
    0 references
    integer linear programming
    0 references
    application of travelling salesman problem
    0 references
    maximum clique
    0 references
    maximum independent set
    0 references
    high density subgraphs
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references