Hydrozip: how hydrological knowledge can be used to improve compression of hydrological data (Q742695): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Import241208061232 (talk | contribs)
Normalize DOI.
 
(One intermediate revision by one other user not shown)
Property / DOI
 
Property / DOI: 10.3390/e15041289 / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.3390/E15041289 / rank
 
Normal rank

Latest revision as of 02:46, 10 December 2024

scientific article
Language Label Description Also known as
English
Hydrozip: how hydrological knowledge can be used to improve compression of hydrological data
scientific article

    Statements

    Hydrozip: how hydrological knowledge can be used to improve compression of hydrological data (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    19 September 2014
    0 references
    Summary: From algorithmic information theory, which connects the information content of a data set to the shortest computer program that can produce it, it is known that there are strong analogies between compression, knowledge, inference and prediction. The more we know about a data generating process, the better we can predict and compress the data. A model that is inferred from data should ideally be a compact description of those data. In theory, this means that hydrological knowledge could be incorporated into compression algorithms to more efficiently compress hydrological data and to outperform general purpose compression algorithms. In this study, we develop such a hydrological data compressor, named HydroZIP, and test in practice whether it can outperform general purpose compression algorithms on hydrological data from 431 river basins from the Model Parameter Estimation Experiment (MOPEX) data set. HydroZIP compresses using temporal dependencies and parametric distributions. Resulting file sizes are interpreted as measures of information content, complexity and model adequacy. These results are discussed to illustrate points related to learning from data, overfitting and model complexity.
    0 references
    data compression
    0 references
    algorithmic information theory
    0 references
    hydrology
    0 references
    inference
    0 references
    streamflow
    0 references
    MOPEX
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references