Hailfinder (Q4834873): Difference between revisions

From MaRDI portal
Changed an Item
CorrectionBot (talk | contribs)
Changed an Item
 
(8 intermediate revisions by 2 users not shown)
label / enlabel / en
bnlearn datasets
Hailfinder
description / endescription / en
A collection of various structure learning datasets from the Bayesian Network Repository with description files.
Dataset published at Zenodo repository.
Property / instance of
 
Property / instance of: Statistical data / rank
Normal rank
 
Property / author
 
Property / author: MaRDI / rank
Preferred rank
 
Property / author: MaRDI / qualifier
 
Property / Zenodo IDProperty / Zenodo ID
Property / copyright licenseProperty / copyright license
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI dataset profile / rank
Normal rank
 
Property / description
The bnlearn dataset collection consists of five structure learning datasets from the Bayesian Network Repository. These datasets are intended for studying causal discovery algorithms, with known or estimated ground truth structures and exclusively discrete variables. The Alarm dataset is a medical diagnosis dataset with 37 variables, including central venous pressure (CVP), pulmonary capillary wedge pressure (PCWP), and total peripheral resistance (TPR). The Asia dataset is a synthetic binary dataset modeling respiratory diseases, containing variables such as tuberculosis, lung cancer, smoking, and chest X-ray results. The Coronary dataset focuses on risk factors for coronary heart disease, including smoking, strenuous physical and mental work, systolic blood pressure, and family history. The Hailfinder dataset includes 56 meteorological variables related to hail forecasting, such as vertical motion, meso-alpha area, and satellite moisture contributions. The Lizards dataset contains three variables describing lizard species, perch height, and perch diameter. Each dataset provides a well-defined structure for evaluating causal discovery methods, making them valuable for Bayesian network research and algorithm development.
 
Property / description: The bnlearn dataset collection consists of five structure learning datasets from the Bayesian Network Repository. These datasets are intended for studying causal discovery algorithms, with known or estimated ground truth structures and exclusively discrete variables. The Alarm dataset is a medical diagnosis dataset with 37 variables, including central venous pressure (CVP), pulmonary capillary wedge pressure (PCWP), and total peripheral resistance (TPR). The Asia dataset is a synthetic binary dataset modeling respiratory diseases, containing variables such as tuberculosis, lung cancer, smoking, and chest X-ray results. The Coronary dataset focuses on risk factors for coronary heart disease, including smoking, strenuous physical and mental work, systolic blood pressure, and family history. The Hailfinder dataset includes 56 meteorological variables related to hail forecasting, such as vertical motion, meso-alpha area, and satellite moisture contributions. The Lizards dataset contains three variables describing lizard species, perch height, and perch diameter. Each dataset provides a well-defined structure for evaluating causal discovery methods, making them valuable for Bayesian network research and algorithm development. / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.5281/ZENODO.7676616 / rank
Normal rank
 
Property / description
 
This synthetic datset is set about severe summer hail in northeastern Colorado. It was introduced in Abramson et al. (1996). Task: The dataset can be used to study causal discovery algorithms. Summary: Size of dataset: 20,000 x 56 Task: Causal Discovery Problem Data Type: Categorical Data Dataset Scope: Standalone Dataset Ground Truth: Known Graph Temporal Structure: Static Data License: CC0 (generated for bnlearn) Missing Values: No Missing Data Missingness Statement: There are no missing values. Features: N07muVerMo: 10.7mu vertical motion (StrongUp, WeakUp, Neutral, and Down) SubjVertMo: subjective judgment of vertical motion (StrongUp, WeakUp, Neutral, and Down) QGVertMotion: quasigeostrophic vertical motion (StrongUp, WeakUp, Neutral, and Down) CombVerMo: combined vertical motion (StrongUp, WeakUp, Neutral, and Down) AreaMesoALS: area of meso-alpha (StrongUp, WeakUp, Neutral, and Down) SatContMoist: satellite contribution to moisture (VeryWet, Wet, Neutral, and Dry) RaoContMoist: reading at the forecast center for moisture (VeryWet, Wet, Neutral, and Dry) CombMoisture: combined moisture (VeryWet, Wet, Neutral, and Dry) AreaMoDryAir: area of moisture and dry air (VeryWet, Wet, Neutral, and Dry) VISCloudCov: visible cloud cover (Cloudy, PC, and Clear) RCloudCover: infrared cloud cover (Cloudy, PC, and Clear) CombClouds: combined cloud cover (Cloudy, PC, and Clear) CldShadeOth: cloud shading, other (Cloudy, PC, and Clear) AMInstabMt: AM instability in the mountains (None, Weak, and Strong) InsInMt: instability in the mountains (None, Weak, and Strong) WndHodograph: wind hodograph (DCVZFavor, StrongWest, Westerly, and Other) OutflowFrMt: outflow from mountains (None, Weak, and Strong) MorningBound: morning boundaries (None, Weak, and Strong) Boundaries: boundaries (None, Weak, and Strong) CldShadeConv: cloud shading, convection (None, Some, and Marked) CompPlFcst: composite plains forecast (IncCapDecIns, LittleChange, and DecCapIncIns) CapChange: capping change (Decreasing, LittleChange, and Increasing) LoLevMoistAd: low-level moisture advection (StrongPos, WeakPos, Neutral, and Negative) InsChange: instability change (Decreasing, LittleChange, and Increasing) MountainFcst: mountains (region 1) forecast (XNIL, SIG, and SVR) Date: date (May15_Jun14, Jun15_Jul1, Jul2_Jul15, Jul16_Aug10, Aug11_Aug20, and Aug20_Sep15) Scenario: scenario (A, B, C, D, E, F, G, H, I, J, and K) ScenRelAMCIN: scenario relevant to AM convective inhibition (AB and CThruK) MorningCIN: morning convective inhibition (None, PartInhibit, Stifling, and TotalInhibit) AMCINInScen: AM convective inhibition in scenario (LessThanAve, Average, and MoreThanAve) CapInScen: capping within scenario (LessThanAve, Average, and MoreThanAve) ScenRelAMIns: scenario relevant to AM instability (ABI, CDEJ, F, G, H, and K) LIfr12ZDENSd: LI from 12Z DEN sounding (LIGt0, N1GtLIGt_4, N5GtLIGt_8, and LILt_8) AMDewptCalPl: AM dewpoint calculations, plains (Instability, Neutral, and Stability) AMInsWliScen: AM instability within scenario (LessUnstable, Average, and MoreUnstable) InsSclInScen: instability scaling within scenario (LessUnstable, Average, and MoreUnstable) ScenRel34: scenario relevant to regions 2/3/4 (ACEFK, B, D, GJ, and HI) LatestCIN: latest convective inhibition (None, PartInhibit, Stifling, and TotalInhibit) LLIW: LLIW severe weather index (Unfavorable, Weak, Moderate, and Strong) CurPropConv: current propensity to convection (None, Slight, Moderate, and Strong) ScnRelPlFcst: scenario relevant to plains forecast (A, B, C, D, E, F, G, H, I, J, and K) PlainsFcst: plains forecast (XNIL, SIG, and SVR) N34StarFcst: regions 2/3/4 forecast (XNIL, SIG, and SVR) R5Fcst: region 5 forecast (XNIL, SIG, and SVR) Dewpoints: dewpoints (LowEverywhere, LowAtStation, LowSHighN, LowNHighS, LowMtsHighPl, HighEverywhere, and Other) LowLLapse: low-level lapse rate (CloseToDryAd, Steep, ModerateOrLe, and Stable) MeanRH: mean relative humidity (VeryMoist, Average, and Dry) MidLLapse: mid-level lapse rate (CloseToDryAd, Steep, and ModerateOrLe) MvmtFeatures: movement of features (StrongFront, MarkedUpper, OtherRapid, and NoMajor) RHRatio: relative humidity ratio (MoistMDryL, DryMMoistL, and Other) SfcWndShfDis: surface wind shifts and discontinuities (DenvCyclone, E_W_N, E_W_S, MovigFtorOt, DryLine, None, and Other) SynForcng: synoptic forcing (SigNegative, NegToPos, SigPositive, PosToNeg, and LittleChange) TempDis: temperature discontinuities (QStationary, Moving, None, and Other) WindAloft: wind aloft (LV, SWQuad, NWQuad, and AllElse) WindFieldMt: wind fields, mountains (Westerly and LVorOther) WindFieldPln: wind fields, plains (LV, DenvCyclone, LongAnticyc, E_NE, SEquad, and WidespdDnsl) Files: hailfinder.csv: dataset hailfinder.bif:Bayesian Network from (Scutari (2010), License CC BY-SA 3.0). The network was used for data generation in Abramson et al. (1996) and was designed on meteorological data and expert judgment, based on both experience and physical understanding.
Property / description: This synthetic datset is set about severe summer hail in northeastern Colorado. It was introduced in Abramson et al. (1996). Task: The dataset can be used to study causal discovery algorithms. Summary: Size of dataset: 20,000 x 56 Task: Causal Discovery Problem Data Type: Categorical Data Dataset Scope: Standalone Dataset Ground Truth: Known Graph Temporal Structure: Static Data License: CC0 (generated for bnlearn) Missing Values: No Missing Data Missingness Statement: There are no missing values. Features: N07muVerMo: 10.7mu vertical motion (StrongUp, WeakUp, Neutral, and Down) SubjVertMo: subjective judgment of vertical motion (StrongUp, WeakUp, Neutral, and Down) QGVertMotion: quasigeostrophic vertical motion (StrongUp, WeakUp, Neutral, and Down) CombVerMo: combined vertical motion (StrongUp, WeakUp, Neutral, and Down) AreaMesoALS: area of meso-alpha (StrongUp, WeakUp, Neutral, and Down) SatContMoist: satellite contribution to moisture (VeryWet, Wet, Neutral, and Dry) RaoContMoist: reading at the forecast center for moisture (VeryWet, Wet, Neutral, and Dry) CombMoisture: combined moisture (VeryWet, Wet, Neutral, and Dry) AreaMoDryAir: area of moisture and dry air (VeryWet, Wet, Neutral, and Dry) VISCloudCov: visible cloud cover (Cloudy, PC, and Clear) RCloudCover: infrared cloud cover (Cloudy, PC, and Clear) CombClouds: combined cloud cover (Cloudy, PC, and Clear) CldShadeOth: cloud shading, other (Cloudy, PC, and Clear) AMInstabMt: AM instability in the mountains (None, Weak, and Strong) InsInMt: instability in the mountains (None, Weak, and Strong) WndHodograph: wind hodograph (DCVZFavor, StrongWest, Westerly, and Other) OutflowFrMt: outflow from mountains (None, Weak, and Strong) MorningBound: morning boundaries (None, Weak, and Strong) Boundaries: boundaries (None, Weak, and Strong) CldShadeConv: cloud shading, convection (None, Some, and Marked) CompPlFcst: composite plains forecast (IncCapDecIns, LittleChange, and DecCapIncIns) CapChange: capping change (Decreasing, LittleChange, and Increasing) LoLevMoistAd: low-level moisture advection (StrongPos, WeakPos, Neutral, and Negative) InsChange: instability change (Decreasing, LittleChange, and Increasing) MountainFcst: mountains (region 1) forecast (XNIL, SIG, and SVR) Date: date (May15_Jun14, Jun15_Jul1, Jul2_Jul15, Jul16_Aug10, Aug11_Aug20, and Aug20_Sep15) Scenario: scenario (A, B, C, D, E, F, G, H, I, J, and K) ScenRelAMCIN: scenario relevant to AM convective inhibition (AB and CThruK) MorningCIN: morning convective inhibition (None, PartInhibit, Stifling, and TotalInhibit) AMCINInScen: AM convective inhibition in scenario (LessThanAve, Average, and MoreThanAve) CapInScen: capping within scenario (LessThanAve, Average, and MoreThanAve) ScenRelAMIns: scenario relevant to AM instability (ABI, CDEJ, F, G, H, and K) LIfr12ZDENSd: LI from 12Z DEN sounding (LIGt0, N1GtLIGt_4, N5GtLIGt_8, and LILt_8) AMDewptCalPl: AM dewpoint calculations, plains (Instability, Neutral, and Stability) AMInsWliScen: AM instability within scenario (LessUnstable, Average, and MoreUnstable) InsSclInScen: instability scaling within scenario (LessUnstable, Average, and MoreUnstable) ScenRel34: scenario relevant to regions 2/3/4 (ACEFK, B, D, GJ, and HI) LatestCIN: latest convective inhibition (None, PartInhibit, Stifling, and TotalInhibit) LLIW: LLIW severe weather index (Unfavorable, Weak, Moderate, and Strong) CurPropConv: current propensity to convection (None, Slight, Moderate, and Strong) ScnRelPlFcst: scenario relevant to plains forecast (A, B, C, D, E, F, G, H, I, J, and K) PlainsFcst: plains forecast (XNIL, SIG, and SVR) N34StarFcst: regions 2/3/4 forecast (XNIL, SIG, and SVR) R5Fcst: region 5 forecast (XNIL, SIG, and SVR) Dewpoints: dewpoints (LowEverywhere, LowAtStation, LowSHighN, LowNHighS, LowMtsHighPl, HighEverywhere, and Other) LowLLapse: low-level lapse rate (CloseToDryAd, Steep, ModerateOrLe, and Stable) MeanRH: mean relative humidity (VeryMoist, Average, and Dry) MidLLapse: mid-level lapse rate (CloseToDryAd, Steep, and ModerateOrLe) MvmtFeatures: movement of features (StrongFront, MarkedUpper, OtherRapid, and NoMajor) RHRatio: relative humidity ratio (MoistMDryL, DryMMoistL, and Other) SfcWndShfDis: surface wind shifts and discontinuities (DenvCyclone, E_W_N, E_W_S, MovigFtorOt, DryLine, None, and Other) SynForcng: synoptic forcing (SigNegative, NegToPos, SigPositive, PosToNeg, and LittleChange) TempDis: temperature discontinuities (QStationary, Moving, None, and Other) WindAloft: wind aloft (LV, SWQuad, NWQuad, and AllElse) WindFieldMt: wind fields, mountains (Westerly and LVorOther) WindFieldPln: wind fields, plains (LV, DenvCyclone, LongAnticyc, E_NE, SEquad, and WidespdDnsl) Files: hailfinder.csv: dataset hailfinder.bif:Bayesian Network from (Scutari (2010), License CC BY-SA 3.0). The network was used for data generation in Abramson et al. (1996) and was designed on meteorological data and expert judgment, based on both experience and physical understanding. / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.5281/zenodo.14793201 / rank
 
Normal rank
Property / author
 
Property / author: Mathematical Research Data Initiative / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI dataset profile / rank
 
Normal rank

Latest revision as of 10:23, 19 May 2025

Dataset published at Zenodo repository.
Language Label Description Also known as
English
Hailfinder
Dataset published at Zenodo repository.

    Statements

    0 references
    25 February 2023
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    This synthetic datset is set about severe summer hail in northeastern Colorado. It was introduced in Abramson et al. (1996). Task: The dataset can be used to study causal discovery algorithms. Summary: Size of dataset: 20,000 x 56 Task: Causal Discovery Problem Data Type: Categorical Data Dataset Scope: Standalone Dataset Ground Truth: Known Graph Temporal Structure: Static Data License: CC0 (generated for bnlearn) Missing Values: No Missing Data Missingness Statement: There are no missing values. Features: N07muVerMo: 10.7mu vertical motion (StrongUp, WeakUp, Neutral, and Down) SubjVertMo: subjective judgment of vertical motion (StrongUp, WeakUp, Neutral, and Down) QGVertMotion: quasigeostrophic vertical motion (StrongUp, WeakUp, Neutral, and Down) CombVerMo: combined vertical motion (StrongUp, WeakUp, Neutral, and Down) AreaMesoALS: area of meso-alpha (StrongUp, WeakUp, Neutral, and Down) SatContMoist: satellite contribution to moisture (VeryWet, Wet, Neutral, and Dry) RaoContMoist: reading at the forecast center for moisture (VeryWet, Wet, Neutral, and Dry) CombMoisture: combined moisture (VeryWet, Wet, Neutral, and Dry) AreaMoDryAir: area of moisture and dry air (VeryWet, Wet, Neutral, and Dry) VISCloudCov: visible cloud cover (Cloudy, PC, and Clear) RCloudCover: infrared cloud cover (Cloudy, PC, and Clear) CombClouds: combined cloud cover (Cloudy, PC, and Clear) CldShadeOth: cloud shading, other (Cloudy, PC, and Clear) AMInstabMt: AM instability in the mountains (None, Weak, and Strong) InsInMt: instability in the mountains (None, Weak, and Strong) WndHodograph: wind hodograph (DCVZFavor, StrongWest, Westerly, and Other) OutflowFrMt: outflow from mountains (None, Weak, and Strong) MorningBound: morning boundaries (None, Weak, and Strong) Boundaries: boundaries (None, Weak, and Strong) CldShadeConv: cloud shading, convection (None, Some, and Marked) CompPlFcst: composite plains forecast (IncCapDecIns, LittleChange, and DecCapIncIns) CapChange: capping change (Decreasing, LittleChange, and Increasing) LoLevMoistAd: low-level moisture advection (StrongPos, WeakPos, Neutral, and Negative) InsChange: instability change (Decreasing, LittleChange, and Increasing) MountainFcst: mountains (region 1) forecast (XNIL, SIG, and SVR) Date: date (May15_Jun14, Jun15_Jul1, Jul2_Jul15, Jul16_Aug10, Aug11_Aug20, and Aug20_Sep15) Scenario: scenario (A, B, C, D, E, F, G, H, I, J, and K) ScenRelAMCIN: scenario relevant to AM convective inhibition (AB and CThruK) MorningCIN: morning convective inhibition (None, PartInhibit, Stifling, and TotalInhibit) AMCINInScen: AM convective inhibition in scenario (LessThanAve, Average, and MoreThanAve) CapInScen: capping within scenario (LessThanAve, Average, and MoreThanAve) ScenRelAMIns: scenario relevant to AM instability (ABI, CDEJ, F, G, H, and K) LIfr12ZDENSd: LI from 12Z DEN sounding (LIGt0, N1GtLIGt_4, N5GtLIGt_8, and LILt_8) AMDewptCalPl: AM dewpoint calculations, plains (Instability, Neutral, and Stability) AMInsWliScen: AM instability within scenario (LessUnstable, Average, and MoreUnstable) InsSclInScen: instability scaling within scenario (LessUnstable, Average, and MoreUnstable) ScenRel34: scenario relevant to regions 2/3/4 (ACEFK, B, D, GJ, and HI) LatestCIN: latest convective inhibition (None, PartInhibit, Stifling, and TotalInhibit) LLIW: LLIW severe weather index (Unfavorable, Weak, Moderate, and Strong) CurPropConv: current propensity to convection (None, Slight, Moderate, and Strong) ScnRelPlFcst: scenario relevant to plains forecast (A, B, C, D, E, F, G, H, I, J, and K) PlainsFcst: plains forecast (XNIL, SIG, and SVR) N34StarFcst: regions 2/3/4 forecast (XNIL, SIG, and SVR) R5Fcst: region 5 forecast (XNIL, SIG, and SVR) Dewpoints: dewpoints (LowEverywhere, LowAtStation, LowSHighN, LowNHighS, LowMtsHighPl, HighEverywhere, and Other) LowLLapse: low-level lapse rate (CloseToDryAd, Steep, ModerateOrLe, and Stable) MeanRH: mean relative humidity (VeryMoist, Average, and Dry) MidLLapse: mid-level lapse rate (CloseToDryAd, Steep, and ModerateOrLe) MvmtFeatures: movement of features (StrongFront, MarkedUpper, OtherRapid, and NoMajor) RHRatio: relative humidity ratio (MoistMDryL, DryMMoistL, and Other) SfcWndShfDis: surface wind shifts and discontinuities (DenvCyclone, E_W_N, E_W_S, MovigFtorOt, DryLine, None, and Other) SynForcng: synoptic forcing (SigNegative, NegToPos, SigPositive, PosToNeg, and LittleChange) TempDis: temperature discontinuities (QStationary, Moving, None, and Other) WindAloft: wind aloft (LV, SWQuad, NWQuad, and AllElse) WindFieldMt: wind fields, mountains (Westerly and LVorOther) WindFieldPln: wind fields, plains (LV, DenvCyclone, LongAnticyc, E_NE, SEquad, and WidespdDnsl) Files: hailfinder.csv: dataset hailfinder.bif:Bayesian Network from (Scutari (2010), License CC BY-SA 3.0). The network was used for data generation in Abramson et al. (1996) and was designed on meteorological data and expert judgment, based on both experience and physical understanding.
    0 references

    Identifiers

    0 references