AgrImOnIA: Open Access dataset correlating livestock and air quality in the Lombardy region, Italy
DOI10.5281/zenodo.7956006Zenodo7956006MaRDI QIDQ6686053FDOQ6686053
Dataset published at Zenodo repository.
Philipp Otto, Alessandro Fassò, Alessandro Fusta Moro, Francesco Finazzi, Paolo Maranzano, Marco Vinciguerra, Jacopo Rodeschini, Michela Cameletti, Rosaria Ignaccolo, Qendrim Shaboviq, Natalia Golini
Publication date: 31 May 2023
Copyright license: Creative Commons Attribution 4.0 International
The AgrImOnIA dataset is a comprehensive dataset relating air quality and livestock (expressed as thedensity of bovines and swine bred) along with weather and other variables. The AgrImOnIA Dataset represents the first step of the AgrImOnIA project. The purpose of this dataset is to give the opportunity to assess the impact of agriculture on air quality in Lombardy through statistical techniques capable of highlighting the relationship between the livestock sector and air pollutants concentrations. The building process of the dataset is detailed in the companion paper: A. Fass, J. Rodeschini, A. Fusta Moro, Q. Shaboviq, P. Maranzano, M. Cameletti, F. Finazzi, N. Golini, R. Ignaccolo, and P. Otto(2023). Agrimonia: a dataset on livestock, meteorology and air quality in the Lombardy region, Italy.SCIENTIFIC DATA, 1-19. available here. This dataset is a collection of estimated daily values for a range of measurements of different dimensions as: air quality, meteorology, emissions, livestock animals and land use. Data are related to Lombardy and the surrounding area for2016-2021, inclusive. The surrounding area is obtained by applying a 0.3 buffer on Lombardy borders. The data uses several aggregation and interpolation methods to estimate the measurement for all days. The files in the record, renamed according to their version (es. .._v_3_0_0),are: Agrimonia_Dataset.csv(.mat and .Rdata) which is built by joining the daily time series related to the AQ, WE, EM, LI and LA variables. In order to simplify access to variables in the Agrimonia dataset, the variable name starts with the dimension of the variable, i.e., the name of the variables related to the AQ dimension start with AQ_. This file is archived also in theformat for MATLAB and R software. Metadata_Agrimonia.csv which provides further information about the Agrimonia variables: e.g. sources used, original names of the variables imported, transformations applied. Metadata_AQ_imputation_uncertainty.csv which contains the daily uncertainty estimate of the imputed observation for the AQ to mitigate missing data in the hourly time series. Metadata_LA_CORINE_labels.csv which contains the label and the description associated with the CLC class. Metadata_monitoring_network_registry.csv which contains all details about the AQ monitoring station used to build the dataset. Information about air quality monitoring stations include: station type, municipality code, environment type, altitude, pollutants sampled and other. Each row represents a single sensor. Metadata_LA_SIARL_labels.csv which contains the label and the description associated with the SIARL class. AGC_Dataset.csv(.mat and .Rdata)thatincludes daily data of almost all variables available inthe AgrimoniaDataset (excluding AQ variables)on anequidistant grid covering the Lombardy region and its surrounding area. The Agrimonia dataset can be reproduced usingthe code available at the GitHub page: https://github.com/AgrImOnIA-project/AgrImOnIA_Data UPDATE 31/05/2023- NEW RELEASE - V 3.0.0 A new version of the dataset is released:Agrimonia_Dataset_v_3_0_0.csv (.Rdata and .mat), where variableWE_rh_min, WE_rh_mean and WE_rh_maxhave been recomputed due to some bugs. In addition, two new columns are added, they areLI_pigs_v2 and LI_bovine_v2and represents the density of the pigs and bovine (expressed as animals per kilometer squared) of a square of size ~ 10 x 10 km centered at the station localisation. A new dataset is released: the Agrimonia Grid Covariates (AGC) that includes daily information for the period from 2016 to 2020 of almost all variables within the Agrimonia Dataset on a equidistant grid containing the Lombardy region and its surrounding area. The AGC does not include AQ variables as they come fromthe monitoring stations that are irregularly spread over the area considered. UPDATE 11/03/2023- NEW RELEASE - V 2.0.2 A new version of the dataset is released:Agrimonia_Dataset_v_2_0_2.csv (.Rdata), where variableWE_tot_precipitationhave been recomputed due to some bugs. A new version of the metadata is available:Metadata_Agrimonia_v_2_0_2.csv where the spatial resolution of the variable WE_precipitation_tis corrected. UPDATE 24/01/2023- NEW RELEASE - V 2.0.1 minor bug fixed UPDATE 16/01/2023- NEW RELEASE - V 2.0.0 A new version of the dataset is released, Agrimonia_Dataset_v_2_0_0.csv (.Rdata) and Metadata_monitoring_network_registry_v_2_0_0.csv.Some minor points have been addressed: Addedvalues for LA_land_use variable for Switzerland stations (in Agrimonia Dataset_v_2_0_0.csv) Deletedincorrect values for LA_soil_use variable for stations outside Lombardy region during 2018 (in Agrimonia Dataset_v_2_0_0.csv) Fixed duplicatesensors correspondingto the same pollutant within the samestation(inMetadata_monitoring_network_registry_v_2_0_0.csv)
This page was built for dataset: AgrImOnIA: Open Access dataset correlating livestock and air quality in the Lombardy region, Italy