Texas-Winter-Storm-2021-Tweets

From MaRDI portal
Dataset:6036775



OpenML43680MaRDI QIDQ6036775FDOQ6036775RO-CrateQ6036775

OpenML dataset with id 43680

Author name not available (Why is that?)

Full work available at URL: https://api.openml.org/data/v1/download/22102505/Texas-Winter-Storm-2021-Tweets.arff

Upload date: 24 March 2022



Dataset Characteristics

Number of features: 14 (numeric: 4, symbolic: 0 and in total binary: 0 )
Number of instances: 23,358
Number of instances with missing values: 23,197
Number of missing values: 53,699

Context Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions. This dataset consists of 23K+ tweets during the crisis week. Data is filtered to mostly include the tweets from influencers (users having more than 5000 followers) however there is a small subset of tweets from other users as well. My notebook - https://www.kaggle.com/rajsengo/eda-texas-winterstrom-2021-tweets Acknowledgements

https://www.kaggle.com/gpreda/pfizer-vaccine-tweets - For the inspiration https://github.com/dataquestio/twitter-scrape - Reference utility to scrape twitter

Inspiration Apply NLP techniques to undestand user sentiments about the crisis management





ROCrate

What is a RO-Crate?

A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:

  • the files belonging to the dataset (e.g. CSVs, images, code, documentation)
  • a ro-crate-metadata.json file describing the content, provenance, and context
  • persistent identifiers and references to related research objects (e.g. software, publications)

This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.

Download

You can download a RO-Crate for this dataset here: Download RO-Crate

HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.


This page was built for dataset: Texas-Winter-Storm-2021-Tweets