Datasets for "Auditing Elon Musk's Impact on Hate Speech and Bots"

From MaRDI portal
(Redirected from Dataset:6707472)



DOI10.5281/zenodo.10407968Zenodo10407968MaRDI QIDQ6707472FDOQ6707472

Dataset published at Zenodo repository.

Paul E. Smaldino, Goran Muric, Daniel Fessler, Daniel Hickey, Keith Burghardt, Matheus Schmitz

Publication date: 19 December 2023

Copyright license: Creative Commons Attribution 4.0 International



Datasets for the publication "Auditing Elon Musk's Impact on Hate Speech and Bots" [1]. File information: baseline_tweet_ids_2022.csv, hate_tweet_ids_2022.csv: List of IDs and their corresponding dates from the "baseline" and "hate" samples of tweets used in the publication, respectively.The former is used to create the number of baseline tweets each day (baseline_freq.csv) while the latter is used to create the number of hate tweets each day (hate_freq.csv'). We share the date a tweet was made as well as its tweet ID from which you can find the original tweets URL with the help of this web page. As you explore these data, you may notice in a minority of cases hate tweets that are not hateful or, alternatively, baseline tweets that are hateful. This is a product of our filtering method used to collect and analyze tweets at scale. We always look forward to hearing your suggestions to improve the tweet filtering process. baseline_freq.csv, hate_freq.csv: Number of collected tweets per day for the baseline and hate samples, respectively. The file 'freq_data.py' is used to calculate these frequencies from the raw data. Feel free to consult this if you have questions about how the frequencies are calculated (or if you want to change how the data are aggregated).Use these to recreate Figure 2 from Hickey et al [1]. user_hate_levels_per_day.csv: CSV file with dates (YYYY-MM-DD format) and the mean proportion of slurs used by hateful users each day from October 1st to November 30th, 2022.Use these data to recreate Figure 1 from Hickey et al [1].See the Methods section of Hickey et al [1]. for details. hate_keywords.txt: Words used to query the Twitter Academic API for hate tweets. unfiltered_tweets_containing_hate_words.csv: All tweets with hate words collected with values for Perspective API attributes. Reference: 1. Hickey, D., Schmitz, M., Fessler, D.M.T, Smaldino, P., Muric, G., Burghardt, K. Auditing Elon Musk's Impact on Hate Speech and Bots. In Proceedings of the 17th International AAAI Conference on Web and Social Media, (2023).







This page was built for dataset: Datasets for "Auditing Elon Musk's Impact on Hate Speech and Bots"