The-Social-Dilemma-Tweets---Text-Classification

From MaRDI portal
Dataset:6036629



OpenML43532MaRDI QIDQ6036629

OpenML dataset with id 43532

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22102357/The-Social-Dilemma-Tweets---Text-Classification.arff

Upload date: 23 March 2022


Dataset Characteristics

Number of features: 14 (numeric: 3, symbolic: 2 and in total binary: 2 )
Number of instances: 20,068
Number of instances with missing values: 8,638
Number of missing values: 10,429

Context The Social Dilemma, a documentary-drama hybrid explores the dangerous human impact of social networking, with tech experts sounding the alarm on their own creations as the tech experts sound the alarm on the dangerous human impact of social networking. Initial release: January 2020 Director: Jeff Orlowski Producer: Larissa Rhodes Music director: Mark A. Crawford Screenplay: Jeff Orlowski, Vickie Curtis, Davis Coombe Content This dataset brings you the twitter responses made with the TheSocialDilemma hashtag after watching the eye-opening documentary "The Social Dilemma" released in an OTT platform(Netflix) on September 9th, 2020. The dataset was extracted using TwitterAPI, consisting of nearly 10,526 tweets from twitter users all over the globe!


No Columns Descriptions



1 user_name The name of the user, as theyve defined it.


2 user_location The user-defined location for this accounts profile.


3 user_description The user-defined UTF-8 string describing their account.


4 user_created Time and date, when the account was created.


5 user_followers The number of followers an account currently has.


6 user_friends The number of friends an account currently has.


7 user_favourites The number of favorites a account currently has


8 user_verified When true, indicates that the user has a verified account


9 date UTC time and date when the Tweet was created


10 text The actual UTF-8 text of the Tweet


11 hashtags All the other hashtags posted in the tweet along with TheSocialDilemma


12 source Utility used to post the Tweet, Tweets from the Twitter website have a source value - web


13 is_retweet Indicates whether this Tweet has been Retweeted by the authenticating user.


14 Sentiment(Target variable) Indicates the sentiment of the tweet, consists of three categories: Positive, neutral, and negative


Inspiration You can use this data to dive into the subjects that use this hashtag, look to the geographical distribution, evaluate sentiments, looks to trends.