Goodreads-Computer-Books

From MaRDI portal
Dataset:6036878



OpenML43785MaRDI QIDQ6036878

OpenML dataset with id 43785

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22102610/Goodreads-Computer-Books.arff

Upload date: 24 March 2022


Dataset Characteristics

Number of classes: 0
Number of features: 9 (numeric: 6, symbolic: 0 and in total binary: 0 )
Number of instances: 1,234
Number of instances with missing values: 0
Number of missing values: 0

Context The reason for creating this dataset is the requirement of a good clean dataset of computer books. I had searched for datasets on books in Kaggle and I found out that while most of the datasets had a good amount of books listed, there were either major columns missing or grossly unclean data. I mean, you can't determine how good a book is just from a few text reviews. So I collected this data from the Goodreads website from the "Computer" category to help people who are like this type of book. Acknowledgements This data was entirely scraped via the Webdriver Inspiration The reason behind creating this dataset is pretty straightforward, I'm listing the books for all who need computer books, irrespective of the language and publication and all of that. So go ahead and use it to your liking, find out what book you should be reading next, all possible approaches to exploring this dataset are welcome. I started creating this dataset on Jan 18, 2021, and intend to update it frequently. P.S. If you like this, please don't forget to give an upvote! Notes The missing values are imputed in this data by the creator.