Medium-Articles
OpenML43677MaRDI QIDQ6036772FDOQ6036772RO-CrateQ6036772
OpenML dataset with id 43677
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22102502/Medium-Articles.arff
Upload date: 24 March 2022
Dataset Characteristics
Number of features: 5 (numeric: 1, symbolic: 0 and in total binary: 0 )
Number of instances: 337
Number of instances with missing values: 0
Number of missing values: 0
Context
Medium is one of the most famous tools for spreading knowledge about almost any field. It is widely used to published articles on ML, AI, and data science. This dataset is the collection of about 350 articles in such fields.
Content
The dataset contains articles, their title, number of claps it has received, their links and their reading time.
Acknowledgements
This dataset was scraped from Medium. I created a Python script to scrap all the required articles using just their tags from Medium. Check out the script here
Inspiration
How to write a good article? How to inform the reader in an interesting way? What sort of title attracts more crowd? How long an article should be?
ROCrate
What is a RO-Crate?
A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:
- the files belonging to the dataset (e.g. CSVs, images, code, documentation)
- a ro-crate-metadata.json file describing the content, provenance, and context
- persistent identifiers and references to related research objects (e.g. software, publications)
This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.
Download
You can download a RO-Crate for this dataset here: Download RO-Crate
HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.
This page was built for dataset: Medium-Articles