MADOC: Multi-Platform Aggregated Dataset of Online Communities

From MaRDI portal
Dataset:6702622



DOI10.5281/zenodo.14637314Zenodo14637314MaRDI QIDQ6702622FDOQ6702622

Dataset published at Zenodo repository.

Darja Cvetkovic, Aleksandar Bogojević, Aleksandar Tomašević, Miroslav Andjelkovic, Marija Mitrović, Dusan Vudragovic, Boris Stupovski, Slobodan Maletic, Sara Major, Ana Vranic

Publication date: 13 January 2025

Copyright license: Creative Commons Attribution 4.0 International



The Multi-platform Aggregated Dataset of Online Communities (MADOC) is a comprehensive dataset that facilitates computational social science research by providing a unified, standardized dataset for cross-platform analysis of online social dynamics. MADOC aggregates and standardizes data from four distinct platforms: Bluesky, Koo, Reddit, and Voat, spanning from 2012 to 2024. The dataset includes 18.9 million posts, 236 million comments, and data from 23.1 million unique users across all platforms, with a particular focus on understanding community dynamics, user migration patterns, and the evolution of toxic behavior across platforms. By providing standardized data structures and FAIR-compliant access through Zenodo, MADOC enables researchers to conduct comparative analyses of user behavior, interaction networks, and content sentiment across diverse social media environments. The dataset's unique value lies in its cross-platform scope, standardized structure, and rich metadata, making it particularly suitable for studying societal phenomena such as community formation, toxic behavior propagation, and user migration patterns in response to platform moderation policies.







This page was built for dataset: MADOC: Multi-Platform Aggregated Dataset of Online Communities