Meta_Album_PRT_Micro

From MaRDI portal
Revision as of 12:27, 16 April 2024 by Import240416010454 (talk | contribs) (Created automatically from import240416010454)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Dataset:6037273



OpenML44278MaRDI QIDQ6037273FDOQ6037273RO-CrateQ6037273

OpenML dataset with id 44278

Ihsan Ullah

Full work available at URL: https://api.openml.org/data/v1/download/22110978/Meta_Album_PRT_Micro.arff

Upload date: 28 October 2022
Copyright license: Creative Commons Attribution-NonCommercial 4.0 International



Dataset Characteristics

Number of classes: 20
Number of features: 3 (numeric: 1, symbolic: 0 and in total binary: 0 )
Number of instances: 800
Number of instances with missing values: 800
Number of missing values: 800

Meta-Album Subcellular Human Protein Dataset (Micro)

* This dataset is a subset of the Subcellular dataset in the Protein Atlas project(https://www.proteinatlas.org/). The original dataset, which stems from the Human Protein Atlas Image Classification Kaggle competition(https://www.kaggle.com/competitions/human-protein-atlas-image-classification), comprises 31 072 RGBY images of size 512x512 px, each of which belongs to one or more out of 28 classes. The labels correspond to protein organelle localizations. For Meta-Album, we performed two modifications: (1), to turn the dataset into a multi-class dataset, we dropped all images belonging to more than a single class and also those images that belong to classes with less than 40 members; (2) we converted the remaining images into RGB simply by dropping the yellow channel; this was also a common practice in the competition. Finally, and as for all datasets in Meta-Album, the images from the original dataset were resized to 128x128 image size.


Dataset Details

![1]

Meta Album ID: MCR.PRT Meta Album URL: https://meta-album.github.io/datasets/PRT.html Domain ID: MCR Domain Name: Microscopy Dataset ID: PRT Dataset Name: Subcellular Human Protein Short Description: Subcellular protein patterns in human cells \# Classes: 20 \# Images: 800 Keywords: human protein, subcellular Data Format: images Image size: 128x128

License (original data release): CC BY-SA 3.0 License URL(original data release): https://www.proteinatlas.org/about/licence

License (Meta-Album data release): CC BY-SA 3.0 License URL (Meta-Album data release): https://www.proteinatlas.org/about/licence

Source: The Human Protein Atlas Source URL: https://proteinatlas.org https://www.kaggle.com/c/human-protein-atlas-image-classification

Original Author: Peter J Thul, Lovisa Akesson, Mikaela Wiking, Diana Mahdessian, Aikaterini Geladaki, Hammou Ait Blal, Tove Alm, Anna Asplund, Lars Bjork, Lisa Breckels, and others Original contact: contact@proteinatlas.org

Meta Album author: Felix Mohr Created Date: 01 June 2022 Contact Name: Felix Mohr Contact Email: meta-album@chalearn.org Contact URL: https://meta-album.github.io/


Cite this dataset

``` @article{thul2017subcellular,

 title={A subcellular map of the human proteome},
 author={Thul, Peter J and Akesson, Lovisa and Wiking, Mikaela and Mahdessian, Diana and Geladaki, Aikaterini and Ait Blal, Hammou and Alm, Tove and Asplund, Anna and Bjork, Lars and Breckels, Lisa M},
 journal={Science},
 volume={356},
 number={6340},
 year={2017},
 publisher={American Association for the Advancement of Science}

}

```


Cite Meta-Album

``` @inproceedings{meta-album-2022,

       title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},
       author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},
       booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
       url = {https://meta-album.github.io/},
       year = {2022}
   }

```


More

For more information on the Meta-Album dataset, please see the [NeurIPS 2022 paper] For details on the dataset preprocessing, please see the [supplementary materials] Supporting code can be found on our [GitHub repo] Meta-Album on Papers with Code [Meta-Album]


Other versions of this dataset**

[Mini] [Extended]





ROCrate

What is a RO-Crate?

A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:

  • the files belonging to the dataset (e.g. CSVs, images, code, documentation)
  • a ro-crate-metadata.json file describing the content, provenance, and context
  • persistent identifiers and references to related research objects (e.g. software, publications)

This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.

Download

You can download a RO-Crate for this dataset here: Download RO-Crate

HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.


This page was built for dataset: Meta_Album_PRT_Micro