Meta_Album_MD_MIX_Mini (Q6037282)

From MaRDI portal





OpenML dataset with id 44287
Language Label Description Also known as
English
Meta_Album_MD_MIX_Mini
OpenML dataset with id 44287

    Statements

    0 references
    ## **Meta-Album OmniPrint-MD-mix Dataset (Mini)**\N***\NOmniPrint-MD-mix dataset consists of 28 240 images (128x128, RGB) from 706 categories. The images are synthesized with OmniPrint, and no further processing was done. The OmniPrint synthesis parameters are stated as follows: font size is 192, image size is 128, the strength of random perspective transformation is 0.04, left/right/top/bottom margins are all 20% of the image size, the strength of pre-rasterization elastic transformation is 0.035, random translation is activated both horizontally and vertically, rotation is within -60 and 60 degrees, horizontal shear is within -0.5 and 0.5, brightness is within 0.8333 and 1.2, contrast is within 0.8333 and 1.2, color enhancement is within 0.8333 and 1.2. The other parameters vary between images. We designed 20 settings, each setting is used to synthesize 2 images. All images/textures consists of photos taken by a personal mobile phone. \N\N\N\N### **Dataset Details**\N![](https://meta-album.github.io/assets/img/samples/MD_MIX.png)\N\N**Meta Album ID**: OCR.MD_MIX \N**Meta Album URL**: [https://meta-album.github.io/datasets/MD_MIX.html](https://meta-album.github.io/datasets/MD_MIX.html) \N**Domain ID**: OCR \N**Domain Name**: Optical Character Recognition \N**Dataset ID**: MD_MIX \N**Dataset Name**: OmniPrint-MD-mix \N**Short Description**: Character images with a specific set of nuisance parameters \N**\# Classes**: 706 \N**\# Images**: 28240 \N**Keywords**: ocr \N**Data Format**: images \N**Image size**: 128x128 \N\N**License (original data release)**: CC BY 4.0 \N**License URL(original data release)**: https://creativecommons.org/licenses/by/4.0/\N \N**License (Meta-Album data release)**: CC BY 4.0 \N**License URL (Meta-Album data release)**: [https://creativecommons.org/licenses/by/4.0/](https://creativecommons.org/licenses/by/4.0/) \N\N**Source**: OmniPrint \N**Source URL**: https://github.com/SunHaozhe/OmniPrint \N \N**Original Author**: Haozhe Sun \N**Original contact**: sunhaozhe275940200@gmail.com \N\N**Meta Album author**: Haozhe Sun \N**Created Date**: 25 June 2021 \N**Contact Name**: Haozhe Sun \N**Contact Email**: meta-album@chalearn.org \N**Contact URL**: [https://meta-album.github.io/](https://meta-album.github.io/) \N\N\N\N### **Cite this dataset**\N```\N@inproceedings{sun2021omniprint,\N title={OmniPrint: A Configurable Printed Character Synthesizer},\N author={Haozhe Sun and Wei-Wei Tu and Isabelle M Guyon},\N booktitle={Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1)},\N year={2021},\N url={https://openreview.net/forum?id=R07XwJPmgpl}\N}\N```\N\N\N### **Cite Meta-Album**\N```\N@inproceedings{meta-album-2022,\N title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\N author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\N booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\N url = {https://meta-album.github.io/},\N year = {2022}\N }\N```\N\N\N### **More**\NFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https://meta-album.github.io/paper/Meta-Album.pdf) \NFor details on the dataset preprocessing, please see the [[supplementary materials]](https://openreview.net/attachment?id=70_Wx-dON3q&name=supplementary_material) \NSupporting code can be found on our [[GitHub repo]](https://github.com/ihsaan-ullah/meta-album) \NMeta-Album on Papers with Code [[Meta-Album]](https://paperswithcode.com/dataset/meta-album) \N\N\N\N### **Other versions of this dataset**\N[[Micro]](https://www.openml.org/d/44243)
    0 references
    Ihsan Ullah
    0 references
    30-09-2022
    0 references
    28 October 2022
    0 references
    CATEGORY
    0 references
    9a1d7251bc9917ef30c23ef40c80074c
    0 references
    1
    0 references
    0
    0 references
    69
    0 references
    28,240
    0 references
    665,053
    0 references
    46
    0 references
    0 references

    Identifiers

    0 references