{"entities":{"Q6699604":{"pageid":14418577,"ns":120,"title":"Item:Q6699604","lastrevid":54708911,"modified":"2026-01-29T20:00:38Z","type":"item","id":"Q6699604","labels":{"en":{"language":"en","value":"MediaText: a media industry-based dataset for scene text detetcion"}},"descriptions":{"en":{"language":"en","value":"Dataset published at Zenodo repository."}},"aliases":{},"claims":{"P31":[{"mainsnak":{"snaktype":"value","property":"P31","hash":"dae155fd0809a7906855cd4fa50dd7d71bed552b","datavalue":{"value":{"entity-type":"item","numeric-id":56885,"id":"Q56885"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$0317A555-9160-4333-B726-AEDA6EC1B55E","rank":"normal"}],"P1459":[{"mainsnak":{"snaktype":"value","property":"P1459","hash":"02be529e3cc87030348762c2ba03387507ffa278","datavalue":{"value":"Media-Text Media-Text dataset comprising images of banners, posters, covers and another images characterised for media industry. Full paper is available here: Media-Text: a Media Industry-Based Dataset for Scene Text Detection DATASET DESCRIPTION  400 images 7 744 annotated text instances 973 annotations have been marked as illegible for the task of text recognition 659 texts have been markes as do not care (###) for scene text detection. Images are represented by 193 unique resolutions.  Annotation Format - Each image has corresponding gt_*.txt file, which contains annotations in bounding box format (defined by 4 courners), transcription, and bool flag which determines that text is illegible for OCR. Proposed format is similar to ICDAR15 annotations. x1, x2, ..., x4, y4, transcription, OCR Flag Example:37,68,198,49,214,181,52,200,LADIES,False ACKNOWLEDGMENT This work was supported by the Silesian University of Technology (SUT) through the subsidy for maintaining and developing research potential grant in 2024 for young researchers, No. 2/070/BKM24/0058, and by the Ministry of Science and Higher Education \"Implementation Doctorate\" No. DWD/5/0511/2021. Thanks to the graphic department of media-press group for the preparation and possibility of sharing graphics thematically related to the prepared dataset.  LICENSE Annotations created by authors are licesned under CC-BY-4.0 license.Images from the Open-Image-V7 dataset and are licensed according to their source information. Source information is defined in a file metadata.csv file that defines all the metadata of each file (File name corresponds to the ImageID column). Images whose name corresponds to the media_press pattern are provided for academic use.  CITING THE RELATED WORKS    Please cite the related works in your publications if it helps your research:  ``` @inproceedings{inproceedings, author = {Kalisz, Seweryn and Marczyk, Micha\u0142 and Polanska, Joanna}, booktitle = {Modelling and simulation 2024. The 2024 European Simulation and Modelling Conference} editor = {Manuel Graa; J. David Nuez-Gonzalez} year = {2024}, month = {10}, pages = {138-144}, publisher = {EUROSIS-ETI}, title = {Media-Text: a Media Industry-Based Dataset for Scene Text Detection} } ```","type":"string"},"datatype":"string"},"type":"statement","id":"Q6699604$5B2FA93B-ED84-4CE9-B631-306FD4A1610A","rank":"normal"}],"P28":[{"mainsnak":{"snaktype":"value","property":"P28","hash":"46a5f992d4c06418175512c68a930d9d4d0bceaa","datavalue":{"value":{"time":"+2024-07-22T00:00:00Z","timezone":0,"before":0,"after":0,"precision":11,"calendarmodel":"http://www.wikidata.org/entity/Q1985727"},"type":"time"},"datatype":"time"},"type":"statement","id":"Q6699604$9DB88807-0375-41E8-858A-F5A90E6FCC42","rank":"normal"}],"P16":[{"mainsnak":{"snaktype":"value","property":"P16","hash":"2ab344bd06e08110231eaadb0f60d62736c98d37","datavalue":{"value":{"entity-type":"item","numeric-id":6699602,"id":"Q6699602"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$78F2DFE3-8F47-4DBB-B486-491198FE7788","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"209e1d72810100314905bff143cfb13b60c8485d","datavalue":{"value":{"entity-type":"item","numeric-id":488370,"id":"Q488370"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$F338C75A-827D-46FA-912E-C5E3CF28B30D","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"9db7dc84b2c09f716b743f7c75e3e0f82e257462","datavalue":{"value":{"entity-type":"item","numeric-id":6699603,"id":"Q6699603"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$C250E951-AFBC-4D9F-84AB-A77544A8C0AF","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"f42f2ecfc6b1a11c7313aa82ebf264c246899204","datavalue":{"value":{"entity-type":"item","numeric-id":2314537,"id":"Q2314537"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$C5CFE5D7-479A-4312-92FA-356D88D9EDB2","rank":"normal"}],"P227":[{"mainsnak":{"snaktype":"value","property":"P227","hash":"81f339d201c61ca2528d88be0ce893c90468d930","datavalue":{"value":"12796380","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q6699604$8E88E076-E4ED-4B1C-88C6-F4E5136FC90A","rank":"normal"}],"P27":[{"mainsnak":{"snaktype":"value","property":"P27","hash":"687e7ac1b7b7c3d4584c0ddc80e65fc3e7410f4f","datavalue":{"value":"10.5281/zenodo.12796380","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q6699604$42D03768-7463-401B-A2C4-EF7BE26EC8F2","rank":"normal"}],"P163":[{"mainsnak":{"snaktype":"value","property":"P163","hash":"45fcd4163b5f33e6e8c784f5522d7246c0a1a61e","datavalue":{"value":{"entity-type":"item","numeric-id":57056,"id":"Q57056"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$B38B3AE0-574C-4E16-A3EB-0F3A2E65A0DC","rank":"normal"}],"P1460":[{"mainsnak":{"snaktype":"value","property":"P1460","hash":"d1e8073b72a070520efd3d14d4b3d2d3d03859e2","datavalue":{"value":{"entity-type":"item","numeric-id":5984635,"id":"Q5984635"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6699604$E3E902EA-CC04-4837-8317-47EBC85A9B22","rank":"normal"}]},"sitelinks":{"mardi":{"site":"mardi","title":"MediaText: a media industry-based dataset for scene text detetcion","badges":[],"url":"https://portal.mardi4nfdi.de/wiki/MediaText:_a_media_industry-based_dataset_for_scene_text_detetcion"}}}}}