{"entities":{"Q6710101":{"pageid":14429051,"ns":120,"title":"Item:Q6710101","lastrevid":54665825,"modified":"2026-01-29T19:03:45Z","type":"item","id":"Q6710101","labels":{"en":{"language":"en","value":"Large-scale and fine-grained phenological stage annotation of herbarium specimens datasets"}},"descriptions":{"en":{"language":"en","value":"Dataset published at Zenodo repository."}},"aliases":{},"claims":{"P31":[{"mainsnak":{"snaktype":"value","property":"P31","hash":"dae155fd0809a7906855cd4fa50dd7d71bed552b","datavalue":{"value":{"entity-type":"item","numeric-id":56885,"id":"Q56885"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$CAB7EE95-1B04-4F45-8A7F-A1CF75EE6DB6","rank":"normal"}],"P1459":[{"mainsnak":{"snaktype":"value","property":"P1459","hash":"ea625b282321b5faaff9338790969185b836b459","datavalue":{"value":"This upload is constituted of four datasets of specimens from American herbaria covering different levels of information precision and different floras - from temperate to equatorial.  Three of these datasets consist of selected specimens from herbaria located in different geographic and environmental regions. Each specimen of these three datasets was annotated with the following fields: family, genus, species name, fertile / non-fertile, presence / absence of flower(s), presence / absence of fruit(s). The resulting dataset was composed of 163,233 herbarium specimens belonging to 7,782 species, 1,906 genera, and 236 families. Specimens were annotated as fertile if any reproductive structures were present, such as sporangia (ferns), cones (gymnosperms), flowers, or fruits (angiosperms). Non-fertile specimens were those that lacked any reproductive structures.  The fourth dataset consists of 20,371 herbarium specimens from 11 genera in the sunflower family (Asteraceae). The main difference in this dataset is that it is annotated with fine-grained phenophase scores rather than presence/absence attributes (see description below).  Each of these datasets is described below:      NEVP: this dataset of New England vascular plant (NEVP) specimens was produced by members of the Consortium of Northeastern Herbaria. The dataset comprises 42,658 digitized specimens that belong to 1,375 species and come from several North American institutions. Most of the specimens in this dataset are from the north-temperate region of the northeastern United States.      FSU: this dataset was produced by the Florida State Universitys Robert K. Godfrey Herbarium (FSU), a collection that focuses on northern Florida and the U.S. Southeast Coastal Plain, one of North Americas biodiversity hotspots. This dataset contains 54,263 digitized herbarium specimen records that belong to 3,870 species, making it the taxonomically richest dataset in this study. Most species in this dataset grow under subtropical or warm temperate conditions in the southeastern region of the United States.      CAY: this dataset comes from the IRDs Herbarium of French Guiana (CAY). CAY is dedicated to the Guayana Shield flora, with a strong focus on tropical tree species. This dataset is composed of 66,312 herbarium specimens that belong to 3,024 species. All digitized specimens of this herbarium are accessible online. Most specimens were collected in the tropical rainforests of French Guiana, with the remaining specimens coming mostly from Suriname and Guyana.      PHENO: this dataset includes 20,371 herbarium specimens of 139 species in the Asteraceae produced in a study of phenological trends in the U.S. Southeast Coastal Plain. The dataset is composed of specimen records from 57 herbaria. Each recorded specimen was annotated for quartile percentages (0, 25, 50, 75, or 100%) of (i) closed buds, (ii) buds transformed into flowers, and (iii) fruits. According to the distribution of these three categories for each specimen, a phenophase code was computed.       Datasets format  These datasets are grouped in 3 tasks:    fertility detection  flowers and/or fruit detection  phenophase classification   The first 2 tasks are carried on the first 3 previous datasets and thus are based on the same set of images, unlike the third task which has its own disjoint set of images. This is why the dataset is presented into two separated files, one for each set of images.  Fertility detection  flower/fruit detection  These tasks are contained into the herbarium_fertility_annotations.zip archive. It consists of 3 files:    metadata.csv: general information about all the herbarium specimens for these tasks      id: specimen identifier   collection: which of NEVP, FSU or CAY does the specimen come from   herbarium: institution of origin of the specimen, especially for NEVP collection   clade, family, genus, species: classification of the specimen   URL: URL of the scan      fertility_task.csv: specific information regarding the fertility detection task     id: specimen identifier   is_fertile: True if the specimen has an expression of fertility, False otherwise   train_test_set: which subset does the specimen belong to; possible values are: train, random_test, species_test and herbarium_test      flower_fruit_task.csv: specific information regarding the flower/fruit detection task     id: specimen identifier, note that in this case not all the specimen described in metadata.csv are included in this task   has_flower: True if the specimen has at least one flower, False otherwise   has_fruit: True if the specimen has at least one fruit, False otherwise   train_test_set: which subset does the specimen belong to; possible values are: train, random_test, species_test and herbarium_test       Phenophase classification  These tasks are contained into the herbarium_asteraceae_phenophase_annotations.zip archive. It consists of a single file:    annotations.csv:      id: specimen identifier   URL: URL of the scan   genus: genus of the specimen   phenophase: integer from 1 to 9 describing the phenophase of the specimen   train_test_set: which subset does the specimen belong to; possible values are: train and test         Additional ressources  More information can be found in the related paper: Lorieul, T., K. D. Pearson, E. R. Ellwood, H. Goau, J.-F. Molino, P. W. Sweeney, J. M. Yost, J. Sachs, E. Mata-Montero, G. Nelson, P. S. Soltis, P. Bonnet, and A. Joly. 2019. Toward a large-scale and deep phenological stage annotation of herbarium specimens: Case studies from temperate, tropical, and equatorial floras. Applications in Plant Sciences 7(3): e1233.  For an example of usage of these datasets as well as a baseline, see: http://doi.org/10.5281/zenodo.2549996","type":"string"},"datatype":"string"},"type":"statement","id":"Q6710101$F0786139-B0EA-4228-ADD1-D0FDCE2CE6E2","rank":"normal"}],"P28":[{"mainsnak":{"snaktype":"value","property":"P28","hash":"131839c99eab3d4b3e32038ae3e6b5905496559f","datavalue":{"value":{"time":"+2019-01-25T00:00:00Z","timezone":0,"before":0,"after":0,"precision":11,"calendarmodel":"http://www.wikidata.org/entity/Q1985727"},"type":"time"},"datatype":"time"},"type":"statement","id":"Q6710101$C1746824-620D-4F63-A4E3-1DC201836D5B","rank":"normal"}],"P16":[{"mainsnak":{"snaktype":"value","property":"P16","hash":"af69a9cc91ac4dbfc0e6bc71549715bf96965d25","datavalue":{"value":{"entity-type":"item","numeric-id":6710090,"id":"Q6710090"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$8BBB013F-6581-4DC5-80F0-F0AFBB0CDF88","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"ffa9d3367758d1299df4b28d89bd98fb2fbda339","datavalue":{"value":{"entity-type":"item","numeric-id":6710091,"id":"Q6710091"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$0E195B74-E40D-4152-BED8-4303DE6157AD","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"95c990186981d986ac33db43483bb8dc61e6d85b","datavalue":{"value":{"entity-type":"item","numeric-id":6710092,"id":"Q6710092"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$CE887115-F47D-4734-87FD-9B7C91D0B031","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"f52a51855cb2079cff72bc2a794be3eece0e861d","datavalue":{"value":{"entity-type":"item","numeric-id":6710093,"id":"Q6710093"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$FD4902B8-2E2B-48A6-B914-B022E253980F","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"be0d8fb80bf5b4840b3168cd0942bcc90c2a4e06","datavalue":{"value":{"entity-type":"item","numeric-id":6710094,"id":"Q6710094"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$BAC96F6C-8B8E-4717-89DC-3528B5850E9A","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"2529177b9902c6282d6f87aaf190391f8c2268ce","datavalue":{"value":{"entity-type":"item","numeric-id":6710095,"id":"Q6710095"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$21A56F6B-9979-43B0-864E-CF44A9BFE9DC","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"37ca84e66cd7d68a6c853df9f501e52c66f9a2f7","datavalue":{"value":{"entity-type":"item","numeric-id":6710096,"id":"Q6710096"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$8230E7A7-63A6-421A-A7F3-54160FB23225","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"098a9ea94b7b1dc5964253cf17a2f67269ac9ab0","datavalue":{"value":{"entity-type":"item","numeric-id":6710097,"id":"Q6710097"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$1C104AAA-79FC-4760-986F-51B43E1A3F5C","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"bfe0b276b6638563325c755b7cb017ad0d7d5c3e","datavalue":{"value":{"entity-type":"item","numeric-id":6710098,"id":"Q6710098"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$AD177B73-C2DB-4F65-A527-A3AF8B5F2858","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"0f470e9894789da217bc71d6e45eae71df598c5f","datavalue":{"value":{"entity-type":"item","numeric-id":6710099,"id":"Q6710099"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$7926C7C1-404D-4BFD-86D1-7C05FBF51DBE","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"36b69cc01b0d5ea534aa954f0522319fd0387f9b","datavalue":{"value":{"entity-type":"item","numeric-id":251603,"id":"Q251603"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$C5B8A66D-7FF0-4C2F-BE12-1D1D94115DF9","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"875923a969af504adb6ba8d69c1824f3ec152afa","datavalue":{"value":{"entity-type":"item","numeric-id":6682287,"id":"Q6682287"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$9CCD019C-D8C0-4468-99CC-BEC37505025A","rank":"normal"},{"mainsnak":{"snaktype":"value","property":"P16","hash":"1d3ea60da91b36e19733b7daf544425ed95f3a58","datavalue":{"value":{"entity-type":"item","numeric-id":6710100,"id":"Q6710100"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$03354661-44EF-4E5D-8235-F9445E97565F","rank":"normal"}],"P227":[{"mainsnak":{"snaktype":"value","property":"P227","hash":"992942cadec571c0031f87d4414828c86b77187e","datavalue":{"value":"2548630","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q6710101$96548C8A-96E4-4208-90D3-C238F77C2357","rank":"normal"}],"P27":[{"mainsnak":{"snaktype":"value","property":"P27","hash":"017bb143d6eb7581da1a673ef2c27abd471a212f","datavalue":{"value":"10.5281/zenodo.2548630","type":"string"},"datatype":"external-id"},"type":"statement","id":"Q6710101$917C0279-49DE-4FDA-AD8A-257AAA72A0C2","rank":"normal"}],"P163":[{"mainsnak":{"snaktype":"value","property":"P163","hash":"45fcd4163b5f33e6e8c784f5522d7246c0a1a61e","datavalue":{"value":{"entity-type":"item","numeric-id":57056,"id":"Q57056"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$F9F1698B-685F-4608-A4A4-2132978EDAB0","rank":"normal"}],"P1474":[{"mainsnak":{"snaktype":"value","property":"P1474","hash":"9828656774b0531597a635a46438869928ecc18d","datavalue":{"value":"1.0.0","type":"string"},"datatype":"string"},"type":"statement","id":"Q6710101$F79B1F59-DA0E-4207-BA75-BF70F7FA6668","rank":"normal"}],"P1460":[{"mainsnak":{"snaktype":"value","property":"P1460","hash":"d1e8073b72a070520efd3d14d4b3d2d3d03859e2","datavalue":{"value":{"entity-type":"item","numeric-id":5984635,"id":"Q5984635"},"type":"wikibase-entityid"},"datatype":"wikibase-item"},"type":"statement","id":"Q6710101$598E312E-AB2D-43F4-808A-95C694ED6575","rank":"normal"}]},"sitelinks":{"mardi":{"site":"mardi","title":"Large-scale and fine-grained phenological stage annotation of herbarium specimens datasets","badges":[],"url":"https://portal.mardi4nfdi.de/wiki/Large-scale_and_fine-grained_phenological_stage_annotation_of_herbarium_specimens_datasets"}}}}}