User:Daniel/Queries/Graph profiling

From MaRDI portal

About

This page assists in the collection and curation of queries that

  • can be used with no or minimal modification on multiple graphs
  • provide insights into the graph structure

Generic queries

Number of triples in the graph

SELECT (COUNT(*) AS ?count) WHERE {
  ?s ?p ?o
}
Try it!

Number of distinct entities in the graph

Number of distinct relationships in the graph

Examples of distinct relationships in the graph

SELECT DISTINCT ?p WHERE {
 ?s ?p ?o .
}
LIMIT 100
Try it!

Types of entities in the graph

This requires what the graph's local equivalent of "instance of" is. For instance, in the EU knowledge graph, this would be P35, leading to the following query.'

SELECT (count(DISTINCT ?type ) as ?count) WHERE {
  ?s <https://linkedopendata.eu/entity/P35> ?type . 
}

External entities equivalent to entities in the graph

External entities equivalent to relationships in the graph

Entities with the largest number of triples

Entities with the lowest number of triples

Languages used in the graph

Queries for Wikibase instances

# Properties by frequency in a random sample 
SELECT DISTINCT ?property ?propertyLabel (COUNT(*) AS ?count) WHERE {
  SERVICE bd:sample { ?item ?p [ ] . bd:serviceParam bd:sample.limit 1000 }
  
  ?property a wikibase:Property;
              wikibase:claim ?p.
  SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
}
GROUP BY ?property ?propertyLabel
ORDER BY DESC(?count)
Try it!

Queries for a specific set of KGs

* Relative coverage of zbMATH authors in MaRDI vs. Wikidata

See also

* NFDI Core Ontology * SPARQL Endpoint Metadata Check ** all fine with https://sparql.uniprot.org/ ** warnings for https://query.portal.mardi4nfdi.de/proxy/wdqs/bigdata/namespace/wdq/sparql ** warnings for https://database.factgrid.de/sparql * Vocabulary of Interlinked Datasets (VoID) ** VoID vocabulary definition ** Describing Linked Datasets with the VoID Vocabulary ** VoID generator * LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge Graphs