Project:MilestonesMeeting/20240223

From MaRDI portal

MaRDI TA5 Milestones Meeting 23.02.2024 @ ZIB

Goals of the meeting

  • We have an idea about how to reach the official milestones
  • Everybody is aware of the personal milestones
  • Some (all) technical points are discussed / solved
  • We have a plan about how to have better documentation

Agenda

  1. Welcome
  2. Mission clarification
    1. What IS our mission in TA5? Connect papers/software AND data-sets?
      --> Make it easy to access and find the data produced by MaRDI TAs 1-4
  3. Milestone Planning
    1. What are our 2024 goals for MaRDI?
      --> Bring in content from TAs 1-4
    2. What are the official 2024 milestones?
    3. Who is doing what? (-> Personal Milestone Planning) PART1 - Presentations
  4. Open (technical) topics (see below)
  5. Documentation
    1. How to improve internal documentation?
    2. How to improve documentation for external?
  6. Outreach (to other SFBs, Math+, Libraries, ...)
  7. [If time permits] Who is doing what? (-> Personal Milestone Planning) PART2 - Refinement

(Technical) Topics to discuss

  • How to define items?
    • Formulae
      • Which "instance of" to use?
      • Which properties to use?
    • Papers
      • Which "instance of" to use? ("scholary article"?)
      • Current way of selecting papers in SPARQL queries by "has zbMath ID"?
      • How to link to a paper, as in "This data-set / software was used in this paper" (Now: in software-item we use "is described in" and in )
      • How to link from a paper, as in "cites software"? / "uses dataset"?
    • Datasets
      • Which "instance of" to use?
      • Which properties to use?
      • How to link to a paper, as in "was used in paper"? (Is this necessary?)
    • Software items (How can we query all of them - "instance of X" - what is X?)
      • Which "instance of" to use?
      • "instance of software" is violating the WikiData hierarchy? (Software is quite high-level)
  • arXiv Importer [ELOI]
    • What is the plan?
      • Import of formulae (can we use an LLM to describe a particular formula? parameters etc.?)
      • Import of paper-meta-data? (->Disambiguation)
    • What is the status?
  • LLMs for MaRDI portal [ELOI/MORITZ/LITESH]
    • What is the overall plan?
    • What is the status?
      • Chat-Bot (LLM to query the portal)
  • How to integrate more of the cool Scholia stuff? (Simple example: number of citations of a paper, see e.g. https://scholia.portal.mardi4nfdi.de/work/Q25938997)
  • Zenodo importer (for Math+ integration)
    • What is the plan?
  • Workflows for periodic updates (for any source we have)
  • zbMath MSC Keyword import? (We only have the IDs)
  • Wikidata graph split
  • Licensing
  • Author disambiguation
  • OKMaps
  • environmental footprint