Skip to content

Introduction

To be able to harvest dataset metadata from a repository a machine acionable way to access the metadata is needed. To avoid a researchdata.se-specific method we recomends estabilised standards like DataCite XML via OAI-PMH and/or schema.org json-ld one the landing page via sitemap.xml.

Harvesting flow

(work in progress)

graph
  RDA[(RDA \n elasticsearch)]
  RDAC[researchdata.se]

  DS[(DataCite)]
  DSOAI[DataCite\nOAI-PMH endpoint]
  DSAPI[DataCite API]
  DS --> DSOAI
  DSAPI --> DS

  DSOAI --> RDAH


  RP1[(Repository 1)]
  OAI1[OAI-PMH endpoint]

  RP3[(Repository 3)]
  RP3 --> DSAPI

  RP4[(Repository 4)]
  RP4 --> DSAPI

  RP2[(Repository 2)]
  LDSM2[sitemap.xml]
  LDLP2[Landing page \nJsonLD]

  RDAH[RDA harvester]

  RP1 --> OAI1
  OAI1 --> RDAH

  RP2 --> LDSM2
  LDSM2 --> LDLP2
  LDSM2 --> RDAH
  LDLP2 --> RDAH

  RDAH --> RDA
  RDA --> RDAC