PSDI Vocabulary Description
Summary
The PSDI vocabulary is part of the metadata for the Physical Sciences Data Infrastructure (PSDI) and it defines key terms used within the project, infrastructure and platform. The latest released version is available at: https://metadata.psdi.ac.uk/psdi-voc.jsonld.
This document outlines guidelines specific to this vocabulary profile in the PSDI metadata - there are general guidelines for the PSDI metadata as a whole in PSDI Metadata General Guidelines which will not be repeated here.
This vocabulary corresponds to the "Core Controlled Vocabularies" profile in Cross-Domain Interoperability Framework.
A human-readable version of the definitions in the "PSDI top-level terms collection" collection is available at PSDI Terminology.
Purpose of PSDI vocabulary
- consistent vocabulary for the PSDI team
- understanding of terminology for PSDI contributors and user
- discoverability and interpretation of PSDI applications and components for machines and indexes
What is included?
The initial scope of the SKOS vocabulary is to define "PSDI top-level terms collection" which all belong to the skos:collection with @id [psdiVoc:psdiOverviewCollection]
.
Background reading
- Cross-Domain Interoperability Framework
- SKOS Simple Knowledge Organization System Primer
- SKOS Simple Knowledge Organization System Reference
- "Ten simple rules for making a vocabulary FAIR" Cox SJD, Gonzalez-Beltran AN, Magagna B, Marinescu MC (2021) Ten simple rules for making a vocabulary FAIR. PLOS Computational Biology 17(6): e1009041. https://doi.org/10.1371/journal.pcbi.1009041
- Check against A checklist for complete vocabulary metadata
Overall Format
- The output is a SKOS Simple Knowledge Organization System thesaurus in JSON-LD format.
- It is a thesaurus rather than a controlled vocabulary or taxonomy because it contains more information about each concept including: preferred and alternative terms, and relationships to related concepts.
- According to the SKOS schema, each defined term is presented as a skos:Concept within the skos:ConceptScheme which represents the whole vocabulary.
- The default namespace is
["psdiVoc": "http://metadata.psdi.ac.uk/psdi-voc#"]
.
Top-level vocabulary metadata fields
- These are the top-level fields that are captured for the SKOS vocabulary - see A checklist for complete vocabulary metadata for more details:
[@id]
- term ids defined in this vocabulary are human readable, based on their names, shortened if possible while retaining meaning, and converted to camelCase (e.g. communityActivity). When the namespace prefix "psdiVoc" of[@ids]
within this vocabulary are expanded their iris take the form:{http://} + {metadata.psdi.ac.uk/} + {psdi-voc} + [#] + {id}
(e.g.http://metadata.psdi.ac.uk/psdi-voc#psdiPlatform
). See PSDI Metadata General Guidelines > PIDs (Persistent Identifiers) for more general information.[rdfs:label]
is a short, human-readable label for the vocabulary[dcterms:title]
is a more complete title for the vocabulary[dcterms:description]
is a small paragraph indicating what the vocabulary does and why should some one use it[dce:Version]
use semantic versioning- see PSDI Metadata General Guidelines > Version numbers for more details[dcterms:created]
date created - see PSDI Metadata General Guidelines > Dates for more details[dcterms:modified]
date modified - see PSDI Metadata General Guidelines > Dates for more details[dcterms:creator"]
and[dcterms:contributor"]
- the people who created and contributed to this vocabulary - see PSDI Metadata General Guidelines > Person, Organization and Agent for guidelines about capturing people[dcterms:publisher]
- the publisher of this vocabulary - see PSDI Metadata General Guidelines > Publisher for more details[foaf:fundedBy]
- the organization who funded this vocabulary - PSDI Metadata General Guidelines > Person, Organization and Agent for more details[dc11:subject]
- resources within the PSDI catalogue are classified using EuroSciVoc - see PSDI Metadata General Guidelines > Subjects and themes for more details.[dcterms:license]
- see PSDI Metadata General Guidelines > Licenses for more details
- Note that we have not included:
[dcterms:language]
was not included since we have set the overall lanugage in the context rather than in this field - see "Language" section of PSDI Metadata General Guidelines for more details
Concepts
- Each
[skos:Concept]
concept within the vocabulary attempts to:- say what the term "is"
- give a short clear definition
- be well connected to each other where needed via
[skos:related]
,[skos:broader]
and[skos:narrower]
relationships - where possible and appropriate be assigned an ontological type via the rdf:type property – however, these need to be logically consistent with the ontology commitments of terms used
- Required fields for this vocabulary are:
- skos:prefLabel
- @id - prefLabel converted to an iri if there are any non-standard characters in it
- skos:definition
- skos:Collection
- skos:inScheme with value "psdiVoc:Vocabulary"
- Optional fields are:
- skos:altLabel
- skos:related - we use this to highlight related terms (those mentioned in the definition) and aim for these to be reciprocal.
- skos:broader - we only currently use this for the terms in relation to hasTopConcept
- skos:narrower - we only currently use this for the terms in relation to hasTopConcept
- skos:example
[rdf:hasType]
with a value that links to the PID of an ontology term
- After modification follow steps in Validation section
Versioning
- After changes are made to this file the following top-level metadata fields for the file are checked and updated if necessary - see "Rule 10" of Ten simple rules for making a vocabulary FAIR:
[dc:modified]
— date that the vocabulary or term was last updated- increment
[dc:Version]
using semantic versioning
- Also consider use of the following terms for altered concepts where appropriate:
[dcterms:isReplacedBy]
— to point to a superseding vocabulary or term[dcterms:replaces]
— to point to a prior version of a vocabulary or term[skos:changeNote]
— modifications to a term relative to prior versions[skos:historyNote]
— past state/use/meaning of a term
Validation
See general JSON-LD metadata validation outlined in PSDI Metadata Checking Guidelines. Additional validation specific to this profile:
- Automated validation performing SHACL validation against SHACL shapes:
- skosShapes.shapes.ttl contains SKOS-XL SHACL definitions published by the Publications Office of the European Union and downloaded from Application Profiles - EU Vocabularies on 20251002
- skosClassAndPropertyDefinitions.ttl is a SKOS SHACL shape built from the ground up based on the SKOS Reference, as described in Development of SKOS SHACL shape blog post, and downloaded from skosClassAndPropertyDefinitions.ttl file in the skohub-shapes github repository on 20251002
- skos.shacl.ttl is from Shape Repository For SkoHub downloaded on 250401
- Additional validation:
- Structure and quality checker for SKOS vocabularies: qSKOS - note that:
- this only raises one Fail for "No Common Languages" which claims "Concepts are not described in a common language" which is not being actioned because all languages are set to @en
- SKOSify
- SKOS testing tool
- Structure and quality checker for SKOS vocabularies: qSKOS - note that:
Future development (beyond PSDI version 1)
[dcterms:language]
- if other languages if required at a later point we will revisit this- In time if this ConceptScheme is used for describing enumerations appearing the the data we will need to have a
[skos:Notation]
, and not have more than one according to p72 in D2.3: Cross-Domain Interoperability Framework (CDIF)
- Creator: Aileen Day
- Last modified date: 2025-04-09