2016-12-07
dataid
http://dataid.dbpedia.org/ns/core#
The DataID core ontology is the central part of the DataID multi-layered metadata system.
Its goal is to extensively describes datasets and their different manifestations in a uniform way,
as well as relations with agents such as persons or organisations, in regard to their rights and responsibilities.
Established vocabularies such as DCAT, VoID, Prov-O are used for maximum compatibility.
We introduced a multilayer concept of supporting ontologies extending DataID core to fit different
use case scenarios (for Linked Data, providing statistical data about datasets etc.).
By describing datasets including content, authors, licenses and provenance via RDF, thereby making data
accessible to others, is the basic idea of Linked Open Data and is your contribution to a
universal data give and take. You can create your own DataID files for any type of data at http://dataid.dbpedia.org.
Validating and delivering metadata files in a uniform way, as well as providing the ability to query for datasets,
are main objectives of the DBpedia/DataID project.
DataID
2.0.0
Agents are real or legal persons, groups of persons, programs, organisations etc.
Agent
AgentRoles are defined by a set of rights and responsibilities (see AuthorizedAction) an Agent, assigned with this Role, has to address or can execute.
Agent Role
The AgentRole scheme (hierarchy) provided by DataID core, depicting Roles commonly used in the context of a file or document management system. This scheme should be replaced in other use cases.
agent role scheme
The responsibility (ResponsibleAction) to supervise other Agents.
agent supervision
agent supervision
Represents all entitlements (EntitledActions). The top concept of the AuthorizedActionScheme hierarchy.
all entitlements
all entitlements
Encompasses all responsibilities (ResponsibleActions). The top concept of the AuthorizedActionScheme hierarchy.
all responsibilities
all responsibilities
This concept is a mediator between a set of Entities (or scope) defined by the property dataid:authorizedFor, a set of AgentRoles
defining rights and responsibilities (dataid:authorityAgentRole) and Agents (dataid:associatedAgent), fulfilling the AgentRoles,
in regard to the collection of Entities. This subclass of prov:Attribution qualifies the relation prov:wasAttributedTo between prov:Entity
and prov:Agent with the AgentRoles and AuthorizedActions of the DataID-domain.
Authorization
AuthorizedActions may be comprised either of activities an Agent can execute in regard to a collection of Entities (dataid:EntitledAction),
or of responsibilities an Agent should act upon (dataid:ResponsibleAction) (e.g. make certain decisions if necessary or act upon certain events).
Authorized Action
Description of the AuthorizedAction hierarchy.
authorized action scheme
An Agent that can be contacted for general requests about the resource.
contact
contact
Contributor to the resource. An Agent that was involved in creating or maintaining the resource but did not have the main part in this activity.
contributor
contributor
This role specifies a dataid:DatasetRelationship where the origin dataset is an exact copy of the target dataset (e.g. when republished under a different domain).
Copy Role
Creator of the resource. An AgentRole that is credited with the main part in the initial creation of the resource.
creator
creator
The dataid:DataId class is the most generic entity in a DataID graph about one or more datasets.
As a subclass of dcat:CatalogRecord / void:DatasetDescription it describes not a dataset itself but provides metadata about
its entry in a dcat:Catalog and/or its relations to repositories or other data collections.
DataId
A collection of data, available for access in one or more formats.
Dataset resources describe the concept of the dataset, not its manifestation (the data itself), which can be acquired
as a Distribution. Datasets are prov:Entities and can be generated by prov:Activities.
Dataset
Provides the role of a dataid:DatasetRelationship (e.g. Linkset, Source, Derivate, Similarity etc.).
Dataset Relation Role
Portrays a generic relation between two or more datasets.
Dataset Relationship
EntitledAction to delete some content of an entity.
delete content right
delete content right
This role specifies a dataid:DatasetRelationship where one dataset points out a second dataset, which is a derivate of the first.
Derivate Role
A dedicated file system directory holding (multiple) files of the same Dataset, which, when put together, make up the whole Dataset.
Directory
Distributions describe the technical details of a single manifestation of the pertaining Dataset
(for example; its data format/serialisation or a ServiceEndpoint including the procedure required to access the data).
Distribution
EntitledActions describe actions an Agent is allowed to perform when holding a certain AgentRole. These Actions may address restricted matters like access, modification rights and others.
Entitled Action
Multiple files constituting one (complete) Dataset (files in different directories or servers are allowed, as opposed to Directory).
File Collection
This role specifies a dataid:DatasetRelationship between two datasets which have a relation of a unknown quality.
Generic Relation
A visitor or anonymous Agent which only has the right to read public documents.
guest
guest
Use this Agent as a stand in for any Agent not specifically defined in a domain, granting public access to an Entity.
guest agent
Guest Agent
true
This Authorization can be used to point out that the content of an entity is public and can be read by anyone (see also dataid:Guest).
guest authority
Uniquely identifies any resource, given an identifier as literal (see dataid:literal) and a corresponding identifier scheme (e.g. a aid/pid scheme such as ORCID, ResearcherID etc. pointed out via datacite:usesIdentifierScheme). Optionally it can point out a reference document on the web (about this Identifier) using dct:references.
Identifier
Maintainer of the Dataset. An Agent that ensures the technical correctness, accessibility and up-to-dateness of a Dataset.
maintainer
maintainer
Extends the dct:MediaTypeOrExtend class of Dublin Core, providing the IANA media type description (or mime type) and common file extensions used.
A pointer to an inner media (dataid:innerMediaType) can describe (multiple) layers of compressions and all internal media types.
Media Type
EntitledAction to modify the role of Agents on certain Entities.
modify agent roles
modify agent roles
EntitledAction to modify an Authorization.
modify authorization
modify authorization
EntitledAction to modify the content of an Entity.
modify content right
modify content right
An Agent that makes the Dataset accessible online on a server or repository without necessarily being involved in its creation and decides on all dissemination related tasks as well (e.g. data portal entries).
publisher
publisher
The final decision if an Entity (or a version of it) should be published.
publishing decision
publishing decision
EntitledAction to read the content of an Entity.
read content right
read content right
EntitledAction to read the DataID dataset metadata.
read DataID right
read DataID right
The responsibility (ResponsibleAction) to respond to contact attempts by external Agents. A contact point for the Entity.
response to contact attempt
response to contact attempt
The responsibility to manage changes and react to any event related to the lifecycle of a Dataset (e.g. issue tracker entries, unavailable download URL, etc.)
response to life-cycle event
response to life-cycle event
AgentRoles provide rights as well as responsibilities an agent has to attend to in order to fulfil this Role.
ResponsibleActions should describe actions pertaining to responsibilities an agent is supposed to do when holding a specific AgentRole.
Responsible Action
A specific Distribution, which is accessible via an access URL and provides data as a web service in a certain format.
Service Endpoint
This role specifies a dataid:DatasetRelationship where the origin dataset has a significant similarity to the target dataset (without any assertion as to dimension of similarity).
Similarity Role
A SimpleStatement is intended as a tool for conveying a statement, definition or point of view about a certain topic. Using either a simple literal (using dataid:literal) to provide a quotation or by a referencing a web resource providing or representing the statement in any given medium (picture, text, video etc.). This class implements several classes of Dublin Core which is not further specified within DC.
Simple Statement
A single data dump file representing the whole Dataset, in a certain format/serialisation.
Single File
This role specifies a dataid:DatasetRelationship where the origin dataset is created by transforming/collecting data from the target dataset.
Source Role
This dedicated version of a dataid:Dataset has exacly one purpose: to point out all its Sub-Datsets with void:subset.
A dataid:Superset has no data itself and is therefore prohibited to point out Distributions with dcat:distribution.
It can be used in a dataset hierarchy (e.g. as a root dataset), or as a container for other datasets.
Superset
0
The responsibility (ResponsibleAction) to update dataset metadata.
update DataId
update DataId
Describes the steps which have to be taken to gain access to the described data at the location of a Distribution (e.g. register an account to gain dct:accessRights).
access procedure
An AgentRole allows an Agent to execute certain AuthorizedActions.
allows for
An Agent which is generally connected to the Dataset. Their exact function in regard to the Entity has to be specified by the qualification of an Authorization.
associated agent
Assigns an AgentRole for an Authorization to an Agent, thereby allowing for certain AuthorizedActions this Agent can execute on the Entities defined by the scope of the Authorization.
authority agent role
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 1 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 2 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 3 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 4 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 5 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 6 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 7 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 8 property
With the help of this property we can extend the reach dataid:authorizationScope over multiple instances of dataid:Dataset and dataid:Distribution.
Authorization Chain 9 property
This property defines the scope of an Authorization. An Agent has the right to execute AuthorizedActions for all Entities of this scope.
Together with its sub-properties (dataid:authorizedFor, dataid:authorizationChainX), it defines how the scope of an Authorization is extended to other Entities.
Since this property can be inferred by its sub-properties it shall not be instantiated in a DataID document.
authorization scope
Points out AuthorizedActions Agents are allowed to execute under a given Authorization (and its chosen AgentRoles).
authorized action
Points out an Agent which holds the rights granted by an Authorization (e.g. to modify the metadata of a Dataset).
authorized agent
Points out the Entities which are under the direct influence of an Authorization.
An Agent has the right to execute AuthorizedActions (as defined by the pertaining AgentRole) for these Entities.
This property shall be used to point out the initial entities for which an Authorization is valid. Inference via
authorizationChain properties may extend the scope of this Authorization further over multiple Entities along the hirarical structure of a DataID.
authorized for
The checksum value allows the contents of a file to be authenticated since it shifts even with small changes to the file.
checksum
Provides a detailed description of the data represented by this Dataset.
data description
The role which qualifies a dataid:DatasetRelationship. It specifies which relationship datasets (pointed out with dataid:qualifiedRelationTo) have in regard to the dataset referred to with dataid:qualifiedRelationOf (e.g. source datasets when dataid:SourceRole is in place).
dataset relation role
Indication of what size the approximated end volume of the Dataset is.
growth
Provides an Agent with the Authorization for a given scope of Entities (e.g. Dataset, Distribution etc.), granting rights to execute certain AuthorizedActions.
has authorization
A unique identifier for an Agent or Entity (for other, non DataID related identifiers).
identifier
Points out which scheme of identifiers is used to uniquely identify resources inside a Dataset.
identifier scheme
The inverse property of dcat:record, pointing back to the dcat:Catalog in which this DataId is recorded.
in catalog
Points out the MediaType of the (compressed) file inside another file (relevant for archive files).
inner media type
Inverse property of dcat:distribution, linking a Distribution to a Dataset.
is distribution of
Indicates whether this Authorization will be valid after an update to the DataID or a scope element.
is inheritable
Latest version of a DataId/Dataset/Distribution.
latest version
An actual textual statement or literal (string).
statement
Points out an Authorization which grants some degree of authority over this resource to the exclusion of other Authorizations which are not referred with this property.
needs special authorization
Next version of a DataId/Dataset/Distribution.
next version
General description of how data will be shared. For example embargo periods (if any), outlines of technical mechanisms for dissemination or a definition of whether access will be widely open or restricted to specific groups. In case the Dataset cannot be shared, the reasons for this should be mentioned (e.g. ethical, rules of personal data, intellectual property, commercial, privacy-related, security-related).
openness
Provides the URL of a short preview of the data provided by a Distribution, helpful when conveying type and format of the data provided as an example.
preview
Previous version of a DataId/Dataset/Distribution.
previous version
Qualifies the dataid:relatedDatset property with a DatasetRelationship.
relation of
Inverse property of dataid:qualifiedDatasetRelation, pointing out a dataset which has related datasets referred to via dataid:dataid:qualifiedRelationTo.
qualified relation of
Pointing out datasets which are somehow related to the dataset referred to via dataid:dataid:qualifiedRelationOf.
qualified relation to
Points to other Datasets containing related data. Can be qualified with dataid:DatasetRelationship. (note: while this property is symmetric, its qualification is not)
related dataset
Information on the possibilities for integration and reuse of the Dataset.
reuse and integration
Information on the existence (or absence) of similar data (see also dataid:similarDataset).
similar data
Software needed to access the data provided via this Distribution or otherwise relevant for consuming the data.
software requirement
Lists file extensions commonly used with this MediaType.
media type file extension
Names the MediaType described.
media type name
Refers to a standard document of the MediaType described.
media type reference
The template (or mime type string) of a MediaType.
media type template
Records the byte size of the uncompressed content of an archive file.
uncompressed byte size
Points out an Authorization which grants some degree of authority over this resource (inverse property of dataid:authorizationScope).
under authorization
Description of to whom the Dataset could be useful, and whether it underpins a scientific publication.
usefulness
The influence an Agent has over an Entity is valid from (inclusive) a certain point in time.
valid from
The influence an Agent has over an Entity is valid until (exclusive) a certain point in time.
valid until
freudenberg@informatik.uni-leipzig.de
Markus Freudenberg