Class: Dataset
Dataset generated within the CRC 1451
classDiagram
class Dataset
Dataset : author
Dataset --|> Author : author
Dataset : crc_project
Dataset : data_controller
Dataset --|> DataController : data_controller
Dataset : description
Dataset : doi
Dataset : funding
Dataset --|> Funding : funding
Dataset : hasPart
Dataset --|> File : hasPart
Dataset : homepage
Dataset : keywords
Dataset : last_updated
Dataset : license
Dataset : name
Dataset : publication
Dataset --|> Publication : publication
Dataset : sample[organism_part]
Dataset : sample[organism]
Dataset : title
Dataset : used_for
Dataset --|> Usage : used_for
Dataset : version
Slots
Name | Cardinality and Range | Description | Inheritance |
---|---|---|---|
name | 1..1 String |
Identifies the dataset uniquely within the scope of a CRC project, i.e. the respective project must not have two different datasets of the same name. The name should be suitable for a directory/folder name. Spaces and special characters should be avoided. | direct |
title | 1..1 String |
Title to be displayed on the catalog landing page for the dataset. Language must be English. | direct |
description | 1..1 String |
General description of the dataset. It may summarize its purpose, scope, content, and potential applications. If a long description need to be split into paragraphs, each paragraph can be put into a dedicated column in this row. Language must be English. | direct |
doi | 0..1 String |
A DOI the dataset got assigned (e.g., from a data portal it was published in). The DOI should preferably point to the dataset version described in the catalog record. | direct |
crc_project | 1..1 String |
One or more CRC project this dataset is attributed to (typically the project responsible for acquisition). The project is identified by its CRC project code (e.g., Z02). If multiple project have been involved, additional project codes can be given in subsequent columns, one per column. | direct |
version | 1..1 String |
A label that identifies the version of the dataset the catalog record is describing. If a dataset is unversioned, it is acceptable to state latest. Otherwise any numerical label (e.g., 1.2), or text label (e.g., GITSHA 7db210fb5) can be provided here. The version should change when the content of the dataset changes. | direct |
sample[organism] | 1..* String |
Classification of organism(s) associated with, or studied for the dataset. One or more organisms can be given, one per column. Organisms must be identified by their ID in the NCBI organismal taxonomy, which can be searched at https://www.ebi.ac.uk/ols4/ontologies/ncbitaxon. For example, the identifier for human or homo sapiens is NCBITaxon:9606. The column value should be NCBITaxon:9606 in this case. | direct |
sample[organism_part] | 1..* String |
Classification of organism part(s) associated with, or studied for the dataset. One or more organism parts can be given, one per column. Organism parts must be identified by their ID in the Uber-anatomy ontology (UBERON), which can be searched at https://www.ebi.ac.uk/ols4/ontologies/uberon. For example, the identifier for upper limb segment is UBERON:0008785. The column value should be UBERON:0008785 in this case. The identifier for the brain is UBERON:0000955, but more precise definitions for individual brain structures are available. | direct |
keywords | 0..* String |
Keywords describing the major topical themes of the dataset. Any number of keywords can be given, one keyword per column. Keyword aid the discoverability of a dataset. | direct |
license | 0..1 String |
A license document (URL) that applies to the dataset and defines the terms and conditions for use. | direct |
homepage | 0..1 String |
A URL the catalog should advertise as the primary source of information/data on this dataset. This could be a dataset page in a data portal. | direct |
last_updated | 0..1 String |
Date of the last modification of the described dataset (version), for example a release date. Must be given in ISO 8601 format (i.e., YYYY-MM-DD). | direct |
data_controller | 1..* DataController |
One or more entities (natural persons or organizations) that are (legally) responsible for a dataset, and serve as an official contact point regarding collaboration inquiries. For datasets involving personal data (as defined in the European General Data Protection Regulation; GDPR) these are data controllers. For CRC datasets, these are typically the PIs of the involved CRC project(s). | direct |
author | 1..* Author |
One or more entities (natural persons or organizations) that are considered authors of the dataset. These authors need not be identical to an author list of an associated publication. Any entity listed in this table will be credited on the catalog page of the dataset. | direct |
funding | 1..* Funding |
One or more funding sources that are associated with the dataset and shall be credited on the dataset's catalog page. | direct |
publication | 0..* Publication |
One or more publications which are associated with the dataset and shall be credited on the dataset's catalog page. | direct |
hasPart | 0..* File |
One or more files that form the dataset. | direct |
used_for | 0..* Usage |
One or more activities/projects that the dataset has been or is presently being used for. | direct |
Identifier and Mapping Information
Schema Source
- from schema: https://example.org/sfb1451/sfb1451-schema
Mappings
Mapping Type | Mapped Value |
---|---|
self | sfb1451_schema:Dataset |
native | sfb1451_schema:Dataset |
LinkML Source
Direct
name: Dataset
description: Dataset generated within the CRC 1451
from_schema: https://example.org/sfb1451/sfb1451-schema
slots:
- name
- title
- description
- doi
slot_usage:
name:
name: name
description: Identifies the dataset uniquely within the scope of a CRC project,
i.e. the respective project must not have two different datasets of the same
name. The name should be suitable for a directory/folder name. Spaces and special
characters should be avoided.
domain_of:
- Dataset
- DataController
- Author
required: true
title:
name: title
description: Title to be displayed on the catalog landing page for the dataset.
Language must be English.
domain_of:
- Dataset
- Usage
required: true
description:
name: description
description: General description of the dataset. It may summarize its purpose,
scope, content, and potential applications. If a long description need to be
split into paragraphs, each paragraph can be put into a dedicated column in
this row. Language must be English.
domain_of:
- Dataset
- Usage
required: true
doi:
name: doi
description: A DOI the dataset got assigned (e.g., from a data portal it was published
in). The DOI should preferably point to the dataset version described in the
catalog record.
domain_of:
- Dataset
- Publication
required: false
pattern: https://doi.org/.*
attributes:
crc-project:
name: crc-project
description: One or more CRC project this dataset is attributed to (typically
the project responsible for acquisition). The project is identified by its CRC
project code (e.g., Z02). If multiple project have been involved, additional
project codes can be given in subsequent columns, one per column.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:ResearchProject
required: true
version:
name: version
description: A label that identifies the version of the dataset the catalog record
is describing. If a dataset is unversioned, it is acceptable to state latest.
Otherwise any numerical label (e.g., 1.2), or text label (e.g., GITSHA 7db210fb5)
can be provided here. The version should change when the content of the dataset
changes.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:version
required: true
sample[organism]:
name: sample[organism]
description: Classification of organism(s) associated with, or studied for the
dataset. One or more organisms can be given, one per column. Organisms must
be identified by their ID in the NCBI organismal taxonomy, which can be searched
at https://www.ebi.ac.uk/ols4/ontologies/ncbitaxon. For example, the identifier
for human or homo sapiens is NCBITaxon:9606. The column value should be NCBITaxon:9606
in this case.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: openminds:Species
multivalued: true
required: true
sample[organism-part]:
name: sample[organism-part]
description: Classification of organism part(s) associated with, or studied for
the dataset. One or more organism parts can be given, one per column. Organism
parts must be identified by their ID in the Uber-anatomy ontology (UBERON),
which can be searched at https://www.ebi.ac.uk/ols4/ontologies/uberon. For example,
the identifier for upper limb segment is UBERON:0008785. The column value should
be UBERON:0008785 in this case. The identifier for the brain is UBERON:0000955,
but more precise definitions for individual brain structures are available.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: openminds:UBERONParcellation
multivalued: true
required: true
keywords:
name: keywords
description: Keywords describing the major topical themes of the dataset. Any
number of keywords can be given, one keyword per column. Keyword aid the discoverability
of a dataset.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:keywords
multivalued: true
required: false
license:
name: license
description: A license document (URL) that applies to the dataset and defines
the terms and conditions for use.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:license
required: false
homepage:
name: homepage
description: A URL the catalog should advertise as the primary source of information/data
on this dataset. This could be a dataset page in a data portal.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:mainEntityOfPage
required: false
last-updated:
name: last-updated
description: Date of the last modification of the described dataset (version),
for example a release date. Must be given in ISO 8601 format (i.e., YYYY-MM-DD).
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:dateModified
required: false
data-controller:
name: data-controller
description: One or more entities (natural persons or organizations) that are
(legally) responsible for a dataset, and serve as an official contact point
regarding collaboration inquiries. For datasets involving personal data (as
defined in the European General Data Protection Regulation; GDPR) these are
data controllers. For CRC datasets, these are typically the PIs of the involved
CRC project(s).
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: dpv:hasDataController
multivalued: true
range: DataController
required: true
author:
name: author
description: One or more entities (natural persons or organizations) that are
considered authors of the dataset. These authors need not be identical to an
author list of an associated publication. Any entity listed in this table will
be credited on the catalog page of the dataset.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:author
multivalued: true
range: Author
required: true
funding:
name: funding
description: One or more funding sources that are associated with the dataset
and shall be credited on the dataset's catalog page.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:funding
multivalued: true
range: Funding
required: true
publication:
name: publication
description: One or more publications which are associated with the dataset and
shall be credited on the dataset's catalog page.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:citation
multivalued: true
range: Publication
required: false
hasPart:
name: hasPart
description: One or more files that form the dataset.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: dcterms:hasPart
multivalued: true
range: File
required: false
used-for:
name: used-for
description: One or more activities/projects that the dataset has been or is presently
being used for.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: prov:hadUsage
multivalued: true
range: Usage
required: false
Induced
name: Dataset
description: Dataset generated within the CRC 1451
from_schema: https://example.org/sfb1451/sfb1451-schema
slot_usage:
name:
name: name
description: Identifies the dataset uniquely within the scope of a CRC project,
i.e. the respective project must not have two different datasets of the same
name. The name should be suitable for a directory/folder name. Spaces and special
characters should be avoided.
domain_of:
- Dataset
- DataController
- Author
required: true
title:
name: title
description: Title to be displayed on the catalog landing page for the dataset.
Language must be English.
domain_of:
- Dataset
- Usage
required: true
description:
name: description
description: General description of the dataset. It may summarize its purpose,
scope, content, and potential applications. If a long description need to be
split into paragraphs, each paragraph can be put into a dedicated column in
this row. Language must be English.
domain_of:
- Dataset
- Usage
required: true
doi:
name: doi
description: A DOI the dataset got assigned (e.g., from a data portal it was published
in). The DOI should preferably point to the dataset version described in the
catalog record.
domain_of:
- Dataset
- Publication
required: false
pattern: https://doi.org/.*
attributes:
crc-project:
name: crc-project
description: One or more CRC project this dataset is attributed to (typically
the project responsible for acquisition). The project is identified by its CRC
project code (e.g., Z02). If multiple project have been involved, additional
project codes can be given in subsequent columns, one per column.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:ResearchProject
alias: crc_project
owner: Dataset
domain_of:
- Dataset
range: string
required: true
version:
name: version
description: A label that identifies the version of the dataset the catalog record
is describing. If a dataset is unversioned, it is acceptable to state latest.
Otherwise any numerical label (e.g., 1.2), or text label (e.g., GITSHA 7db210fb5)
can be provided here. The version should change when the content of the dataset
changes.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:version
alias: version
owner: Dataset
domain_of:
- Dataset
range: string
required: true
sample[organism]:
name: sample[organism]
description: Classification of organism(s) associated with, or studied for the
dataset. One or more organisms can be given, one per column. Organisms must
be identified by their ID in the NCBI organismal taxonomy, which can be searched
at https://www.ebi.ac.uk/ols4/ontologies/ncbitaxon. For example, the identifier
for human or homo sapiens is NCBITaxon:9606. The column value should be NCBITaxon:9606
in this case.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: openminds:Species
multivalued: true
alias: sample[organism]
owner: Dataset
domain_of:
- Dataset
range: string
required: true
sample[organism-part]:
name: sample[organism-part]
description: Classification of organism part(s) associated with, or studied for
the dataset. One or more organism parts can be given, one per column. Organism
parts must be identified by their ID in the Uber-anatomy ontology (UBERON),
which can be searched at https://www.ebi.ac.uk/ols4/ontologies/uberon. For example,
the identifier for upper limb segment is UBERON:0008785. The column value should
be UBERON:0008785 in this case. The identifier for the brain is UBERON:0000955,
but more precise definitions for individual brain structures are available.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: openminds:UBERONParcellation
multivalued: true
alias: sample[organism_part]
owner: Dataset
domain_of:
- Dataset
range: string
required: true
keywords:
name: keywords
description: Keywords describing the major topical themes of the dataset. Any
number of keywords can be given, one keyword per column. Keyword aid the discoverability
of a dataset.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:keywords
multivalued: true
alias: keywords
owner: Dataset
domain_of:
- Dataset
range: string
required: false
license:
name: license
description: A license document (URL) that applies to the dataset and defines
the terms and conditions for use.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:license
alias: license
owner: Dataset
domain_of:
- Dataset
range: string
required: false
homepage:
name: homepage
description: A URL the catalog should advertise as the primary source of information/data
on this dataset. This could be a dataset page in a data portal.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:mainEntityOfPage
alias: homepage
owner: Dataset
domain_of:
- Dataset
range: string
required: false
last-updated:
name: last-updated
description: Date of the last modification of the described dataset (version),
for example a release date. Must be given in ISO 8601 format (i.e., YYYY-MM-DD).
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:dateModified
alias: last_updated
owner: Dataset
domain_of:
- Dataset
range: string
required: false
data-controller:
name: data-controller
description: One or more entities (natural persons or organizations) that are
(legally) responsible for a dataset, and serve as an official contact point
regarding collaboration inquiries. For datasets involving personal data (as
defined in the European General Data Protection Regulation; GDPR) these are
data controllers. For CRC datasets, these are typically the PIs of the involved
CRC project(s).
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: dpv:hasDataController
multivalued: true
alias: data_controller
owner: Dataset
domain_of:
- Dataset
range: DataController
required: true
author:
name: author
description: One or more entities (natural persons or organizations) that are
considered authors of the dataset. These authors need not be identical to an
author list of an associated publication. Any entity listed in this table will
be credited on the catalog page of the dataset.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:author
multivalued: true
alias: author
owner: Dataset
domain_of:
- Dataset
range: Author
required: true
funding:
name: funding
description: One or more funding sources that are associated with the dataset
and shall be credited on the dataset's catalog page.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:funding
multivalued: true
alias: funding
owner: Dataset
domain_of:
- Dataset
range: Funding
required: true
publication:
name: publication
description: One or more publications which are associated with the dataset and
shall be credited on the dataset's catalog page.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:citation
multivalued: true
alias: publication
owner: Dataset
domain_of:
- Dataset
range: Publication
required: false
hasPart:
name: hasPart
description: One or more files that form the dataset.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: dcterms:hasPart
multivalued: true
alias: hasPart
owner: Dataset
domain_of:
- Dataset
range: File
required: false
used-for:
name: used-for
description: One or more activities/projects that the dataset has been or is presently
being used for.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: prov:hadUsage
multivalued: true
alias: used_for
owner: Dataset
domain_of:
- Dataset
range: Usage
required: false
name:
name: name
description: Identifies the dataset uniquely within the scope of a CRC project,
i.e. the respective project must not have two different datasets of the same
name. The name should be suitable for a directory/folder name. Spaces and special
characters should be avoided.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:name
alias: name
owner: Dataset
domain_of:
- Dataset
- DataController
- Author
range: string
required: true
title:
name: title
description: Title to be displayed on the catalog landing page for the dataset.
Language must be English.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:title
alias: title
owner: Dataset
domain_of:
- Dataset
- Usage
range: string
required: true
description:
name: description
description: General description of the dataset. It may summarize its purpose,
scope, content, and potential applications. If a long description need to be
split into paragraphs, each paragraph can be put into a dedicated column in
this row. Language must be English.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: schema:description
alias: description
owner: Dataset
domain_of:
- Dataset
- Usage
range: string
required: true
doi:
name: doi
description: A DOI the dataset got assigned (e.g., from a data portal it was published
in). The DOI should preferably point to the dataset version described in the
catalog record.
from_schema: https://example.org/sfb1451/sfb1451-schema
rank: 1000
slot_uri: bibo:doi
alias: doi
owner: Dataset
domain_of:
- Dataset
- Publication
range: string
required: false
pattern: https://doi.org/.*