Skip to content

Class: Dataset

Dataset generated within the CRC 1451

URI: sfb1451_schema:Dataset

classDiagram class Dataset Dataset : author Dataset --|> Author : author Dataset : crc_project Dataset : data_controller Dataset --|> DataController : data_controller Dataset : description Dataset : doi Dataset : funding Dataset --|> Funding : funding Dataset : hasPart Dataset --|> File : hasPart Dataset : homepage Dataset : keywords Dataset : last_updated Dataset : license Dataset : name Dataset : publication Dataset --|> Publication : publication Dataset : sample[organism_part] Dataset : sample[organism] Dataset : title Dataset : used_for Dataset --|> Usage : used_for Dataset : version

Slots

Name Cardinality and Range Description Inheritance
name 1..1
String
Identifies the dataset uniquely within the scope of a CRC project, i.e. the respective project must not have two different datasets of the same name. The name should be suitable for a directory/folder name. Spaces and special characters should be avoided. direct
title 1..1
String
Title to be displayed on the catalog landing page for the dataset. Language must be English. direct
description 1..1
String
General description of the dataset. It may summarize its purpose, scope, content, and potential applications. If a long description need to be split into paragraphs, each paragraph can be put into a dedicated column in this row. Language must be English. direct
doi 0..1
String
A DOI the dataset got assigned (e.g., from a data portal it was published in). The DOI should preferably point to the dataset version described in the catalog record. direct
crc_project 1..1
String
One or more CRC project this dataset is attributed to (typically the project responsible for acquisition). The project is identified by its CRC project code (e.g., Z02). If multiple project have been involved, additional project codes can be given in subsequent columns, one per column. direct
version 1..1
String
A label that identifies the version of the dataset the catalog record is describing. If a dataset is unversioned, it is acceptable to state latest. Otherwise any numerical label (e.g., 1.2), or text label (e.g., GITSHA 7db210fb5) can be provided here. The version should change when the content of the dataset changes. direct
sample[organism] 1..*
String
Classification of organism(s) associated with, or studied for the dataset. One or more organisms can be given, one per column. Organisms must be identified by their ID in the NCBI organismal taxonomy, which can be searched at https://www.ebi.ac.uk/ols4/ontologies/ncbitaxon. For example, the identifier for human or homo sapiens is NCBITaxon:9606. The column value should be NCBITaxon:9606 in this case. direct
sample[organism_part] 1..*
String
Classification of organism part(s) associated with, or studied for the dataset. One or more organism parts can be given, one per column. Organism parts must be identified by their ID in the Uber-anatomy ontology (UBERON), which can be searched at https://www.ebi.ac.uk/ols4/ontologies/uberon. For example, the identifier for upper limb segment is UBERON:0008785. The column value should be UBERON:0008785 in this case. The identifier for the brain is UBERON:0000955, but more precise definitions for individual brain structures are available. direct
keywords 0..*
String
Keywords describing the major topical themes of the dataset. Any number of keywords can be given, one keyword per column. Keyword aid the discoverability of a dataset. direct
license 0..1
String
A license document (URL) that applies to the dataset and defines the terms and conditions for use. direct
homepage 0..1
String
A URL the catalog should advertise as the primary source of information/data on this dataset. This could be a dataset page in a data portal. direct
last_updated 0..1
String
Date of the last modification of the described dataset (version), for example a release date. Must be given in ISO 8601 format (i.e., YYYY-MM-DD). direct
data_controller 1..*
DataController
One or more entities (natural persons or organizations) that are (legally) responsible for a dataset, and serve as an official contact point regarding collaboration inquiries. For datasets involving personal data (as defined in the European General Data Protection Regulation; GDPR) these are data controllers. For CRC datasets, these are typically the PIs of the involved CRC project(s). direct
author 1..*
Author
One or more entities (natural persons or organizations) that are considered authors of the dataset. These authors need not be identical to an author list of an associated publication. Any entity listed in this table will be credited on the catalog page of the dataset. direct
funding 1..*
Funding
One or more funding sources that are associated with the dataset and shall be credited on the dataset's catalog page. direct
publication 0..*
Publication
One or more publications which are associated with the dataset and shall be credited on the dataset's catalog page. direct
hasPart 0..*
File
One or more files that form the dataset. direct
used_for 0..*
Usage
One or more activities/projects that the dataset has been or is presently being used for. direct

Identifier and Mapping Information

Schema Source

  • from schema: https://example.org/sfb1451/sfb1451-schema

Mappings

Mapping Type Mapped Value
self sfb1451_schema:Dataset
native sfb1451_schema:Dataset

LinkML Source

Direct

name: Dataset
description: Dataset generated within the CRC 1451
from_schema: https://example.org/sfb1451/sfb1451-schema
slots:
- name
- title
- description
- doi
slot_usage:
  name:
    name: name
    description: Identifies the dataset uniquely within the scope of a CRC project,
      i.e. the respective project must not have two different datasets of the same
      name. The name should be suitable for a directory/folder name. Spaces and special
      characters should be avoided.
    domain_of:
    - Dataset
    - DataController
    - Author
    required: true
  title:
    name: title
    description: Title to be displayed on the catalog landing page for the dataset.
      Language must be English.
    domain_of:
    - Dataset
    - Usage
    required: true
  description:
    name: description
    description: General description of the dataset. It may summarize its purpose,
      scope, content, and potential applications. If a long description need to be
      split into paragraphs, each paragraph can be put into a dedicated column in
      this row. Language must be English.
    domain_of:
    - Dataset
    - Usage
    required: true
  doi:
    name: doi
    description: A DOI the dataset got assigned (e.g., from a data portal it was published
      in). The DOI should preferably point to the dataset version described in the
      catalog record.
    domain_of:
    - Dataset
    - Publication
    required: false
    pattern: https://doi.org/.*
attributes:
  crc-project:
    name: crc-project
    description: One or more CRC project this dataset is attributed to (typically
      the project responsible for acquisition). The project is identified by its CRC
      project code (e.g., Z02). If multiple project have been involved, additional
      project codes can be given in subsequent columns, one per column.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:ResearchProject
    required: true
  version:
    name: version
    description: A label that identifies the version of the dataset the catalog record
      is describing. If a dataset is unversioned, it is acceptable to state latest.
      Otherwise any numerical label (e.g., 1.2), or text label (e.g., GITSHA 7db210fb5)
      can be provided here. The version should change when the content of the dataset
      changes.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:version
    required: true
  sample[organism]:
    name: sample[organism]
    description: Classification of organism(s) associated with, or studied for the
      dataset. One or more organisms can be given, one per column.  Organisms must
      be identified by their ID in the NCBI organismal taxonomy, which can be searched
      at https://www.ebi.ac.uk/ols4/ontologies/ncbitaxon.  For example, the identifier
      for human or homo sapiens is NCBITaxon:9606. The column value should be NCBITaxon:9606
      in this case.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: openminds:Species
    multivalued: true
    required: true
  sample[organism-part]:
    name: sample[organism-part]
    description: Classification of organism part(s) associated with, or studied for
      the dataset. One or more organism parts can be given, one per column.  Organism
      parts must be identified by their ID in the Uber-anatomy ontology (UBERON),
      which can be searched at https://www.ebi.ac.uk/ols4/ontologies/uberon. For example,
      the identifier for upper limb segment is UBERON:0008785. The column value should
      be UBERON:0008785 in this case.  The identifier for the brain is UBERON:0000955,
      but more precise definitions for individual brain structures are available.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: openminds:UBERONParcellation
    multivalued: true
    required: true
  keywords:
    name: keywords
    description: Keywords describing the major topical themes of the dataset. Any
      number of keywords can be given, one keyword per column. Keyword aid the discoverability
      of a dataset.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:keywords
    multivalued: true
    required: false
  license:
    name: license
    description: A license document (URL) that applies to the dataset and defines
      the terms and conditions for use.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:license
    required: false
  homepage:
    name: homepage
    description: A URL the catalog should advertise as the primary source of information/data
      on this dataset. This could be a dataset page in a data portal.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:mainEntityOfPage
    required: false
  last-updated:
    name: last-updated
    description: Date of the last modification of the described dataset (version),
      for example a release date. Must be given in ISO 8601 format (i.e., YYYY-MM-DD).
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:dateModified
    required: false
  data-controller:
    name: data-controller
    description: One or more entities (natural persons or organizations) that are
      (legally) responsible for a dataset, and serve as an official contact point
      regarding collaboration inquiries. For datasets involving personal data (as
      defined in the European General Data Protection Regulation; GDPR) these are
      data controllers. For CRC datasets, these are typically the PIs of the involved
      CRC project(s).
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: dpv:hasDataController
    multivalued: true
    range: DataController
    required: true
  author:
    name: author
    description: One or more entities (natural persons or organizations) that are
      considered authors of the dataset. These authors need not be identical to an
      author list of an associated publication. Any entity listed in this table will
      be credited on the catalog page of the dataset.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:author
    multivalued: true
    range: Author
    required: true
  funding:
    name: funding
    description: One or more funding sources that are associated with the dataset
      and shall be credited on the dataset's catalog page.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:funding
    multivalued: true
    range: Funding
    required: true
  publication:
    name: publication
    description: One or more publications which are associated with the dataset and
      shall be credited on the dataset's catalog page.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:citation
    multivalued: true
    range: Publication
    required: false
  hasPart:
    name: hasPart
    description: One or more files that form the dataset.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: dcterms:hasPart
    multivalued: true
    range: File
    required: false
  used-for:
    name: used-for
    description: One or more activities/projects that the dataset has been or is presently
      being used for.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: prov:hadUsage
    multivalued: true
    range: Usage
    required: false

Induced

name: Dataset
description: Dataset generated within the CRC 1451
from_schema: https://example.org/sfb1451/sfb1451-schema
slot_usage:
  name:
    name: name
    description: Identifies the dataset uniquely within the scope of a CRC project,
      i.e. the respective project must not have two different datasets of the same
      name. The name should be suitable for a directory/folder name. Spaces and special
      characters should be avoided.
    domain_of:
    - Dataset
    - DataController
    - Author
    required: true
  title:
    name: title
    description: Title to be displayed on the catalog landing page for the dataset.
      Language must be English.
    domain_of:
    - Dataset
    - Usage
    required: true
  description:
    name: description
    description: General description of the dataset. It may summarize its purpose,
      scope, content, and potential applications. If a long description need to be
      split into paragraphs, each paragraph can be put into a dedicated column in
      this row. Language must be English.
    domain_of:
    - Dataset
    - Usage
    required: true
  doi:
    name: doi
    description: A DOI the dataset got assigned (e.g., from a data portal it was published
      in). The DOI should preferably point to the dataset version described in the
      catalog record.
    domain_of:
    - Dataset
    - Publication
    required: false
    pattern: https://doi.org/.*
attributes:
  crc-project:
    name: crc-project
    description: One or more CRC project this dataset is attributed to (typically
      the project responsible for acquisition). The project is identified by its CRC
      project code (e.g., Z02). If multiple project have been involved, additional
      project codes can be given in subsequent columns, one per column.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:ResearchProject
    alias: crc_project
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: true
  version:
    name: version
    description: A label that identifies the version of the dataset the catalog record
      is describing. If a dataset is unversioned, it is acceptable to state latest.
      Otherwise any numerical label (e.g., 1.2), or text label (e.g., GITSHA 7db210fb5)
      can be provided here. The version should change when the content of the dataset
      changes.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:version
    alias: version
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: true
  sample[organism]:
    name: sample[organism]
    description: Classification of organism(s) associated with, or studied for the
      dataset. One or more organisms can be given, one per column.  Organisms must
      be identified by their ID in the NCBI organismal taxonomy, which can be searched
      at https://www.ebi.ac.uk/ols4/ontologies/ncbitaxon.  For example, the identifier
      for human or homo sapiens is NCBITaxon:9606. The column value should be NCBITaxon:9606
      in this case.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: openminds:Species
    multivalued: true
    alias: sample[organism]
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: true
  sample[organism-part]:
    name: sample[organism-part]
    description: Classification of organism part(s) associated with, or studied for
      the dataset. One or more organism parts can be given, one per column.  Organism
      parts must be identified by their ID in the Uber-anatomy ontology (UBERON),
      which can be searched at https://www.ebi.ac.uk/ols4/ontologies/uberon. For example,
      the identifier for upper limb segment is UBERON:0008785. The column value should
      be UBERON:0008785 in this case.  The identifier for the brain is UBERON:0000955,
      but more precise definitions for individual brain structures are available.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: openminds:UBERONParcellation
    multivalued: true
    alias: sample[organism_part]
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: true
  keywords:
    name: keywords
    description: Keywords describing the major topical themes of the dataset. Any
      number of keywords can be given, one keyword per column. Keyword aid the discoverability
      of a dataset.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:keywords
    multivalued: true
    alias: keywords
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: false
  license:
    name: license
    description: A license document (URL) that applies to the dataset and defines
      the terms and conditions for use.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:license
    alias: license
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: false
  homepage:
    name: homepage
    description: A URL the catalog should advertise as the primary source of information/data
      on this dataset. This could be a dataset page in a data portal.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:mainEntityOfPage
    alias: homepage
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: false
  last-updated:
    name: last-updated
    description: Date of the last modification of the described dataset (version),
      for example a release date. Must be given in ISO 8601 format (i.e., YYYY-MM-DD).
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:dateModified
    alias: last_updated
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    required: false
  data-controller:
    name: data-controller
    description: One or more entities (natural persons or organizations) that are
      (legally) responsible for a dataset, and serve as an official contact point
      regarding collaboration inquiries. For datasets involving personal data (as
      defined in the European General Data Protection Regulation; GDPR) these are
      data controllers. For CRC datasets, these are typically the PIs of the involved
      CRC project(s).
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: dpv:hasDataController
    multivalued: true
    alias: data_controller
    owner: Dataset
    domain_of:
    - Dataset
    range: DataController
    required: true
  author:
    name: author
    description: One or more entities (natural persons or organizations) that are
      considered authors of the dataset. These authors need not be identical to an
      author list of an associated publication. Any entity listed in this table will
      be credited on the catalog page of the dataset.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:author
    multivalued: true
    alias: author
    owner: Dataset
    domain_of:
    - Dataset
    range: Author
    required: true
  funding:
    name: funding
    description: One or more funding sources that are associated with the dataset
      and shall be credited on the dataset's catalog page.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:funding
    multivalued: true
    alias: funding
    owner: Dataset
    domain_of:
    - Dataset
    range: Funding
    required: true
  publication:
    name: publication
    description: One or more publications which are associated with the dataset and
      shall be credited on the dataset's catalog page.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:citation
    multivalued: true
    alias: publication
    owner: Dataset
    domain_of:
    - Dataset
    range: Publication
    required: false
  hasPart:
    name: hasPart
    description: One or more files that form the dataset.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: dcterms:hasPart
    multivalued: true
    alias: hasPart
    owner: Dataset
    domain_of:
    - Dataset
    range: File
    required: false
  used-for:
    name: used-for
    description: One or more activities/projects that the dataset has been or is presently
      being used for.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: prov:hadUsage
    multivalued: true
    alias: used_for
    owner: Dataset
    domain_of:
    - Dataset
    range: Usage
    required: false
  name:
    name: name
    description: Identifies the dataset uniquely within the scope of a CRC project,
      i.e. the respective project must not have two different datasets of the same
      name. The name should be suitable for a directory/folder name. Spaces and special
      characters should be avoided.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:name
    alias: name
    owner: Dataset
    domain_of:
    - Dataset
    - DataController
    - Author
    range: string
    required: true
  title:
    name: title
    description: Title to be displayed on the catalog landing page for the dataset.
      Language must be English.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:title
    alias: title
    owner: Dataset
    domain_of:
    - Dataset
    - Usage
    range: string
    required: true
  description:
    name: description
    description: General description of the dataset. It may summarize its purpose,
      scope, content, and potential applications. If a long description need to be
      split into paragraphs, each paragraph can be put into a dedicated column in
      this row. Language must be English.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: schema:description
    alias: description
    owner: Dataset
    domain_of:
    - Dataset
    - Usage
    range: string
    required: true
  doi:
    name: doi
    description: A DOI the dataset got assigned (e.g., from a data portal it was published
      in). The DOI should preferably point to the dataset version described in the
      catalog record.
    from_schema: https://example.org/sfb1451/sfb1451-schema
    rank: 1000
    slot_uri: bibo:doi
    alias: doi
    owner: Dataset
    domain_of:
    - Dataset
    - Publication
    range: string
    required: false
    pattern: https://doi.org/.*