Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
maxLevel2

What is “Structural Metadata”?

Simply put, structural metadata describes the variables within a dataset and how they are organised.

What do I need to include?

We recommend you provide a data dictionary of structural metadata that includes the following information:

...

Metadata Field

...

Completion Guidance

...

Constraints

...

Mandatory?

...

Table Name 

...

Name of the table in the dataset. Use a fully qualified name if appropriate.

...

255 characters max

...

Yes

...

Table Description

...

Description of the table in the dataset.

...

20,000 characters max

...

No

...

Column Name

...

Name of the column in the table dataset

...

255 characters max

...

Yes

...

Column Description 

...

Description of the column in the table dataset.

...

20,000 characters max

...

No

...

Data Type Name

...

Type of data contained in the column.

...

255 characters max

...

Yes

...

Sensitive 

...

Please indicate (True / False) whether the information must be treated as sensitive and may need additional constraints / removal / anonymisation / masking through the data access request process.

...

True / False only.

...

Yes

Where can I see some examples of valid Structural Metadata?

Please see the following example for your reference.

If you require further clarification, we also have several fully populated submissions from other sites that we would be happy to walk through with you. Please contact us via Service Desk to arrange a session.

...

Structural Metadata Example

...

Table Name

...

Table Description

...

Column Name

...

Column Description

...

Data Type

...

Sensitive

...

Summary

...

The summary

...

Identifier

...

The identifier

...

String

...

TRUE

...

Summary

...

The summary

...

Title

...

The title

...

String

...

FALSE

...

Summary

...

The summary

...

Description

...

The description

...

String

...

TRUE

...

Required

...

The required elements

...

Release Date

...

The release date

...

Date

...

FALSE

...

Required

...

The required elements

...

Data Controller

...

The controller

...

String

...

FALSE

...

Required

...

The required elements

...

Data Processor

...

The data processor

...

String

...

TRUE

...

Required

...

The required elements

...

License

...

The license

...

String

...

FALSE

...

Recommended

...

The recommended elements

...

Periodicity

...

The periodicity

...

Integer

...

TRUE

...

Recommended

...

The recommended elements

...

Frequency

...

The frequency

...

String

...

FALSE

What format should this be in?

The standard format required is an excel file which includes the Metadata Fields listed in the table above (Table Name, Table Description, Column Name, Column Description, Data Type, Sensitive).

Please see this template file, which is also available on the Onboarder:

View file
nameStructural_Metadata_Template.xlsx

Is an Excel spreadsheet the only way I can upload metadata?

Definitely not!

This simple excel template is used as the low common denominator to make our interface as accessible as possible.

However, there are several different methods that supervisors can use to consume and update metadata. We've already used a few of these different formats, on request, with other clients:

  1. REST API that accepts well-formed XML, such as these:

View file
nameMDC dataModel_2.2 (2).xsd
View file
nameHDRUK6482603468091225922.xml

...

In DSRC Data Standards this takes the form of a Data Dictionary.

The Data Dictionary is organised into Data Classes.

image-20240207-223401.pngImage Added


A Data Class, or a table, is a collection of one or more Data Elements and/or child Data Classes.

The definitions, including rules and constraints for valid values for all Data Elements are defined using the following metamodel:

  • Name: the name of the data class

  • Description: a description of the data class

  • Data Elements: data elements contained with the class

  • Child Classes: data classes contained within the class

  • Cardinality: the number of times a data class can occur within a group

  • Level: Could, Should, Must

  • Metadata: any additional metadata associated with the data class

  • Relationships: related data classes or elements (which may reside in another standard)

image-20240207-225650.pngImage Added


A Data Element is the logical definition of a unit of data such as:

  • a column in a database table 

  • a field in a form 

  • an element in an XML schema

The definitions, including rules and constraints for valid values for all Data Elements are defined using the following metamodel:

  • Name: the name of the data element

  • Description: a description of the data class

  • Data type: the constraint on the data element e.g. integer, string

  • Cardinality: the number of times an lement can occur within a group

  • Level: Could, Should, Must

  • Metadata: any additional metadata associated with the data element

  • Relationships: related data classes or elements (which may reside in another standard), including child/parent relationships

image-20240207-225747.pngImage Added


Data Elements contain a Data Type which specifies the input value of the element e.g. varchar or datetime.

  • Name: the name of the data type

  • Description: a description of the data type, if applicable

  • Type

    • Simple - e.g. varchar, string, integer

    • Enumerated - a list of possible values

    • Reference

  • Relationships: to other data types or data element

image-20240207-164201.pngImage Added