Skip to Main Content

Guide to Project Data Management

How do these instructions work?

Laurea Data Management Plan detailed instructions - How do these instructions work?

On this page you will find instructions on how to fill in the Data Management Plan template. First take a cursory look at the guide and then use it to create the actual DMP.

You can make a DMP using DMPTuuli tool. Check instructions for the tool here. If you prefer, you can also use simple MS Word document instead of DMPTuuli tool.

Your DMP is a living document where you describe how you will manage your data throughout the research life cycle. Update the plan when your project progresses.

NOTE: For EU-funded projects, especially for Horizon Europe, the Data Management Plan differs from the basic format. Use the Horizon template for Horizon projects.

1. General description of the data

1.1. What kinds of data is your research based on? What data will be collected, produced or reused? What file formats will the data be in? Additionally, give a rough estimate of the size of the data produced/collected.

Briefly describe what types of data you are collecting or producing. In addition, explain what kinds of already existing data you will (re)use. List, for example, the types of texts, images, photographs, measurements, statistics, physical samples or codes.

Categorise your data in a table or with a clear list, for example:
A) previously collected existing data which is being reused in this project,
B) data collected for this project,
C) data produced as an outcome of the research process.
The categorisation can form a general structure for the rest of the DMP.

List the file formats for each data set. In some cases, the file formats used during the research project may differ from those used in archiving the data after the project. List both. The file format is a primary factor in the accessibility and reusability of your data in the future.

In the DMP, what is important is to describe the required disk space, not how many informants participated in the project. A rough estimation of the size of the data is sufficient, for example, less than 100 GB, approx. 1 TB or several petabytes.

Tips for best practices

  • Use a table or bullet points for a concise way to present data types, file formats, the software used and the size of the data.
  • Examples of file formats are .csv, .txt, .docx, .xslx and .tif.
  • Make sure to describe any special or uncommon software necessary to view or use the data, especially if the software is coded in your project.
  • You can also estimate the increase in data production or collection during the project for a specific time period: "The project is producing/collecting approximately 100 GB of data per week."

AVOID OVERLAPS WITH THE RESEARCH PLAN! Data analysis and methodological issues related to data and materials should be described in your research plan.

1.2 How will the consistency and quality of data be controlled?

Explain how the data collection, analysis and processing methods used may affect the quality of the data and how you will minimise the risks related to data accuracy.

Data quality control ensures that no data is accidentally changed and that the accuracy of the data is maintained over its entire life cycle. Quality problems can emerge due to the technical handling, converting, or transferring of data, or during its contextual processing and analysis.

Tips for best practices

  • Adopt and enforce formal version control processes. This can mean e.g. simply shared and documented file naming conventions, or everyone in team working in Git repositories.
  • Transcriptions of audio or video interviews should be checked by someone other than the transcriber.
  • Analog material should be digitised in the highest resolution possible for accuracy.
  • In all conversions, maintaining the original information content should be ensured.
  • Organise training sessions and set guidelines to ensure that everyone in your research group can implement quality control and anticipate the risks related to the quality of the data.

AVOID OVERLAPS WITH THE RESEARCH PLAN! Issues related to data analysis, methods and tools should be described in your research plan, that is, do not include, for example, instrument calibration descriptions here.

2.1 What legal issues are related to your data management? (For example, GDPR and other legislation affecting data processing.)

2.1 What legal issues are related to your data management? (For example, GDPR and other legislation affecting data processing.)

These question are usually most crucial for you project. Address each of these themes (2.1.1 - 2.1.7) separately.

2.1.1. Personal data and personal data protection

Tell first whether you collect personal data and, if so, what kind of personal data you collect. Is the data identifiable or is it special or sensitive personal data? If your data contains personal data, you must comply with EU data protection regulations and the Finnish Data Protection Act.

Personal data are data that allow a person to be identified, directly or indirectly, for example by combining data. Personal data can be direct (e.g. name, personal identification number, e-mail address, telephone number) or indirect (e.g. image, voice, a particular feature or even a particular hobby). What is direct or indirect personal data? Read more information on personal data.

If your data contains sensitive personal data or other data whose processing may pose risks to individuals, you may need to prepare a Data Protection Impact Assessment (DPIA). If applicable, describe how you will assess the necessity of the personal data you collect and how you will prevent risks related to the processing of personal data.

2.1.2 Primary responsibility for the processing of personal data, i.e. the controller

When collecting personal data for a project, it must be determined who is responsible for the processing of personal data and the methods of protection for the research project. This is the controller, who is responsible for ensuring that the processing of personal data is lawful.

In general, the data controller for a project is the organisation responsible for collecting the data for the project. As a general rule, the controller is therefore the person responsible for the project as a whole. If Laurea is collecting the data, Laurea is the controller. If the processing of personal data has been jointly agreed among other organizations participating in the project, add their contact information as well. Use the instructions of Laurea Privacy notice

2.1.3 Privacy notice

If your data contains personal data, you must prepare a privacy notice explaining what and how you process personal data and the basis for the processing. Use the Laurea Privacy notice template

If research data are collected from subjects or they otherwise participate in the research, a research note and informed consent (section 2.4) are always required. Consent is usually requested in writing and in some cases orally as part of the interview. If the data are collected by questionnaire, the first question of the questionnaire concerns informed consent.

If the processing of personal data is based on consent, consent must also be asked for the processing of personal data as part of the participant's consent. Click here to find out more about what is meant by processing based on personal data.

Describe here what data protection-related documents and practices are needed in your study and how you will implement them. Make use of Laurea's 1) the Laurea Privacy notice template and 2) consent form.

2.1.4 Informing research participants and consent to participate

Explain how you will inform your research participants and ask their consent to participate.

Read more about informing and informed concent.

2.1.5 Reseach permit

Do you need a research permit? The research permit is obtained from the organisation to which the research relates.

Whenever the research concerns either Laurea as an organisation, Laurea students or staff, or part of them, a research permit is required from Laurea.  Read more on research permit in Laurea intranet.

2.1.6 Ethical review

Does your research or development project need an ethical review? If necessary, address this in your data management plan. Read more here.

 

 

2.2 How will you manage the rights of the data you use, produce and share?

2.2  How will you manage the rights of the data you use, produce and share?

Describe how you will agree upon the rights of use related to your research data – including the collected, produced and (re)used data of your project. Here, you can employ your categorisation in the first question. Each of these categories involves different rights and licenses. Describe the transfer of rights procedures relevant to your project. Describe confidentiality issues if applicable in your project.

Tips for best practices

  • Agreements on rights of use should be made as early as possible in the project life cycle.
  • Have you gained consent for data preservation and sharing?
  • Follow the funder's or publisher's policies.

It is recommended to make all of the research data, code and software created within a research project available for reuse, e.g., under a Creative Commons (https://creativecommons.org/choose/Opens in a new window), GNU (https://www.gnu.org/licenses/gpl-3.0.en.htmlOpens in a new window) or MIT license (https://opensource.org/licenses/MITOpens in a new window), or under another relevant license.

3 Documentation and metadata

3. Documentation and metadata

How will you document your data in order to make it findable, accessible, interoperable and re-usable for you and others?  What kind of metadata standards, README files or other documentation will you use to help others to understand and use your data?

Data documentation enables data sets and files to be discovered, used and properly cited by other users (human or computer). Without sufficient documentation the data cannot be reused.

Documentation includes essential information regarding the data, for example a) core metadata (for discovery and identification) where, when, why and how the data were collected as well as b) descriptive information how the data is interpreted correctly using metadata standards, vocabularies and e.g. readme-files.

Tips for best practices

  • Describe all the types of documentation (README files, metadata standards, vocabularies etc.) you will provide to help secondary users to understand and reuse your data. Repositories often require the use of a specific metadata standard. Check whether a discipline-specific metadata schema or standard exists that can be adopted.
  • Consider how the data will be organised during the project. Describe, for example, your file-naming conventions, version control and folder structure.
  • Use research instruments, which create standardised metadata formats automatically.

Identify the types of information that should be captured to enable other researchers to discover, access, interpret, use and cite your data. See for example Qvain requirements (https://www.fairdata.fi/en/user-guides/qvain-user-guide/#QvainDataset)

 

4.1 Where will your data be stored, and how will the data be backed up?

4.1 Where will your data be stored, and how will the data be backed up?

Describe where you will store and back up your data during your research project. Consider who will be responsible for backup and recovery. If there are several researchers involved, create a plan with your collaborators and ensure safe transfer between participants.

Show that you are aware of the storing solutions provided by Laurea or your organisation. Do not merely refer to IT services. In the end, you are responsible for your data, not the IT department or the organisation.

Explain the methods for preserving and sharing your data after your research project has ended in more detail in Section 5.

Tips for best practices

  • The use of a safe and secure storage provided and maintained by Laurea’s IT support or other reliable IT provider such as CSC is preferable. Read the Laurea Guide for Processing information materials.
  • Do NOT USE external hard drives as the main storing option.
  • Follow Laurea's data security requirements

4.2 Who will be responsible for controlling access to your data, and how will secured access be controlled?

4.2 Who will be responsible for controlling access to your data, and how will secured access be controlled?

It is essential to consider data security issues, especially if your data include sensitive data, personal data, politically sensitive information or trade secrets. Describe who has access to your data, what they are authorised to do with the data, or how you will ensure the safe transfer of data to your collaborators.

Tips for best practices

  • Access controls should always be in line with the level of confidentiality involved.

5. Opening, publishing and archiving the data after the project

5.1 What part of the data can be made openly available or published? Where and when will the data, or its metadata, be made available?

Describe how you will make data available and findable for reuse. If your data or parts of the data cannot be opened, explain why you publish only metadata.

In the case of sensitive data, which cannot be opened, describe the opening of its metadata. Describe the secured preservation procedure of sensitive data in Section 5.2.

The openness of research data promotes its reuse.

Tips for best practices

5.2 Where will data with long-term value be preserved, and for how long?

Briefly describe what part of your data you will preserve, where it is preserved, and for how long. Long term preservation means that data is preserved for as long as necessary, for several decades or even centuries.

You can categorise your data sets according to the anticipated preservation period:

A) Data to be destroyed upon the ending of the project
B) Data to be archived for a verification period, which varies across disciplines, e.g., 5–15 years
C) Data to be archived for potential re-use, e.g., for 25 years
D) Data with long-term value to be archived by a curated facility for future generations for tens or hundreds of years

You will need to decide which of your research data to preserve and dispose of. Data that is unique or difficult to replicate might have long-term value and be fit for preservation. Special long term data repositories should be used for digital preservation. 

Tips for best practices

  • Decisions about preserving data should begin during the data management planning stage, and should take into account e.g. institutional guidance and requirements.
  • Use data repositories with a commitment to long-term curation, e.g. Fairdata Digital Preservation ServiceOpens in a new window is dedicated for research datasets that have significant value to the organization or on a national level currently and especially also in the future. Contact your home organisation for further information.

6 Data management responsibilities and resources

6.1 Who (for example role, position, and institution) will be responsible for data management?

Summarise here all the roles and responsibilities described in the previous answers.

Tips for best practices

  • Outline the roles and responsibilities for data management/stewardship activities, for example, data capture, metadata production, data quality, storage and backup, data archiving, and data sharing. Name the responsible individual(s) where possible.
  • For collaborative projects, explain the co-ordination of data management responsibilities across partners.
  • Indicate who is responsible for implementing the DMP and for ensuring that it is reviewed and, if necessary, revised.
  • Consider scheduling regular updates of the DMP.

Finally, consider who will be responsible for the data resulting from your project after your project has ended.

6.2 What resources will be required for your data management procedures to ensure that the data can be opened and preserved according to FAIR principles (Findable, Accessible, Interoperable, Re-usable)?

Estimate the resources, such as time and financial costs, needed to manage, share and preserve the data. These may include storage costs, hardware, staff time, the costs of preparing data for deposit and repository charges.

Tips for best practices

  • Consider, if there will be additional costs from computational facilities or resources that need to be accessed.
  • Account for resources, time and money, needed to prepare the data for sharing it and preservation (data curation).
  • Remember to specify your data management costs in the budget, according to funder requirements.

 

Laurea-kirjasto | Saavutettavuusseloste | Laurea Library | Accessibility statement