Open Data Licenses¶
The development of tutorials for the Open Energy Family takes place publicly in a dedicated tutorial repository. Please report bugs and suggestions as new issues. If Jupyter Notebooks are new to you and you'd like to get an introduction, have a look at this less than 10 minute introduction video. Official installation instructions are available on jupyter's readthedocs page.
license: GNU Affero General Public License Version 3 (AGPL-3.0)
copyright: Reiner Lemoine Institut
authors: Ludwig Hülk
Takeaways¶
- Overview of existing open data licenses
- Suitable and allowed open licenses for the OEP
Introduction¶
Licensing your data is important. Copyright affects all digital information and without context it's unclear if our data can be used by others or not. Supplying a license will automate the steps of asking for permission and allowing, so it makes life better for the giving and the receiving end. Note that in some countries, like Germany, it is not possible to waive one's copyright. Data that don't come with any license are not open. A license will provide legal certainty for usage. Choosing an established license will take care of questionable edge cases without umbiguity as well.
In the field of information technology there are many licenses. Some cover multiple materials and domains, i.e. software, data or content while others only cover specific domains.
This tutorial aims to collect and describe existing and commonly used open data licenses.
Please note that there are different definitions of free, open, libre and what falls into open science data.
Open Licenses¶
The Open Knowledge Foundation provides a definition for open data:
In general, open licenses grant the four freedoms, i.e. rights, that were originally defined for free software. These freedoms are:
- to run the program as wished and for any purpose
- to study how the program works, and change it so it does your computing as wished. Access to the source code is a precondition for this.
- to redistribute copies so you can help others
- to distribute copies of modified versions to others
These freedoms can be translated to open data. So they become the freedoms to freely use, analyse, share and modify the data. The main difference between open data licenes and free or open source software licenses, lies in the definition of the obligations that must be met in order to obtain the rights. There are already lists of open source licenses that guarantee the four freedoms. For example the Open Source Initiative (OSI) offers a list of approved licenses.
Licenses which are not open¶
There are certain obligations that a license may formalise, which will prevent the data from being considered open! The Open Knowledge Foundation has a section on non-conformant clauses. In brief, the main relevant clauses are:
- Non-commercial (NC)
You may not use the material for commercial purposes. As there is no definition of what is considered commercial, and an NGO or a non-profit organization may also be considered commercial entities, this clause renders data licensed under it not open. - No Derivatives (ND)
You may not alter, transform, or build upon this work. This limits your freedom to modify the data and is therefore not open.
Open Data Licenses¶
The open data licenses below are grouped by the three main obligations. For a broader list of licenses, also see: https://github.com/ifrOSS/ifrOSS/blob/master/OpenDataLicenses.md and https://opendefinition.org/licenses/ .
Public Domain¶
Public domain is a concept in which no copyright or other protection like patents and trademark exists and thus no obligations are defined. In order to label and communicate this, information about being public domain is useful! In some countries, like Germany, it is not possible to waive one's copyright. Specifically declaring an unrestrictive open license will ensure a legal status that is equivalent to public domain and that works worldwide.
For more information see the international definition of public domain and the German/European Gemeinfreiheit.
Usecase: Data for which no attribution to authors is required, e.g. "helper tables" for conversions.
Openness: As open as possible, but others can create non-open derivatives of it.
id | name | spdx | link | comment |
---|---|---|---|---|
1.1 | Creative Commons Zero v1.0 Universal | CC0-1.0 | https://creativecommons.org/publicdomain/zero/1.0/legalcode | recommended |
1.2 | ODC Public Domain Dedication & License 1.0 | PDDL-1.0 | https://opendatacommons.org/licenses/pddl/1-0/ | recommended |
1.3 | Datenlizenz Deutschland – Zero – Version 2.0 | dl-de/zero-2-0 | https://www.govdata.de/dl-de/zero-2-0 | granted only by the German government |
1.4 | Public Domain Mark | CC-PDDC | https://creativecommons.org/licenses/publicdomain/ | not recommended |
Attribution¶
This obligation prescribes creating and placing an attribution string close to the created work. The string must contain the name of the author and/or copyright holder and has sometimes additional requirements, such as a link to the license text or the original source. Be aware that this attribution string is different from the scientific citation string, which is necessary for good scientific practice.
Licenses that merely require an attribution are often referred to as permissive licenses
For more information, see Best practices for attribution and Attribution Generator.
Usecase: Data for which naming of the authors and/or copyright holders is required, e.g. "input and result data". Note: The holder of database rights under EU law is the investor, not the author. The same is true for works made for hire in the US.
Openness: Open and suitable for acedemic work.
id | name | spdx | link | comment |
---|---|---|---|---|
2.1 | Creative Commons Attribution 4.0 International | CC-BY-4.0 | https://creativecommons.org/licenses/by/4.0/legalcode | recommended, prior versions OK if necessary |
2.2 | Open Data Commons Attribution License v1.0 | ODC-By-1.0 | https://opendatacommons.org/licenses/by/1-0/ | recommended |
2.3 | Community Data License Agreement – Permissive – Version 1.0 | CDLA-Permissive-1.0 | https://cdla.io/permissive-1-0/ | |
2.4 | Datenlizenz Deutschland – Namensnennung – Version 2.0 | dl-de/by-2-0 (no official spdx-id) | https://www.govdata.de/dl-de/by-2-0 | granted only by the German government |
2.5 | Gesetz über den Zugang zu digitalen Geodaten (Geodatenzugangsgesetz - GeoZG) | GeoZG (no official spdx-id) | https://www.gesetze-im-internet.de/geozg/index.html | quasi-license, granted only by the German government |
2.6 | Verordnung zur Festlegung der Nutzungsbestimmungen für die Bereitstellung von Geodaten des Bundes (GeoNutzV) | GeoNutzV | https://www.gesetze-im-internet.de/geonutzv/index.html | quasi-license, granted only by the German government. Requires citation instead of attribution |
Share-Alike (Copyleft)¶
The Share-Alike obligation requires that you choose the exact same license for aggregated or modified data. It can be described as reciprocal, hereditary or (negatively connotated) virulent.
For more information see copyleft on Wikipedia.
Usecase: Data where naming of the authors is required and all further updates must be open as well, e.g. "manual collection of technical parameters from factsheets".
Openness: Open and will be open forever.
id | name | spdx | link | comment |
---|---|---|---|---|
3.1 | ODC Open Database License v1.0 | ODbL-1.0 | https://opendatacommons.org/licenses/odbl/1-0/ | recommended |
3.2 | CC-BY-SA-4.0 | https://creativecommons.org/licenses/by-sa/4.0/legalcode | not recommended! |
Legal bases for use of data¶
There are a few legal bases with the goal of providing an open access and free use of public data. For example there is a directive by the EU parliament and the council of the EU for the creation of a geodata-infrastructure, also called the INSPIRE directive. Another example is the directive on the reuse of public sector information, or PSI directive.
German law contains something called "Gesetz über den Zugang zu digitalen Geodaten" (Geodatenzugangsgesetz, or GeoZG for short)". GeoZG in § 11 introduces an obligation to allow the use of the country's geodata, however it doesn't itself allow use. Asking for attribution is not required by this law. If you're feeling particularly good about your skills in legal German language: „Geodaten und Metadaten sind über Geodatendienste für die kommerzielle und nicht kommerzielle Nutzung geldleistungsfrei zur Verfügung zu stellen, soweit durch besondere Rechtsvorschrift nichts anderes bestimmt ist oder vertragliche oder gesetzliche Rechte Dritter dem nicht entgegenstehen.“