Difference between revisions of "Data License Agreement"

Jump to: navigation, search
(17 intermediate revisions by 2 users not shown)
Line 1: Line 1:
'''Data license agreements (DLA)''', or data use agreements (DUA) are documents that describe what kind of data is being shared with recipients. '''DLAs''' specify clearly the purpose and duration of access being provided to the recipient, along with restrictions and security protocols that the recipient of the data must follow. Other contracts, such as a '''non-disclosure agreement (NDA)''', may be used to guarantee [[De-identification|confidentiality]] of sensitive data.  
'''Data license agreements (DLA)''', or data use agreements (DUA) are documents that describe what kind of data is being shared with recipients. A '''DLA''' clearly states the purpose and duration of access being provided to the recipient, along with restrictions and security protocols that the recipient of the data must follow. Other contracts, such as a '''non-disclosure agreement (NDA)''', may also be used to guarantee [[De-identification|confidentiality]] of sensitive data.  


== Read First ==
== Read First ==
* A '''non-disclosure agreement (NDA)''' is an agreement signed by a member of the [[Impact Evaluation Team|research team]] stating clearly that they will not share any sensitive information with anyone outside the '''research team'''.
* A '''non-disclosure agreement (NDA)''' is an agreement signed by a member of the [[Impact Evaluation Team|research team]] stating clearly that they will not share any sensitive information with anyone outside the '''research team'''.
* A '''data license agreement (DLA)''' usually has two parties - one who provides the data, and the other who requests access to that data.
* A '''data license agreement (DLA)''' usually has two parties - one who provides the data ('''data provider'''), and the other who requests access to that data ('''data requestor''').
* Projects that have multiple sources of data will require multiple '''data license agreements'''.
* In a '''DLA''', the '''data provider''' retains full ownership of the dataset that is requested.
* Members of the [[Impact Evaluation Team|research team]] must follow certain best practices when developing a '''DLA'''.
* Projects that have multiple sources of data will require multiple '''DLAs'''.
* Members of the [[Impact Evaluation Team|research team]] must follow certain best practices when drafting a '''DLA'''.
* The [https://www.worldbank.org/en/home World Bank] uses the following '''DLA''' [https://worldbankgroup.sharepoint.com/teams/ddh/SiteAssets/SitePages/ddh/DataLicenseAgreementTemplate_v4.pdf?cid=68a54269-bbff-4b47-846d-cab248ad7de1 template].
* The [https://www.worldbank.org/en/home World Bank] uses the following '''DLA''' [https://worldbankgroup.sharepoint.com/teams/ddh/SiteAssets/SitePages/ddh/DataLicenseAgreementTemplate_v4.pdf?cid=68a54269-bbff-4b47-846d-cab248ad7de1 template].
== Overview ==
== Overview ==
Every '''data license agreement (DLA)''' involves at least two sides: one, the researcher or organization providing the data (data provider), and two, the researcher or organization requesting access to the data (data requestor).
Every '''data license agreement (DLA)''' involves at least two sides: one, the researcher or organization providing the data - the '''data provider''', and two, the researcher or organization requesting access to the data - the '''data requestor'''. The '''data provider''' is responsible for permitting access to the data as a representative of the [[Protecting Human Research Subjects|research subjects]] or the agency responsible for [[Primary Data Collection|data collection]]. The data provider is also bound by laws and policies that deal with providing access to [[Personally Identifiable Information (PII)|PII data]] and other sensitive data such as test scores, health conditions, etc. In some cases, the data provider can take the help of a '''data intermediary''' or a '''data custodian''' to share the data on their behalf. '''Custodians''' and '''intermediaries''' support access to data for the requestors. They also reduce the burden on data providers by ensuring compliance with laws, and coordinating between multiple requestors and providers.
The '''data provider''' is responsible for permitting access to the data as a representative of the [[Protecting Human Research Subjects|research subjects]] or the [[Primary Data Collection|data collecting agency]]. The data provider is also bound by laws and policies that deal with providing access to [[Personally Identifiable Information (PII)|PII data]] and other sensitive data such as test scores, health conditions, and so on. In some cases, the data provider can use a third-party like a '''data intermediary''' or '''data custodian''' to offer data on their behalf. Custodians and intermediaries support access to data and reduce the burden on data providers by ensuring compliance with laws, and coordinating between multiple requestors and providers.  


== Scope ==  
== Scope ==  
Line 20: Line 21:
'''NOTE:''' In addition to this, the '''data requestors''' must also submit their '''DLA''' to their organization's information security specialists for an internal review. In general, it is a good practice for the data requestor to prepare some form of '''documentation''' for data access, even if the '''data provider''' does not require it.
'''NOTE:''' In addition to this, the '''data requestors''' must also submit their '''DLA''' to their organization's information security specialists for an internal review. In general, it is a good practice for the data requestor to prepare some form of '''documentation''' for data access, even if the '''data provider''' does not require it.


== Timeline and Costs ==
== Drafting ==
While discussing the '''data license agreement (DLA)''', the '''data provider''' and the '''data requestor''' should carefully discuss the '''timelines''' and '''costs''' involved for both parties. This includes the following:
The process of '''drafting''' a '''data license agreement (DLA)''' can take up a lot of time. In some cases, negotiations between '''data providers''' and '''data requestors''' can also fall apart after months (or even years) of discussion. Therefore, advance planning can help both '''requestors''' and '''providers''' achieve secure agreements that ensure safety of [[Protecting Human Research Subjects|subjects]], as well as of the data and the study for which the data is required. Keep the following points in mind about preparing a secure '''DLA''':
* '''Time of data delivery''' to the requestor
# '''Initiation:''' DLAs can be initiated by the provider, as well as the requester.  
* '''Method of extracting data''' that is to be shared with the requestor
# '''Established procedures:''' Data providers may already have established procedures for sharing data with a requestor, or with an organization working on behalf of the requestor.
* '''Expenses''' that might arise while the data-sharing agreement is in effect, including fees charged by administrative agencies for facilitating the agreement
# '''Review:''' If a data provider has an established process for sharing data with requestors, the requesting organization must review the terms of these procedures. The requestor should offer additions or edits whereever they consider appropriate. However, requestors should note that it might not always be possible for providers to modify data-sharing procedures based on this feedback because of certain policies.
 
# '''Legal framework:''' Data providers should be aware of the laws, regulations, and policies permitting use of their data. As soon as the providers receive a request, they should determine whether they already have established procedures for sharing data, and whether these procedures are legally valid.  
In addition to this, both parties should keep in mind the following with regards to timeline and cost of preparing and carrying out a data-sharing arrangement:
# '''Public procedures:''' While procedures for providing access to data may not always be publicly available, some agencies and organizations post their established procedures on their websites. This can significantly speed up and simplify the process of fulfilling the request for data access.
* '''Negotiations:''' Negotiating agreements like the '''DLA''' can take up to a year, from the drafting stage to the execution stage, especially if there is no history of the two parties exchanging data before.
* '''Pending requests:''' Even if the requestor and the provider have previously exchanged data before, there could be pending requests with the provider that can cause delays.  
* '''Interim:''' Even after signing a '''DLA''', requestors should aaccount for the time after approval, but before data delivery, since the processes for fulfilling the request can take a significant amount of time to be completed. For example, data providers will need time to [[Data Documentation|document]] and [[Data Cleaning|format]] the requested data. They may also need time to extract the data from multiple databases, or carefully [[Encryption|encrypted storage]]
* '''Publication:''' The data requestor may also need to notify the data providers before any releasing any outputs or [[Publishing Data|publications]]. This would take additional time.
* '''Transparency:''' Finally, if both parties are transparent about the timeline and costs, and focus on making the '''DLA''' as clear as possible, then it makes it easier to carry out the data-sharing arrangement.
 
== Drafting DLAs ==
Creating DUAs can be time-intensive. In some cases, negotiations fall apart after months or years of discussions. Advance planning can help both researchers and data providers achieve sound DUAs. DUAs can be initiated by the researcher or data provider.36 Data providers may have different or expedited procedures when sharing data with a researcher, an evaluator, or contractor working on their behalf.


If a data provider has an established data request process, a researcher must review their terms and requirements, offering additions or edits as appropriate. Data providers should be aware of the laws, regulations, and policies permitting use of their data, and, upon receiving a first request, determine whether data request procedures already exist in their organization. Data providers (such as government agencies or private companies) may have Offices of General Counsel that have preferred templates or formats. Some data providers will be reluctant or unable to modify their request processes. Data request and access procedures may not always be publicly available, though some agencies and organizations have data request procedures on their websites, and this can significantly speed up and simplify the request process.
== Logistics ==
While discussing the '''data license agreement (DLA)''', the '''data provider''' and the '''data requestor''' should carefully discuss the '''logistics'''. This includes the following:
* '''Time of data delivery:''' That is, when the requestor will get access to the data.
* '''Method of extracting data:''' That is, how the data will be extracted from [[Encryption|storage]] before being shared with the requestor.
* '''Expenses involved:''' That is, the expenses that might arise for both the requestor and the provider while the data-sharing agreement is in effect. This includes fees charged by administrative agencies and intermediaries for ensuring the agreement is successfully completed.
In addition to this, both parties should keep in mind the following with regards to '''timeline''' and '''cost''' of drafting and carrying out a data-sharing arrangement:
* '''Negotiations:''' Negotiating agreements like the '''DLA''' can take up to a year from the drafting stage to the execution stage, especially if there is no history of the two parties exchanging data in the past.
* '''Pending requests:''' Even if the requestor and the provider have exchanged data in the past, there could be pending requests with the provider that can cause delays.
* '''Interim steps:''' After signing a '''DLA''', requestors should allocate additional time after approval, but before data delivery, since the processes for fulfilling the request can take a significant amount of time to be completed. For example, data providers will need time to [[Data Documentation|document]] and [[Data Cleaning|format]] the requested data. The providers may also need time to extract the data from multiple databases, or from carefully [[Encryption|encrypted storage]]
* '''Publication:''' The data requestor may also need to notify the data providers before any releasing any outputs or [[Publishing Data|publications]], and therefore additional time should be allocated to allow for these approvals.
* '''Transparency:''' Finally, if both parties are transparent about the timeline and costs, and focus on making the '''DLA''' as clear as possible, then it makes it significantly easier to carry out the data-sharing arrangement.
In conclusion, no matter how big the project is, or how large the requested dataset is, both parties should invest time and effort into preparing a sound '''data license agreement'''.


== Related Pages ==
== Related Pages ==
Line 42: Line 46:


== Additional Resources ==
== Additional Resources ==
* JPAL, [https://admindatahandbook.mit.edu/book/testing/index.html Handbook on Using Administrative Data for Research and Evidence-based Policy]]
* JPAL, [https://admindatahandbook.mit.edu/book/testing/index.html Handbook on Using Administrative Data for Research and Evidence-based Policy]
* World Bank, [https://worldbankgroup.sharepoint.com/teams/ddh/SiteAssets/SitePages/ddh/DataLicenseAgreementTemplate_v4.pdf?cid=68a54269-bbff-4b47-846d-cab248ad7de1 World Bank's Data License Agreement]
* World Bank, [https://worldbankgroup.sharepoint.com/teams/ddh/SiteAssets/SitePages/ddh/DataLicenseAgreementTemplate_v4.pdf?cid=68a54269-bbff-4b47-846d-cab248ad7de1 World Bank's Data License Agreement]
[[Category: Research Design]]

Revision as of 15:03, 13 April 2021

Data license agreements (DLA), or data use agreements (DUA) are documents that describe what kind of data is being shared with recipients. A DLA clearly states the purpose and duration of access being provided to the recipient, along with restrictions and security protocols that the recipient of the data must follow. Other contracts, such as a non-disclosure agreement (NDA), may also be used to guarantee confidentiality of sensitive data.

Read First

  • A non-disclosure agreement (NDA) is an agreement signed by a member of the research team stating clearly that they will not share any sensitive information with anyone outside the research team.
  • A data license agreement (DLA) usually has two parties - one who provides the data (data provider), and the other who requests access to that data (data requestor).
  • In a DLA, the data provider retains full ownership of the dataset that is requested.
  • Projects that have multiple sources of data will require multiple DLAs.
  • Members of the research team must follow certain best practices when drafting a DLA.
  • The World Bank uses the following DLA template.

Overview

Every data license agreement (DLA) involves at least two sides: one, the researcher or organization providing the data - the data provider, and two, the researcher or organization requesting access to the data - the data requestor. The data provider is responsible for permitting access to the data as a representative of the research subjects or the agency responsible for data collection. The data provider is also bound by laws and policies that deal with providing access to PII data and other sensitive data such as test scores, health conditions, etc. In some cases, the data provider can take the help of a data intermediary or a data custodian to share the data on their behalf. Custodians and intermediaries support access to data for the requestors. They also reduce the burden on data providers by ensuring compliance with laws, and coordinating between multiple requestors and providers.

Scope

Data license agreements (DLAs) can differ in terms of structure and the level of detail depending on the context. However, they must clearly cover the following aspects:

  • The legal framework within which access to the data will be provided
  • Scope of the study for which requestor needs access to the data
  • Restrictions on what the requestor can do with the data
  • Constraints on publishing sensitive information and PII data

NOTE: In addition to this, the data requestors must also submit their DLA to their organization's information security specialists for an internal review. In general, it is a good practice for the data requestor to prepare some form of documentation for data access, even if the data provider does not require it.

Drafting

The process of drafting a data license agreement (DLA) can take up a lot of time. In some cases, negotiations between data providers and data requestors can also fall apart after months (or even years) of discussion. Therefore, advance planning can help both requestors and providers achieve secure agreements that ensure safety of subjects, as well as of the data and the study for which the data is required. Keep the following points in mind about preparing a secure DLA:

  1. Initiation: DLAs can be initiated by the provider, as well as the requester.
  2. Established procedures: Data providers may already have established procedures for sharing data with a requestor, or with an organization working on behalf of the requestor.
  3. Review: If a data provider has an established process for sharing data with requestors, the requesting organization must review the terms of these procedures. The requestor should offer additions or edits whereever they consider appropriate. However, requestors should note that it might not always be possible for providers to modify data-sharing procedures based on this feedback because of certain policies.
  4. Legal framework: Data providers should be aware of the laws, regulations, and policies permitting use of their data. As soon as the providers receive a request, they should determine whether they already have established procedures for sharing data, and whether these procedures are legally valid.
  5. Public procedures: While procedures for providing access to data may not always be publicly available, some agencies and organizations post their established procedures on their websites. This can significantly speed up and simplify the process of fulfilling the request for data access.

Logistics

While discussing the data license agreement (DLA), the data provider and the data requestor should carefully discuss the logistics. This includes the following:

  • Time of data delivery: That is, when the requestor will get access to the data.
  • Method of extracting data: That is, how the data will be extracted from storage before being shared with the requestor.
  • Expenses involved: That is, the expenses that might arise for both the requestor and the provider while the data-sharing agreement is in effect. This includes fees charged by administrative agencies and intermediaries for ensuring the agreement is successfully completed.

In addition to this, both parties should keep in mind the following with regards to timeline and cost of drafting and carrying out a data-sharing arrangement:

  • Negotiations: Negotiating agreements like the DLA can take up to a year from the drafting stage to the execution stage, especially if there is no history of the two parties exchanging data in the past.
  • Pending requests: Even if the requestor and the provider have exchanged data in the past, there could be pending requests with the provider that can cause delays.
  • Interim steps: After signing a DLA, requestors should allocate additional time after approval, but before data delivery, since the processes for fulfilling the request can take a significant amount of time to be completed. For example, data providers will need time to document and format the requested data. The providers may also need time to extract the data from multiple databases, or from carefully encrypted storage
  • Publication: The data requestor may also need to notify the data providers before any releasing any outputs or publications, and therefore additional time should be allocated to allow for these approvals.
  • Transparency: Finally, if both parties are transparent about the timeline and costs, and focus on making the DLA as clear as possible, then it makes it significantly easier to carry out the data-sharing arrangement.

In conclusion, no matter how big the project is, or how large the requested dataset is, both parties should invest time and effort into preparing a sound data license agreement.

Related Pages

Click here for pages that link to this topic.

Additional Resources