Skip to main content

Advertisement

Responsible data sharing in international health research: a systematic review of principles and norms

Article metrics

Abstract

Background

Large-scale linkage of international clinical datasets could lead to unique insights into disease aetiology and facilitate treatment evaluation and drug development. Hereto, multi-stakeholder consortia are currently designing several disease-specific translational research platforms to enable international health data sharing. Despite the recent adoption of the EU General Data Protection Regulation (GDPR), the procedures for how to govern responsible data sharing in such projects are not at all spelled out yet. In search of a first, basic outline of an ethical governance framework, we set out to explore relevant ethical principles and norms.

Methods

We performed a systematic review of literature and ethical guidelines for principles and norms pertaining to data sharing for international health research.

Results

We observed an abundance of principles and norms with considerable convergence at the aggregate level of four overarching themes: societal benefits and value; distribution of risks, benefits and burdens; respect for individuals and groups; and public trust and engagement. However, at the level of principles and norms we identified substantial variation in the phrasing and level of detail, the number and content of norms considered necessary to protect a principle, and the contextual approaches in which principles and norms are used.

Conclusions

While providing some helpful leads for further work on a coherent governance framework for data sharing, the current collection of principles and norms prompts important questions about how to streamline terminology regarding de-identification and how to harmonise the identified principles and norms into a coherent governance framework that promotes data sharing while securing public trust.

Background

Recently, a number of multi-stakeholder initiatives have been funded to develop data-driven translational research platforms to improve patient outcomes and reduce the societal burden of specific disease areas in the European Union (EU) [1, 2]. The Innovative Medicines Initiative’s (IMI) BigData@Heart is an example of a consortium that is currently designing an international data sharing platform to stimulate drug development and personalised medicine for cardiovascular disease. To ensure responsible use of data in BigData@Heart as well as similar research projects, good governance of data sharing and data access is critical [1].

So far, no blueprint of a broadly accepted governance framework exists. The recently adopted General Data Protection Regulation (GDPR) (Regulation (EU) 2016/679) will not be able to provide for the necessary guidance in full, since specific provisions for scientific research may still be formulated at the level of national jurisdictions within the EU [3]. Moreover, compliance with the law does not always guarantee that data is used in morally acceptable ways, or that public trust is secured [4]. The evolving landscape of big health data raises new questions about both familiar ethical concepts (such as privacy, confidentiality and informed consent), as well as novel ones. These developments indicate that innovative and adaptable governance models are highly needed to establish a practice of truly responsible data sharing.

To identify what elements are considered inherent to a governance structure for responsible data sharing within (consortium-wide) platforms for international health research, we reviewed frameworks for data sharing as described in the academic literature and in ethical guidelines. This study was driven by the question: What are the ethically relevant principles and norms so far developed by (international) working groups or professional organisations with respect to international data sharing in health research?

Methods

Search and selection

We performed a systematic review of principles and norms for responsible health data sharing as identified from the academic, peer-reviewed literature. In addition, we reviewed the principles and norms as developed in a selection of relevant ethical guidelines.

Search

Relevant literature was identified through a systematic search in three databases for scientific, peer-reviewed literature, covering the PubMed, EMBASE and Scopus databases (See Appendix 1 for a breakdown of search terms). Search strings were adjusted to the type of database to restrict superfluous results to a minimum (See Appendix 2). Google Scholar was searched for additional sources, including grey literature. Relevant guidelines and policy documents on data sharing in international health research were identified with help from six academic and industry partners from the IMI BigData@Heart consortium with expertise in health law (n = 2), regulatory science (n = 2) and research ethics (n = 2). Experts were asked to list the ethical guidelines they found most relevant to policy and practice.

Selection

For inclusion, publications were required to present a coherent set of principles and/or norms that could potentially function as or at least be construed as part of a model or framework for ethically responsible data sharing (Table 1). Here, we understand “principles” to constitute propositions that together serve as the foundation for a system of governing norms. We use “norms” to refer to standards for ethically responsible behaviour or actions. We depart from the assumption that principles inform norms, and that no governance framework can go without these elements. Documents were included if norms and principles were discussed along with tangible measures to facilitate implementation into policy and if the content was developed by or in collaboration with (international) working groups or professional organisations active in the field of health data sharing. Since we were also specifically interested in developments over recent years, we limited our eligibility to sources published between 2006 and 6 August 2018. Only sources published in English were eligible.

Table 1 Inclusion and exclusion criteria

National and EU laws were excluded from this study because we were primarily interested in elements of a governance framework that provides comprehensive moral guidance, not only enforces legal compliance. Even though the law does require the implementation of a number of organisational and technical measures, what a governance framework exactly looks like is ultimately to be developed in practice [5]. Publications that were limited to a discussion of benefits, imperatives or challenges for health data sharing or IT infrastructures for Big Data research were not deemed relevant to the purpose of this review. All sources that were not of relevance to the European context were also excluded (e.g., practice guidelines for low and middle income countries).

Data extraction and analysis

From all included references and guidelines we extracted the following data: author names, year of publication, organisation or working group, countries the recommendations apply to (EU/US/international), and the status of the recommendation. By ‘status’ we mean whether the recommendation, for example, has a legal basis, is an ethical guideline, comprises lessons learned, or is an academic proposal. Qualitative content analysis was performed for principles and norms by two independent assessors using the Covidence online support tool for systematic reviews and NVivo qualitative data analysis software (QSR International, Version 11).

Results

Selection and data extraction

The literature database searches resulted in a total of 1083 unique records (Fig. 1). Ultimately, we included 31 articles for final review (Table 2). The expert consultation resulted in the inclusion of 10 ethical guidelines by 7 different organisations or working groups (Table 3). The selected guidelines were published between 2007 and 2017. Identified principles and norms were grouped in themes as a means to structure the research findings. Descriptive themes were established through an iterative method and with consensus of all study authors.

Fig. 1
figure1

Flow diagram of the selection and inclusion of publications

Table 2 Academic publications included for review
Table 3 Selected ethical guidelines and recommendations

Themes, principles and norms

Following data extraction from all sources, the identified principles (and the respective norms promoting those principles) could be grouped among four overarching themes: (1) Societal benefits and value; (2) distribution of risks, benefits and burdens; (3) respect for individuals and groups; and (4) public trust and engagement (Table 4).

Table 4 Themes and principles

Societal benefits and value

In most sources, data sharing activities were required to be governed by principles that overall maximise health benefits or wellbeing (both public and individual) and that serve ends of social value. To realise the potential benefits, sources underpin the importance of the quality and comprehensiveness of the data to be shared, and the scientific validity and social value of the study protocols submitted by researchers in order to use the data. Once quality and validity have been established, many sources demand a data sharing infrastructure that is accessible, enables efficient use, is highly interoperable and sustainable for the future (See Table 4).

In terms of how to bring the principles into practice, sources rely on a wide range of norms, rules and recommendations. First, sources deduce from the potential benefits that there in fact exists a duty to share data for scientific research, or a right to science [6]. The Institute of Medicine (IOM), the International Committee of Medical Journal Editors and the European Federation of Pharmaceutical Industries and Associations (EFPIA) have come forward with statements about researchers’ and companies’ duty to share their clinical trial data [7,8,9]. To effectuate the duty to share, sources state that awareness about the benefits of data sharing should be raised among stakeholders, and that collaborative partnerships and data sharing practices should be promoted [10]. Other recommendations include devoting efforts and resources to alleviate disincentives for data sharing, such as publication moratoria [11]. The sharing of well-managed datasets and commitments to disseminate the results generated from the data (mostly through reports and supporting scientific publications) are considered an equally important element of maximising benefits of data sharing [12,13,14].

Continuous efforts are considered necessary to improve and maintain data quality and reproducibility [15, 16]. Demands with respect to data management and curation include cooperatively developing and implementing quality standards or quality threshold metrics that are submitted to continuous renewal and improvement [17,18,19,20]. Sources emphasise the need for data control, compliance with quality standards and feedback mechanisms [10, 18] at every stage of data processing [19]. The use of central repositories is recommended for deposition of data [20]. To maximise scientific and social value, data access requests will need to be submitted by qualified researchers who are able to justify the research purposes [21,22,23,24], and attest to the use of rigorous scientific methods [9, 25, 26]. Those providing access for secondary use should in turn secure comprehensiveness of the data [20].

Accessibility of the data is considered a shared responsibility of researchers, sponsors, research ethics committees and other stakeholders. These actors should work together to (deliver reasonable efforts to) maximise accessibility, and encourage each other to do so too [10, 12, 16, 17, 19, 22]. Accessibility is further enhanced through harmonisation of data access conditions and procedures [13, 27], and by communicating these to stakeholders [10, 18, 28]. One source speaks of the need to establish a “healthy ecosystem” that relies on stakeholder-informed principles and policies that ensure that the needs and concerns of key stakeholders are addressed across different data initiatives [27]. In such an ecosystem there is less emphasis on uniformity of approaches given that some initiatives already have their own governance structures for data sharing in place. Stakeholder-specific incentives to share data and low-cost access to the international research community are ways to increase accessibility [14, 26]. Many sources consider the development of strategies, processes and/or systems that help secure long-term accessibility and sustainability of the organisation of great importance (e.g., through funding) [10, 18, 19, 23,24,25, 29, 30]. It should be made clear how the data will be dealt with in the event of discontinuation of the data holder [17, 19], or a change of ownership [31]. Uniform policy is required with respect to the duration of storage [19], and the disposal and destruction of data [31].

Interoperability is enhanced by cataloguing data in a consistent manner [14], according to internationally accepted standards and norms [10, 18, 19], by incorporating standardised design elements that provide for compatibility [19], and through harmonisation of regulatory frameworks for data sharing in Europe [13]. Documentation of data quality and origin should be readily available, verifiable [19], accurate, unbiased and proportionate [10]. For those who have been granted access to data, validation exercises should be allowed whenever possible [18].

Distribution of risks, benefits and burdens

Many sources require that the burdens and benefits of data sharing are fairly allocated. In other words, data sharing efforts should adhere to principles of distributive justice (See Table 4). Benefits to individuals and society should be maximised and harms should be minimised and thus should also be proportional [24, 29, 30]. Benefit sharing and reciprocity is distinguished between participants and researchers, as well as between researchers, secondary users, communities and funders [12, 22, 32]. One source states that it should be assured that benefits are shared “as broadly as possible” [19], especially when data is collected from vulnerable communities [31]. Equitable access is ensured by transparency rules, fair access fees and a balance between the needs of data holders, secondary users and the communities who expect health benefits to arise from the research [19, 22, 26]. Commercial interest is generally not considered a reason to restrict access to data. One source states that the criterion of commercial versus non-commercial research is actually not very helpful, since research carried out for commercial reasons or by commercial companies can in fact be very beneficial to society [33]. Instead, access should be based on balanced arrangements between public and private parties [18] and whether or not these parties are “bona fide”, meaning that their research serves the ultimate goal to discover “new knowledge intended for the general interest in health and to be made publicly available without undue delay” [33].

Sources also emphasise the need for establishing adequate systems for recognition, ownership and attribution, that are designed in such a way that due credit and acknowledgment is given to all who contributed to the results. To these principles between data holders and secondary users, sources call upon the application of intellectual property (IP) laws to data access arrangements [18, 31, 34]. According to some sources, policy should make sure to cover benefit sharing and IP issues as transparently as possible, and for it to be communicated appropriately [19, 31]. Researchers are required to report back to the relevant data holders a list of publications and patent issues arising from the database’s resources [10, 19, 35]. However, others sources point out that exclusive ownership runs counter to the goals of data sharing initiatives [27, 33]. This would hold for individuals whose data is being shared but also for other actors involved in data sharing activities. A solution recommended includes inserting a “perpetuity” clause as a condition for making data available in a data sharing platform [33]. The clause would only allow withdrawal of the data in case the grounds for making them available have changed.

Respect for individuals and groups

Respect for individuals and groups is represented by an abundance of identified principles, norms, rules and recommendations (See Table 4). From the principle of respect for autonomy it follows that the purposes to which data is shared should be consistent with the (scope of the original) informed consent [9, 17, 19, 25, 35]. Some sources differentiate between ‘specific informed consent’ and ‘broad informed consent’ or ‘dynamic informed consent’ for a range of future data uses [16, 17, 19, 31]. When future use is specified at the time of data collection, or the data are collected for a given research project, specific informed consent from individuals is required [17, 31]. When this is not the case, some sources permit the conditional use of broad consent models [16, 17, 19, 22, 31]. Valid broad informed consent relies on certain (additional) safeguards, such as a proper governance framework and the provision of sufficient information to participants [16, 17, 19, 31]. During and/or after the informed consent process, sources state that participants should be informed about the topics as listed in Table 5. Clear and easy-to-use processes should remove barriers for participants to withdraw their consent for the use of their data at any time [17, 19, 22, 31]. Rights that are considered relevant for participants are listed in Table 5. Furthermore, policies and procedures are recommended for when and how to re-contact participants [16, 17, 19, 31, 36], in particular with respect to the return of unsolicited findings, and how participants can request access to their data [15, 19].

Table 5 Informing and enabling participants and the public

If informed consent for data access cannot reasonably be obtained (“impossible” or “impracticable”), waivers of informed consent may potentially be issued [17, 19, 31, 37, 38]. Some of the sources state that waivers of informed consent for data (re-)use should be issued after approval of a research ethics committee (REC) only, and “in accordance with applicable law” and “ethical principles” [19, 39]. The Declaration of Taipei restricts waivers to the event of a “clearly identified, serious and immediate threat (...) to protect the health of the population” [31], while the Council for International Organizations of Medical Sciences (CIOMS) guidelines demand that the study has important social value and poses “no more than minimal risks” [17]. An alternative is to have RECs allow the conditional use of an ‘informed opt-out’ procedure [17]. Even in cases where no express consent has been given, however, individuals should be able to express preferences regarding the use of their data—at least to the extent practicable [22].

Norms that help protect privacy and confidentiality include the establishment and periodical updating of security measures, protocols and other protective safeguards [15, 16, 18, 19, 21, 22, 31, 40], which are proportionate to the use and nature of the data [10, 37]. Substantial support was observed among sources for the requirement to only store and share data that is de-identified (anonymised or coded) [19, 21, 41, 42]. At the same time, the limits of anonymity and confidentiality are acknowledged and should be anticipated [17, 19]. One source states that use of anonymised data should generally be avoided because it makes it impossible to add patient-level data and/or to re-contact participants [35]. In all cases, researchers are said to have the obligation to inform individuals that complete confidentiality can never be guaranteed [35]. There is agreement among sources on the rule that the sharing of identifiable data or permission for re-identification should only be allowed for research purposes (unless ordered by law) and after approval “conform applicable procedures” [19, 22]. Terms include access limitation to those with a need-to-know [21], and restrictions on who may have (third party) access to (potentially) identifiable data [15, 17, 19].

Data security is further enhanced if technical alternatives for physical transfer of data are explored, such as the use of secure data access centres and remote data access facilities [22, 35]. To prevent unauthorised access or any other misuse, robust infrastructures will need to arrange for identity verification and authentication before access is granted [19, 21, 22]. Infrastructures should also monitor and document any access to identifiable data [19], and implement feedback mechanisms for data security [10]. Policy should include statements about how confidentiality is practically maintained [17], and that users must refrain from any attempt to (re-)identify participants [10, 16]. Essential to secured sharing is education and training of researchers on issues such as data security and privacy compliance [14, 43].

Public trust and engagement

Many sources report on principles and norms that relate to maintaining public trust and engaging in public and patient involvement and/or participation. Public trust and engagement constitute a theme that has instrumental value to maximise benefits, promote respect for persons, minimise harms and to protects principles of social justice. Nevertheless, we treat public trust and engagement as a separate moral category to illustrate the emphasis that it has been given in the reviewed sources [14, 43]. Key principles reported by the sources that foster public trust and engagement are shown in Table 4.

According to reviewed sources, strategies used by data sharing initiatives should be built upon trust, which is gained by being trustworthy [27]. Sources emphasise the need to develop formats and mechanisms that enable effective deliberation with relevant stakeholders—including participants, the public, funders and the research community—about important issues of data sharing [10, 13, 16, 17, 19, 25, 28]. More specifically, participation should be increased in the design, governance and review of data initiatives—of which the results should eventually translate into policy [8]. Preferably, a regular process of reviewing and modifying data access policies, protocols and procedures should be in place [18, 19], which pays heed to relevant issues that may change over time (e.g., IT, legal and/or cultural issues) [18]. Other opportunities for patient and public involvement include events and workshops to disseminate research findings, as well as organising lay presentations or panels, steering committees and working groups to give participants a meaningful voice in governance regarding their data [14, 21, 26]. One source explicitly places the participant at the center of the data sharing infrastructure, so that individuals whose data is shared are more meaningfully empowered to make decisions about access and use [27]. Through trusted intermediaries and easy-to-use tools individuals would be able to more easily contribute and control use of their data [26].

The principle of transparency can be brought into practice through different mechanisms. First and foremost, transparency needs to exist in all workflow of data sharing activities and transactions (including documentation) [15, 23, 29, 30, 44]. Especially transparency in data sharing transactions is flagged as an essential component of responsible data sharing. The principle is also effectuated through the dissemination of public information about ongoing data sharing activities [43]. Items that are proposed to be included in such public information are listed in Table 5. At the same time, researchers and institutions will need to raise awareness and increase understanding among the public towards the need for data sharing to democratise health research [21, 29, 43].

Special consideration was given to the importance of effective governance systems as a means to promote integrity, solidarity and accountability in data sharing activities [15,16,17, 22, 43, 45]. Each international collaborative data research initiative is expected to operate “within an explicit public ethics and governance framework” [16]. The governance structure should clearly outline the responsibilities of designated individuals or entities [21], establish measures for accountability (e.g., whether secondary use has met the intended purposes and sanctions for breaches) [21], and install mechanisms for monitoring, audits and general oversight (e.g., good stewardship of stored data) [16, 17, 19, 21, 22]. A more specific recommendation is to establish a governance committee to oversee policy developments [11]. Compliance with existing legal requirements, ethical principles and collaborative agreements is considered paramount [19, 21, 24, 25, 33, 39]. Particularly, investments need to be made in fostering professionalism—which involves education and training of professionals and other staff—and communication with participants and the public [14, 19, 43]. Social accountability arises from engagement of individuals in society, supported by organisations that communicate to individuals and society about the expectations and failures of data governance [16].

In most sources, review and approval procedures by an independent REC (or comparable review body) play an important part in discussions about responsible data sharing for health research [14, 39, 46]. Some sources state that an REC (or comparable body) must review and approve every study using collected data [17, 19, 31, 39]. Some aspects of REC review have already been discussed in the context of respect for individuals and groups. The full list of items or situations that are considered subject to ethics review and approval can be found in Table 6. Data access should be based on the legitimacy of the research purpose [9], objective and clearly articulated criteria (as recorded in policy documents) [24], and restricted to researchers who have received adequate data security training [14], and who are subject to institutional oversight and effective sanctioning [16, 19, 21]. When access to data is granted, agreements should specify the terms of access [19, 22, 31]. Transactions can be responsibly facilitated through the use of binding data access agreements (DAAs), such as data transfer agreements (DTAs) [7, 29, 32, 36, 42]. Ideally, these DAAs follow a standardised format to regulate access uniformly and consistently. DAAs should include arrangements to promote good practices to enable quality control [18], arrangements for a secure transfer [22], and appropriate and effective means to sanction misuse or non-compliance [22, 27].

Table 6 Items subject to ethical review

Discussion

This systematic review of the academic literature and research ethical guidelines provides a unique overview of principles and norms that are considered inherent to a governance framework for responsible data sharing. Content of 31 international academic publications and ten guidelines was qualitatively analysed. We observed an abundance of principles and norms with considerable convergence at the aggregate level of four overarching themes: societal benefits and value; distribution of risks, benefits and burdens; respect for individuals and groups; and public trust and engagement.

In terms of societal benefits and value, it is considered necessary by some to raise awareness about the duty to share health data, and to secure that only high-quality data is shared for scientifically valid proposals. Systems for data sharing should allow for efficient use, and be highly interoperable and accessible, as well as sustainable for the future. To ensure fair distribution of risks, benefits and burdens, effective mechanisms for benefit sharing will need to be in place. Collective evidence generation requires governance that has systems for recognition, attribution and ownership built in. Respect for individuals and groups covered a range of identified principles and norms, among which the principles to respect privacy and confidentiality were by far the most prominent. There is a growing consensus that absolute anonymity or confidentiality cannot be guaranteed, despite the common requirement to de-identify data to protect privacy. Moreover, because of the nature of data sharing activities, it is acknowledged that alternatives will need to be devised for traditional, specific informed consent. What is more, it is recommended in most of the sources that an ethics committee (or a comparable body) reviews and approves data access requests. Lastly, public trust is crucial to responsible data sharing. In this relation, accountability, transparency, integrity and professionalism are key principles. Continued stakeholder engagement, from study design to the dissemination of research findings, can and should be facilitated using different methods.

At the level of principles and norms we observed substantial variation in: (1) the phrasing and level of detail of principles and norms, (2) the number and content of norms considered necessary to protect a principle, and (3) the contextual approaches in which principles and norms are used. An example of point (1) is that some sources reported only in very general terms on relevant principles (e.g., “data sharing should be transparent” or “access should be ensured”), while others provided more detailed descriptions (e.g., “the public should be continuously updated about ongoing data sharing activities” or “ensure low data access fees”). Point (2) is exemplified by the diversity of norms related to informed consent and exemptions from (specific) consent requirements. Only some of the sources explicitly allow the conditional use of broad informed consent models or opt-out procedures. With respect to point (3), whereas one source would discourage the use of anonymised data other sources would actually demand complete de-identification. While the identified principles and norms provide helpful guidance on an impressive range of items, these three points indicate that the current collection of principles and norms still requires further work on how to exactly incorporate principles and norms into a coherent yet adaptable governance framework for health data sharing. Although different collaborative partnerships have already undertaken steps towards the latter [47, 48], we stress the need for continued efforts to further develop and implement such a governance framework for international data sharing projects [5].

A particular issue of importance we wish to address here is that our analysis also points to a confusion in the meaning of terms used to describe the degree and type of data de-identification. This could affect both the sharing, security and confidentiality of that data. Our findings support the notion of what Phillips and Knoppers have labelled a “Babel-like lexicon for de-identified data” [49]. While funders and research organisations push towards increased data sharing, there are legal duties that require ‘de-identification’ to protect privacy. We found that one fairly undisputed recommendation is to inform participants about the limits of anonymity and confidentiality. However, the extent to which principles and norms apply to data with varying degrees of de-identification remains largely unclear. The GDPR provides no guidance for sharing of de-identified data because it only applies to the use of personal data. Our findings lead us to suspect that reviewed authors define the terms ‘anonymisation’ and ‘pseudonymisation’ in different ways. For example, the terms ‘anonymous’, ‘anonymised’ and ‘de-identified’ seemed to be used interchangeably. Yet there is a moral difference between collecting data without direct identifiers and removing those direct identifiers later on [49]. We recommend that a governance framework (1) clearly defines the terms it uses and 2) goes beyond simply acknowledging the limits of anonymity and/or requiring de-identification at all costs (at the expense of data quality). The key to resolving limitations in anonymity lies in the explicit connection with public trust.

The themes we have identified share considerable similarities with the moral considerations of a framework for public health ethics [50]. This suggests that the ethics of international data sharing is probably best captured by moral duties that arise from the interactions and relationships between health care professionals, various public and private actors and the public. We hasten to mention that our thematic categorisation is not intended as a new governance framework in itself. Rather, our thematisation helps to identify common grounds and to structure various principles and norms in such a way that the basic structure of a governance framework becomes visible. We acknowledge that certain principles could be categorised as belonging to more than one theme, and norms and recommendations as serving more than one principle. With respect to our search strategy, the terms ‘principles’ and ‘norms’ may have been used differently in the literature and guidelines than the way in which we defined them. We could have missed sources that have not used these terms but in fact do refer to notions that fit our description of principles and norms. Nonetheless, we believe that the reviewed sources are informative to the establishment of a governance framework for data sharing.

This review was also limited to expert-selected guidelines and a selection of peer-reviewed literature on the topic of data sharing for health research. We are aware that our findings, particularly the body of sources identified by experts, cannot make any claims to comprehensiveness. A plethora of policy statements on data access and data sharing exists at the level of governmental bodies, industry [51], regulatory agencies (such as the European Medicines Agency [52]), and public and private institutions. A recent publication analysed data sharing guidelines to explore why data is not shared more broadly in the medical sciences [53]. Blassime and colleagues found that three themes were referred to much more frequently than others, namely: data subjects’ autonomy and privacy, and data quality and curation, though these themes were not given the same appreciation by the different organisations. At the same time, the authors observed substantial fragmentation in the landscape of data sharing policies. The findings of Blassime and colleagues [53] support the results of our review in the sense that central themes (or ‘principles’) were uncovered but their contextual use varied and thus leads to under- and sometimes oversharing of health data.

Conclusions

In this study we aimed to capture what principles and norms have been formulated by (international) collaborative working groups and organisations with respect to responsible data sharing in international health research. We believe that the four themes (societal benefits and value; distribution of risks, benefits and burdens; respect for individuals and groups; and public trust and engagement) under which relevant principles and norms can be grouped, reflect what authors, organisations and working groups consider aspects of importance to governing data sharing activities in a responsible manner. These insights provide helpful leads for further work on conceptualising a harmonised governance framework for data sharing in health research. At the same time, our findings indicate substantial variation in: (1) the phrasing and level of detail of principles and norms, (2) the number and content of norms considered necessary to protect a principle, and (3) the contextual approaches in which principles and norms are used. Key questions, in particular how to streamline terminology regarding data de-identification and how to harmonise the identified principles and norms into a coherent governance framework, will have to be part of the research agenda.

References

  1. 1.

    Hemingway H, Asselbergs FW, Danesh J, Dobson R, Maniadakis N, Maggioni A, van Thiel GJM, Cronin M, Brobert G, Vardas P, Anker SD, Grobbee DE, Denaxas S. Innovative medicines initiative 2nd programme, big data for better outcomes, BigData@heart consortium of 20 academic and industry partners including ESC. Big data from electronic health records for early and late translational cardiovascular research: challenges and potential. Eur Heart J. 2018;39:1481–95.

  2. 2.

    BD4BO – Big Data for Better Outcomes. http://bd4bo.eu/. Accessed 22 Mar 2019.

  3. 3.

    Dove ES, Thompson B, Knoppers BM. A step forward for data protection and biomedical research. Lancet (London, England) Elsevier. 2016;387:1374–5.

  4. 4.

    Carter P, Laurie GT, Dixon-Woods M. The social licence for research: why care.data ran into trouble. J Med Ethics Institute of Medical Ethics. 2015;41:404–9.

  5. 5.

    Laurie G. Reflexive governance in biobanking: on the value of policy led approaches and the need to recognise the limits of law. Hum Genet. 2011;130:347–56.

  6. 6.

    Knoppers BM, Thorogood AM. Ethics and big data in health. Curr Opin Syst Biol Elsevier. 2017;4:53–7.

  7. 7.

    Alfonso F. Data sharing: a new editorial initiative of the International Committee of Medical Journal Editors. Netherlands Hear J. 2017;25:297–303.

  8. 8.

    Kuehn BM. IOM Outlines Framework for Clinical Data Sharing, Solicits Input. JAMA American Medical Association. 2014;311:665.

  9. 9.

    EFPIA, PhRMA. Principles for responsible clinical trial data sharing: our commitment to patients and researchers. 2013.

  10. 10.

    Global Alliance for Genomics and Health (GA4GH). Framework for Responsible Sharing of Genomic and Health-Related Data. 2014.

  11. 11.

    Dyke SO, Hubbard TJ. Developing and implementing an institute-wide data sharing policy. Genome Med BioMed Central. 2011;3:60.

  12. 12.

    Funders of public health research. Joint statement of purpose—vision, principles, and goals. 2011.

  13. 13.

    Auffray C, Balling R, Barroso I, Bencze L, Benson M, Bergeron J, Bernal-Delgado E, Blomberg N, Bock C, Conesa A, Del Signore S, Delogne C, Devilee P, Di MA, Eijkemans M, Flicek P, Graf N, Grimm V, Guchelaar H-J, Guo Y-K, Gut IG, Hanbury A, Hanif S, Hilgers R-D, Honrado Á, Hose DR, Houwing-Duistermaat J, Hubbard T, Janacek SH, Karanikas H, et al. Making sense of big data in health research: Towards an EU action plan. Genome Med BioMed Central. 2016;8:71.

  14. 14.

    Lea NC, Nicholls J, Dobbs C, Sethi N, Cunningham J, Ainsworth J, Heaven M, Peacock T, Peacock A, Jones K, Laurie G, Kalra D. Data Safe Havens and Trust: Toward a Common Understanding of Trusted Research Platforms for Governing Secure and Ethical Health Research. JMIR Med informatics JMIR Medical Informatics. 2016;4:e22.

  15. 15.

    Baker DB, Kaye J, Terry SF. Privacy, Fairness, and Respect for Individuals. eGEMs (Generating Evid Methods to Improv patient outcomes). 2016;4:7.

  16. 16.

    The Nuffield Council on Bioethics. The collection, linking and use of data in biomedical research and health care: ethical issues. 2015.

  17. 17.

    Council for International Organizations of Medical Sciences (CIOMS). International Ethical Guidelines for Health-related Research Involving Humans. 2016.

  18. 18.

    Organisation for Economic Co-operation and Development (OECD). Principles and Guidelines for Access to Research Data from Public Funding. 2007.

  19. 19.

    Organisation for Economic Co-operation and Development (OECD). Recommendation of the Council on Human Biobanks and Genetic Research Databases. 2009.

  20. 20.

    Rodriguez H, Snyder M, Uhlén M, Andrews P, Beavis R, Borchers C, Chalkley RJ, Cho SY, Cottingham K, Dunn M, Dylag T, Edgar R, Hare P, Heck AJR, Hirsch RF, Kennedy K, Kolar P, Kraus H-J, Mallick P, Nesvizhskii A, Ping P, Pontén F, Yang L, Yates JR, Stein SE, Hermjakob H, Kinsinger CR, Apweiler R. Recommendations from the 2008 international summit on proteomics data release and sharing policy: the Amsterdam principles. J Proteome Res. 2009;8:3689–92.

  21. 21.

    Chan T, Di Iorio CT, De Lusignan S, Lo RD, Kuziemsky C, Liaw S-T. UK National Data Guardian for Health and Care’s Review of Data Security: Trust, better security and opt-outs. J Innov Heal Informatics. 2016;23:627.

  22. 22.

    Organisation for Economic Co-operation and Development (OECD). Recommendation of the Council on Health Data Governance. 2017.

  23. 23.

    Knoppers BM. Framework for responsible sharing of genomic and health-related data. Hugo J Springer. 2014;8:3.

  24. 24.

    Dove ES, Knoppers BM, Zawati MH. An ethics safe harbor for international genomics research? Genome Med BioMed Central. 2013;5:99.

  25. 25.

    Antman EM, Benjamin EJ, Harrington RA, Houser SR, Peterson ED, Bauman MA, Brown N, Bufalino V, Califf RM, Creager MA, Daugherty A, Demets DL, Dennis BP, Ebadollahi S, Jessup M, Lauer MS, Lo B, MacRae CA, McConnell MV, McCray AT, Mello MM, Mueller E, Newburger JW, Okun S, Packer M, Philippakis A, Ping P, Prasoon P, Roger VL, Singer S, et al. Acquisition, analysis, and sharing of data in 2015 and beyond: a survey of the landscape. J Am Heart Assoc. 2015;4:e002810.

  26. 26.

    Knoppers B, Harris JR, Tassé A, Budin-Ljøsne I, Kaye J, Deschênes M, Zawati MH. Towards a data sharing Code of Conduct for international genomic research. Genome Med. 2011;3:46.

  27. 27.

    Deverka PA, Majumder MA, Villanueva AG, Anderson M, Bakker AC, Bardill J, Boerwinkle E, Bubela T, Evans BJ, Garrison NA, Gibbs RA, Gentleman R, Glazer D, Goldstein MM, Greely H, Harris C, Knoppers BM, Koenig BA, Kohane IS, La RS, Mattison J, O’Donnell CJ, Rai AK, Rehm HL, Rodriguez LL, Shelton R, Simoncelli T, Terry SF, Watson MS, Wilbanks J, et al. Creating a data resource: what will it take to build a medical information commons? Genome Med BioMed Central. 2017;9:84.

  28. 28.

    Allen C, Des JTR, Heider A, Lyman KA, McWilliams L, Rein AL, Schachter AA, Singh R, Sorondo B, Topper J, Turske SA. Data governance and data sharing agreements for community-wide health information exchange: lessons from the beacon communities. EGEMS (Washington, DC). 2014;2:1057.

  29. 29.

    Bredenoord AL, Mostert M, Isasi R, Knoppers BM. Data sharing in stem cell translational science: policy statement by the international stem cell forum ethics working party. Regen Med. 2015;10:857–61.

  30. 30.

    Regulatory and Ethics Working Group, Global Alliance for Genomics & Health R and EW, Sugano S, Sugano S. International code of conduct for genomic and health-related data sharing. Hugo J Springer. 2014;8:1.

  31. 31.

    World Medical Association (WMA). Declaration of Taipei on Ethical Considerations Regarding Health Databases and Biobanks. 2016.

  32. 32.

    Laurie G, Sethi N. Towards Principles-Based Approaches to Governance of Health-related Research using Personal Data. Eur J risk Regul EJRR Europe PMC Funders. 2013;4:43–57.

  33. 33.

    Floridi L, Luetge C, Pagallo U, Schafer B, Valcke P, Vayena E, Addison J, Hughes N, Lea N, Sage C, Vannieuwenhuyse B, Kalra D. Key ethical challenges in the European medical information framework. Minds Mach Springer Netherlands. 2018:1–17.

  34. 34.

    Chokshi DA, Parker M, Kwiatkowski DP. Data sharing and intellectual property in a genomic epidemiology network: policies for large-scale research collaboration. Bull World Health Organ World Health Organization. 2006;84:382–7.

  35. 35.

    Mascalzoni D, Dove ES, Rubinstein Y, Dawkins HJS, Kole A, McCormack P, Woods S, Riess O, Schaefer F, Lochmüller H, Knoppers BM, Hansson M. International charter of principles for sharing bio-specimens and data. Eur J Hum Genet Nature Publishing Group. 2015;23:721–8.

  36. 36.

    Duchange N, Darquy S, d’Audiffret D, Callies I, Lapointe A-S, Loeve B, Boespflug-Tanguy O, Moutel G. Ethical management in the constitution of a European database for leukodystrophies rare diseases. Eur J Paediatr Neurol WB Saunders. 2014;18:597–603.

  37. 37.

    World Medical Association (WMA). Declaration of Helsinki - Ethical Principles for Medical Research Involving Human Subjects. 2013.

  38. 38.

    Andrew NE, Sundararajan V, Thrift AG, Kilkenny MF, Katzenellenbogen J, Flack F, Gattellari M, Boyd JH, Anderson P, Grabsch B, Lannin NA, Johnston T, Chen Y, Cadilhac DA. Addressing the challenges of cross-jurisdictional data linkage between a national clinical quality registry and government-held health data. Aust N Z J Public Health. 2016;40:436–42.

  39. 39.

    Shenkin SD, Pernet C, Nichols TE, Poline J-B, Matthews PM, van der Lugt A, Mackay C, Lanyon L, Mazoyer B, Boardman JP, Thompson PM, Fox N, Marcus DS, Sheikh A, Cox SR, Anblagan D, Job DE, Dickie DA, Rodriguez D, Wardlaw JM, Wardlaw JM. Improving data availability for brain image biobanking in healthy subjects: practice-based suggestions from an international multidisciplinary working group. Neuroimage. 2017;153:399–409.

  40. 40.

    Dyke SO, Dove ES, Knoppers BM. Sharing health-related data: a privacy test? npj Genomic Med Nature Publishing Group. 2016;1:16024.

  41. 41.

    Banzi R, Bertele’ V, Demotes-Mainard J, Garattini S, Gluud C, Kubiak C, Ohmann C. Fostering EMA’s transparency policy. Eur J Intern Med. 2014;25:681–4.

  42. 42.

    Tucker K, Branson J, Dilleen M, Hollis S, Loughlin P, Nixon MJ, Williams Z. Protecting patient privacy when sharing patient-level data from clinical trials. BMC Med Res Methodol. 2016;16:77.

  43. 43.

    Kostkova P, Brewer H, de Lusignan S, Fottrell E, Goldacre B, Hart G, Koczan P, Knight P, Marsolier C, McKendry RA, Ross E, Sasse A, Sullivan R, Chaytor S, Stevenson O, Velho R, Tooke J. Who Owns the Data? Open Data for Healthcare. Front public Heal Frontiers Media SA. 2016;4:7.

  44. 44.

    ACMG Board of Directors AB of. Laboratory and clinical genomic data sharing is crucial to improving genetic health care: a position statement of the American College of Medical Genetics and Genomics. Genet Med Nature Publishing Group. 2017;19:721–2.

  45. 45.

    Prainsack B, Buyx A. A solidarity-based approach to the governance of research biobanks. Med Law Rev Oxford University Press. 2013;21:71–91.

  46. 46.

    Paltoo DN, Rodriguez LL, Feolo M, Gillanders E, Ramos EM, Rutter JL, Sherry S, Wang VO, Bailey A, Baker R, Caulder M, Harris EL, Langlais K, Leeds H, Luetkemeier E, Paine T, Roomian T, Tryka K, Patterson A, Green ED, National Institutes of Health Genomic Data Sharing Governance Committees. Data use under the NIH GWAS data sharing policy and future directions. Nat Genet. 2014;46:934–8.

  47. 47.

    Global Alliance for Genomics and Health (GA4GH). Regulatory & Ethics Toolkit. https://www.ga4gh.org/genomic-data-toolkit/regulatory-ethics-toolkit/. Accessed 22 Mar 2019.

  48. 48.

    BBMRI-ERIC. A Code of Conduct for Health Research. http://code-of-conduct-for-health-research.eu/. Accessed 6 Aug 2018.

  49. 49.

    Phillips M, Knoppers BM. The discombobulation of de-identification. Nat Biotechnol. 2016;34(11):1102–3.

  50. 50.

    Childress JF, Faden RR, Gaare RD, Gostin LO, Kahn J, Bonnie RJ, Kass NE, Mastroianni AC, Moreno JD, Nieburg P. Public Health Ethics: Mapping the Terrain. J Law, Med Ethics Wiley/Blackwell (10.1111). 2002;30:170–8.

  51. 51.

    GSK. Data transparency | GSK. 2014. https://www.gsk.com/en-gb/research/our-approach/trials-in-people/data-transparency/. Accessed 22 Mar 2019.

  52. 52.

    European Medicines Agency (EMA). European Medicines Agency policy on publication of clinical data for medicinal products for human use. 2014.

  53. 53.

    Blasimme A, Fadda M, Schneider M, Vayena E. Data sharing for precision medicine: policy lessons and future directions. Health Aff (Millwood). 2018;37(5):702–9.

Download references

Acknowledgments

The results presented in this paper were part of Work Package 7 (ethics, legal and societal implications) of the IMI BigData@Heart consortium. We thank Diederick Grobbee, Susanne Løgstrup, Sofia Marchã, Evert-Ben van Veen and Marjolein Timmers for their valuable feedback during various stages in drafting the manuscript.

Funding

This work was supported by the Innovative Medicines Initiative 2 Joint Undertaking (IMI2) under [grant agreement No. 116055]. This Joint Undertaking receives support from the European Horizon 2020 research and innovation programme and European Federation of Pharmaceutical Industries and Associations (EFPIA).

Availability of data and materials

Files supporting the search strategy, record selection and data extraction for this study are available from the corresponding author on reasonable request.

Author information

JD and GT conceived the research question and established the methodology to be used, SK and MM further developed the study design and performed data acquisition, analysis and interpretation of the data. CG was involved in various stages of interpretation of the data. All authors contributed substantially to drafting of the work and critical revision of the manuscript. Final approval for publication and agreement to be accountable for all aspects of the work was obtained from all authors.

Correspondence to Shona Kalkman.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1

Table 7 Breakdown of search terms

Appendix 2

Table 8 Search queries (performed August 6, 2018)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Keywords

  • Big data
  • Data sharing
  • Secondary use
  • Research ethics
  • Ethical governance