Skip to main content
  • Research article
  • Open access
  • Published:

Public responses to the sharing and linkage of health data for research purposes: a systematic review and thematic synthesis of qualitative studies



The past 10 years have witnessed a significant growth in sharing of health data for secondary uses. Alongside this there has been growing interest in the public acceptability of data sharing and data linkage practices. Public acceptance is recognised as crucial for ensuring the legitimacy of current practices and systems of governance. Given the growing international interest in this area this systematic review and thematic synthesis represents a timely review of current evidence. It highlights the key factors influencing public responses as well as important areas for further research.


This paper reports a systematic review and thematic synthesis of qualitative studies examining public attitudes towards the sharing or linkage of health data for research purposes. Twenty-five studies were included in the review. The included studies were conducted primarily in the UK and North America, with one study set in Japan, another in Sweden and one in multiple countries. The included studies were conducted between 1999 and 2013 (eight studies selected for inclusion did not report data collection dates). The qualitative methods represented in the studies included focus groups, interviews, deliberative events, dialogue workshops and asynchronous online interviews.


Key themes identified across the corpus of studies related to the conditions necessary for public support/acceptability, areas of public concern and implications for future research. The results identify a growing body of evidence pointing towards widespread general—though conditional—support for data linkage and data sharing for research purposes. Whilst a variety of concerns were raised (e.g. relating to confidentiality, individuals’ control over their data, uses and abuses of data and potential harms arising) in cases where participants perceived there to be actual or potential public benefits from research and had trust in the individuals or organisations conducting and/or overseeing data linkage/sharing, they were generally supportive. The studies also find current low levels of awareness about existing practices and uses of data.


Whilst the results indicate widespread (conditional) public support for data sharing and linkage for research purposes, a range of concerns exist. In order to ensure public support for future research uses of data greater awareness raising combined with opportunities for public engagement and deliberation are needed. This will be essential for ensuring the legitimacy of future health informatics research and avoiding further public controversy.

Peer Review reports


Since the publication of the World Medical Association’s Declaration on Ethical Considerations regarding Health Databases in 2002, which stated that “databases are valuable sources of information” for health research, quality assurance and risk management [1] there has been steady and significant growth in the sharing of health data for ‘secondary uses’. The Medical Research Council (MRC) and Wellcome Trust ([2], p.6) note that “recent years have brought many calls for the optimisation of data sharing for research, with the intention of deriving maximal societal benefit”.

Recently this commitment to expanding research uses of data has led to growing interest in the public acceptability of data sharing and data linkage practices (e.g. [3]). This relates, in part, to the recognition of the importance of ensuring that data uses align with public interests or preferences. Recent highly publicised controversies (for example relating to in England) have drawn attention to the importance of ensuring public support for the ways that data are used. Thus, there is increasing attention to public acceptability of secondary uses of data and to ensuring that these uses are understood and supported by the wider public (from whom the data originate). This may be crucial for ensuring the legitimacy of current practices and systems of governance. As Bradwell and Gallagher [4] have suggested; “personal information use needs to be far more democratic, open and transparent” and this means “giving people the opportunity to negotiate how others use their personal information in the various and many contexts in which this happens” (pp:18–19).

Previously it was noted that the literature in this area was dominated by practitioner perspectives and public views were underrepresented or underreported [5]. However, over the last decade there has been a steady increase in the number of studies exploring public attitudes or acceptability of secondary uses of data. Such studies have been conducted in a range of contexts and in relation to various research practices. Qualitative studies in the field of medical and healthcare research have, historically, tended to receive less attention than quantitative studies. However, despite qualitative studies usually being based on small sample sizes that prohibit claims to being statistically representative [6, 7], they can provide rich insights and a deeper understanding of the complexities or nuances of public opinions and experiences. They also allow for public views to be interpreted in a way that can effectively inform policy and practice issues [8]. Recently, reports discussing public views toward data sharing or data linkage for research purposes have principally used qualitative methods [3, 9, 10], exemplifying the value of such approaches for exploring the challenges and complexities of this topic.

Data-sharing and data-linkage refer to two distinct processes which are used in different ways. Data-sharing involves information moving from one organisation or department to another, whereas data-linkage is defined as: “the bringing together from two or more different sources, data that relate to the same individual, family, place or event” [11]. Increasing amounts of health research are conducted through data-linkage, for example health related records have been linked with population registries [12], alcohol and drugs services [11], genealogical registries [11], the census [13, 14]), the education system [15] and the prison service [16]. Such linkages have enabled, among other things, examination of relationships between social factors and health or access to health services.

This paper reports the results of a systematic review and thematic synthesis of qualitative studies which have explored public attitudes to data-sharing or data-linkage for research purposes. The study aimed to address the following research question:

What are the key issues of public responses in data-sharing and data linkage for research, and how have these been characterised?

This paper reports key themes that have emerged through this thematic synthesis and discusses their relevance for current debates around secondary uses of data for health research. Given the growing international interest in this area this represents a timely review of current evidence. It highlights the key factors influencing public responses and in doing so identifies particular topics of salience which it will be important to examine further.

Throughout this paper the terms ‘review’, ‘researcher(s)’, ‘participant(s)’ and ‘author(s)’ will be used to refer to this systematic review, the authors of the included studies, the research participants of each study and the authors of this paper, respectively.


Search strategy and inclusion criteria

A systematic literature search was conducted of five electronic databases (CINAHL Plus, EMBASE, Medline, Scopus and Web of Science) on 4 April 2014. Table 1 displays the key search terms that were tailored for all databases using both free-text terms and subject headings where possible (see Appendix 1 for an adapted search strategy for Medline). In addition, searches were conducted through Google Scholar and Open Grey as well as scanning references of included papers and contacting experts for a more inclusive result. There were no limitations on publication dates, languages or geographical locations.

Table 1 Key search terms

The initial database searches revealed 1502 papers. Two authors (M.A. and J.S.J.) separately screened titles and abstracts and read eligible full texts before reconvening to discuss their results and resolve any discrepancies. Figure 1 shows the search and selection outcomes for each stage of the process. An additional 19 papers were identified through other sources (hand-searching references, expert communications and grey-literature searches). Papers were included if they met all inclusion criteria (see Table 2).

Fig. 1
figure 1

Selection process based on PRISMA flow diagram

Table 2 Inclusion and exclusion criteria

Quality appraisal and data extraction

Each included study was individually appraised and data were extracted by the same two authors. Table 3 displays the main characteristics of each study including the study aim, date of data collection, setting, sample characteristics, sampling and method of data collection. The Critical Appraisal Skills Programme [17] checklist was used to critically assess the qualitative research. It was agreed that all studies were of sufficient quality to be included in the study. The CASP checklist represented a valuable tool for facilitating critical reflection on each of the studies.

Table 3 Study characteristics


A thematic synthesis approach was adopted using Thomas and Harden’s [18] three-step technique: Free line-by-line coding of the included studies, the emergence of descriptive themes from the codes and the development of analytical themes. Independently, M.A. and J.S.J. coded the included studies using an inductive approach without a priori codes.

All authors met to discuss the codes/themes and to resolve any discrepancies. Three authors (R.J., C.P. and S.C.B.) were each assigned three articles (totalling nine) from the included studies to validate the findings of M.A. and J.S.J. At this stage ten further studies were excluded from the synthesis for not reporting participants’ verbatim views or first-order constructs (Britten et al. 2002); not reporting detailed qualitative findings (Sandelowski and Barroso 2003); not reporting findings relevant to the research topic; or for not including public responses. A list of descriptive themes (referred to as ‘sub’ themes) were agreed and organised by analytical (‘key’) themes. The key themes were identified and interpreted in relation to the research question (see Table 4). From the included studies, M.A. and J.S.J. extracted first- and second-order constructs (the latter being the original researchers’ interpretations of the participants’ constructs) including any reciprocal or refutational translations (comparable or opposing views) [19].

Table 4 Key themes and sub-themes

While three authors (M.A., S.C.B. and C.P.) work directly in the field of public engagement regarding health informatics research, the remaining two authors (J.S.J. and R.J.) were not previously familiar with the literature or debates in this area. The involvement of authors without prior understandings or perspectives on the literature was valuable for ensuring an inductive approach. The authors discussed and deliberated the coding and analysis to ensure that the findings emerged from the included studies rather than being shaped by or confirming the expectations of authors who are actively engaged in this subject matter.


Included Studies

A total of 1521 studies were identified from the systematic searches. From these, 25 studies were included in the review. The research was conducted primarily in the UK (five studies in Scotland, four in England, one in Wales and two across the UK) and in North America (seven studies in the USA and three in Canada) with one study set in Japan, another in Sweden and one worldwide. Data was collected from 1999 to 2013, though eight studies did not report data collection dates. The research participants included patients, service users, carers, surrogate decision-makers, lay persons and the general public ranging from 18 years of age to over 75 years. Six studies reported expert opinions from healthcare professionals, managers, health service staff and diabetes specialists in addition to the views of members of the wider public or patient groups. The qualitative methods of data collection included focus groups, interviews, deliberative events, dialogue workshops and asynchronous online interviews. Six studies included mixed methods using surveys or structured questionnaires. Additionally, three studies reported both primary and secondary research including a literature or policy review or systematic review.

Seven key themes were identified across the included studies: Widespread Conditional Support; Conditions for Support; Benefits; Control and Consent; Uses and Abuses of Data; Private Sector Involvement; and Trust and Transparency.

Key Themes

Widespread Conditional Support

The included studies point to a clear trend that there was generally widespread—albeit conditional—support for uses of data in health research.Footnote 1 This is typically expressed in relation to a view that health research—or research more broadly—is “in the public interest” or is expected to bring about benefits for “the greater good”.Footnote 2 For example, one participant in study number 25 stated:

“I think the medical research is going to be of general benefit to the general population and if my records can help; I think personally I would be quite willing to participate in any medical study that is of general benefit to the population. I just feel it is worthwhile to participate in these studies” (Patient 4, Willison et al. 2003: 2)

Uses of data for health or medical research were often conceptualised in relation to the potential for discovery of new cures or treatments, or the improvement of healthcare services.

In several studies participants were reported as being surprised that data are not already more widely used, with questions being asked such as: “Doesn’t this happen already?!”.Footnote 3 Many studies reported that participants considered research uses of data to be in the public interest and conversely that not using data was against the public interest since this was a resource which should be used, not wasted.Footnote 4

Despite broad agreement that using health data for medical research is generally a good thing, across the studies it is evident that support for these data uses was never unconditional. A number of factors were identified as being important conditions for public support or acceptance.

Conditions for Support

In a large number of studies assurances of individuals’ confidentiality were reported as crucial for public support.Footnote 5 Whilst confidentiality may be assured through various mechanisms, in the included studies this was largely associated with anonymisation of data. Public preferences for data to be anonymous were widely reported,Footnote 6 for example in one studyFootnote 7 a participant stated:

“[The public need] reassurance about anonymity because that’s what people worry about”

Some individuals expressed a view that if the data are anonymous “what does it matter?!”.Footnote 8 However, others noted that anonymisation is not an absolute guarantee of confidentiality Footnote 9 and in a number of studies participants recognised that the anonymisation process is imperfect and therefore did not fully or adequately protect individuals’ confidentiality.Footnote 10 For example, it was said:

“I think you’re right enough, it’s anonymised. But then if you’re dealing with particular areas, that again kind of cuts in to the anonymous factor, because if you’re looking at maybe, let’s say, a housing estate, so there’s only so many people, so it’s not…I don’t think there’s anything that’s truly anonymous; I think everything can be found out if you’ve got the wherewithal and the curiosity to find things out.” Footnote 11

In a number of studies participants made a distinction between “plain stats” and more detailed qualitative information, with the former largely considered not to be concerning while the latter raised greater issues relating to confidentiality and privacy.Footnote 12

Assurances of safeguards to protect against misuse or abuse of data were also widely considered important for ensuring public support/acceptability.Footnote 13 Similarly, members of the public often expressed a preference for strong accountability mechanisms to be in place.Footnote 14 However, there was generally found to be low public awareness of current research practicesFootnote 15 and in particular, of current governance or ethics processes.Footnote 16 As such, in a number of studies it was reported that public acceptance increased after participants were informed about existing safeguards and governance mechanisms.

Assurances of data security were also found to be important for public acceptance of the use of health data in researchFootnote 17 and across the studies concerns about data security were widely identified.Footnote 18 Such concerns related to the fallibility of IT systems to protect against breachesFootnote 19 as well as to human error. Media reports of “laptops left on trains” or misplaced data were widely called upon to illustrate this latter point.Footnote 20 However, in a number of studies it was reported that participants regarded breaches of security as always being possible, yet security risks were sometimes regarded as tolerable or acceptable where individuals valued the purpose and potential benefits of research.Footnote 21

A further condition for public support was that data would only be used for legitimate purposes. Whilst the term “legitimate” was not always referred to explicitly, the included studies often suggested or concluded that the extent to which members of the public perceived uses of data to be legitimate influenced their responses or preferences.Footnote 22 However, there were varying views on how, or by whom, legitimacy was to be defined.

Public Benefits

Another key condition for public support for research using individuals’ data was that such research must have public benefits.Footnote 23 Whilst in some cases perceived personal benefits, or personal relevance of research was reported to motivate participation in research,Footnote 24 benefits of research were largely conceptualised in terms of benefits to wider society, or “the greater good”. For example, study participants said:

…We wouldn’t have the national health service, we wouldn’t have drugs, we wouldn’t have anything, if it hadn’t have been for people being allowed to try things out in the past. So, I suppose, when you look at it like that, it is almost as if you have a moral duty to say, we have benefited, so why shouldn’t we contribute for [future generations?]”Footnote 25

In many cases it was reported that concerns relating to personal privacy were balanced with recognition of the importance of societal benefits anticipated to come from research.Footnote 26 Moreover, in two studies it was reported that some participants prioritised societal benefits over personal privacy.Footnote 27

Assurances that research would bring about public benefits—or at least that it had the potential to bring about such benefits—were widely reported to be fundamental for ensuring public support or acceptance. If research is perceived to be focussed primarily at benefitting individual researchers (e.g. through advancing their careers or raising their profile), as having no clear practical application or “real-world” value, or as being conducted solely for profit this leads to concerns and opposition (or at least less support) for research uses of data.Footnote 28

Control and Consent

Perceived autonomy, or individual control over how data is used, was found to be a key factor shaping public responses in a number of studies.Footnote 29 It was reported that members of the public valued having control over their own data.Footnote 30 Such control relates to what data are collected, who has access to this, how and with whom data is shared and for what purposes the data are used. In a number of studies participants explicitly referred to this control in terms of individual or human rights.Footnote 31

Whilst perceived individual control clearly emerged as a key factor shaping public attitudes or acceptance of research uses of data, there was no clear consensus (across or within) the studies regarding what this control implied or necessitated. In some studies there was a clear link between levels of trust in research organisations or data controllers and desired level of individual control.Footnote 32 This suggests that where individuals trust organisations handling their data they are less likely to favour more stringent forms of control. Conversely, when this trust is lacking individuals want to have greater control over their own data.

Preferences for control are also influenced by wider attitudes towards the value of research. In a number of studies it was found that, whilst individual control was highly valued, participants did not want this control to come at the cost of creating barriers to research. Thus it was often found that participants felt that individual control needs to be balanced with efficiency of research.Footnote 33

Across the included studies control is largely discussed in relation to consent. There is evidence that members of the public also made this association and recognised consent as a mechanism for facilitating individual control.Footnote 34 However, both between and within studies there were varied views on consent and what form this should take.Footnote 35 Some studies indicated public preferences for explicit opt-in consent models,Footnote 36 whilst an acceptance of opt-out models was also reported due to recognition of the challenges or practical limitations of opt-in.Footnote 37 In a significant number of studies there was a clear preference for varied or flexible consent models which would enable individuals to set limits on their consent or to indicate particular preferences or objections.Footnote 38 Similarly, some studies reported that participants objected to one-time consent models which would not allow individuals to review or change their consent preferences.Footnote 39 This relates to the fact that public opinions or preferences are not fixed but change and adapt in response to information, deliberation, events or circumstances.Footnote 40

Whilst consent was widely valued as a mechanism for facilitating individual control in many studies, it was also recognised to be problematic.Footnote 41 In particular participants in the studies acknowledged the potential for selection bias or low participation rates if explicit opt-in consent is required. Such recognition led to some individuals becoming more inclined to support opt-out consent models or non-consented uses of data, however this trend was certainly not universal and others maintained that consent was always important.

The included studies highlight a number of areas where consent was regarded as particularly important, for example in relation to named or identifying data,Footnote 42 qualitative information rather than “plain stats”,Footnote 43 research using genetic dataFootnote 44 or where a commercial entity is involved in research.Footnote 45

Where consent was acknowledged to be problematic and/or where individuals reported that they were largely unconcerned about research uses of data, consent was nevertheless widely viewed to be important. In a number of studies consent was in this regard represented as an act of courtesy with participants suggesting that they would be happy to allow their data to be used for research but that this should nonetheless not be used without their permission.Footnote 46

Uses and Abuses of Data

A key area of concern regarding research uses of data related to the potential for data to be misused or abused.Footnote 47 In some cases this related to concerns that individuals with access to data would use it maliciously or inappropriately, for example it was stated that:

“there are some people, [that] regardless of the consequences will defy rules and regulations to justify their existence or to prove they can do it…” (Damschroder et al. 2007: 231)

In other instances these concerns related to data being sold or passed on to third partiesFootnote 48 and used for commercial purposes, e.g.:

“What I don’t like is any information being passed on to a third party, for promotion purposes. Say you’ve got a particular problem then it goes to a drugs supplier or something like that, that I would object to.” (Participant 4, group 1, Hill et al. 2013: 6)

There was also concern about data being used for political purposes,Footnote 49 e.g.:

“If the Government are using the details for the benefit of society, I think that’s okay. But if the Government are using that data to then look at their next election campaign, or look at the independence campaign by looking at the demographics of a particular area, then I don’t know if that’s as acceptable. They’[d] simply be using our data for their own goals” (Female, aged 18–34, Glasgow, Davidson et al. 2013).

Some participants in the studies expressed concerns about potential future uses of data.Footnote 50 While current uses or research objectives may be regarded as acceptable participants expressed scepticism that such uses would remain clearly defined and limited. Some study participants were worried about potential “slippery slopes” with more and more information becoming accessibleFootnote 51 or with data being used for purposes other than those which were originally described.Footnote 52

There were also concerns about the proliferation of data within modern societies and increasing surveillance through data collection. For some these concerns were expressed in relation to the creation of a “Big Brother Society”,Footnote 53 e.g.:

“You can’t move. You can’t do anything without somebody, somewhere knowing exactly what you’re up to” (Female, depth interview, MRC & Ipsos-MORI 2007: 25)

A significant area of concern related to the potential outcomes or implications of research. In particular, study participants were concerned about the potential for stigma or discriminatory treatment to result from research which would label or categorise groups within society,Footnote 54 e.g.:

“I think research maybe tends to lump everybody together, and there must be individuals that would be totally different […] so it could lump everybody together and maybe that’s not what we want.” (Tayside—Female4, Aitken 2011: 12)

“Some universities might feel: ‘we don’t want to involve people from areas of deprivation, because we know they are less likely to finish their course and that’s bad for us, for our figures’” (Male, oldest age group, Edinburgh) (Davidson et al. 2013: 70)

There were also concerns relating to potential indirect negative impacts on individuals from participating in research.Footnote 55 For example, a frequent concern related to potential for insurance premiums to increase or be denied as a result of information being accessible from medical records. Additionally there was concern that employers may gain access to information which could be used to the detriment of individual employees. Participants were concerned that data which was shared could be accessed and used in ways which could be harmful for individuals, e.g.:

“Money’s money but health is how you feel as well and if you’re being persecuted in a way because of that, it’s just going to make you worse” (Female, depth interview, MRC & Ipsos-MORI 2007: 29).

“People can judge them, so if they find out something about you because of your health you could be picked on” (Female, depth interview, MRC & Ipsos-MORI 2007: 29).

Such concerns were particularly salient in relation to more sensitive forms of data. Across the studies it was reported that participants differentiated between types of data and regarded some as more sensitive—and concerning—than others.Footnote 56 Examples of particularly sensitive forms of data include data relating to mental health, sexual health, sexuality and religion.

Private Sector Involvement

Across the studies there was significant concern about private sector involvement in research using individuals’ data.Footnote 57 Such concerns largely related to two key factors: low levels of public trust in the private sectorFootnote 58 and a perception that private sector organisations are primarily—or solely—motivated by profit.Footnote 59 Across the studies participants often made distinctions between research which was perceived to be “for profit” and research perceived to be “for the greater good”.Footnote 60 Similarly, distinctions were made between “research purposes” and “commercial gain”Footnote 61 as if they were opposing motivations. As noted above, the creation of public benefits from research was widely regarded as an essential prerequisite for public support or acceptance. Therefore, where participants regarded research to be conducted for purposes other than creating public benefits this raised concerns.

However, such concerns did not necessarily mean outright opposition to private sector involvement in research. Profit-creation resulting from research was regarded as acceptable under certain conditions. Notably, the included studies indicated that participants wanted assurances that public benefits would be prioritised over profit,Footnote 62 that individuals’ privacy would be prioritised over profitFootnote 63 and that profits would be shared or reinvested so as to create public/societal benefits.Footnote 64 Additionally, while there were concerns about individuals’ data being sold, studies which explored private sector access of public sector data found that participants often felt it was appropriate that private sector organisations pay for access to these dataFootnote 65 and that this would be regarded as acceptable on the condition that revenue generated is appropriately re-invested in the public sector.Footnote 66

While there was widespread concern about private sector involvement in research this was often balanced by a recognition that private sector involvement in research can be important or valuable.Footnote 67 In some cases private sector involvement was represented as a “necessary evil”,Footnote 68 e.g.:

“… the drug companies are just trying to make money, and yes of course they are, it’s all about money in the end of the day but if they don’t find the research for some of these the less interesting or less topical things then they, there will not be research into those things…we need to get funding from drug companies anyway, if they’re the ones with the money.” (Female, patient focus group 3, PPG, Grant et al. 2013: 8).

Thus profit-creation was regarded by some study participants to act as an incentive for private sector organisations to conduct valuable research in the public interest.

Overall, the included studies demonstrate that members of the public hold nuanced and complex views regarding private sector involvement. It is noteworthy that the private sector was not regarded as a homogenous entity, but rather distinctions were made between private sector organisations.Footnote 69 There was also acknowledgement of the different roles that private sector organisations can play in research. For example it was reported in one study that private sector involvement was acceptable as long as commercial actors did not have access to data.Footnote 70 Other studies reported concerns about private sector organisations as funders of research and the implications this may have for the integrity or objectivity of the research.Footnote 71

Whilst low trust in private sector actors is frequently reported, the included studies also demonstrate complex or ambivalent relationships of trust in actors from other sectors. For example, several studies identified ambivalent views on government researchFootnote 72 and concern about government access to data.Footnote 73 Additionally, whilst some studies reported high levels of trust in universities and academic researchersFootnote 74 one reported a lack of trust in university researchers.Footnote 75 Thus relationships of trust are not straightforward and there does not appear to be a clear, or static hierarchy of trusted organisations/sectors.

Trust and Transparency

Trust is a key theme running through all of the included studies (both implicitly and explicitly). A number of studies indicated that the level of trust individuals place in research organisations, oversight bodies or government, informs their level of support for research uses of data.Footnote 76 The included studies indicate that trust is essential for ensuring public acceptance and/or participation in research.Footnote 77

As noted above, relationships of trust are nuanced and complex. However the included studies indicate generally higher levels of trust in the public sector compared with the private sector, largely related to greater confidence in accountability and data protection mechanisms within the public sector.Footnote 78 There is also evidence of particularly high levels of public trust in primary healthcare providers.Footnote 79 This reflects a trend of higher levels of trust in known or familiar individuals or organisations,Footnote 80 which was exemplified in study participants’ confidence in particular healthcare professionals to make good judgements on access to patients’ data:

“I know my physician well enough to have a good feel for the types of things he would be involved with” (Patient 12, Nair et al. 2004: 25).

“If you trust the doctor, I don’t think it would worry me how much [data] you needed, and I do trust the doctor” (Patient 15, Nair et al. 2004: 25).

It also leads to individuals preferring to be contacted only by healthcare professionals, or known individuals:

“I am happy to have personal contact with our hospital, GP or the health professionals who knows me, but I am not happy being contacted by a Pfizer company, or whatever” (MRC & Ipsos-MORI 2007: 19).

Participants in the included studies often expressed a preference that data-sharing and research uses of data be overseen within, and governed by, the public sector.Footnote 81 In some instances there was a preference for such processes to be overseen and controlled by healthcare professionals (e.g. known/familiar individuals).Footnote 82 However, some study participants acknowledged that this may be overly burdensome and take valuable time and resources away from the provision of healthcare.Footnote 83

The importance of relationships and familiarity to trust is indicative of a broader desire for greater transparency about research practices. The included studies overwhelmingly suggest an appetite among study participants for more information about current research practices and uses of data.Footnote 84 Transparency about how data is used in research is considered crucial for building public trust, and thereby securing public support.Footnote 85 Moreover, many of the included studies point towards the importance of awareness raising for building trust and public support.Footnote 86

However, the included studies highlight that the public should not be conceived of as simply subjects of information provision relating to research uses of data. Rather, several studies indicate public interest and enthusiasm for more meaningful forms of public engagement/involvement.Footnote 87 Such involvement was considered essential for ensuring accountability.Footnote 88

Differences between studies

It is not possible to make clear or consistent comparisons between the findings of the included studies due to different social and cultural contexts. For example, in a Japanese studyFootnote 89 participants were reported to describe “unequal relationships” between patients and doctors with patients belonging to a “lower rank”. This may reflect (actual or perceived) traditional doctor-patient relations in Japan that are more hierarchical and paternalistic [20]. However, discussions of unequal relationships in other studies were not explicitly reported though some study participants may have implicitly referred to them. Diverse study populations also limit the findings from being comparable. These smaller populations include U.S. veterans reporting higher levels of trust and greater support for research by Veteran AffairsFootnote 90; African Americans expressing lower willingness to engage in genetic/genomic research due to past abusesFootnote 91; and LGBT participants in the U.K. concerned for the misuse of data, particularly identifiable data, that could lead to discriminating opinions and behaviour.Footnote 92 These findings build on previous research reporting concerns over the underrepresentation of minority populations in research, such as African Americans [2123] and LGBTs [24]. While these views may not be comparable to other contexts, they are indeed essential to understanding the needs of different social groups to better inform a wide variety of policies and practices. Despite variations in opinion, the overall views of these study populations were consistent with the general findings of the thematic synthesis.

A further limitation to the review was the underrepresentation of young people across the studies. Of the few studies that compared all age groups, the variations in opinion were detailed. Two studies reported that younger participants expressed greater concerns for privacy and a desire for control over research data.Footnote 93 Another noted that some felt “anxious” about their data being held while others believed they had little control over their own information.Footnote 94 In contrast, older participants were reported to favour less individual controlFootnote 95 or to be less worried about the possible loss of confidentiality.Footnote 96 Previous research by Buckley et al. [25] equally commented on the lack of participation of younger people in their study. The few that responded, were more cautious about the use of their medical information compared to older participants. However, the researchers were wary of these results due to the unrepresentativeness of the sample. Additionally, there are some contradictory findings, for example, King et al. [26] found that younger participants and older respondents over the age of 60 were less concerned about the privacy of their health information compared to participants in the mid age range. King et al. [26] suggested this may be due to the “carefree” nature of younger generations who were perceived to be more willing to share their personal information (e.g. on social-networking sites) and older respondents who are no longer invested in their career and therefore under less scrutiny. More recently the Wellcome Trust [3] found a non-linear relationship between acceptance of commercial access to health data and age and noted that young people are not automatically more supportive/accepting. These varying and, at times, conflicting findings point to the need for greater research to explore the variations in perceptions and opinions across age groups.

Finally, the authors conducted a broad search of public responses to data sharing and data linkage in research that included studies looking at genetic dataFootnote 97 and medical-records data.Footnote 98 These topics were considered together with other papers discussing health, personal or administrative data or information for statistical, health, social or other research purposes. Some studies suggest genetic data is particularly sensitiveFootnote 99 or personal/potentially identifying.Footnote 100 In one study, participants perceived genetic data to be potentially less sensitive than information from medical records (e.g. information relating to reproductive or mental health).Footnote 101 Participants’ from another study reported no real variation in attitudes toward the use of medical records and biological samples.Footnote 102 In some studies, linking medical records data to biological samples raises concerns.Footnote 103 However, overall opinions were largely consistent with the key themes of this review.


The included studies point towards widespread support for uses of data in research, including for practices of data-linkage and data-sharing. However, this support is never unconditional. Key conditions for public support or acceptance relate to the research being in “the public interest” or for “the greater good” and to public trust in researchers or organisations handling/accessing their data. The themes of public benefits and public trust run through all the studies (explicitly or implicitly) and underpin all other areas of concern or interest. As has been noted elsewhere [27] trust—or trustworthiness—is increasingly recognised as being central in shaping public responses. However, the included studies do not point to clear relationships or hierarchies between particular areas of concern or conditions for support and there is a lack of evidence relating to the ways in which trade offs might be made or how preferences would be formed in reality. This may represent a valuable area to explore further in future research.

As the literature in this area has frequently observed, confidentiality is a key area of public concern and assurances of confidentiality appear to be important for ensuring public support. However, in the wider literature relating to secondary uses of data in health research there has been much debate about the value and implications of anonymisation which is frequently described as representing significant challenges [2830]. For example, it is argued that a certain amount of identifying information is needed in order to allow updating, linkage or validation of data [30, 31]. Ohm has argued that ‘data can either be useful or perfectly anonymous but never both’ [32] (p.1704). Despite these challenges relating to anonymisation, confidentiality is largely discussed and understood in terms of anonymisation. The included studies which explored public attitudes towards confidentiality typically focussed on attitudes towards anonymisation of data.

Anonymisation is generally understood as the process of removing key identifiers such as names and dates of birth from personal data thus rendering the identification of subjects highly unlikely. However, anonymisation is not straightforward and, as the MRC & Wellcome Trust suggest: ‘Because identifiability runs a spectrum, anonymisation is relative’ [2] (p.10). The UK Information Commissioner’s Office (ICO) has stated that ‘[i]n reality it can be difficult to determine whether data has been anonymised or is still personal data’ [33] (p.16). This ambiguity around anonymisation has implications for understanding public responses in this area, as Haddow et al. note, where studies have explored public attitudes ‘it is often unclear whether the research into publics’ views relates to fully anonymised data, the use of weaker forms of anonymisation or indeed fully identifiable data [34] (p. 1141). Therefore whilst studies have reported public attitudes towards anonymisation it is not always clear what members of the public understand anonymisation to mean, or what they perceive it to require.

There is evidence within the included studies that assurances of anonymisation may be important for members of the public, however those studies which enabled greater reflection on the implications or practicalities of anonymisation (e.g. through deliberative methods) typically uncovered more nuanced positions with members of the public often acknowledging that anonymisation is imperfect as a mechanism for protecting confidentiality and/or problematic for facilitating valuable research. Thus, anonymisation is not regarded as a panacea for addressing public concerns and it may be fruitful to explore further public attitudes towards confidentiality—and the ways that this might be ensured—beyond anonymisation of data.

Similarly, whilst the extant literature in this area has focussed heavily on the role and challenges of consent in relation to data-sharing or data-linkage for research purposes, the included studies highlight that this may not be a fundamental requirement for public acceptability. Rather, the studies indicate that whilst autonomy—or individual control over one’s data—is highly valued, consent is acknowledged to be problematic. As in discussions of anonymisation, where study participants have had opportunities to reflect on and discuss consent, views typically shift from an initial preference for explicit opt-in consent, towards more flexible models of either opt-out or varied consent. In some cases where study participants have been convinced of the value of research and the potential for public benefits consent has been regarded as non-essential. However, the degree of control individuals describe as necessary relates to the extent to which they trust the institutions, organisations or individuals involved in processing or accessing their data. A recent study conducted by Ipsos Mori on behalf of the Wellcome Trust found that whilst participants in their deliberative workshops initially tended to express preferences for opt-in consent models through the deliberative process, they shifted to a position where they “felt that if they knew more about the processes and safeguards in place they might feel more empowered, and hence more open and trusting in the decision-making process around data collection and sharing (and may not, therefore, need to opt-in)” [3] (p.13). Control may be facilitated through transparency and public engagement rather than direct or specific opt-in consent. As such, the findings reported in the included studies suggest that rather than focussing on which consent mechanisms are most favoured by members of the public, it may be more valuable to focus on how relationships of trust are built up (and conversely eroded) and how trust can be facilitated within research and data-sharing or data-linkage processes including through public/patient engagement or involvement.

This represents an important finding of this review. The literature has often suggested consent may be a requirement for public acceptability, whilst simultaneously arguing that requirements for consent present obstacles to effective and necessary health research and/or surveillance [29, 30, 35]. One alternative to consent which is currently used in the United Kingdom and elsewhere is authorisation. In England, for example, the Confidentiality Advisory Group (CAG) advises on requests to access data for research where neither consent nor anonymisation are deemed practicable. Similarly, the Public Benefit and Privacy Panel (PBPP) in Scotland is responsible for advising on data access requests involving personal data held by Information Services Division (ISD) of NHS National Services Scotland (NSS) and NRS (National Records of Scotland). Authorisation is now a widely used governance mechanism and authorising bodies play a significant role within the data sharing landscape. However, this review has found that to date the literature has not engaged with the subject of authorisation and there is a lack of evidence on public awareness of, or responses to, authorisation as a governance mechanism. The findings that individual level consent may not be crucial for public acceptance and that trust in organisations and institutions may be more important in shaping public responses, point to the salience of public engagement relating to authorisation approaches. Future research ought to explore public responses to authorisation.

As well as highlighting important conditions for public support, the included studies also indicate a number of areas of public concern about research uses of data. These relate largely to the purposes the research is perceived to serve, and the extent to which it is considered to be in the public interest or likely to yield public benefits. There is significant concern about potential misuse or abuse of data with negative implications for individuals, however there are also concerns about the potential for wider negative impacts from the outcomes of research. These relate to: the potential for data-sharing or data-linkage to enable, or perpetuate mass surveillance and a perceived “Big Brother Society”; the potential for individuals or groups within society to be labelled as a result of data-linkage research and for such labelling to result in stigma or discriminatory treatment, and to; the potential for research based on analysis of large data-sets to be used to inform policies or practices designed “for the masses” rather than reflecting individual circumstances and needs. What is apparent in relation to all these concerns is the underlying questioning of whether the research and its potential impacts/outcomes are perceived to be in the public interest or likely to bring about public benefits. The potential for research to lead to harm (directly or indirectly) is an area of significant concern.

The studies identified in this review reveal generally lower levels of trust in private sector actors compared with public sector actors alongside concern about private sector involvement in research. These concerns are often related to profit creation from use of individuals’ data and/or perceptions that data is routinely sold or passed on within the private sector. However, the studies do not suggest widespread opposition to private sector involvement, indeed many study participants acknowledged the important role of private sector actors in conducting or facilitating valuable research. Public support/acceptance of private sector involvement was largely conditional on the extent to which the research was perceived to be in the public interest or to lead to public benefits (as has recently been found by the Wellcome Trust [3]). Profit creation largely was not perceived as a problem so long as public benefits were prioritised over profits. The extent to which this was expected to be the case depended on the level of trust study participants had in the individuals or organisations handling/accessing data.

An important observation to emerge from this thematic synthesis is the public’s appetite for more information about current research and data-sharing or data-linkage practices. Many of the included studies reveal that there is generally very low public awareness of current research practices and governance systems or safeguards in place. There is evidence that those studies which used deliberative methods and provided participants with opportunities to learn more about current, or planned practices led to greater support/acceptance, or less concern about research uses of data. Additionally, almost all included studies reported that participants expressed a desire for more information and/or greater transparency about the ways in which data are used in research and the safeguards in place to protect against misuse/abuse or harms. This is significant and indicates not only that more awareness raising is needed but also that there may be significant enthusiasm amongst the public to engage more directly with and in these forms of research. Awareness raising should not be approached as a simple process of one-way information provision but rather requires a more engaged approach in order to ensure that it addresses public interests, concerns or uncertainties. The findings reported in the literature indicate that greater transparency may be needed, however, as we have previously noted, “research/researchers will be more likely to be perceived as trustworthy if transparency and public engagement involve open dialogue with members of the public and opportunities for deliberation, rather than controlled dissemination of information” [27] (p.9).

Within the included studies members of the public have been conceptualised in a number of ways. Some studies have suggested that uses of data in research—and particularly data-linkage—is a complex area which is difficult for members of the public to understand or meaningfully engage with. This leads to suggestions that awareness raising should be used to reassure members of the public through simple information provision and reflects a deficit model approach to public understanding of science [36].Footnote 104 However, those studies which involved deliberative methods have demonstrated that members of the public were able and enthusiastic to engage in discussions on this subject and were competent and valuable deliberators.Footnote 105 The nuanced positions described within the included studies highlight the value of qualitative methods for not only revealing but also informing and developing public attitudes. In this way qualitative methods themselves—as forms of public engagement—may have a role to play in building trust which in turn may underpin greater support for secondary uses of data. In this way increased use of qualitative methods might be a building block for support. Such public engagement and qualitative research are increasingly frequent components of large science projects and represent, in part, efforts to increase public trust and to ensure Responsible Research and Innovation (RRI) [27].

Overall this thematic synthesis has also revealed that there is great scope for qualitative methods to be used more fully or effectively in this area. This thematic synthesis has focussed only on qualitative studies—or qualitative findings reported within mixed methods studies—yet in some cases qualitative methods had been used primarily to inform the design of quantitative studies.Footnote 106 Moreover, ten studies were excluded at the final stage due to their limited reporting of qualitative findings or their narrow, structured approach (e.g. qualitative methods being used to examine public responses to narrowly defined questions/hypotheses). Therefore it appears there may be a tendency for qualitative methods to be used largely as a means for informing subsequent quantitative methods, which in turn suggests an under-appreciation of the value of qualitative methods. Indeed, there is some evidence that qualitative methods may at times not be recognised as research methods. The authors found that only just over half of the included studiesFootnote 107 explicitly referred to ethical review procedures relating to the qualitative research while researchers in one study specifically stated that ethical approval was not required.Footnote 108

Study Limitations

Qualitative studies are sometimes criticised for their limited generalisability due to small and/or unrepresentative samples, such criticisms might be levied at the included studies within this thematic synthesis. The sample sizes ranged from 14 to 217 participants with the average being 54.84. Moreover, many of the included studies focussed on particular groups such as those with particular health conditions/susceptibilities,Footnote 109 particular socio-demographic groupsFootnote 110 or with previous experience with research and/or data-sharing.Footnote 111 Additionally, it is important to note that while random or quota sampling was often usedFootnote 112 the qualitative methods relied upon people volunteering to participate in the research which often involved a significant time commitment. Thus it might be speculated that those individuals who participated in these studies were more likely to be supportive of—or at least interested in—research and individuals who are less supportive, or more sceptical of research might have been less inclined to participate. Whilst these factors mean that the studies cannot be taken as being representative of the views of the wider public they remain valuable as indicators of the range of views within the public and particularly as illustrating how opinions are expressed and how they may be informed or influenced. This synthesis of the included studies has addressed some of the criticisms directed at qualitative studies in giving increased breadth through synthesising findings from a large (total) number of study participants and in a variety of contexts.


With ever-growing interest in secondary uses of data for health research, including practices of data linkage and data sharing, there has increasingly been attention directed at public acceptability of these practices. Public acceptability is recognised as crucial for ensuring the legitimacy of current practices and systems of governance. This systematic review and thematic synthesis has highlighted a growing body of evidence pointing towards widespread general—though conditional—support for data linkage and data sharing for research purposes. It has found that whilst a variety of concerns are raised (e.g. relating to confidentiality, individuals’ control over their data, uses and abuses of data and potential harms arising) where members of the public perceive there to be actual or potential public benefits arising from research and where they have trust in the individuals or organisations conducting and/or overseeing data linkage/sharing they are generally supportive. However, the thematic synthesis has also highlighted current low levels of awareness about existing practices and uses of data, it points towards the need for greater awareness raising combined with opportunities for public engagement and deliberation. This will be important for ensuring the legitimacy of future health informatics research and for avoiding further public controversy.


  1. As indicated in Studies: 3, 7, 8, 10, 19, 21, 22, 25

  2. Studies: 2, 3, 4, 7, 8, 9, 10, 12, 13, 14, 15, 18, 19, 20, 21, 22, 25

  3. Studies: 7, 10, 11

  4. Studies: 1, 2, 6, 19, 21, 22

  5. Studies: 2, 3, 4, 5, 6, 9, 11, 14, 17, 18, 21, 22, 24

  6. Studies: 1, 3, 5, 6, 9, 11, 14, 15, 17, 20, 21, 22

  7. Study 5

  8. e.g. Study 6

  9. Studies: 8, 17

  10. Studies: 1, 5, 8, 11, 14

  11. Study 1

  12. Studies: 1, 3, 11, 14

  13. Studies: 1, 7, 8, 10, 11, 14, 18, 19, 21, 22, 24

  14. Studies: 4, 5, 12, 14, 18, 22

  15. Studies: 8, 10, 11, 14, 15, 17, 19, 20, 21, 25

  16. Studies: 7, 10, 19, 21

  17. Studies: 7, 8, 10, 11, 24

  18. Studies: 1, 5, 6, 8, 11, 14, 20, 22, 24

  19. Studies: 5, 8, 11

  20. Studies: 5, 6, 8, 11, 23

  21. Studies: 8, 10, 22

  22. Studies: 8, 11, 19, 21, 22, 25

  23. Studies: 1, 2, 4, 5, 6, 10, 11, 18, 21, 23, 25

  24. Studies: 7, 11, 13, 19, 23

  25. Study 5

  26. Studies: 4, 5, 8, 9, 14, 15, 17, 21, 22, 24

  27. Studies: 21, 22

  28. Studies: 1, 2, 4, 5, 6, 10, 11, 18, 21, 23, 25

  29. Studies: 4, 12, 14, 18, 23, 25

  30. Studies: 2, 12, 14, 18, 23, 25

  31. Studies: 4, 14, 15, 17, 25

  32. Studies: 4, 25

  33. Studies: 5, 10, 12, 18, 25

  34. Studies: 12, 15, 23, 25

  35. Studies: 2, 11, 12, 14, 20, 22, 23, 24

  36. Studies: 14, 15, 19, 23

  37. Studies: 10, 19

  38. Studies: 4, 5, 12,18, 22, 23, 24

  39. Studies: 16, 23, 24

  40. Studies: 11, 15, 16, 20, 21, 23, 25

  41. Studies: 8, 10, 12, 17, 21, 22, 23

  42. Studies: 5, 20, 21

  43. Study: 3

  44. Studies: 18, 19, 24

  45. Study: 18

  46. Studies: 8, 17, 20, 23, 25

  47. Studies: 4, 5, 8, 10, 11, 12, 13, 14, 21, 22

  48. Studies: 4, 8, 9, 11, 12, 14, 19, 21, 22, 23, 25

  49. Studies: 5, 11, 21, 24

  50. Studies: 5, 9, 11, 12, 13, 19, 21, 22, 23

  51. Study: 14

  52. Studies: 17, 21, 23

  53. Studies: 5, 11, 13, 14, 22

  54. Studies: 1, 5, 6, 8, 11, 14, 19

  55. Studies: 1, 5, 6, 9, 11, 12, 17, 24

  56. Studies: 6, 7, 11, 15, 16, 17, 20, 22, 23

  57. Studies: 5, 6, 7, 8, 9, 10, 11, 12, 14, 17, 20, 22, 24

  58. Studies: 4, 5, 10, 11, 12, 14, 18, 21, 22, 23, 25

  59. Studies: 6, 10,11,20,22

  60. Studies: 6, 7, 10, 11, 21, 22, 25

  61. Studies: 3, 22

  62. Studies: 6, 10, 14, 18, 21, 25

  63. Study: 18

  64. Study: 6

  65. Studies: 6, 11, 22

  66. Studies: 17, 25

  67. Studies: 8, 11, 21, 22

  68. Studies: 6, 7, 8

  69. Studies: 6, 8

  70. Study: 15

  71. Studies: 5, 10, 15, 21, 25

  72. Studies: 6, 11, 15

  73. Studies: 9, 11, 12, 22, 24

  74. Studies: 7, 11, 12, 20, 22

  75. Study: 4

  76. Studies: 2, 4, 6, 11, 14, 22

  77. Studies: 2, 4, 6, 8, 9, 11, 13, 14, 21

  78. Studies: 6, 11, 21

  79. Studies: 5, 7, 8, 14, 15, 17, 25

  80. Studies: 4, 17, 20, 22, 24, 25

  81. Studies: 5, 6, 11

  82. Studies: 5, 14, 15

  83. Study: 17

  84. Studies: 2, 4, 5, 6, 7, 9, 10, 11, 12, 13, 14, 15, 19, 21, 23, 24

  85. Studies: 12, 14, 18, 21

  86. Studies: 6, 7, 11, 14, 15, 18, 19

  87. Studies: 5, 6, 11

  88. Studies: 5, 6, 15

  89. Study: 2

  90. Study: 4

  91. Study: 9

  92. Study: 6

  93. Studies: 22, 23

  94. Study: 14

  95. Study: 23

  96. Study: 22

  97. Studies: 2, 9, 13, 19, 22, 23

  98. Studies: 2, 4, 10, 15, 16, 17, 20, 24, 25

  99. Studies: 13, 14

  100. Studies: 2, 13, 18, 11, 12, 24

  101. Study: 22

  102. Study: 2

  103. Studies: 18, 9, 22

  104. Such a position is implicit in Studies:, 7, 10, 11, 14, 15, 16, 17, 19

  105. Studies: 4, 5, 6, 18

  106. Studies: 4, 13, 24, 25

  107. Studies: 3, 4, 9, 10, 12, 13,15, 17, 19, 20, 22, 23, 24, 25

  108. Study 17

  109. Studies: 3, 8, 12, 14, 16

  110. Studies: 2, 4, 6, 8, 9, 10, 17, 19, 24

  111. Studies: 12, 13, 22, 23, 24

  112. Studies: 4, 5, 6, 7, 10, 11, 14, 20, 22, 23


  1. WMA. WMA Declaration on Ethical Considerations Regarding Health Databases. WMA; 2002. 9th June 2016)

  2. MRC & Wellcome Trust. Access to Collections of Data and Materials for Health Research. Wellcome Trust; 2006. Accessed 03 Nov 2016.

  3. Wellcome Trust. The One-Way Mirror: Public attitudes to commercial access to health data. 2016.

    Google Scholar 

  4. Bradwell & Gallagher. FYI: The new politics of personal information. 2007. Accessed 03 Nov 2016.

  5. Robling MR, Hood K, Houston H, Pill R, Fay J, Evans HM. Public attitudes towards the use of primary care patient record data in medical research without consent: a qualitative study. J Med Ethics. 2004;30(1):104–9.

    Article  Google Scholar 

  6. Barbour RS, Barbour M. Evaluating and synthesising qualitative research: the need to develop a distinctive approach. J Eval Clin Pract. 2003;9(2):179–86.

    Article  Google Scholar 

  7. Pope C, Ziebland S, Mays N. Qualitative research in health care: Analysing qualitative data. Br Med J. 2000;320:114–6.

    Article  Google Scholar 

  8. Popay J, Rogers A, Williams G. Standards for the systematic review of qualitative literature in health services research. Qual Health Res. 1998;8:341–51.

    Article  Google Scholar 

  9. Davidson S, McLean C, Treanor S, Aitken M, Cunningham-Burley S, Laurie G, Sethi N, Pagliari C. Public acceptability of data sharing between the public, private and third sectors for research purposes. (Social Research series). Edinburgh: Scottish Government; 2013.

    Google Scholar 

  10. Wellcome Trust, Enabling data linkage to maximise the value of public health research data: full report Wellcome Trust; 2015 Accessed 03 Nov 2016.

  11. Holman CDJ, Bass AJ, Rosman DL, Smith MB, Semmens JB, Glasson EJ, Brook EL, Trutwein B, Rouse IL, Watson CR, de Klerk NH, Stanley FJ. A decade of data linkage in Western Australia: strategic design, applications and benefits of the WA data linkage system. Aust Health Rev. 2008;32(4):766–77.

    Article  Google Scholar 

  12. Young TK, Kliewer E, Blanchard J, Mayer T. Monitoring disease burden and preventive behavior with data linkage: cervical cancer among aboriginal people in Manitoba, Canada. Am J Public Health. 2000;90(9):1466–8.

    Article  Google Scholar 

  13. Fischbacher CM, Bhopal R, Povey C, Steiner M, Chalmers J, Mueller G, Jamieson J, Knowles D. Record linked retrospective cohort study of 4.6 million people exploring ethnic variations in disease: myocardial infarction in South Asians. BMC Public Health. 2007;7:142.

    Article  Google Scholar 

  14. Veugelers PJ, Yip AM, Kephart G. Proximate and contextual socioeconomic determinants of mortality: multilevel approaches in a setting with universal health care coverage. Am J Epidemiol. 2001;154(8):725–32.

    Article  Google Scholar 

  15. Jutte DP, Roos LL, Brownell MD. Administrative record linkage as a tool for public health research. Annu Rev Public Health. 2011;32:91–108.

    Article  Google Scholar 

  16. Kariminia A, Butler TG, Corben SP, Levy MH, Grant L, Kaldor JM, Law MG. Extreme cause-specific mortality in a cohort of adult prisoners—1988 to 2002: a data-linkage study. Int J Epidemiol. 2007;36:310–6.

    Article  Google Scholar 

  17. CASP. CASP Qualitative Checklist. 2013.

    Google Scholar 

  18. Thomas J, Harden A. Methods for the thematic synthesis of qualitative research in systematic reviews. BMC Med Res Methodol. 2008;8(45):1–10.

    Google Scholar 

  19. Britten N, Campbell R, Pope C, Donovan J, Morgan M, Pill R. Using meta-ethnography to synthesise qualitative research: A worked example. J Health Serv Res Policy. 2002;7(4):209–15.

    Article  Google Scholar 

  20. Ishikawa H, Yamazaki Y. How applicable are western models of patient-physician relationship in Asia?: Changing patient-physician relationship in contemporary Japan. Int J Jpn Sociol. 2005;14:84–93.

    Article  Google Scholar 

  21. Pentz RD, Billot L, Wendler D. Research on stored biological samples: Views of African American and White American cancer patients. Am J Med Genet. 2006;140A:733–9.

    Article  Google Scholar 

  22. Mezuk B, Eaton WW, Zandi P. Participant characteristics that influence consent for genetic research in a population-based survey: the Baltimore epidemiological catchment area follow-up. Community Genet. 2008;11:171–8.

    Google Scholar 

  23. Hartz SM, Johnson EO, Saccone NL, Hatsukami D, Breslau N, Bierut LJ. Inclusion of African Americans in genetic studies: what is the barrier? Am J Epidemiol. 2010;174(3):336–44.

    Article  Google Scholar 

  24. Devers K, Gray B, Ramos C, Shah A, Blavin F, Waidmann T. The feasibility of using electronic health records (EHRs) and other electronic health data for research on small populations. Washington: Urban Institute; 2013.

  25. Buckley BS, Murphy AW, MacFarlane AE. Public attitudes to the use in research of personal health information from general practitioners’ records a survey of the Irish general public. J Med Ethics. 2011;37:50–5.

    Article  Google Scholar 

  26. King T, Brankovic L, Gillard P. Perspectives of Australian adults about protecting the privacy of their information in statistical databases. Int J Med Inform. 2012;81:279–89.

    Article  Google Scholar 

  27. Aitken, M., Cunningham-Burley, S., & Pagliari, C. Moving from trust to trustworthiness: Experiences of public engagement in the Scottish Health Informatics Programme. Sci Public Policy. 2016. doi: 10.1093/scipol/scv075.

  28. Campbell B, Thomson H, Slater J, Coward C, Wyatt K, Sweeney K. Extracting information from hospital records: what patients think about consent. Qual Safety Health Care. 2007;16:404–8.

    Article  Google Scholar 

  29. Chalmers J, Muir R. Patient Privacy and Confidentiality: The debate goes on; the issues are complex but a consensus is emerging. Br Med J. 2003;326:725–6.

    Article  Google Scholar 

  30. Verity C, Nicoll A. Consent, confidentiality and the threat to public health surveillance. Br Med J. 2002;324:1210–3.

    Article  Google Scholar 

  31. Strobl J, Cave E, Walley T. Data Protection Legislation: Interpretation and barriers to research. Br Med J. 2000;321:890–2.

    Article  Google Scholar 

  32. Ohm P. Broken promises of privacy: responding to the surprising failure of anonymity. UCLA Law Rev. 2010;57:1701–77.

    Google Scholar 

  33. ICO. Anonymisation: managing data protection risk code of practice. Cheshire: Information Commissioner’s Office (ICO); 2012.

  34. Haddow G, Bruce A, Sathanandam S, Wyatt JC. “Nothing is really safe”: a focus group study on the processes of anonymizing and sharing of health data for research purposes. J Eval Clin Pract. 2011;17(6):1140–6.

    Article  Google Scholar 

  35. Damschroder LJ, Pritts JL, Neblo MA, Kalarickal RJ, Creswell JW, Hayward RA. Patients, privacy and trust: Patients’ willingness to allow researchers to access their medical records. Soc Sci Med. 2007;64(1):223–35.

    Article  Google Scholar 

  36. Gross AG. The roles of rhetoric in the public understanding of science. Public Underst Sci. 1994;3:3–23.

    Article  Google Scholar 

  37. Booth, A.Cochrane or cock-eyed? How should we conduct systematic reviews of qualitative research? 2001; Accessed 03 Nov 2016.

  38. Sandelowski M, Barroso J. Classifying the findings in qualitative studies. Qual Health Res. 2003;13(17):905–23.

    Article  Google Scholar 

Download references


Not Applicable.


We acknowledge the support from The Farr Institute of Health Informatics Research.

The UK HIRN is supported by a 10-funder consortium: Arthritis Research UK, the British Heart Foundation, Cancer Research UK, the Economic and Social Research Council, the Engineering and Physical Sciences Research Council, the Medical Research Council, the National Institute of Health Research, the National Institute for Social Care and Health Research (Welsh Assembly Government), the Chief Scientist Office (Scottish Government Health Directorates), the Wellcome Trust, (MRC Grant No: MR/M501633/2).

Availability of data and materials

The dataset supporting the conclusions of this article are outlined in Tables 3 and 4 in this article.

Authors’ contributions

MA and SCB contributed to the conception and design of the study. MA and JSJ developed the protocol for this study and conducted the systematic review. MA and JSJ coded all of the included studies. SCB, CP and RJ each coded a subset of the included studies. All authors collaborated in refining the coding frame and analysing the findings. MA and JSJ drafted the manuscript, SCB, CP and RJ critically revised the article. All authors have read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Mhairi Aitken.

Appendix 1

Adapted search strategy used in medline

  1. 1.

    (Lay or patient* or public or citizen$1).mp. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  2. 2.

    exp Patients/

  3. 3.

    1 or 2

  4. 4.

    (Attitude* or view$1 or perspective* or opinion$1).mp. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  5. 5.

    exp attitude to health/ or health knowledge, attitudes, practice/

  6. 6.

    4 or 5

  7. 7.

    3 and 6

  8. 8.

    exp Public Opinion/

  9. 9.

    7 or 8

  10. 10.

    (Record$1 or data).mp. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  11. 11.

    (Share* or sharing or link$2 or linkage).mp. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  12. 12.

    10 and 11

  13. 13.

    exp Medical Record Linkage/

  14. 14.

    12 or 13

  15. 15. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  16. 16.

    *research/ or exp behavioral research/ or exp biomedical research/ or exp community-based participatory research/ or exp empirical research/ or exp human experimentation/ or exp operations research/ or exp parapsychology/ or exp peer review, research/ or exp research design/ or exp research report/ or exp social validity, research/

  17. 17.

    15 or 16

  18. 18.

    (Access or purpose$1).mp. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  19. 19.

    17 and 18

  20. 20.

    9 and 14 and 19

  21. 21.

    (qualitative or ethnograph* or “grounded theory” or “in depth interview$1” or “*structured interview$1” or “focus group$1” or “case study” or “case studies” or “case series” or “citizen$2 jury” or “citizen$2 juries” or vignette* or observation*).mp. [mp = title, abstract, original title, name of substance word, subject heading word, keyword heading word, protocol supplementary concept word, rare disease supplementary concept word, unique identifier]

  22. 22.

    exp qualitative research/

  23. 23.


  24. 24.

    exp Focus Groups/

  25. 25.

    exp Observation/

  26. 26.

    21 or 22 or 23 or 24 or 25

  27. 27.

    20 and 26

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aitken, M., de St. Jorre, J., Pagliari, C. et al. Public responses to the sharing and linkage of health data for research purposes: a systematic review and thematic synthesis of qualitative studies. BMC Med Ethics 17, 73 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: