November 4, 2015 By Douglas Bonderud 2 min read

What’s the most valuable piece of data owned by individuals? A look at recent data breaches suggested that financial information is sought by malicious actors to generate short-term gains, while health care information offers long-term opportunity for cybercriminals to wreak havoc.

According to Threatpost, however, a pair of Stanford researchers have now upped the ante by discovering a vulnerability in The Beacon Project, a genome sharing network. With enough motivation, time and effort, it may be possible for cybercriminals to uncover critical genetic information about specific individuals.

Vulnerability in a Post-Privacy World

All users assume some risk by leveraging online services. In the case of sites like Facebook, for example, real names, addresses and even birthdates may be up for grabs if someone hacks the network, while compromised bank databases could mean thousands lost to fraud. The result is a kind of lowered privacy standard that has users exchanging a measure of their safety for access to certain services or products. But a recent Travelers survey suggested that privacy concerns aren’t so old-fashioned just yet: The report found that 6 out of every 10 Americans “worry about losing personal information or privacy,” with one-quarter of those “worrying a great deal.”

Consider recent issues surrounding Microsoft’s new OS, Windows 10. The company built in a number of data collection measures that it claims help improve system performance and health, CSO Online reported. Users can opt out of some collection but not all; according to Corporate Vice President Joe Belfiore, certain pieces of user data “are not personal information or are not related to privacy.” Unsurprisingly, consumers and watchdog groups do not agree.

Cracking the Code

If users are up in arms about data collected by their operating system and worried about the broader scope of identity theft, the Stanford team’s findings indicated a whole new landscape of potential threats. It goes like this: The Beacon Project allows anonymous pings to its genome servers to check incoming genetic code against what’s already stored. These beacons typically contain the genetic information of 1,000 individuals, with each record stripped of any identifying characteristics.

The use of anonymous queries, however, leads to a vulnerability: Attackers with access to an individual’s genome sequence along with The Beacon Project’s infrastructure could theoretically determine if the victim falls into a specialized group — such as people with heart disease or autism — by sending just 5,000 anonymous queries.

At first glance, this doesn’t look like a commonplace scenario since attackers need a way to obtain users’ genetic information before attempting to compromise the Beacon system. According to The Register, the Global Alliance for Genomics and Health (GA4GH) — which runs The Beacon Project — believes it has done enough to safeguard this stored information. It does acknowledge, however, that it may be possible to reidentify individuals if malicious actors already possess their genome sequence.

But here’s the bigger concern: As genome databases grow and access to this data becomes more commonplace, the distance between “unknown” and “maliciously compromised” begins to shrink. For cybercriminals, a bigger attack surface and the trickle-down effect of stored data from first to subcontracted third parties represents the ideal threat vector.

So what’s the real risk to genome data? Right now, fairly low: Attackers would need to sacrifice substantial amounts of time and effort for a relatively small return. Economies of scale, however, suggest that as the genome market broadens, this vulnerability may shift from mere research curiosity to a real-world issue.

More from

What does resilience in the cyber world look like in 2025 and beyond?

6 min read -  Back in 2021, we ran a series called “A Journey in Organizational Resilience.” These issues of this series remain applicable today and, in many cases, are more important than ever, given the rapid changes of the last few years. But the term "resilience" can be difficult to define, and when we define it, we may limit its scope, missing the big picture.In the age of generative artificial intelligence (gen AI), the prevalence of breach data from infostealers and the near-constant…

Airplane cybersecurity: Past, present, future

4 min read - With most aviation processes now digitized, airlines and the aviation industry as a whole must prioritize cybersecurity. If a cyber criminal launches an attack that affects a system involved in aviation — either an airline’s system or a third-party vendor — the entire process, from safety to passenger comfort, may be impacted.To improve security in the aviation industry, the FAA recently proposed new rules to tighten cybersecurity on airplanes. These rules would “protect the equipment, systems and networks of transport…

Protecting your digital assets from non-human identity attacks

4 min read - Untethered data accessibility and workflow automation are now foundational elements of most digital infrastructures. With the right applications and protocols in place, businesses no longer need to feel restricted by their lack of manpower or technical capabilities — machines are now filling those gaps.The use of non-human identities (NHIs) to power business-critical applications — especially those used in cloud computing environments or when facilitating service-to-service connections — has opened the doors for seamless operational efficiency. Unfortunately, these doors aren’t the…

Topic updates

Get email updates and stay ahead of the latest threats to the security landscape, thought leadership and research.
Subscribe today