Skip to main navigation Skip to search Skip to main content

Improvements on a privacy-protection algorithm for DNA sequences with generalization lattices

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

When developing personal DNA databases, there must be an appropriate guarantee of anonymity, which means that the data cannot be related back to individuals. DNA lattice anonymization (DNALA) is a successful method for making personal DNA sequences anonymous. However, it uses time-consuming multiple sequence alignment and a low-accuracy greedy clustering algorithm. Furthermore, DNALA is not an online algorithm, and so it cannot quickly return results when the database is updated. This study improves the DNALA method. Specifically, we replaced the multiple sequence alignment in DNALA with global pairwise sequence alignment to save time, and we designed a hybrid clustering algorithm comprised of a maximum weight matching (MWM)-based algorithm and an online algorithm. The MWM-based algorithm is more accurate than the greedy algorithm in DNALA and has the same time complexity. The online algorithm can process data quickly when the database is updated.

Original languageEnglish
Pages (from-to)1-9
Number of pages9
JournalComputer Methods and Programs in Biomedicine
Volume108
Issue number1
DOIs
StatePublished - Oct 2012
Externally publishedYes

Keywords

  • Personal DNA database
  • Privacy protection
  • k-Anonymity

Fingerprint

Dive into the research topics of 'Improvements on a privacy-protection algorithm for DNA sequences with generalization lattices'. Together they form a unique fingerprint.

Cite this