Thesis
Random matrices and applications to data filtering
Washington State University
Master of Science (MS), Washington State University
2003
Handle:
https://hdl.handle.net/2376/142
Abstract
Preserving privacy is becoming an important issue in data mining. Random perturbation is a widely used technique to protect privacy of sensitive data values. This technique hides the true data records by modifying the data values using additive random noise, but can still estimate the data distribution from the perturbed data set. Our question in this thesis is: does this method really preserve privacy? Large random matrices have many properties. Random matrix theory has been widely used in nuclear physics, the study of the zeros of Reimann zeta function, the study of chaotic systems, and signal processing. After investigating some asymptotic properties of eigenvalues of covariance matrices, this thesis presents a random matrix-based data filtering technique, which we call spectral filtering. This method analyzes the eigenstates (spectrum) of the sample covariance matrix of observed data, and identifies the noisy eigenstates by applying the known asymptotic properties of random matrices. Experiments in the thesis show that this technique can produce good estimate of the true data for a reasonable value of the signal-to-noise ratio (SNR). This questions the effectiveness of additive random perturbation technique. Although spectral filtering technique has the ability to breach privacy in additive random perturbation, there exist other data mining techniques that can preserve privacy. One example is multi-party secure sum computation. We examine the relation between the perturbed values using this method and the original ones, and prove that they are statistically independent. This means that it is not possible to estimate the true data values from the perturbed values. Hence multi-party secure sum computation preserves privacy.
Metrics
46 File views/ downloads
15 Record Views
Details
- Title
- Random matrices and applications to data filtering
- Creators
- Qi Wang
- Contributors
- Krishnamoorthy Sivakumar (Degree Supervisor)
- Awarding Institution
- Washington State University
- Academic Unit
- Electrical Engineering and Computer Science, School of
- Theses and Dissertations
- Master of Science (MS), Washington State University
- Publisher
- Washington State University; Pullman, Wash. :
- Identifiers
- 99900525083501842
- Language
- English
- Resource Type
- Thesis