Gaussian kernel with correlated variables for incomplete data

Jeongsub Choi, Youngdoo Son, Myong K. Jeong

Research output: Contribution to journalArticlepeer-review

Abstract

The presence of missing components in incomplete instances precludes a kernel-based model from incorporating partially observed components of incomplete instances and computing kernels, including Gaussian kernels that are extensively used in machine learning modeling and applications. Existing methods with Gaussian kernels to handle incomplete data, however, are based on independence among variables. In this study, we propose a new method, the expected Gaussian kernel with correlated variables, that estimates the Gaussian kernel with incomplete data, by considering the correlation among variables. In the proposed method, the squared distance between two instance vectors is modeled with the sum of the correlated squared unit-dimensional distances between the instances, and the Gaussian kernel with missing values is obtained by estimating the expected Gaussian kernel function under the probability distribution for the squared distance between the vectors. The proposed method is evaluated on synthetic data and real-life data from benchmarks and a case from a multi-pattern photolithographic process for wafer fabrication in semiconductor manufacturing. The experimental results show the improvement by the proposed method in the estimation of Gaussian kernels with incomplete data of correlated variables.

Original languageEnglish
Pages (from-to)223-244
Number of pages22
JournalAnnals of Operations Research
Volume341
Issue number1
DOIs
StatePublished - Oct 2024

Keywords

  • Gamma approximation
  • Gaussian kernel
  • Incomplete data
  • Semiconductor manufacturing

Fingerprint

Dive into the research topics of 'Gaussian kernel with correlated variables for incomplete data'. Together they form a unique fingerprint.

Cite this