# Sungkyu Jung

## Contact Details

NameSungkyu Jung |
||

Affiliation |
||

Location |
||

## Pubs By Year |
||

## Pub CategoriesStatistics - Methodology (5) Mathematics - Metric Geometry (3) Mathematics - Statistics (2) Statistics - Theory (2) Statistics - Machine Learning (2) Statistics - Applications (1) Computer Science - Learning (1) |

## Publications Authored By Sungkyu Jung

In modern biomedical research, it is ubiquitous to have multiple data sets measured on the same set of samples from different views (i.e., multi-view data). Read More

We investigate a geometric structure on ${\rm Sym}^+(p)$, the set of $p \times p$ symmetric positive-definite matrices, based on eigen-decomposition. Eigenstructure determines both a stratification of ${\rm Sym}^+(p)$, defined by eigenvalue multiplicities, and fibers of the "eigen-composition" map $F:M(p):=SO(p)\times{\rm Diag}^+(p)\to{\rm Sym}^+(p)$. The fiber structure leads to the notions of {\em scaling-rotation distance} between $X,Y\in {\rm Sym}^+(p)$, the distance in $M(p)$ between fibers $F^{-1}(X)$ and $F^{-1}(Y)$, and {\em minimal smooth scaling-rotation (MSSR) curves} [Jung et al. Read More

We present a method for individual and integrative analysis of high dimension, low sample size data that capitalizes on the recurring theme in multivariate analysis of projecting higher dimensional data onto a few meaningful directions that are solutions to a generalized eigenvalue problem. We propose a general framework, called SELP (Sparse Estimation with Linear Programming), with which one can obtain a sparse estimate for a solution vector of a generalized eigenvalue problem. We demonstrate the utility of SELP on canonical correlation analysis for an integrative analysis of methylation and gene expression profiles from a breast cancer study, and we identify some genes known to be associated with breast carcinogenesis, which indicates that the proposed method is capable of generating biologically meaningful insights. Read More

We consider dimension reduction of multivariate data under the existence of various types of auxiliary information. We propose a criterion that provides a series of orthogonal directional vectors, that form a basis for dimension reduction. The proposed method can be thought of as an extension from the continuum regression, and the resulting basis is called continuum directions. Read More

Modeling deformations of a real object is an important task in computer vision, biomedical engineering and biomechanics. In this paper, we focus on a situation where a three-dimensional object is rotationally deformed about a fixed axis, and assume that many independent observations are available. Such a problem is generalized to an estimation of concentric, co-dimension 1, subspheres of a polysphere. Read More

When functional data manifest amplitude and phase variations, a commonly-employed framework for analyzing them is to take away the phase variation through a function alignment so that standard statistical tools can be applied to the aligned functions. A downside of this approach is that the important variations contained in the phases are completely ignored. To combine both of amplitude and phase variations, we propose a principal component analysis (PCA) that aims to capture a non-linear principal component that simultaneously explains the amplitude, phase and their associations. Read More

We investigate a geometric computational framework, called the "scaling-rotation framework", on ${\rm Sym}^+(p)$, the set of $p \times p$ symmetric positive-definite (SPD) matrices. The purpose of our study is to lay geometric foundations for statistical analysis of SPD matrices, in situations in which eigenstructure is of fundamental importance, for example diffusion-tensor imaging (DTI). Eigen-decomposition, upon which the scaling-rotation framework is based, determines both a stratification of ${\rm Sym}^+(p)$, defined by eigenvalue multiplicities, and fibers of the "eigen-composition" map $SO(p)\times{\rm Diag}^+(p)\to{\rm Sym}^+(p)$. Read More

We introduce a new geometric framework for the set of symmetric positive-definite (SPD) matrices, aimed to characterize deformations of SPD matrices by individual scaling of eigenvalues and rotation of eigenvectors of the SPD matrices. To characterize the deformation, the eigenvalue-eigenvector decomposition is used to find alternative representations of SPD matrices, and to form a Riemannian manifold so that scaling and rotations of SPD matrices are captured by geodesics on this manifold. The problems of non-unique eigen-decompositions and eigenvalue multiplicities are addressed by finding minimal-length geodesics, which gives rise to a distance and an interpolation method for SPD matrices. Read More

Set classification problems arise when classification tasks are based on sets of observations as opposed to individual observations. In set classification, a classification rule is trained with $N$ sets of observations, where each set is labeled with class information, and the prediction of a class label is performed also with a set of observations. Data sets for set classification appear, for example, in diagnostics of disease based on multiple cell nucleus images from a single tissue. Read More

We propose a new approach to analyze data that naturally lie on manifolds. We focus on a special class of manifolds, called direct product manifolds, whose intrinsic dimension could be very high. Our method finds a low-dimensional representation of the manifold that can be used to find and visualize the principal modes of variation of the data, as Principal Component Analysis (PCA) does in linear spaces. Read More

Principal Component Analysis (PCA) is an important tool of dimension reduction especially when the dimension (or the number of variables) is very high. Asymptotic studies where the sample size is fixed, and the dimension grows [i.e. Read More