Journal Papers
Online tensor low-rank representation for streaming data clustering
T. Wu
IEEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 2, pp. 602-617, Feb. 2023.
[abs] [pdf]
Tensor low-rank representation (TLRR) has attracted increasing attention in recent years due to its effectiveness in capturing the intrinsic low-rank structure of tensor data. However, it is known that TLRR suffers from high computational complexity for large-scale data because it minimizes the tensor rank of the representation tensor whose size is proportional to the sample size. This paper develops a streaming algorithm—termed online low-rank tensor subspace clustering (OLRTSC)—for low-rank tensor data recovery and clustering based on the tensor singular value decomposition (t-SVD) algebraic framework. The key idea of OLRTSC is the non-convex reformulation of the objective function of TLRR by decomposing the tensor nuclear norm into an explicit tensor product of two low-rank tensors, which can be solved by leveraging the stochastic optimization technique. Compared to batch TLRR, OLRTSC is online by nature, can handle dynamic data, avoids computing t-SVD of the representation tensor, and reduces the storage cost significantly. This paper provides the theoretical guarantee that the sequence of the solutions produced by OLRTSC converges almost surely to a stationary point of the objective function. Moreover, an extension of OLRTSC is also proposed to handle the case when parts of the data are missing. Finally, experimental results on both synthetic and real data demonstrate the efficiency and robustness of the proposed approaches.
|
Graph regularized low-rank representation for submodule clustering
T. Wu
Pattern Recognition, vol. 100, 107145, Apr. 2020.
[abs] [pdf]
In this paper, a new submodule clustering method for imaging (2-D) data is proposed. Unlike most existing clustering methods that first convert such data into vectors as preprocessing, the proposed method arranges the data samples as lateral slices of a third-order tensor. Our algorithm is based on the union-of-free-submodules model and the samples are represented using t-product in the third-order tensor space. First, we impose a low-rank constraint on the representation tensor to capture the principle information of data. By incorporating manifold regularization into the tensor factorization, the proposed method explicitly exploits the local manifold structure of data. Meanwhile, a segmentation dependent term is employed to integrate the two pipeline steps of affinity learning and spectral clustering into a unified optimization framework. The proposed method can be efficiently solved based on the alternating direction method of multipliers and spectral clustering. Finally, a nonlinear extension is proposed to handle data drawn from a mixture of nonlinear manifolds. Extensive experimental results on five real-world image datasets confirm the effectiveness of the proposed methods.
|
A low tensor-rank representation approach for clustering of imaging data
T. Wu and W.U. Bajwa
IEEE Signal Processing Letters, vol. 25, no. 8, pp. 1196-1200, Aug. 2018.
[abs] [pdf] [code]
This letter proposes an algorithm for clustering of two-dimensional data. Instead of "flattening" data into vectors, the proposed algorithm keeps samples as matrices and stores them as lateral slices in a third-order tensor. It is then assumed that the samples lie near a union of free submodules and their representations under this model are obtained by imposing a low tensor-rank constraint and a structural constraint on the representation tensor. Clustering is carried out using an affinity matrix calculated from the representation tensor. Effectiveness of the proposed algorithm and its superiority over existing methods are demonstrated through experiments on two image datasets.
|
Learning the nonlinear geometry of high-dimensional data: Models and algorithms
T. Wu and W.U. Bajwa
IEEE Transactions on Signal Processing, vol. 63, no. 23, pp. 6229-6244, Dec. 2015.
[abs] [pdf] [published version] [code]
Modern information processing relies on the axiom that high-dimensional data lie near low-dimensional geometric structures. This paper revisits the problem of data-driven learning of these geometric structures and puts forth two new nonlinear geometric models for data describing “related” objects/phenomena. The first one of these models straddles the two extremes of the subspace model and the union-of-subspaces model, and is termed the metric-constrained union-of-subspaces (MC-UoS) model. The second one of these models—suited for data drawn from a mixture of nonlinear manifolds—generalizes the kernel subspace model, and is termed the metric-constrained kernel union-of-subspaces (MC-KUoS) model. The main contributions of this paper in this regard include the following. First, it motivates and formalizes the problems of MC-UoS and MC-KUoS learning. Second, it presents algorithms that efficiently learn an MC-UoS or an MC-KUoS underlying data of interest. Third, it extends these algorithms to the case when parts of the data are missing. Last, but not least, it reports the outcomes of a series of numerical experiments involving both synthetic and real data that demonstrate the superiority of the proposed geometric models and learning algorithms over existing approaches in the literature. These experiments also help clarify the connections between this work and the literature on (subspace and kernel k-means) clustering.
|
Conference Papers
Online tensor low-rank representation for streaming data
T. Wu
In Proc. IEEE 30th Int. Workshop on Machine Learning for Signal Processing, Espoo, Finland, 2020. (MLSP 2020)
[abs] [pdf]
This paper proposes a new streaming algorithm to learn low-rank structures of tensor data using the recently proposed tensor-tensor product (t-product) and tensor singular value decomposition (t-SVD) algebraic framework. It reformulates the tensor low-rank representation (TLRR) problem using the equivalent bifactor model of the tensor nuclear norm, where the tensor dictionary is updated based on the newly collected data and representations. Compared to TLRR, the proposed method processes tensor data in an online fashion and makes the memory cost independent of the data size. Experimental results on three benchmark datasets demonstrate the superior performance, efficiency and robustness of the proposed algorithm over state-of-the-art methods.
|
Clustering-aware structure-constrained low-rank submodule clustering
T. Wu
In Proc. 53rd Asilomar Conf. on Signals, Systems, and Computers, Pacific Grove, CA, Nov. 3-6, 2019, pp. 1852-1856. (Asilomar 2019)
[abs] [pdf]
In this paper, a new clustering algorithm for two-dimensional data is proposed. Unlike most conventional clustering methods which are derived for dealing with matrices, the proposed algorithm performs clustering in a third-order tensor space. The images are then assumed to be drawn from a mixture of low-dimensional free submodules. The proposed method integrates spectral clustering into the optimization problem, thereby overcomes the shortcomings of existing techniques by its ability to perform optimal clustering of the submodules. An efficient algorithm via a combination of an alternating direction method of multipliers and spectral clustering is developed to find the representation tensor and the segmentation simultaneously. The effectiveness of the proposed method is demonstrated through experiments on three image datasets.
|
Human action attribute learning using low-rank representations
T. Wu, P. Gurram, R.M. Rao and W.U. Bajwa
In Proc. Signal Processing with Adaptive Sparse Structured Representations workshop, Lisbon, Portugal, Jun. 5-8, 2017. (SPARS 2017)
[abs] [pdf]
This paper studies the problem of learning human action attributes based on union-of-subspaces model. It puts forth an extension of the low-rank representation (LRR) model, termed the hierarchical clustering-aware structure-constrained low-rank representation (HCS-LRR) model, for unsupervised learning of human action attributes from video data. The effectiveness of the proposed model is demonstrated through experiments on five human action datasets for action recognition.
|
Distributed learning of human mobility patterns from cellular network data
T. Wu, R.M. Rustamov and C. Goodall
In Proc. 51st Annu. Conf. on Information Sciences and Systems, Baltimore, MD, Mar. 22-24, 2017. (CISS 2017)
[abs] [pdf] [code]
The advent of ubiquitous mobile devices has provided us with an abundant spatio-temporal data source that helps us understand human mobility. The big data generated from mobile devices can be distributed at different locations and it is always infeasible to aggregate the data from multiple data collection centers into one location due to communication and privacy considerations. This paper studies human mobility patterns by learning data-adaptive representations for cellular network data that are distributed across a set of interconnected nodes. It proposes a distributed algorithm, termed cloud NN-K-SVD, for collaboratively learning a sparsifying dictionary (i.e., overcomplete basis) from the data without exchanging data samples between different nodes. The effectiveness of cloud NN-K-SVD is demonstrated through experiments on anonymized Call Detail Records from Columbus, OH.
|
Clustering-aware structure-constrained low-rank representation model for learning human action attributes
T. Wu, P. Gurram, R.M. Rao and W.U. Bajwa
In Proc. IEEE Image, Video, and Multidimensional Signal Processing Workshop, Bordeaux, France, Jul. 11-12, 2016. (IVMSP 2016) (Best Student Paper Award)
[abs] [pdf]
This paper addresses the problem of learning meaningful human action attributes from high-dimensional video sequences based on union-of-subspaces (UoS) model. The model hypothesizes that each action attribute is represented by a subspace. It puts forth an extension of existing low-rank representation (LRR), termed the clustering-aware structure-constrained low-rank representation (CS-LRR) model, for unsupervised learning of human action attributes. The proposed CS-LRR model overcomes the shortcomings of existing techniques by its ability to handle disjoint subspaces, and by performing optimal spectral clustering of the subspaces. An efficient linear alternating direction method (LADM) is developed to solve the CS-LRR optimization problem. A human action or activity is represented as a sequence of transitions from one action attribute to another and can be uniquely represented by a subspace transition vector. These subspace transition vectors are used for human action recognition. The effectiveness of the proposed model is demonstrated through experiments on two real-world datasets for action recognition.
|
Hierarchical union-of-subspaces model for human activity summarization
T. Wu, P. Gurram, R.M. Rao and W.U. Bajwa
In Proc. IEEE Int. Conf. on Computer Vision Workshop, Santiago, Chile, 2015, pp. 1053-1061. (ICCVW 2015)
[abs] [pdf]
A hierarchical union-of-subspaces model is proposed for performing semi-supervised human activity summarization in large streams of video data. The union of low-dimensional subspaces model is used to learn meaningful action attributes from a collection of high-dimensional video sequences of human activities. An approach called hierarchical sparse subspace clustering (HSSC) is developed to learn this model from the data in an unsupervised manner by capturing the variations or movements of each action in different subspaces, which allow the human actions to be represented as sequences of transitions from one subspace to another. These transition sequences can be used for human action recognition. The action attributes can also be represented at multiple resolutions using the subspaces at different levels of the hierarchical structure. By visualizing and labeling these action attributes, the hierarchical model can be used to semantically summarize long video sequences of human actions at different scales. The effectiveness of the proposed model is demonstrated through experiments on three real-world human action datasets for action recognition and semantic summarization of the actions using different resolutions of the action attributes.
|
Active dictionary learning for image representation
T. Wu, A.D. Sarwate and W.U. Bajwa
In Proc. SPIE Unmanned Systems Technology XVII, vol. 9468, Baltimore, MD, Apr. 21-23, 2015. (SPIE 2015)
[abs] [pdf]
Sparse representations of images in overcomplete bases (i.e., redundant dictionaries) have many applications in computer vision and image processing. Recent works have demonstrated improvements in image representations by learning a dictionary from training data instead of using a predefined one. But learning a sparsifying dictionary can be computationally expensive in the case of a massive training set. This paper proposes a new approach, termed active screening, to overcome this challenge. Active screening sequentially selects subsets of training samples using a simple heuristic and adds the selected samples to a "learning pool," which is then used to learn a newer dictionary for improved representation performance. The performance of the proposed active dictionary learning approach is evaluated through numerical experiments on real-world image data; the results of these experiments demonstrate the effectiveness of the proposed method.
|
Metric-constrained kernel union of subspaces
T. Wu and W.U. Bajwa
In Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Brisbane, Australia, Apr. 19-24, 2015, pp. 5778-5782. (ICASSP 2015)
[abs] [pdf]
This paper addresses the problem of learning a collection of nonlinear manifolds. Inspired by kernel methods, it puts forth a generalization of the kernel subspace model, termed the Metric-Constrained Kernel Union-of-Subspaces (MC-KUoS) model. It then develops an iterative method for learning of an MC-KUoS whose solution is based on the data representation capability of the manifolds and distances between subspaces in the kernel (feature) space. The proposed method (when using Gaussian and polynomial kernels) outperforms existing competitive state-of-the-art methods for real-world image denoising, which shows the benefits of the MC-KUoS model and the proposed denoising approach.
|
Subspace detection in a kernel space: The missing data case
T. Wu and W.U. Bajwa
In Proc. IEEE Workshop on Statistical Signal Processing, Gold Coast, Australia, Jun. 29-Jul. 2, 2014, pp. 93-96. (SSP 2014)
[abs] [pdf]
This paper studies the problem of matched subspace detection in high-dimensional feature space where the signal in the input space is partially observed. We present a test statistic for our detection problem using kernel functions and provide kernel function value estimators with missing data for different kernels. The test statistic can be calculated approximately with estimated kernel function values. We also give theoretical results regarding the kernel function value and test statistic estimation. Numerical experiments involving both Gaussian and polynomial kernels show the efficacy of the proposed kernel function value estimator and resulting subspace detector.
|
Revisiting robustness of the union-of-subspaces model for data-adaptive learning of nonlinear signal models
T. Wu and W.U. Bajwa
In Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Florence, Italy, May 4-9, 2014, pp. 3390-3394. (ICASSP 2014)
[abs] [pdf]
This paper revisits the problem of data-adaptive learning of geometric signal structures based on the Union-of-Subspaces (UoS) model. In contrast to prior work, it motivates and investigates an extension of the classical UoS model, termed the Metric-Constrained Union-of-Subspaces (MC-UoS) model. In this regard, it puts forth two iterative methods for data-adaptive learning of an MC-UoS in the presence of complete and missing data. The proposed methods outperform existing approaches to learning a UoS in numerical experiments involving both synthetic and real data, which demonstrates effectiveness of both an MC-UoS model and the proposed methods.
|
Painting analysis using wavelets and probabilistic topic models
T. Wu, G. Polatkan, D. Steel, W. Brown, I. Daubechies and R. Calderbank
In Proc. IEEE Int. Conf. on Image Processing, Melbourne, Australia, Sep. 15-18, 2013, pp. 3264-3268. (ICIP 2013)
[abs] [pdf]
In this paper, computer-based techniques for stylistic analysis of paintings are applied to the five panels of the 14th century Peruzzi Altarpiece by Giotto di Bondone. Features are extracted by combining a dual-tree complex wavelet transform with a hidden Markov tree (HMT) model. Hierarchical clustering is used to identify stylistic keywords in image patches, and keyword frequencies are calculated for sub-images that each contains many patches. A generative hierarchical Bayesian model learns stylistic patterns of keywords; these patterns are then used to characterize the styles of the sub-images; this in turn, permits to discriminate between paintings. Results suggest that such unsupervised probabilistic topic models can be useful to distill characteristic elements of style.
|