ECE Seminars

Latent Variable Identification using Identifiable Matrix Factorization Methods

Add to Google Calendar
Date:  Thu, March 22, 2018
Time:  10:00am
Location:  Holmes Hall 389
Speaker:  Dr. Kejun Huang

Abstract:

Latent variable identification is a unifying problem formulation technique for unsupervised machine learning and big data analytics. Interesting applications include topic modeling, community detection, hyperspectral unmixing, to name just a few. Identifiability arises as a fundamental issue since it amounts to answering whether the latent structure can truly be learned without the help of labeled data. Among many approaches that have identifiability guarantees, this talk focuses on nonnegative matrix factorization (NMF)-type methods. NMF is widely and successfully used in many applications, but a theoretical understanding on why it is able to identify latent variables used to be very limited. The take-home point of this talk is that a latent variable can be uniquely identified if it is sufficiently scattered, an assumption inspired by convex geometry, using either plain NMF model or in addition with a "volume" regularization. This principle is demonstrated in the application of hidden Markov model (HMM) identification, which shows that a HMM can be uniquely identified from the pairwise co-occurrences, which is particularly suitable for applications where the possible outcomes of the observations is relatively large, for example in topic modeling. We show that we can learn topics with higher quality if documents are modeled as observations of HMMs sharing the same emission (topic) probability, compared to the simple but widely used bag-of-words model.

Bio:

Kejun Huang received the Ph.D. degree in Electrical Engineering from the University of Minnesota, Minneapolis, MN, USA in 2016. He is currently a Postdoctoral Associate at the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN, USA. His research interests include signal processing, machine learning, big data analytics, and optimization, with special focus on identifiability analysis and non-convex algorithm design for latent variable models.

Return to ECE Seminars

EE & CENG

Mission Statement

Objectives & Outcomes

ALOHAnet

Contact Us

Giving

Prospective Students

Admission Information

Tuition Information

Current Students

Degree Requirements

Electrical Engineering

Computer Engineering

BAM (BS & MS)

Concentrations

Courses

Course Descriptions

Course Syllabi

Class Schedules

Planned Offerings

EE Technical Electives

CENG Technical Electives

Engineering Breadth

Highlighted Courses

x96 Projects

ECE 296

ECE 396

ECE 496

Instructions for ECE 496

Advising

TA Office Hours

Forms

Scholarships & Financial Aid

Assistantship Opportunities

Prospective Students

How to Apply

Tuition Information

Admitted Students

Local Information

Current Students

Degree Requirements

Master's Degree

Doctoral Degree

BAM (BS & MS)

Courses

Course Descriptions

Course Syllabi

Class Schedules

TA Responsibilities

Student Rights

Forms

Seminar Policy

Scholarships & Financial Aid

Assistantship Opportunities

Biomedical Engineering

Groups & Projects

Related Graduate Courses

Computer Engineering

Groups & Projects

Related Graduate Courses

Electrophysics

Groups & Projects

Related Graduate Courses

Power & Energy

Groups & Projects

Related Graduate Courses

Systems & Data Science

Groups & Projects

Related Graduate Courses

Computer Engineering

Electrophysics

Systems & Data Science

Affiliate / Cooperating

Staff

Latest News

ECE Seminars

Theses and Dissertations

Undergraduate Presentations

Affiliate Events

Conferences