Matrix dimensionality reduction for mining typical user profiles

Jianjiang Lu,Baowen Xu,Gangshi Huang,Yafei ZHANG
DOI: https://doi.org/10.3969/j.issn.1003-7985.2003.03.006
2003-01-01
Abstract:Recently clustering techniques have been used to automatically discover typical user profiles. In general, it is a challenging problem to design effective similarity measure between the session vectors which are usually high-dimensional and sparse. Two approaches for mining typical user profiles, based on matrix dimensionality reduction, are presented. In these approaches, non-negative matrix factorization is applied to reduce dimensionality of the session-URL matrix, and the projecting vectors of the user-session vectors are clustered into typical user-session profiles using the spherical k-means algorithm. The results show that two algorithms are successful in mining many typical user profiles in the user sessions.
What problem does this paper attempt to address?