SLIDE 1 Cross-Domain Recommendation via Clustering on Multi-Layer Graphs
Aug 8th, 2017
Al Aleksandr Fa Farseev, Ivan Samborskii, Andrey Filchenkov, Tat-Seng Chua
By AleksandrFarseev http://farseev.com
SLIDE 2
Collaborative Venue Category Recommendation β recommendation of venue categories (i.e. restaurant, cinema) to user using information about his/her profile (i.e. past visits) and/or information about users from the same domain. Venue categories: Clothing Store Hotel Ice Cream Shop Total 764 different categories
Venue Category Recommendation
Venue categories:
SLIDE 3
Idea 1: Utilization of Individual And Group Knowledge for Better Recommendation
SLIDE 4 We perform venue category recommendation based on both individual and group knowledge => naturally models the impact of society on an individual's behavior during the selection of a new place to go:
π ππ π£ = π‘ππ π’ πΏ * π€ππ, + π β π€ππ0
0β23
π·,
User Community-Based Collaborative Recommendation
+
SLIDE 5
+ Users from the same community (extracted from multi-source data) may have similar location preferences + Search within user community significantly reduces search space during the recommendation process
What do we need user communities for?
SLIDE 6
Example of User Communities (1) Community 1: Gingers Community K: Darker Hair
SLIDE 7
One way to find user communities is to model users' relationships in the form of a graph so that dense subgraphs are considered to be user communities.
User Relation and Community Representations
SLIDE 8 One of the commonly formulations is MinCut problem. For a given number k of subsets, the MinCut involves choosing a partition π·;,β¦, π·> such that it minimizes the expression: ππ£π’ π·;,β¦ ,π·> = ? π(π·B,π·Μ
B)
> BE;
Community Detection based on a single data source
*W is the sum of weights of edges attached to vertices in π·B
SLIDE 9 Approximation of MinCut as st standard tr trace mi minimi mization problem: m: min
HβIJΓL tr πOππ ,s.t. πOπ = π½
which can be solved by Sp Spectral Clu lusterin ing:
1. Calculates Laplacian matrix π β πUΓU 2. Builds matrix of the first π eigenvectors π β πUΓ> correspond to the smallest eigenvalues of π 3. Clusters data in a new space π using i.e. π-means algorithm
How to solve MinCut problem?
SLIDE 10
Idea 2: Utilization of Multi-Source Data
SLIDE 11 ~6 ~6 registered social network accounts per person*
Ac Accounts
People actively use ~3 ~3 social platforms simultaneously*
Ac Active Usage
1 2 3 4 5 10 9 8 7 6
* GlobalWebIndex. 2016. GWI Social report. http://www.globalwebindex.net/blog/internet-users-have-average-of-5-social-media-accounts
Most of user actively use β3 social networks
SLIDE 12
Multi-source data describe user from multiple views
SLIDE 13
Cr Cross Domain - Ve Venue ca category reco commendation β recommendation of venue categories (i.e. restaurant, cinema) using information about his/her profile (i.e. past visits) and/or information about users from other sources (i.e. images, texts, location types). Venue categories: Clothing Store Hotel Ice Cream Shop
Cross-Domain Venue Category Recommendation
Multi-Source Data:
SLIDE 14 Community Detection must performed in a Cross-Source Mannerβ¦
- Data source integration
- Community detection
Problems:
SLIDE 15
Mu Multi-la layer graph β graph π», where π» = π»B , π»B= π,πΉB
How to represent multi-source data?
SLIDE 16 Extending definition of spectral clustering
min
HβIJΓL ? tr πOπBπ [ BE;
, s.t.πOπ = π½ min
HβIJΓLtr πOπ\,]π , where π\,] = ? πB [ BE;
Such approximation could suffer from poor poor ge gene neralization
bility.
SLIDE 17 Regularized Clustering on Multi-layer Graph -1
Use Gr Grassman Ma Manifolds to keep final latent representation βcloseβ to all layers of multi-layer graph*. Where projected distance between two spaces π
; and π b:
πdefg
b
π
;,π b = 1
2 π
;π ; O β π bπ b O k b,where π΅ k is the Frobenius norm
πdefg
b
π, πB BE;
[
= ππ β ?tr(ππO β πBπB
O) [ BE;
* X. Dong, P. Frossard, P. Vandergheynst, and N. Nefedov. Clustering on multi-layer graphs via subspace analysis on grassmann manifolds. IEEE Transactions on Signal Processing, 2014.
SLIDE 18 Regularized Clustering on Multi-layer Graph -2
Extends the objective function to introduce the subspace analysis regularization min
HββJΓL ? tr [ BE;
πOπBπ + π½ ππ β ? tr
[ BE;
ππOπBπB
O
,s.t. πOπ = π½ min
HββJΓLtr πOπ]ftπ
π]ft = ?(πB β π½πBπB
O) [ BE;
SLIDE 19
Idea 4: Making use of Inter-Layer (Inter-Source) Relations
SLIDE 20 Incorporating inter-layer relationship (1)
By using distance on Grassman Manifolds, we present the new objective function for the πth layer: min
H vwββJΓLtr π
vB
OπBπ
vB + πΎB ππ β ? π₯B,gtr
[ gE;,gzB
π vBπ vB
Oπ gπ g O
min
H vwββJΓLtr π
vB
Oπ
{Bπ vB π {B = πB β πΎB ? π₯B,gtr
[ gE;,gzB
π
gπ g O
SLIDE 21 But how can we determine w|,} when computing i-th layer ?
min
H vwββJΓLtr π
vB
Oπ
{Bπ vB π {B = πB β πΎB ? π₯B,gtr
[ gE;,gzB
π
gπ g O
In Inter-la layer rela latio ionship ip graph πΊ(πΎ,π) β weighted graph which represents the similarity between layers. β π,π β πΉ, π₯B,g= β 1 β πB,> β π
g,>
π π β 1
β >Eb
πΏ β 1 where πB,> is clustering co-occurrence matrix of layer π, πβ‘,Λ = 1, if users π and π assigned to the same cluster, and 0 otherwise.
SLIDE 22 Final objective function
Letβs combine equations from previous slides to define the final objective function: min
H ββJΓL ?tr [ BE;
πOπ {Bπ + π½ ππ β ? tr
[ BE;
ππOπ vBπ vB
O
= = min
H ββJΓLtr πO ?(π
{B β π½π vBπ vB
O) [ BE;
π
SLIDE 23
- Community detection
- Data source integration
Problems
SLIDE 24 Recall: Community-Based Cross-Domain Recommendation
We perform venue category recommendation based on both individual and group knowledge, where group knowledge is obtained from multiple sources:
π ππ π£ = π‘ππ π’ πΏ * π€ππ, + π β π€ππ0
0β23
π·,
+
SLIDE 25 Twitter Instagram
NUS-MSS Dataset
Dataset* is presented as a set of features, extracted from user-generated data in three social networks:
- text based fromTwitter (LDA, LIWC, text
features)
- image based from Instagram (concepts)
- location based from Foursquare (LDA,
categories, Mobility Features) Foursquare categories is splited into two parts: 3 months data (train) and 2 months (test).
* A. Farseev, N. Liqiang, M. Akbari, and T.-S. Chua. Ha Harvesting multiple so sources s for use ser profile learning: a Big data st
Conference on Multimedia Retrieval (ICMR). China. June 23-26, 2015.
Foursquare
SLIDE 26 Linguistic features: LIWC; Latent Topics Heuristic features: Writing behavior
Text Features:
Data Sources
Location Semantics: Venue Category Distribution Mobility Features: Areas of Interest (AOI)
Location Features:
Image Concept Distribution (Image Net)
Image Features
Image Concepts Google Net LIWC LDA Mobility Location Type Preferences Images
SLIDE 27 Re Recommender Systems
Po Popular (PO POP) P) βrecommendation based on userβs past experience Popular Al All (POP Al All) ) βrecommendation based on experience of all users Mu Multi-So Source Re-Ra Ranking (MSRR) RR) β linearly combines recommendation results from all data modalities Ne Nearest Ne Neighbor Collaborative Filtering (CF) β recommendation based on top k most similar Foursquare users Ea Early Fusion (EF EF) β fuses multi-source data into a single feature vector SV SVD++ β makes use of the βimplicit feedbackβ information FM FMβ brings together the advantages of different factorization- based models via regularization. πππ β π
- π£ β CβR recommendation without inter-layer
regularization πππ β π
- π£ - π
- ππ©π β CβR recommendation without inter-layer
regularization and sub-space regularization πππβπ«πππ β CβR recommendation without user community extraction πππ (DB DBScan) ) β CβR recommendation, where user communities are detected by Density-Based clustering (DBScan) πππ (x (x-me means) β CβR recommendation, where user communities are detected by x-means clustering πππ (H (Hierarchical) β CβR recommendation, where user communities are detected by Hierarchical Clustering πππ β Our Ap Approach
Evaluation Baselines
Co Community Detection Approaches
SLIDE 28
Evaluation against other recommender systems
SLIDE 29
Evaluation against other community detection approaches
+ Incorporation of group knowledge is is important + Multi-modal clustering performs better than single-source clustering + Incorporation of Inter-Source relationshipis crucial.
SLIDE 30 Evaluation against source combinations
+ In different geo regions, different data sources are
+ Location data is more powerful than other data modalities
SLIDE 31
Examples of detected user communities
SLIDE 32 Future Work
Community Detection is more useful when it is Source-Dependent => Introduce Supervision Into Clustering How?
- Graph Construction Level β reweight edges according to prior
knowledge about existing user communities
- Model Level β introduce community-related constraints into
clustering
SLIDE 33
Summary
+ Multi-View Data is crucial for User Community Detection + For the task of venue category recommendation, both Group And Individual Knowledge are Important + Venue Category Recommendation is not a conventional recommendation task: users visit many venue types from the past. (items from the train set often occur in test set)
SLIDE 34 Our released large multi-source multi-modal datasets
34
NUS-MSS NUS-SENSE http://nusmss.azurewebsites.net http://nussense.azurewebsites.net
The Released Datasets
http://tutorial.farseev.com
Our Tutorial on Multi-View Learning @ WST WSSSβ17
SLIDE 35
Thank You
Questions? By AleksandrFarseev http://farseev.com
SLIDE 36