Theoretical Justification of Link Prediction Heuristics

a theoretical justification of link prediction l.w
1 / 23
Embed
Share

Explore the theoretical basis behind link prediction heuristics in network analysis, delving into strategies to predict connections between nodes based on common neighbors, shortest paths, and empirical studies. Discover how generative models aid in recommending connections in various scenarios.

  • Link Prediction
  • Network Analysis
  • Heuristics
  • Generative Models
  • Empirical Studies

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. A Theoretical Justification of Link Prediction Heuristics Deepayan Chakrabarti (deepay@cs.cmu.edu) Purnamrita Sarkar Andrew Moore 1

  2. Link Prediction Which pair of nodes {i,j} should be connected? Alice Bob Charlie Goal: Recommend a movie 2

  3. Link Prediction Which pair of nodes {i,j} should be connected? Goal: Suggest friends 3

  4. Link Prediction Heuristics Predict link between nodes Connected by the shortest path With the most common neighbors (length 2 paths) More weight to low-degree common nbrs (Adamic/Adar) Alice Prolific common friends 1000 followers Less evidence Bob Less prolific 3followers Much more evidence Charlie

  5. Link Prediction Heuristics Predict link between nodes Connected by the shortest path With the most common neighbors (length 2 paths) More weight to low-degree common nbrs (Adamic/Adar) With more short paths (e.g. length 3 paths ) exponentially decaying weights to longer paths (Katz measure)

  6. Previous Empirical Studies* Especially if the graph is sparse How do we justify these observations? Link prediction accuracy* Random Shortest Common Neighbors Adamic/Adar Ensemble of short paths Path *Liben-Nowell & Kleinberg, 2003; Brand, 2005; Sarkar & Moore, 2007 6

  7. Link Prediction Generative Model Unit volume universe Model: 1. Nodes are uniformly distributed points in a latent space 2. This space has a distance metric 3. Points close to each other are likely to be connected in the graph Logistic distance function (Raftery+/2002) 7

  8. Link Prediction Generative Model determines the steepness Higher probability of linking 1 radius r Model: 1. Nodes are uniformly distributed points in a latent space 2. This space has a distance metric 3. Points close to each other are likely to be connected in the graph Link prediction find nearest neighbor who is not currently linked to the node. Equivalent to inferring distances in the latent space 8

  9. Previous Empirical Studies* Especially if the graph is sparse Link prediction accuracy* Random Shortest Common Neighbors Adamic/Adar Ensemble of short paths Path *Liben-Nowell & Kleinberg, 2003; Brand, 2005; Sarkar & Moore, 2007 9

  10. Common Neighbors j i Pr2(i,j) = Pr(common neighbor|dij) = Pr (i, j) Pr( ~ d | ) Pr( ~ d | ) d ( P d , d | ) d d i k j k 2 ik jk ik jk ij ik jk Product of two logistic probabilities, integrated over a volume determined by dij As Logistic Step function Much easier to analyze! 10

  11. Common Neighbors Everyone has same radius r Unit volume universe j i =Number of common neighbors = Pr (i, j) A(r, r, d ) 2 ij + = P A(r, r, d ) 1 2 ij N N V(r)=volume of radius r in D dims # common nbrs gives a bound on distance / 1 / 2 D D + /N /N 2 1 d 2 1 r r ij V(r) V(r) 11

  12. Common Neighbors OPT = node closest to i MAX = node with max common neighbors with i Theorem: w.h.p dOPT dMAX dOPT + 2[ /V(1)]1/D Link prediction by common neighbors is asymptotically optimal 12

  13. Common Neighbors: Distinct Radii m i Node k has radius rk . k rk i k if dik rk (Directed graph) rk captures popularity of node k j Type 2: i k j Type 1: i k j k k j j i rj i rk rk ri A(ri , rj ,dij) A(rk , rk ,dij) 13

  14. Type 2 common neighbors Example graph: N1 nodes of radius r1 and N2 nodes of radius r2 r1 << r2 2 ~ Bin[N2 , A(r2, r2, dij)] 1 ~ Bin[N1 , A(r1, r1, dij)] k i j Pick d* to maximize Pr[ 1 , 2 | dij] w(r1)E[ 1|d*] + w(r2) E[ 2|d*] = w(r1) 1 + w(r2) 2 Weighted common neighbors Inversely related to d*

  15. Common Neighbors: Distinct Radii m i Node k has radius rk . k rk i k if dik rk (Directed graph) rk captures popularity of node k j Weighted common neighbors: Predict (i,j) pairs with highest w(r) (r) # common neighbors of radius r Weight for nodes of radius r 15

  16. Type 2 common neighbors i k rk j Adamic/Ad ar Presence of common neighbor is very informative const const w(r) Absence is very informative 1 r deg D 1/r r is close to max radius Real world graphs generally fall in this range 16

  17. Previous Empirical Studies* Especially if the graph is sparse Link prediction accuracy* Random Shortest Common Neighbors Adamic/Adar Ensemble of short paths Path *Liben-Nowell & Kleinberg, 2003; Brand, 2005; Sarkar & Moore, 2007 17

  18. l hop Paths Common neighbors = 2 hop paths Analysis of longer paths: two components 1. Bounding E( l | dij). [ l= # l hop paths] Bounds Prl (i,j) by using triangle inequality on a series of common neighbor probabilities. 2. l E( l | dij) Triangulation

  19. l hop Paths Common neighbors = 2 hop paths Analysis of longer paths: two components 1. Bounding E( l | dij). [ l= # l hop paths] Bounds Prl (i,j) by using triangle inequality on a series of common neighbor probabilities. 2. l E( l | dij) Bounded dependence of lon position of each node Can use McDiarmid s inequality to bound | l -E( l|dij)|

  20. -hop Paths Common neighbors = 2 hop paths For longer paths: - 1 ( ) + dij r ( 1)r g , N, Bounds are weaker For we need >> to obtain similar bounds justifies the exponentially decaying weight given to longer paths by the Katz measure 20

  21. Summary Three key ingredients Closer points are likelier to be linked. Small World Model- Watts, Strogatz, 1998, Kleinberg 2001 1. Triangle inequality holds necessary to extend to -hop paths 2. Points are spread uniformly at random Otherwise properties will depend on location as well as distance 3. 21

  22. Summary In sparse graphs, paths of length 3 or more help in prediction. Differentiating between different degrees is important Link prediction accuracy* For large dense graphs, common neighbors are enough The number of paths matters, not the length Random Shortest Common Neighbors Adamic/Adar Ensemble of short paths Path *Liben-Nowell & Kleinberg, 2003; Brand, 2005; Sarkar & Moore, 2007 22

  23. Sweep Estimators Number of common neighbors of a given radius r TR = Fraction of nodes with radius R which are common neighbors Qr = Fraction of nodes with radius r which are common neighbors Small TR large dij Large Qr small dij

More Related Content