TY - GEN
T1 - Scalable evaluation of k-NN queries on large uncertain graphs
AU - Li, Xiaodong
AU - Cheng, Reynold
AU - Fang, Yixiang
AU - Hu, Jiafeng
AU - Maniu, Silviu
N1 - Publisher Copyright:
© 2018 Copyright held by the owner/author(s)
PY - 2018
Y1 - 2018
N2 - Large graphs are prevalent in social networks, traffic networks, and biology. These graphs are often inexact. For example, in a friendship network, an edge between two nodes u and v indicates that users u and v have a close relationship. This edge may only exist with a probability. To model such information, the uncertain graph model has been proposed, in which each edge e is augmented with a probability that indicates the chance e exists. Given a node q in an uncertain graph G, we study the k-NN query of q, which looks for k nodes in G whose distances from q are the shortest. The k-NN query can be used in friend-search, data mining, and pattern-recognition. Despite the importance of this query, it has not been well studied. In this paper, we develop a tree-based structure called the U-tree. Given a k-NN query, the U-tree produces a compact representation of G, based on which the query can be executed efficiently. Our results on real and synthetic datasets show that our algorithm can scale to large graphs, and is 75% faster than state-of-the-art solutions.
AB - Large graphs are prevalent in social networks, traffic networks, and biology. These graphs are often inexact. For example, in a friendship network, an edge between two nodes u and v indicates that users u and v have a close relationship. This edge may only exist with a probability. To model such information, the uncertain graph model has been proposed, in which each edge e is augmented with a probability that indicates the chance e exists. Given a node q in an uncertain graph G, we study the k-NN query of q, which looks for k nodes in G whose distances from q are the shortest. The k-NN query can be used in friend-search, data mining, and pattern-recognition. Despite the importance of this query, it has not been well studied. In this paper, we develop a tree-based structure called the U-tree. Given a k-NN query, the U-tree produces a compact representation of G, based on which the query can be executed efficiently. Our results on real and synthetic datasets show that our algorithm can scale to large graphs, and is 75% faster than state-of-the-art solutions.
UR - https://www.scopus.com/pages/publications/85069468114
U2 - 10.5441/002/edbt.2018.17
DO - 10.5441/002/edbt.2018.17
M3 - Conference contribution
AN - SCOPUS:85069468114
T3 - Advances in Database Technology - EDBT
SP - 181
EP - 192
BT - Advances in Database Technology - EDBT 2018
A2 - Bohlen, Michael
A2 - Pichler, Reinhard
A2 - May, Norman
A2 - Rahm, Erhard
A2 - Wu, Shan-Hung
A2 - Hose, Katja
PB - OpenProceedings.org
T2 - 21st International Conference on Extending Database Technology, EDBT 2018
Y2 - 26 March 2018 through 29 March 2018
ER -