TY - GEN
T1 - MuSha
T2 - 41st IEEE International Conference on Data Engineering, ICDE 2025
AU - Cao, Hongtai
AU - Wang, Qihao
AU - Li, Xiaodong
AU - Najafi, Mohammad Matin
AU - Chang, Kevin Chen Chuan
AU - Cheng, Reynold
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Subgraph matching (SM) is a fundamental problem in graph data analysis. Real-world patterns used in graph analysis are often symmetric and contain isomorphic substructures, but existing SM algorithms fail to explore such properties. To fill this gap, we propose MuSha, a multi-objective optimization framework for SM, leveraging multilevel sharing of isomorphic substructure results to speed up SM and symmetry breaking to avoid directly computing symmetric results. To efficiently compute and cache intermediate results for sharing, MuSha applies worst-case optimal joins (WCOJs) and utilizes trie data structures to compress and index results. To enable multilevel sharing, MuSha solves a multi-objective optimization problem involving pattern decomposition, symmetry breaking, WCOJ orders, and trie structural orders. Experimental results demonstrate that MuSha outperforms the state of the art by up to two orders of magnitude on graphs of millions of vertices.
AB - Subgraph matching (SM) is a fundamental problem in graph data analysis. Real-world patterns used in graph analysis are often symmetric and contain isomorphic substructures, but existing SM algorithms fail to explore such properties. To fill this gap, we propose MuSha, a multi-objective optimization framework for SM, leveraging multilevel sharing of isomorphic substructure results to speed up SM and symmetry breaking to avoid directly computing symmetric results. To efficiently compute and cache intermediate results for sharing, MuSha applies worst-case optimal joins (WCOJs) and utilizes trie data structures to compress and index results. To enable multilevel sharing, MuSha solves a multi-objective optimization problem involving pattern decomposition, symmetry breaking, WCOJ orders, and trie structural orders. Experimental results demonstrate that MuSha outperforms the state of the art by up to two orders of magnitude on graphs of millions of vertices.
KW - heterogeneous graphs
KW - sharing
KW - subgraph matching
KW - symmetry breaking
KW - trie
KW - WCOJ
UR - https://www.scopus.com/pages/publications/105015558692
UR - https://www.mendeley.com/catalogue/8816f557-32cd-30f0-8aa8-ccf8173e1325/
U2 - 10.1109/ICDE65448.2025.00192
DO - 10.1109/ICDE65448.2025.00192
M3 - Conference contribution
AN - SCOPUS:105015558692
T3 - Proceedings - International Conference on Data Engineering
SP - 2548
EP - 2561
BT - Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025
PB - IEEE Computer Society
Y2 - 19 May 2025 through 23 May 2025
ER -