Resampling-based gap analysis for detecting nodes with high centrality on large social network
conference contribution
posted on 2023-05-23, 11:11authored byOhara, K, Saito, K, Kimura, M, Motoda, H
We address a problem of identifying nodes having a high centrality value in a large social network based on its approximation derived only from nodes sampled from the network. More specifically, we detect gaps between nodes with a given confidence level, assuming that we can say a gap exists between two adjacent nodes ordered in descending order of approximations of true centrality values if it can divide the ordered list of nodes into two groups so that any node in one group has a higher centrality value than any one in another group with a given confidence level. To this end, we incorporate confidence intervals of true centrality values, and apply the resampling-based framework to estimate the intervals as accurately as possible. Furthermore, we devise an algorithm that can efficiently detect gaps by making only two passes through the nodes, and empirically show, using three real world social networks, that the proposed method can successfully detect more gaps, compared to the one adopting a standard error estimation framework, using the same node coverage ratio, and that the resulting gaps enable us to correctly identify a set of nodes having a high centrality value.
History
Publication title
Proceedings of the Advances in Knowledge Discovery and Data Mining 19th Pacific-Asia Conference (PAKDD 2015)
Volume
LNAI 9077
Editors
T Cao, E-P Lim, Z-H Zhou, T-B Ho, D Cheung, H Motoda
Pagination
135-147
ISBN
978-3-319-18037-3
Department/School
School of Engineering
Publisher
Springer International
Place of publication
Switzerland
Event title
Advances in Knowledge Discovery and Data Mining 19th Pacific-Asia Conference, PAKDD 2015
Event Venue
Ho Chi Minh City, Vietnam
Date of Event (Start Date)
2015-05-19
Date of Event (End Date)
2015-05-22
Rights statement
Copyright 2015 Springer International Publishing
Repository Status
Restricted
Socio-economic Objectives
Expanding knowledge in the information and computing sciences