Degree-biased random walk for large-scale network embedding

Yunyi Zhang; Zhan Shi; Dan Feng; Xiuxiu Zhan

doi:10.1016/j.future.2019.05.033

Degree-biased random walk for large-scale network embedding

Yunyi Zhang, Zhan Shi^*, Dan Feng, Xiuxiu Zhan

^*Corresponding author for this work

Multimedia Computing

Research output: Contribution to journal › Article › Scientific › peer-review

18 Citations (Scopus)

48 Downloads (Pure)

Abstract

Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods.

Original language	English
Pages (from-to)	198-209
Number of pages	12
Journal	Future Generation Computer Systems
Volume	100
DOIs	https://doi.org/10.1016/j.future.2019.05.033
Publication status	Published - 2019

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Network embedding
Random walks
Scale-free

Access to Document

10.1016/j.future.2019.05.033

1_s2.0_S0167739X19300378_main-1Final published version, 844 KB

Cite this

@article{affc7ad51b0e45708ceb1e0fe6fe1602,

title = "Degree-biased random walk for large-scale network embedding",

abstract = "Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods.",

keywords = "Network embedding, Random walks, Scale-free",

author = "Yunyi Zhang and Zhan Shi and Dan Feng and Xiuxiu Zhan",

note = "Green Open Access added to TU Delft Institutional Repository {\textquoteleft}You share, we take care!{\textquoteright} – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2019",

doi = "10.1016/j.future.2019.05.033",

language = "English",

volume = "100",

pages = "198--209",

journal = "Future Generation Computer Systems",

issn = "0167-739X",

publisher = "Elsevier",

}

TY - JOUR

T1 - Degree-biased random walk for large-scale network embedding

AU - Zhang, Yunyi

AU - Shi, Zhan

AU - Feng, Dan

AU - Zhan, Xiuxiu

N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2019

Y1 - 2019

N2 - Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods.

AB - Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods.

KW - Network embedding

KW - Random walks

KW - Scale-free

UR - http://www.scopus.com/inward/record.url?scp=85065833362&partnerID=8YFLogxK

U2 - 10.1016/j.future.2019.05.033

DO - 10.1016/j.future.2019.05.033

M3 - Article

AN - SCOPUS:85065833362

SN - 0167-739X

VL - 100

SP - 198

EP - 209

JO - Future Generation Computer Systems

JF - Future Generation Computer Systems

ER -

Degree-biased random walk for large-scale network embedding

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this