On the theory of dynamic graph regression problem

Haghir Chehreghani, Mostafa

doi:10.1007/s40314-022-02071-2

On the theory of dynamic graph regression problem

Published: 23 October 2022

Volume 41, article number 362, (2022)
Cite this article

Computational and Applied Mathematics Aims and scope Submit manuscript

Mostafa Haghir Chehreghani ORCID: orcid.org/0000-0003-3436-0541¹

183 Accesses
Explore all metrics

Abstract

Most of real-world graphs are dynamic, i.e., they change over time by a sequence of update operations. While the regression problem has been studied for static graphs and temporal graphs, it is not investigated for general dynamic graphs. In this paper, we study the theory of regression over dynamic graphs. First, we present the notion of update-efficient matrix embedding, that defines conditions sufficient for a matrix embedding to be effectively used for dynamic graph regression (under $l_2$ norm). Then, we show that given a $n \times m$ update-efficient matrix embedding (e.g., the adjacency matrix) and after an update operation in the graph, the exact optimal solution of linear regression can be updated in O(nm) time for the revised graph. Moreover, we show that this also holds when the matrix embedding is the Laplacian matrix and the update operations are restricted to edge insertion/deletion. In the end, by conducting experiments over synthetic and real-world graphs, we show the high efficiency of updating the solution of graph regression.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Understanding the Benefits of Forgetting When Learning on Dynamic Graphs

Dynamic Heterogeneous Graph Representation

Algebraic and Dynamic Graph Algorithms

Notes

We note that this notion of embedding is different from the notion of embedding used in graph pattern mining (Chehreghani et al. 2011, 2020; Chehreghani and Bruynooghe 2016).
Note that the graph and the information used to find the solution of dynamic graph regression, i.e., the pseudoinverse of matrix ${\textbf{M}}$ and vector ${\textbf{b}},$ change over time. However, we only care about their values before and after an update operation, as we want to find their values after the update operation, based on their values before the update operation. To keep notations as simple as possible, we do not parameterize them by time, rather, we simply use the terms before and after the update operation to distinguish these two situations.
Note that when inserting a new node to a bounded-degree graph, at most a constant (bounded) number of edges are drawn between the new node and existing nodes.
https://snap.stanford.edu/data/wiki-Vote.html.
https://snap.stanford.edu/data/feather-lastfm-social.html.
https://snap.stanford.edu/data/soc-sign-bitcoin-otc.html.
https://charts.coinmetrics.io.

References

Borgwardt KM, Kriegel H, Wackersreuther P (2006) Pattern mining in frequent dynamic subgraphs. In: Proceedings of the 6th IEEE international conference on data mining (ICDM 2006), 18–22 December, Hong Kong, China. IEEE Computer Society, pp 818–822. https://doi.org/10.1109/ICDM.2006.124
Calissano A, Feragen A, Vantini S (2021) Graph-valued regression: prediction of unlabelled networks in a non-Euclidean graph-space. MOX-Report No. 02/2021, Dipartimento di Matematica, Politecnico di Milano, Via Bonardi 9, 20133 Milano, Italy
Calissano A, Feragen A, Vantini S (2022) Graph-valued regression: prediction of unlabelled networks in a non-Euclidean graph space. J Multivar Anal 190:104950. https://doi.org/10.1016/j.jmva.2022.104950. https://www.sciencedirect.com/science/article/pii/S0047259X22000021
Chehreghani M (2022) Half a decade of graph convolutional networks. Nat Mach Intell 4:1–2. https://doi.org/10.1038/s42256-022-00466-8
Article Google Scholar
Chehreghani MH, Bruynooghe M (2016) Mining rooted ordered trees under subtree homeomorphism. Data Min Knowl Discov 30(5):1249–1272. https://doi.org/10.1007/s10618-015-0439-5
Article MathSciNet MATH Google Scholar
Chehreghani MH, Rahgozar M, Lucas C, Chehreghani MH (2007) A heuristic algorithm for clustering rooted ordered trees. Intell Data Anal 11(4):355–376. http://content.iospress.com/articles/intelligent-data-analysis/ida00292
Chehreghani MH, Chehreghani MH, Lucas C, Rahgozar M, Ghadimi E (2009) Efficient rule based structural algorithms for classification of tree structured data. Intell Data Anal 13(1):165–188. https://doi.org/10.3233/IDA-2009-0361
Article Google Scholar
Chehreghani MH, Chehreghani MH, Lucas C, Rahgozar M (2011) Oinduced: an efficient algorithm for mining induced patterns from rooted ordered trees. IEEE Trans Syst Man Cybern Part A 41(5):1013–1025. https://doi.org/10.1109/TSMCA.2010.2096808
Article Google Scholar
Chehreghani MH, Abdessalem T, Bifet A, Bouzbila M (2020) Sampling informative patterns from large single networks. Future Gener Comput Syst 106:653–658. https://doi.org/10.1016/j.future.2020.01.042
Article Google Scholar
Chowdhury S, Memoli F (2019) The Gromov–Wasserstein distance between networks and stable network invariants. Inf Inference J IMA 8(4):757–787. https://doi.org/10.1093/imaiai/iaz026
Article MathSciNet MATH Google Scholar
Erdos P, Renyi A (1960) On the evolution of random graphs. Publ Math Inst Hung Acad Sci 5:17–61
MathSciNet MATH Google Scholar
Gall FL (2014) Powers of tensors and fast matrix multiplication. In: Nabeshima K, Nagasaka K, Winkler F, Szántó Á (eds) International symposium on symbolic and algebraic computation, ISSAC ’14, Kobe, Japan, July 23–25. ACM, pp 296–303. https://doi.org/10.1145/2608628.2608664
Goyal P, Kamra N, He X, Liu Y (2018) Dyngem: deep embedding method for dynamic graphs. CoRR. arXiv:1805.11273
Goyal P, Chhetri SR, Canedo A (2020) dyngraph2vec: capturing network dynamics using dynamic graph representation learning. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2019.06.024
Article Google Scholar
Greville TNE (1960) Some applications of the pseudoinverse of a matrix. SIAM Rev 2:15–22. https://doi.org/10.1137/1002004
Article MathSciNet MATH Google Scholar
Grover A, Leskovec J (2016) node2vec: scalable feature learning for networks. In: Krishnapuram B, Shah M, Smola AJ, Aggarwal CC, Shen D, Rastogi R (eds) Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016. ACM, pp 855–864. https://doi.org/10.1145/2939672.2939754
Haghir Chehreghani M (2021) Dynamical algorithms for data mining and machine learning over dynamic graphs. WIREs Data Min Knowl Discov. https://doi.org/10.1002/widm.1393
Article MATH Google Scholar
Han C, Zhang S, Ghalwash MF, Vucetic S, Obradovic Z (2016) Joint learning of representation and structure for sparse regression on graphs. In: Venkatasubramanian SC, Meira W Jr (eds) Proceedings of the 2016 SIAM international conference on data mining, Miami, Florida, USA, May 5–7, 2016. SIAM, pp 846–854. https://doi.org/10.1137/1.9781611974348.95
Han C, Cao X.H, Stanojevic M, Ghalwash M.F, Obradovic Z (2019) Temporal graph regression via structure-aware intrinsic representation learning. In: Berger-Wolf TY, Chawla NV (eds) Proceedings of the 2019 SIAM international conference on data mining, SDM 2019, Calgary, Alberta, Canada, May 2–4, 2019. SIAM, pp 360–368. https://doi.org/10.1137/1.9781611975673.41
Hayashi T, Akiba T, Yoshida Y(2015) Fully dynamic betweenness centrality maintenance on massive networks. In: Proceedings of the VLDB endowment (PVLDB), vol 9(2), pp 48–59. https://doi.org/10.14778/2850578.2850580. http://www.vldb.org/pvldb/vol9/p48-hayashi.pdf
Herbster M, Pontil M (2006) Prediction on a graph with a perceptron. In: Schölkopf B, Platt JC, Hofmann T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 4–7, 2006. MIT Press, Cambridge, pp 577–584. https://proceedings.neurips.cc/paper/2006/hash/cc3d69ed781b16bce06687822ae56e6d-Abstract.html
Herbster M, Pasteris S, Pontil M (2015) Predicting a switching sequence of graph labelings. J Mach Learn Res 16:2003–2022. http://dl.acm.org/citation.cfm?id=2886813
Kleinberg JM, Tardos É (2002) Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields. J ACM 49(5):616–639. https://doi.org/10.1145/585265.585268
Article MathSciNet MATH Google Scholar
Kovac A, Smith AD (2011) Nonparametric regression on a graph. J Comput Graph Stat 20(2):432–447. https://doi.org/10.1198/jcgs.2011.09203
Article MathSciNet Google Scholar
Kumar S, Spezzano F, Subrahmanian VS, Faloutsos C (2016) Edge weight prediction in weighted signed networks. In: IEEE 16th international conference on data mining, ICDM 2016, December 12-15, 2016, Barcelona, Spain, pp 221–230. https://doi.org/10.1109/ICDM.2016.0033
Lee JB, Rossi RA, Kong X (2018) Graph classification using structural attention. In: Guo Y, Farooq F (eds) Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, KDD 2018, London, UK, August 19–23, 2018. ACM, pp 1666–1674 (2018). https://doi.org/10.1145/3219819.3219980
Leskovec J, Huttenlocher DP, Kleinberg JM (2010) Predicting positive and negative links in online social networks. In: Rappa M, Jones P, Freire J, Chakrabarti S (eds) Proceedings of the 19th international conference on world wide web, WWW 2010, Raleigh, North Carolina, USA, April 26–30, 2010. ACM, pp 641–650. https://doi.org/10.1145/1772690.1772756
Li J, Hu X, Jian L, Liu H (2016) Toward time-evolving feature selection on dynamic networks. In: IEEE 16th international conference on data mining, ICDM 2016, December 12–15, 2016, Barcelona, Spain, pp 1003–1008
Makkar D, Bader DA, Green O (2017) Exact and parallel triangle counting in dynamic graphs. In: 24th IEEE international conference on high performance computing, HiPC 2017, Jaipur, India, December 18–21, 2017. IEEE Computer Society, pp 2–12. https://doi.org/10.1109/HiPC.2017.00011
Meyer CD Jr (1973) Generalized inversion of modified matrices. SIAM J Appl Math 24(3):315–323
Article MathSciNet MATH Google Scholar
Niepert M, Ahmed M, Kutzkov K (2016) Learning convolutional neural networks for graphs. In: Proceedings of the 33nd international conference on machine learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. JMLR workshop and conference proceedings, vol 48, pp 2014–2023. http://jmlr.org/proceedings/papers/v48/niepert16.html
Parsons T, Pisanski T (1989) Vector representations of graphs. Discret Math 78(1):143–154. Special Double Issue in Memory of Tory Parsons
Qin T, Liu T, Zhang X, Wang D, Li H (2008) Global ranking using continuous conditional random fields. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) Advances in neural information processing systems 21, Proceedings of the twenty-second annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 8–11, 2008. Curran Associates, Inc., Red Hook, pp 1281–1288. https://proceedings.neurips.cc/paper/2008/hash/ebd9629fc3ae5e9f6611e2ee05a31cef-Abstract.html
Rao CR, Mitra SK (1971) Generalized inverse of matrices and its applications. Wiley, New York
MATH Google Scholar
Rozemberczki B, Sarkar R (2020) Characteristic functions on graphs: birds of a feather, from statistical descriptors to parametric models. In: Proceedings of the 29th ACM international conference on information and knowledge management (CIKM ’20). ACM, New York, pp 1325–1334
Saigo H, Nowozin S, Kadowaki T, Kudo T, Tsuda K (2009) gBoost: a mathematical programming approach to graph classification and regression. Mach Learn 75(1):69–89. https://doi.org/10.1007/s10994-008-5089-z
Article MATH Google Scholar
Sohn K, Kim S (2012) Joint estimation of structured sparsity and output structure in multiple-output regression via inverse-covariance regularization. In: Lawrence ND, Girolami MA (eds) Proceedings of the fifteenth international conference on artificial intelligence and statistics, AISTATS 2012, La Palma, Canary Islands, Spain, April 21–23, 2012, JMLR proceedings, vol 22. JMLR.org, pp 1081–1089. http://proceedings.mlr.press/v22/sohn12.html
Woodruff DP (2014) Sketching as a tool for numerical linear algebra. Found Trends Theor Comput Sci 10(1–2):1–157. https://doi.org/10.1561/0400000060
Article MathSciNet MATH Google Scholar
Wytock M, Kolter JZ (2013) Sparse Gaussian conditional random fields: algorithms, theory, and application to energy forecasting. In: Proceedings of the 30th international conference on machine learning, ICML 2013, Atlanta, GA, USA, 16–21 June 2013. JMLR workshop and conference proceedings, vol 28. JMLR.org, pp 1265–1273. http://proceedings.mlr.press/v28/wytock13.html
Yang Z, Cohen WW, Salakhutdinov R (2016) Revisiting semi-supervised learning with graph embeddings. In: Proceedings of the 33nd international conference on machine learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. JMLR workshop and conference proceedings, vol 48, pp 40–48. http://jmlr.org/proceedings/papers/v48/yanga16.html

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Amirkabir University of Technology (Tehran Polytechnic), Tehran, Iran
Mostafa Haghir Chehreghani

Authors

Mostafa Haghir Chehreghani
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Mostafa Haghir Chehreghani.

Additional information

Communicated by Leonardo de Lima.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Haghir Chehreghani, M. On the theory of dynamic graph regression problem. Comp. Appl. Math. 41, 362 (2022). https://doi.org/10.1007/s40314-022-02071-2

Download citation

Received: 12 January 2022
Revised: 09 August 2022
Accepted: 06 October 2022
Published: 23 October 2022
DOI: https://doi.org/10.1007/s40314-022-02071-2

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the theory of dynamic graph regression problem

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding the Benefits of Forgetting When Learning on Dynamic Graphs

Dynamic Heterogeneous Graph Representation

Algebraic and Dynamic Graph Algorithms

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Subscribe and save

Buy Now