Abstract
Most of real-world graphs are dynamic, i.e., they change over time by a sequence of update operations. While the regression problem has been studied for static graphs and temporal graphs, it is not investigated for general dynamic graphs. In this paper, we study the theory of regression over dynamic graphs. First, we present the notion of update-efficient matrix embedding, that defines conditions sufficient for a matrix embedding to be effectively used for dynamic graph regression (under \(l_2\) norm). Then, we show that given a \(n \times m\) update-efficient matrix embedding (e.g., the adjacency matrix) and after an update operation in the graph, the exact optimal solution of linear regression can be updated in O(nm) time for the revised graph. Moreover, we show that this also holds when the matrix embedding is the Laplacian matrix and the update operations are restricted to edge insertion/deletion. In the end, by conducting experiments over synthetic and real-world graphs, we show the high efficiency of updating the solution of graph regression.



Similar content being viewed by others
Notes
Note that the graph and the information used to find the solution of dynamic graph regression, i.e., the pseudoinverse of matrix \({\textbf{M}}\) and vector \({\textbf{b}},\) change over time. However, we only care about their values before and after an update operation, as we want to find their values after the update operation, based on their values before the update operation. To keep notations as simple as possible, we do not parameterize them by time, rather, we simply use the terms before and after the update operation to distinguish these two situations.
Note that when inserting a new node to a bounded-degree graph, at most a constant (bounded) number of edges are drawn between the new node and existing nodes.
References
Borgwardt KM, Kriegel H, Wackersreuther P (2006) Pattern mining in frequent dynamic subgraphs. In: Proceedings of the 6th IEEE international conference on data mining (ICDM 2006), 18–22 December, Hong Kong, China. IEEE Computer Society, pp 818–822. https://doi.org/10.1109/ICDM.2006.124
Calissano A, Feragen A, Vantini S (2021) Graph-valued regression: prediction of unlabelled networks in a non-Euclidean graph-space. MOX-Report No. 02/2021, Dipartimento di Matematica, Politecnico di Milano, Via Bonardi 9, 20133 Milano, Italy
Calissano A, Feragen A, Vantini S (2022) Graph-valued regression: prediction of unlabelled networks in a non-Euclidean graph space. J Multivar Anal 190:104950. https://doi.org/10.1016/j.jmva.2022.104950. https://www.sciencedirect.com/science/article/pii/S0047259X22000021
Chehreghani M (2022) Half a decade of graph convolutional networks. Nat Mach Intell 4:1–2. https://doi.org/10.1038/s42256-022-00466-8
Chehreghani MH, Bruynooghe M (2016) Mining rooted ordered trees under subtree homeomorphism. Data Min Knowl Discov 30(5):1249–1272. https://doi.org/10.1007/s10618-015-0439-5
Chehreghani MH, Rahgozar M, Lucas C, Chehreghani MH (2007) A heuristic algorithm for clustering rooted ordered trees. Intell Data Anal 11(4):355–376. http://content.iospress.com/articles/intelligent-data-analysis/ida00292
Chehreghani MH, Chehreghani MH, Lucas C, Rahgozar M, Ghadimi E (2009) Efficient rule based structural algorithms for classification of tree structured data. Intell Data Anal 13(1):165–188. https://doi.org/10.3233/IDA-2009-0361
Chehreghani MH, Chehreghani MH, Lucas C, Rahgozar M (2011) Oinduced: an efficient algorithm for mining induced patterns from rooted ordered trees. IEEE Trans Syst Man Cybern Part A 41(5):1013–1025. https://doi.org/10.1109/TSMCA.2010.2096808
Chehreghani MH, Abdessalem T, Bifet A, Bouzbila M (2020) Sampling informative patterns from large single networks. Future Gener Comput Syst 106:653–658. https://doi.org/10.1016/j.future.2020.01.042
Chowdhury S, Memoli F (2019) The Gromov–Wasserstein distance between networks and stable network invariants. Inf Inference J IMA 8(4):757–787. https://doi.org/10.1093/imaiai/iaz026
Erdos P, Renyi A (1960) On the evolution of random graphs. Publ Math Inst Hung Acad Sci 5:17–61
Gall FL (2014) Powers of tensors and fast matrix multiplication. In: Nabeshima K, Nagasaka K, Winkler F, Szántó Á (eds) International symposium on symbolic and algebraic computation, ISSAC ’14, Kobe, Japan, July 23–25. ACM, pp 296–303. https://doi.org/10.1145/2608628.2608664
Goyal P, Kamra N, He X, Liu Y (2018) Dyngem: deep embedding method for dynamic graphs. CoRR. arXiv:1805.11273
Goyal P, Chhetri SR, Canedo A (2020) dyngraph2vec: capturing network dynamics using dynamic graph representation learning. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2019.06.024
Greville TNE (1960) Some applications of the pseudoinverse of a matrix. SIAM Rev 2:15–22. https://doi.org/10.1137/1002004
Grover A, Leskovec J (2016) node2vec: scalable feature learning for networks. In: Krishnapuram B, Shah M, Smola AJ, Aggarwal CC, Shen D, Rastogi R (eds) Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016. ACM, pp 855–864. https://doi.org/10.1145/2939672.2939754
Haghir Chehreghani M (2021) Dynamical algorithms for data mining and machine learning over dynamic graphs. WIREs Data Min Knowl Discov. https://doi.org/10.1002/widm.1393
Han C, Zhang S, Ghalwash MF, Vucetic S, Obradovic Z (2016) Joint learning of representation and structure for sparse regression on graphs. In: Venkatasubramanian SC, Meira W Jr (eds) Proceedings of the 2016 SIAM international conference on data mining, Miami, Florida, USA, May 5–7, 2016. SIAM, pp 846–854. https://doi.org/10.1137/1.9781611974348.95
Han C, Cao X.H, Stanojevic M, Ghalwash M.F, Obradovic Z (2019) Temporal graph regression via structure-aware intrinsic representation learning. In: Berger-Wolf TY, Chawla NV (eds) Proceedings of the 2019 SIAM international conference on data mining, SDM 2019, Calgary, Alberta, Canada, May 2–4, 2019. SIAM, pp 360–368. https://doi.org/10.1137/1.9781611975673.41
Hayashi T, Akiba T, Yoshida Y(2015) Fully dynamic betweenness centrality maintenance on massive networks. In: Proceedings of the VLDB endowment (PVLDB), vol 9(2), pp 48–59. https://doi.org/10.14778/2850578.2850580. http://www.vldb.org/pvldb/vol9/p48-hayashi.pdf
Herbster M, Pontil M (2006) Prediction on a graph with a perceptron. In: Schölkopf B, Platt JC, Hofmann T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 4–7, 2006. MIT Press, Cambridge, pp 577–584. https://proceedings.neurips.cc/paper/2006/hash/cc3d69ed781b16bce06687822ae56e6d-Abstract.html
Herbster M, Pasteris S, Pontil M (2015) Predicting a switching sequence of graph labelings. J Mach Learn Res 16:2003–2022. http://dl.acm.org/citation.cfm?id=2886813
Kleinberg JM, Tardos É (2002) Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields. J ACM 49(5):616–639. https://doi.org/10.1145/585265.585268
Kovac A, Smith AD (2011) Nonparametric regression on a graph. J Comput Graph Stat 20(2):432–447. https://doi.org/10.1198/jcgs.2011.09203
Kumar S, Spezzano F, Subrahmanian VS, Faloutsos C (2016) Edge weight prediction in weighted signed networks. In: IEEE 16th international conference on data mining, ICDM 2016, December 12-15, 2016, Barcelona, Spain, pp 221–230. https://doi.org/10.1109/ICDM.2016.0033
Lee JB, Rossi RA, Kong X (2018) Graph classification using structural attention. In: Guo Y, Farooq F (eds) Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, KDD 2018, London, UK, August 19–23, 2018. ACM, pp 1666–1674 (2018). https://doi.org/10.1145/3219819.3219980
Leskovec J, Huttenlocher DP, Kleinberg JM (2010) Predicting positive and negative links in online social networks. In: Rappa M, Jones P, Freire J, Chakrabarti S (eds) Proceedings of the 19th international conference on world wide web, WWW 2010, Raleigh, North Carolina, USA, April 26–30, 2010. ACM, pp 641–650. https://doi.org/10.1145/1772690.1772756
Li J, Hu X, Jian L, Liu H (2016) Toward time-evolving feature selection on dynamic networks. In: IEEE 16th international conference on data mining, ICDM 2016, December 12–15, 2016, Barcelona, Spain, pp 1003–1008
Makkar D, Bader DA, Green O (2017) Exact and parallel triangle counting in dynamic graphs. In: 24th IEEE international conference on high performance computing, HiPC 2017, Jaipur, India, December 18–21, 2017. IEEE Computer Society, pp 2–12. https://doi.org/10.1109/HiPC.2017.00011
Meyer CD Jr (1973) Generalized inversion of modified matrices. SIAM J Appl Math 24(3):315–323
Niepert M, Ahmed M, Kutzkov K (2016) Learning convolutional neural networks for graphs. In: Proceedings of the 33nd international conference on machine learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. JMLR workshop and conference proceedings, vol 48, pp 2014–2023. http://jmlr.org/proceedings/papers/v48/niepert16.html
Parsons T, Pisanski T (1989) Vector representations of graphs. Discret Math 78(1):143–154. Special Double Issue in Memory of Tory Parsons
Qin T, Liu T, Zhang X, Wang D, Li H (2008) Global ranking using continuous conditional random fields. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) Advances in neural information processing systems 21, Proceedings of the twenty-second annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 8–11, 2008. Curran Associates, Inc., Red Hook, pp 1281–1288. https://proceedings.neurips.cc/paper/2008/hash/ebd9629fc3ae5e9f6611e2ee05a31cef-Abstract.html
Rao CR, Mitra SK (1971) Generalized inverse of matrices and its applications. Wiley, New York
Rozemberczki B, Sarkar R (2020) Characteristic functions on graphs: birds of a feather, from statistical descriptors to parametric models. In: Proceedings of the 29th ACM international conference on information and knowledge management (CIKM ’20). ACM, New York, pp 1325–1334
Saigo H, Nowozin S, Kadowaki T, Kudo T, Tsuda K (2009) gBoost: a mathematical programming approach to graph classification and regression. Mach Learn 75(1):69–89. https://doi.org/10.1007/s10994-008-5089-z
Sohn K, Kim S (2012) Joint estimation of structured sparsity and output structure in multiple-output regression via inverse-covariance regularization. In: Lawrence ND, Girolami MA (eds) Proceedings of the fifteenth international conference on artificial intelligence and statistics, AISTATS 2012, La Palma, Canary Islands, Spain, April 21–23, 2012, JMLR proceedings, vol 22. JMLR.org, pp 1081–1089. http://proceedings.mlr.press/v22/sohn12.html
Woodruff DP (2014) Sketching as a tool for numerical linear algebra. Found Trends Theor Comput Sci 10(1–2):1–157. https://doi.org/10.1561/0400000060
Wytock M, Kolter JZ (2013) Sparse Gaussian conditional random fields: algorithms, theory, and application to energy forecasting. In: Proceedings of the 30th international conference on machine learning, ICML 2013, Atlanta, GA, USA, 16–21 June 2013. JMLR workshop and conference proceedings, vol 28. JMLR.org, pp 1265–1273. http://proceedings.mlr.press/v28/wytock13.html
Yang Z, Cohen WW, Salakhutdinov R (2016) Revisiting semi-supervised learning with graph embeddings. In: Proceedings of the 33nd international conference on machine learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. JMLR workshop and conference proceedings, vol 48, pp 40–48. http://jmlr.org/proceedings/papers/v48/yanga16.html
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Leonardo de Lima.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Haghir Chehreghani, M. On the theory of dynamic graph regression problem. Comp. Appl. Math. 41, 362 (2022). https://doi.org/10.1007/s40314-022-02071-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s40314-022-02071-2