Skip to main content
Log in

On the theory of dynamic graph regression problem

  • Published:
Computational and Applied Mathematics Aims and scope Submit manuscript

Abstract

Most of real-world graphs are dynamic, i.e., they change over time by a sequence of update operations. While the regression problem has been studied for static graphs and temporal graphs, it is not investigated for general dynamic graphs. In this paper, we study the theory of regression over dynamic graphs. First, we present the notion of update-efficient matrix embedding, that defines conditions sufficient for a matrix embedding to be effectively used for dynamic graph regression (under \(l_2\) norm). Then, we show that given a \(n \times m\) update-efficient matrix embedding (e.g., the adjacency matrix) and after an update operation in the graph, the exact optimal solution of linear regression can be updated in O(nm) time for the revised graph. Moreover, we show that this also holds when the matrix embedding is the Laplacian matrix and the update operations are restricted to edge insertion/deletion. In the end, by conducting experiments over synthetic and real-world graphs, we show the high efficiency of updating the solution of graph regression.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. We note that this notion of embedding is different from the notion of embedding used in graph pattern mining (Chehreghani et al. 2011, 2020; Chehreghani and Bruynooghe 2016).

  2. Note that the graph and the information used to find the solution of dynamic graph regression, i.e., the pseudoinverse of matrix \({\textbf{M}}\) and vector \({\textbf{b}},\) change over time. However, we only care about their values before and after an update operation, as we want to find their values after the update operation, based on their values before the update operation. To keep notations as simple as possible, we do not parameterize them by time, rather, we simply use the terms before and after the update operation to distinguish these two situations.

  3. Note that when inserting a new node to a bounded-degree graph, at most a constant (bounded) number of edges are drawn between the new node and existing nodes.

  4. https://snap.stanford.edu/data/wiki-Vote.html.

  5. https://snap.stanford.edu/data/feather-lastfm-social.html.

  6. https://snap.stanford.edu/data/soc-sign-bitcoin-otc.html.

  7. https://charts.coinmetrics.io.

References

  • Borgwardt KM, Kriegel H, Wackersreuther P (2006) Pattern mining in frequent dynamic subgraphs. In: Proceedings of the 6th IEEE international conference on data mining (ICDM 2006), 18–22 December, Hong Kong, China. IEEE Computer Society, pp 818–822. https://doi.org/10.1109/ICDM.2006.124

  • Calissano A, Feragen A, Vantini S (2021) Graph-valued regression: prediction of unlabelled networks in a non-Euclidean graph-space. MOX-Report No. 02/2021, Dipartimento di Matematica, Politecnico di Milano, Via Bonardi 9, 20133 Milano, Italy

  • Calissano A, Feragen A, Vantini S (2022) Graph-valued regression: prediction of unlabelled networks in a non-Euclidean graph space. J Multivar Anal 190:104950. https://doi.org/10.1016/j.jmva.2022.104950. https://www.sciencedirect.com/science/article/pii/S0047259X22000021

  • Chehreghani M (2022) Half a decade of graph convolutional networks. Nat Mach Intell 4:1–2. https://doi.org/10.1038/s42256-022-00466-8

    Article  Google Scholar 

  • Chehreghani MH, Bruynooghe M (2016) Mining rooted ordered trees under subtree homeomorphism. Data Min Knowl Discov 30(5):1249–1272. https://doi.org/10.1007/s10618-015-0439-5

    Article  MathSciNet  MATH  Google Scholar 

  • Chehreghani MH, Rahgozar M, Lucas C, Chehreghani MH (2007) A heuristic algorithm for clustering rooted ordered trees. Intell Data Anal 11(4):355–376. http://content.iospress.com/articles/intelligent-data-analysis/ida00292

  • Chehreghani MH, Chehreghani MH, Lucas C, Rahgozar M, Ghadimi E (2009) Efficient rule based structural algorithms for classification of tree structured data. Intell Data Anal 13(1):165–188. https://doi.org/10.3233/IDA-2009-0361

    Article  Google Scholar 

  • Chehreghani MH, Chehreghani MH, Lucas C, Rahgozar M (2011) Oinduced: an efficient algorithm for mining induced patterns from rooted ordered trees. IEEE Trans Syst Man Cybern Part A 41(5):1013–1025. https://doi.org/10.1109/TSMCA.2010.2096808

    Article  Google Scholar 

  • Chehreghani MH, Abdessalem T, Bifet A, Bouzbila M (2020) Sampling informative patterns from large single networks. Future Gener Comput Syst 106:653–658. https://doi.org/10.1016/j.future.2020.01.042

    Article  Google Scholar 

  • Chowdhury S, Memoli F (2019) The Gromov–Wasserstein distance between networks and stable network invariants. Inf Inference J IMA 8(4):757–787. https://doi.org/10.1093/imaiai/iaz026

    Article  MathSciNet  MATH  Google Scholar 

  • Erdos P, Renyi A (1960) On the evolution of random graphs. Publ Math Inst Hung Acad Sci 5:17–61

    MathSciNet  MATH  Google Scholar 

  • Gall FL (2014) Powers of tensors and fast matrix multiplication. In: Nabeshima K, Nagasaka K, Winkler F, Szántó Á (eds) International symposium on symbolic and algebraic computation, ISSAC ’14, Kobe, Japan, July 23–25. ACM, pp 296–303. https://doi.org/10.1145/2608628.2608664

  • Goyal P, Kamra N, He X, Liu Y (2018) Dyngem: deep embedding method for dynamic graphs. CoRR. arXiv:1805.11273

  • Goyal P, Chhetri SR, Canedo A (2020) dyngraph2vec: capturing network dynamics using dynamic graph representation learning. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2019.06.024

    Article  Google Scholar 

  • Greville TNE (1960) Some applications of the pseudoinverse of a matrix. SIAM Rev 2:15–22. https://doi.org/10.1137/1002004

    Article  MathSciNet  MATH  Google Scholar 

  • Grover A, Leskovec J (2016) node2vec: scalable feature learning for networks. In: Krishnapuram B, Shah M, Smola AJ, Aggarwal CC, Shen D, Rastogi R (eds) Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016. ACM, pp 855–864. https://doi.org/10.1145/2939672.2939754

  • Haghir Chehreghani M (2021) Dynamical algorithms for data mining and machine learning over dynamic graphs. WIREs Data Min Knowl Discov. https://doi.org/10.1002/widm.1393

    Article  MATH  Google Scholar 

  • Han C, Zhang S, Ghalwash MF, Vucetic S, Obradovic Z (2016) Joint learning of representation and structure for sparse regression on graphs. In: Venkatasubramanian SC, Meira W Jr (eds) Proceedings of the 2016 SIAM international conference on data mining, Miami, Florida, USA, May 5–7, 2016. SIAM, pp 846–854. https://doi.org/10.1137/1.9781611974348.95

  • Han C, Cao X.H, Stanojevic M, Ghalwash M.F, Obradovic Z (2019) Temporal graph regression via structure-aware intrinsic representation learning. In: Berger-Wolf TY, Chawla NV (eds) Proceedings of the 2019 SIAM international conference on data mining, SDM 2019, Calgary, Alberta, Canada, May 2–4, 2019. SIAM, pp 360–368. https://doi.org/10.1137/1.9781611975673.41

  • Hayashi T, Akiba T, Yoshida Y(2015) Fully dynamic betweenness centrality maintenance on massive networks. In: Proceedings of the VLDB endowment (PVLDB), vol 9(2), pp 48–59. https://doi.org/10.14778/2850578.2850580. http://www.vldb.org/pvldb/vol9/p48-hayashi.pdf

  • Herbster M, Pontil M (2006) Prediction on a graph with a perceptron. In: Schölkopf B, Platt JC, Hofmann T (eds) Advances in neural information processing systems 19, Proceedings of the twentieth annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 4–7, 2006. MIT Press, Cambridge, pp 577–584. https://proceedings.neurips.cc/paper/2006/hash/cc3d69ed781b16bce06687822ae56e6d-Abstract.html

  • Herbster M, Pasteris S, Pontil M (2015) Predicting a switching sequence of graph labelings. J Mach Learn Res 16:2003–2022. http://dl.acm.org/citation.cfm?id=2886813

  • Kleinberg JM, Tardos É (2002) Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields. J ACM 49(5):616–639. https://doi.org/10.1145/585265.585268

    Article  MathSciNet  MATH  Google Scholar 

  • Kovac A, Smith AD (2011) Nonparametric regression on a graph. J Comput Graph Stat 20(2):432–447. https://doi.org/10.1198/jcgs.2011.09203

    Article  MathSciNet  Google Scholar 

  • Kumar S, Spezzano F, Subrahmanian VS, Faloutsos C (2016) Edge weight prediction in weighted signed networks. In: IEEE 16th international conference on data mining, ICDM 2016, December 12-15, 2016, Barcelona, Spain, pp 221–230. https://doi.org/10.1109/ICDM.2016.0033

  • Lee JB, Rossi RA, Kong X (2018) Graph classification using structural attention. In: Guo Y, Farooq F (eds) Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, KDD 2018, London, UK, August 19–23, 2018. ACM, pp 1666–1674 (2018). https://doi.org/10.1145/3219819.3219980

  • Leskovec J, Huttenlocher DP, Kleinberg JM (2010) Predicting positive and negative links in online social networks. In: Rappa M, Jones P, Freire J, Chakrabarti S (eds) Proceedings of the 19th international conference on world wide web, WWW 2010, Raleigh, North Carolina, USA, April 26–30, 2010. ACM, pp 641–650. https://doi.org/10.1145/1772690.1772756

  • Li J, Hu X, Jian L, Liu H (2016) Toward time-evolving feature selection on dynamic networks. In: IEEE 16th international conference on data mining, ICDM 2016, December 12–15, 2016, Barcelona, Spain, pp 1003–1008

  • Makkar D, Bader DA, Green O (2017) Exact and parallel triangle counting in dynamic graphs. In: 24th IEEE international conference on high performance computing, HiPC 2017, Jaipur, India, December 18–21, 2017. IEEE Computer Society, pp 2–12. https://doi.org/10.1109/HiPC.2017.00011

  • Meyer CD Jr (1973) Generalized inversion of modified matrices. SIAM J Appl Math 24(3):315–323

    Article  MathSciNet  MATH  Google Scholar 

  • Niepert M, Ahmed M, Kutzkov K (2016) Learning convolutional neural networks for graphs. In: Proceedings of the 33nd international conference on machine learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. JMLR workshop and conference proceedings, vol 48, pp 2014–2023. http://jmlr.org/proceedings/papers/v48/niepert16.html

  • Parsons T, Pisanski T (1989) Vector representations of graphs. Discret Math 78(1):143–154. Special Double Issue in Memory of Tory Parsons

  • Qin T, Liu T, Zhang X, Wang D, Li H (2008) Global ranking using continuous conditional random fields. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) Advances in neural information processing systems 21, Proceedings of the twenty-second annual conference on neural information processing systems, Vancouver, British Columbia, Canada, December 8–11, 2008. Curran Associates, Inc., Red Hook, pp 1281–1288. https://proceedings.neurips.cc/paper/2008/hash/ebd9629fc3ae5e9f6611e2ee05a31cef-Abstract.html

  • Rao CR, Mitra SK (1971) Generalized inverse of matrices and its applications. Wiley, New York

    MATH  Google Scholar 

  • Rozemberczki B, Sarkar R (2020) Characteristic functions on graphs: birds of a feather, from statistical descriptors to parametric models. In: Proceedings of the 29th ACM international conference on information and knowledge management (CIKM ’20). ACM, New York, pp 1325–1334

  • Saigo H, Nowozin S, Kadowaki T, Kudo T, Tsuda K (2009) gBoost: a mathematical programming approach to graph classification and regression. Mach Learn 75(1):69–89. https://doi.org/10.1007/s10994-008-5089-z

    Article  MATH  Google Scholar 

  • Sohn K, Kim S (2012) Joint estimation of structured sparsity and output structure in multiple-output regression via inverse-covariance regularization. In: Lawrence ND, Girolami MA (eds) Proceedings of the fifteenth international conference on artificial intelligence and statistics, AISTATS 2012, La Palma, Canary Islands, Spain, April 21–23, 2012, JMLR proceedings, vol 22. JMLR.org, pp 1081–1089. http://proceedings.mlr.press/v22/sohn12.html

  • Woodruff DP (2014) Sketching as a tool for numerical linear algebra. Found Trends Theor Comput Sci 10(1–2):1–157. https://doi.org/10.1561/0400000060

    Article  MathSciNet  MATH  Google Scholar 

  • Wytock M, Kolter JZ (2013) Sparse Gaussian conditional random fields: algorithms, theory, and application to energy forecasting. In: Proceedings of the 30th international conference on machine learning, ICML 2013, Atlanta, GA, USA, 16–21 June 2013. JMLR workshop and conference proceedings, vol 28. JMLR.org, pp 1265–1273. http://proceedings.mlr.press/v28/wytock13.html

  • Yang Z, Cohen WW, Salakhutdinov R (2016) Revisiting semi-supervised learning with graph embeddings. In: Proceedings of the 33nd international conference on machine learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. JMLR workshop and conference proceedings, vol 48, pp 40–48. http://jmlr.org/proceedings/papers/v48/yanga16.html

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mostafa Haghir Chehreghani.

Additional information

Communicated by Leonardo de Lima.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Haghir Chehreghani, M. On the theory of dynamic graph regression problem. Comp. Appl. Math. 41, 362 (2022). https://doi.org/10.1007/s40314-022-02071-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s40314-022-02071-2

Keywords

Mathematics Subject Classification