κNUMA: A Model for Clusters of SMP-Machines

Schmollinger, Martin; Kaufmann, Michael

doi:10.1007/3-540-48086-2_5

Martin Schmollinger⁸ &
Michael Kaufmann⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2328))

Included in the following conference series:

International Conference on Parallel Processing and Applied Mathematics

536 Accesses

Abstract

The κnuma model is a new model of parallel computation, which should be used to develop and analyse algorithms for clusters of smp-blocks (symmetrical multiprocessing). smp-blocks are parallel computers with shared memory to which the few processors have uniform access (uma). The model implies modern directions like hierarchical interconnection, innernode communication (threads and shared memory) and internode communication (message-passing and remote data access). κnuma is developed on top of the widely accepted bsp (bulk-synchronous parallel) model. In this paper, we present an examplifying analysis of the personalized one-to-all broadcast. It will be shown that if we transfer optimal algorithms based on the bsp model directly, there will be a lack of information and so a loss of performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Parallelization Methods for Hierarchical SMP Systems

Analytical Estimation of the Scalability of Iterative Numerical Algorithms on Distributed Memory Multiprocessors

Article 25 May 2018

Workstealing and Nested Parallelism in SMP Systems

References

A. Alexandrov, M. Ionescu, K. Schauser, and C. Scheiman. LogGP: Incorporating long messages into the LogP model — one step closer towards a realistic model for parallel computation. In In Proceedings of the 7th Symposium on Parallel Algorithms and Architectures, Santa Barbara, CA, pages 95–105, Juli 1995.
Google Scholar
G. Bilardi, P. Codenotti, G. Del Corso, C. Pinotti, and G. Resta. EURO-PAR 97, volume 1300 of Lecture Notes in Computer Science, chapter Broadcast and Other Primitive Operations on Fat-Trees. Springer, August 1997.
Google Scholar
A. Bäumker, W. Dittrich, and F. M. auf der Heide. Truly efficent parallel algorithms: c-optimal multisearch for an extension of the BSP model. In Proceedings of the European Symposium on Algorithms, pages 17–30, 1995.
Google Scholar
D. Culler, R. Karp, D. Patterson, A. Sahay, K. Schauser, E. Santos, R. Subramonian, and T. von Eicken. LogP: Towards a realistic model of parallel computation. In Proceedings of the 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 1–12, Mai 1993.
Google Scholar
F. Dehne and A. Fabri. Scalable parallel computational geometry for coarse grained multicomputers. In Proceedings of ACM ninth Annual Computational Geometry, pages 298–307, 1993.
Google Scholar
P. B. Gibbons, Y. Matias, and V. Ramachandran. Can shared-memory model serve as a bridging model for parallel computation? In ACM Symposium on Parallel Algorithms and Architectures, pages 72–83, 1997.
Google Scholar
A. Grbic, S. Brown, S. Caranci, R. Grindley, M. Gusat, G. Lemieux, K. Loveless, N. Manjikian, S. Srbljic, M. Stumm, Z. Vranesic, and Z. Zilic. Design and implementation of the NUMAchine multiprocessor. In Proceedings of the 35th IEEE Design Automation Conference, San Francisco, CA, June 1998.
Google Scholar
R. Grindley. The NUMAschine multiprocessor. In Proceedings of the international conference on parallel processing, Toronto Canada, August 2000.
Google Scholar
D. B. Gustavson and Q. Li. The scalable coherant interface (SCI). IEEE Communications Magazine, 34(5):52–63, 1996.
Article Google Scholar
S. E. Hambrusch and A. Khokhar. C ³: A parallel model for coarse-grained machines. Journal on Parallel and Distributed Computing, 32(2):139–154, 1996.
Article Google Scholar
H. Hellwagner and A. Reinefeld, editors. SCI-Scalable Coherent Interface, volume 1734 of Lecture Notes in Computer Science. Springer Verlag, 1999.
Google Scholar
B. Juurlink and H. Wijshoff. EURO-PAR 96, Parallel Processing, volume 1124 of Lecture Notes in Computer Science, chapter The E-BSP Model: Incorporating Unbalanced Communication and General Locality into the BSP Model, pages 339–347. Springer, August 1996.
Book Google Scholar
K. G. Sevcik and S. Zhou. Performance benefits and limitations of large NUMA multiprocessors. In Performance, pages 183–204, Rome, Italy, September 1993.
Google Scholar
Y. Tanaka, M. Matsuda, M. Ando, K. Kazuto, and M. Sato. IPPS Workshop on Personal Computer Based Networks of Workstations, volume 1388 of Lecture Notes in Computer Science, chapter COMPaS: A Pentium Pro PC-based SMP Cluster and its Experience, pages 486–497. 1998.
Google Scholar
P. Torre and C. Kruskal. EURO-PAR 96, Parallel Processing, volume 1124 of Lecture Notes in Computer Science, chapter Submachine Locality in the Bulk Synchronous Setting, pages 352–358. Springer, August 1996.
Book Google Scholar
L. G. Valiant. A bridging model for parallel computation. Communications of the ACM, 33(8):103–111, 1990.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Wilhelm-Schickhard-Institute for Computer Science, Parallel Computing Group, University of Tübingen, Sand 13, 72076, Tübingen, Germany
Martin Schmollinger & Michael Kaufmann

Authors

Martin Schmollinger
View author publications
You can also search for this author in PubMed Google Scholar
Michael Kaufmann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Mathematics and Computer Science, Technical University of Czestochowa, Dabrowskiego 73, 42-200, Czestochowa, Poland
Roman Wyrzykowski
Computer Science Department, University of Tennessee, 122 Volunteer Blvd, Knoxville, TN, 37996-3450, USA
Jack Dongarra
Computer Science Department, Oklahoma State University, 700 N. Greenwood Ave., Tulsa, OK, 74106, USA
Marcin Paprzycki
DTU, UNI-C, Danish Computing Centre for Research and Education, Bldg. 304, 2800, Lyngby, Denmark
Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schmollinger, M., Kaufmann, M. (2002). κNUMA: A Model for Clusters of SMP-Machines. In: Wyrzykowski, R., Dongarra, J., Paprzycki, M., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2001. Lecture Notes in Computer Science, vol 2328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48086-2_5

Download citation

DOI: https://doi.org/10.1007/3-540-48086-2_5
Published: 06 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43792-5
Online ISBN: 978-3-540-48086-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics