Compiling Parallel Loops for High Performance Computers: Partitioning, Data Assignment and Remapping / Edition 1

Compiling Parallel Loops for High Performance Computers: Partitioning, Data Assignment and Remapping / Edition 1

ISBN-10:
0792392833
ISBN-13:
9780792392835
Pub. Date:
10/31/1992
Publisher:
Springer US
ISBN-10:
0792392833
ISBN-13:
9780792392835
Pub. Date:
10/31/1992
Publisher:
Springer US
Compiling Parallel Loops for High Performance Computers: Partitioning, Data Assignment and Remapping / Edition 1

Compiling Parallel Loops for High Performance Computers: Partitioning, Data Assignment and Remapping / Edition 1

Hardcover

$109.99
Current price is , Original price is $109.99. You
$109.99 
  • SHIP THIS ITEM
    In stock. Ships in 1-2 days.
  • PICK UP IN STORE

    Your local store may have stock of this item.


Overview

4. 2 Code Segments . . . . . . . . . . . . . . . 96 4. 3 Determining Communication Parameters . 99 4. 4 Multicast Communication Overhead · 103 4. 5 Partitioning . . . . . . · 103 4. 6 Experimental Results . 117 4. 7 Conclusion. . . . . . . · 121 5 COLLECTIVE PARTITIONING AND REMAPPING FOR MULTIPLE LOOP NESTS 125 5. 1 Introduction. . . . . . . . . 125 5. 2 Program Enclosure Trees. . 128 5. 3 The CPR Algorithm . . 132 5. 4 Experimental Results. . 141 5. 5 Conclusion. . 146 BIBLIOGRAPHY. 149 INDEX . . . . . . . . 157 LIST OF FIGURES Figure 1. 1 The Butterfly Architecture. . . . . . . . . . 5 1. 2 Example of an iterative data-parallel loop . . 7 1. 3 Contiguous tiling and assignment of an iteration space. 13 2. 1 Communication along a line segment. . . 24 2. 2 Access pattern for the access offset, (3,2). 25 2. 3 Decomposing an access vector along an orthogonal basis set of vectors. . . . . . . . . . . . . . . . . . . 26 2. 4 An analysis of communication patterns. 29 2. 5 Decomposing a vector along two separate basis sets of vectors. 31 2. 6 Cache lines aligning with borders. 33 2. 7 Cache lines not aligned with borders. 34 2. 8 nh is the difference of nd and nb. 42 2. 9 nh is the sum of nd and nb. 42 2. 10 The ADAPT system. 44 2. 11 Code segment used in experiments. . 46 2. 12 Execution rates for various partitions. 47 2. 13 Execution time of partitions on Multimax. 48 2. 14 Performance increase as processing power increases. 49 2. 15 Percentage miss ratios for various aspect ratios and line sizes.

Product Details

ISBN-13: 9780792392835
Publisher: Springer US
Publication date: 10/31/1992
Series: The Springer International Series in Engineering and Computer Science , #200
Edition description: 1993
Pages: 159
Product dimensions: 6.10(w) x 9.25(h) x 0.36(d)

Table of Contents

1 Introduction.- 1.1 Model Assumptions.- 1.2 Related Work.- 1.3 Overview.- 2 Contiguous Loop Partitions for Neighborhood Communication.- 2.1 Introduction.- 2.2 Quantifying Communication.- 2.3 Compensation for Cache Line Size.- 2.4 Partition Construction.- 2.5 Experimental Evaluation of ADP.- 2.6 Conclusion.- 3 Contiguous Data Assignments for Neighborhood Communication.- 3.1 Introduction.- 3.2 Data Assignments.- 3.3 Exploiting Overlap.- 3.4 Software Redundancy.- 3.5 ADAPT.- 3.6 Experimental Results.- 3.7 Conclusion.- 4 Cyclic Loop Partitions for Linearly Varying Loops.- 4.1 Introduction.- 4.2 Code Segments.- 4.3 Determining Communication Parameters.- 4.4 Multicast Communication Overhead.- 4.5 Partitioning.- 4.6 Experimental Results.- 4.7 Conclusion.- 5 Collective Partitioning and Remapping for Multiple Loop Nests.- 5.1 Introduction.- 5.2 Program Enclosure Trees.- 5.3 The CPR Algorithm.- 5.4 Experimental Results.- 5.5 Conclusion.
From the B&N Reads Blog

Customer Reviews