Publications

Computer Communication Network Routing

TR-08-95 [tr-08-95.ps.gz ]
Ted Nesson and S. Lennart Johnsson. 1995. ROMM Routing on Mesh and Torus Networks.
TR-21-94 [tr-21-94.ps.gz ]
Ted Nesson and Lennart Johnsson. 1994. ROMM Routing: A Class of Efficient Minimal Routing Algorithms.
TR-18-93 [tr-18-93.ps.gz ]
William George, Ralph G. Brickner, and S. Lennart Johnsson. 1993. POLYSHIFT Communications Software for the Connection Machine System CM-200.
TR-07-93 [tr-07-93.ps.gz ]
Kapil K. Mathur and S. Lennart Johnsson. 1993. All-to-All Communication Algorithms for Distributed BLAS.
TR-02-93 [tr-02-93.ps.gz ]
Kapil K. Mathur and S. Lennart Johnsson. 1993. All-to-All Communication on the Connection Machine CM-200.
TR-23-92 [tr-23-92.ps.gz ]
Kapil K. Mathur and S. Lennart Johnsson. 1992. Communication Primitives for Unstructured Finite Element Simulations on Data Parallel Architectures.
TR-20-92[tr-20-92.ps.gz ]
Ching-Tien Ho, M.T. Raghunath, and S. Lennart Johnsson. 1992. An Efficient Algorithm for Gray-to-Binary Permutation on Hypercubes.
TR-16-92 [tr-16-92.ps.gz ]
S. Lennart Johnsson and Ching-Tien Ho. 1992. Optimal Communication Channel Utilization for Matrix Transposition and Related Permutations on Binary Cubes.
TR-23-91 [tr-23-91.ps.gz ]
S. Lennart Johnsson. 1991. Minimizing the Communication Time for Matrix Multiplication on Multi-Processors.
TR-21-91 [tr-21-91.ps.gz ]
Jean-Philippe Brunet and S. Lennart Johnsson. 1991. All-to-All Broadcast and Applications on the Connection Machine.
TR-20-91 [tr-20-91.ps.gz ]
S. Lennart Johnsson and Ching-Tien Ho. 1991. On the Conversion between Binary Code and Binary-Reflected Gray Code on Boolean Cubes.
TR-18-91 [tr-18-91.ps.gz ]
S. Lennart Johnsson and Ching-Tien Ho. 1991. Optimal All-to-All Personalized Communication with Minimum Span on Boolean Cubes.
TR-04-91 [tr-04-91.ps.gz ]
S. Lennart Johnsson and Ching-Tien Ho. 1991. Generalized Shuffle Permutations on Boolean Cubes.
TR-03-91
S. Lennart Johnsson and Ching-Tien Ho. 1992. Optimal Communication Channel Utilization for Matrix Transposition and Related Permutations on Binary Cubes.
TR-16-92 SUPERCEDES TR-03-91.

Data Distribution

TR-08-94 [tr-08-94.ps.gz ]
Zdenek Johan, Kapil K. Mathur, S. Lennart Johnsson, and Thomas J.R. Hughes. 1994. Mesh Decomposition and Communication Procedures for Finite Element Applications on the Connection Machine CM-5 System.
TR-07-94 [tr-07-94.ps.gz ]
Zdenek Johan, Kapil K. Mathur, S. Lennart Johnsson, and Thomas J.R. Hughes. 1994. Parallel implementation of recursive spectral bisection on the Connection Machine CM-5 system.
TR-11-93 [tr-11-93.ps.gz ]
Zdene k Johan, Kapil K. Mathur, S. Lennart Johnsson, and Thomas J.R. Hughes. 1993. An Efficient Communication Strategy for Finite Element Methods on the Connection Machine CM-5 System.

Parallel Algorithms and Scientific Software Libraries

TR-27-94 [tr-27-94.ps.gz]
David Kramer, S. Lennart Johnsson, and Yu Hu. 1994. Local Basic Linear Algebra Subroutines (LBLAS) for the CM--5/5E.
TR-19-94 [tr-19-94.ps.gz ]
S. Lennart Johnsson and Kapil K. Mathur. 1994. Scientific Software Libraries for Scalable Architectures.
TR-23-93 [tr-23-93.ps.gz ]
S. Lennart Johnsson. 1993. CMSSL: A Scalable Scientific Software Library.
TR-19-93 [tr-19-93.ps.gz ]
S. Lennart Johnsson and Kapil K. Mathur. 1993. High Performance, Scalable Scientific Software Libraries.
TR-09-92 [tr-09-92.ps.gz ]
S. Lennart Johnsson and Luis F. Ortiz. 1992. Local Basic Linear Algebra Subroutines (LBLAS) for Distributed Memory Architectures and Languages with Array Syntax.
TR-04-92 [tr-04-92.ps.gz ]
Woody Lichtenstein and S. Lennart Johnsson. 1992. Block-Cyclic Dense Linear Algebra.
TR-01-92 [tr-01-92.ps.gz ]
Kapil K. Mathur and S. Lennart Johnsson. 1992. Multiplication of Matrices of Arbitrary Shape on a Data Parallel Computer.
TR-25-91 [tr-25-91.ps.gz ]
S. Lennart Johnsson, Michel Jacquemin, and Robert L. Krawitz. 1991. Communication Efficient Multi-processor FFT.
TR-24-91 [tr-24-91.ps.gz ]
S. Lennart Johnsson and Robert L. Krawitz. 1991. Cooley-Tukey FFT on the Connection Machine.
TR-19-91 [tr-19-91.ps.gz ]
Ching-Tien Ho, S. Lennart Johnsson, and Alan Edelman. 1991. Matrix Multiplication on Hypercubes Using Full Bandwidth and Constant Storage.
TR-02-91 [tr-02-91.ps.gz ]
S. Lennart Johnsson and Patrick Worley. 1991. Communication and I/O Libraries.

Scientific Applications

TR-16-96 [tr-16-96.ps.gz ]
Yu Charlie Hu and S. Lennart Johnsson. 1996. On the Accuracy of Anderson's Fast N-body Algorithm.
TR-15-96 [tr-15-96.ps.gz ]
Yu Charlie Hu, Shang-Hua Teng and S. Lennart Johnsson. 1996. A data-parallel implementation of the geometric partitioning algorithm.
TR-14-96 [tr-14-96.ps.gz ]
Yu Charlie Hu, S. Lennart Johnsson, and Shang-Hua Teng. 1996. A data-parallel adaptive N-body method.
TR-13-96 [ tr-13-96.ps.gz ]
Yu Charlie Hu, S. Lennart Johnsson, and Shang-Hua Teng. 1996. High Performance Fortran for Highly Irregular Problems.
TR-06-96 [ tr-06-96.ps.gz ]
Yu Hu and S. Lennart Johnsson. 1996. On the Accuracy of Poisson's Formula Based Fast N-Body Algorithms.
TR-05-96 [ tr-05-96.ps.gz ]
Yu Hu and S. Lennart Johnsson. 1996. A Data-Parallel Implementation of O(N) Hierarchical N-body Methods.
TR-26-94 [tr-26-94.ps.gz ]
Yu Hu and S. Lennart Johnsson. 1994. A Data Parallel Implementation of Hierarchical N-body Methods.
TR-24-94 [tr-24-94.ps.gz ]
Yu Hu and S. Lennart Johnsson. 1994. Implementing O(N) N-body Algorithms Efficiently in Data Parallel Languages (High Performance Fortran).
TR-18-94 [tr-18-94.ps.gz ]
Z. Johan, K.K. Mathur, S.L. Johnsson, and T.J.R. Hughes. 1994. Finite Element Techniques for Computational Fluid Dynamics on the Connection Machine CM-5 System.
TR-16-94 [tr-16-94.ps.gz ]
Zdenek Johan, Kapil K. Mathur, S. Lennart Johnsson, and Thomas J.R. Hughes. 1994. Scalability of Finite Element Applications on Distributed-Memory Parallel Computers.
TR-04-94 [tr-04-94.ps.gz ]
Z. Johan, K.K. Mathur, S.L. Johnsson, and T.J.R. Hughes. 1994. Data Parallel Finite Element Techniques for Compressible Flow Problems.
TR-08-93 [tr-08-93.ps.gz ]
Kapil K. Mathur, Zdene k Johan, S. Lennart Johnsson, and Thomas J.R. Hughes. 1993. Massively Parallel Computing: Unstructured Finite Element Simulations.
TR-06-93 [tr-06-93.ps.gz ]
Zdene k Johan, Kapil K. Mathur, S. Lennart Johnsson, and Thomas J.R. Hughes. 1993. An Efficient Communication Strategy for Finite Element Methods on the Connection Machine CM-5 System.
TR-02-92 [tr-02-92.ps.gz ]
Zdene k Johan, Thomas J.R. Hughes, Kapil K. Mathur, , and S. Lennart Johnsson. 1992. A Data Parallel Finite Element Method for Computational Fluid Dynamics on the Connection Machine System.
TR-01-91 [tr-01-91.ps.gz ]
Ralph G. Brickner, Clive F. Baillie, and S. Lennart Johnsson. 1991. QCD on the Connection Machine: Beyond *LISP.

Special-Purpose Compilers

TR-22-93 [tr-22-93.ps.gz ]
Ralph G. Brickner, William George, S. Lennart Johnsson, , and Alan Ruttenberg. 1993. A Stencil Compiler for the Connection Machine Models CM-2/200.

Parallel Benchmarking

TR-01-97 [tr-01-97.ps.gz ]
Yu Hu, S. Lennart Johnsson, Nadia Shalaby, and Dimeitrios Kehagias. 1996. . A Data parallel Fortran Benchmark Suite.
TR-36-95 [tr-36-95.ps.gz ]
Yu Hu, S. Lennart Johnsson, Nadia Shalaby, and Dimeitrios Kehagias. 1995. DPF: A Data parallel Fortran Benchmark Suite.

Parallel Computation -- General

TR-13-97 [tr-13-97.ps.gz ]
Y. Charlie Hu and S. Lennart Johnsson. 1997. Data Parallel Performance Optimizations Using Array Aliasing.
TR-35-95 [tr-35-95.ps.gz ]
S. Lennart Johnsson . 1995. Data Partitioning for Load--Balance and Communication Bandwidth Preservation.
TR-29-95 [tr-29-95.ps.gz ]
S. Lennart Johnsson. 1995. Network Related Performance Issues and Techniques for MPPs.
TR-22-94 [tr-22-94.ps.gz ]
S. Lennart Johnsson. 1994. Issues in High Performance Computer Networks.
TR-09-94 [tr-09-94.ps.gz ]
S. Lennart Johnsson. 1994. Data Motion and High Performance Computing.
TR-09-93 [tr-09-93.ps.gz ]
S. Lennart Johnsson. 1993. The Connection Machine Systems CM-5.
TR-01-93 [tr-01-93.ps.gz ]
S. Lennart Johnsson and Kapil K. Mathur. 1993. Massively Parallel Computing: Mathematics and communications libraries.
TR-29-92 [tr-29-92.ps.gz ]
S. Lennart Johnsson. 1992. Massively Parallel Computing: Data distribution and communication.
TR-18-92 [tr-18-92.ps.gz ]
S. Lennart Johnsson. 1992. Language and Compiler Issues in Scalable High Performance Scientific Libraries.
TR-07-92 [tr-07-92.ps.gz ]
Alan Edelman, Steve Heller, and S. Lennart Johnsson. 1992. Index Transformation Algorithms in a Linear Algebra Framework.
TR-10-91 [tr-10-91.ps.gz ]
S. Lennart Johnsson. 1991. Performance Modeling of Distributed Memory Architectures.