Accepted papers
The following papers were accepted by the Program Committee. They will be published and presented in SBAC-PAD Symposium in the following sessions. Authors, please, send the camera-ready copy until August 13, 2004.
Session 1: Cache and Memory Architectures
Cache Filtering Techniques
to Reduce the Negative Impact of Useless Speculative Memory References
on Processor Performance
Self-Monitored Adaptive Cache Warm Up for
Microprocessor Simulation
The eDRAM based L3-Cache of the
BlueGene/L Supercomputer Processor Node
Multi-Profile Instruction Based
Compression
Session 2: Processor Architectures I
A Study of Errant Pipeline Flushes caused
by Value Misspeculation
Design Space Exploration using T&D-Bench
Value Predictors for Reuse through
Speculation on Traces
Session 3: Processor Architectures II
IATO: A Flexible EPIC Simulation Environment
ArchC: A SystemC-Based Architecture Description Language
Optimizations for compiled simulation
using instruction type information
Session 4: Languages and Tools for Parallel and Distributed Programming
Improving Server Performance on
Transaction Processing Workloads by Enhanced Data Placement
High Performance Communication System
Based on Generic Programming
Performance Evaluation of a Prototype
Distributed NFS Server
Session 5: Grid, Cluster and Pervasive
FlowCert: Probabilistic
Certification for Peer-to-Peer Computations
A Performance Evaluation of a Quorum-Based Sate-Machine Replication
Algorithm For Computing Grids
Scheduling in Bag-of-Task Grids: The PAUÁ Case
MEu: unifying application modeling and
cluster exploitation
Session 6: High Performance Applications I
Parallel Implementation of a Lagrangian
Stochastic Model for Pollution Dispersion
A Parallel Engine for Graphical
Interactive Molecular Dynamics
Parallel Adaptive Mesh Coarsening for
Seismic Tomography
Combining a Shared-Memory High
Performance Computer and a Heterogeneous Cluster for the Simulation of
Light Interaction with Human Skin
Session 7: Parallel and Distributed Algorithms
Revisiting a BSP/CGM Transitive Closure
Algorithm
Improving Parallel Execution Time of
Sorting on Heterogeneous Clusters
An Approach for Pre-Runtime Scheduling in
Embedded Hard Real-Time Systems with Power Constraints
Session 8: Load Balancing and Scheduling
Graph Partitioning with the Party Library:
Helpful-Sets in Practice
On the Combined Scheduling of Malleable
and Rigid Jobs
A Cluster-based Strategy for Scheduling
Task on Heterogeneous Processors
A New Migration Model based on the
Evaluation of Processes Load and Lifetime on Heterogeneous Computing
Environments
Session 9: Benchmarking, Performance Measurements and Analysis
Characterizing the Dynamic Behavior of
Workload Execution in SVM Systems
A Performance Evaluation of ARM ISA
Extensions for Elliptic Curve Cryptography over Binary Finite Fields
PEMPIs: A New Methodology for Modeling
and Prediction of MPI Programs Performance
Performance Characterisation of Intra-Cluster Collective Communications