757-018
|
An Exploration of OpenCL on Multiple Hardware Platforms for a Numerical Relativity Application
Niket K. Choudhary, Sandeep Navada, Rakesh Ginjupalli, and Gaurav Khanna
doi:
10.2316/P.2011.757-018
|
Abstract
|
|
757-012
|
Simulating Species Interactions and Complex Emergence in Multiple Flocks of Boids with GPUS
Alwyn V. Husselmann and Ken A. Hawick
doi:
10.2316/P.2011.757-012
|
Abstract
|
|
757-077
|
Color Image Edge Detection based on Quantity of Color Information and its Implementation on the GPU
Jingxiu Zhao, Yonghong Xiang, Laurence Dawson, and Iain Stewart
doi:
10.2316/P.2011.757-077
|
Abstract
|
|
757-058
|
Calculating an Approximation to a Union of Balls using a Graphics Processing Unit
Christian Trefftz, Gregory Wolffe, Igor Majdandzic, and Joseph Szakas
doi:
10.2316/P.2011.757-058
|
Abstract
|
|
757-070
|
Acceleration of Ant Colony Optimization for the Traveling Salesman Problem on a GPU
Kazuta Kobashi, Akihiro Fujii, Teruo Tanaka, and Kazunori Miyoshi
doi:
10.2316/P.2011.757-070
|
Abstract
|
|
757-061
|
GPGPU-based Algebraic Multigrid Method
Kosuke Takahashi, Akihiro Fujii, and Teruo Tanaka
doi:
10.2316/P.2011.757-061
|
Abstract
|
|
757-014
|
GPGPU DFA Membership Tests
Beorn Facchini, Yousun Ko, Min-Young Jung, and Bernd Burgstaller
doi:
10.2316/P.2011.757-014
|
Abstract
|
|
757-029
|
Evaluation of Executing DGEMM Algorithms on Modern Multicore CPU
Pawel Gepner, Victor Gamayunov, and David L. Fraser
doi:
10.2316/P.2011.757-029
|
Abstract
|
|
757-066
|
Comparing Implementation Platforms for Real-Time Stream Processing Systems on Multi-Core Hardware
Oleg Danylenko, Welf Löwe, and Sara Rydström
doi:
10.2316/P.2011.757-066
|
Abstract
|
|
757-083
|
Exploiting Thread Level Parallelism by Loop Unrolling along Wavefronts
Johann Steinbrecher and Weijia Shang
doi:
10.2316/P.2011.757-083
|
Abstract
|
|
757-039
|
Dynamic Selection of Speculative Paths in Two-Path Limited Speculation Method
Hiroyoshi Jutori, Kanemitsu Ootsu, Takashi Yokota, and Takanobu Baba
doi:
10.2316/P.2011.757-039
|
Abstract
|
|
757-060
|
Parallelizing Sequential Programs using a Selection of Available Tools and Techniques
Karen Bradshaw and Waide Tristram
doi:
10.2316/P.2011.757-060
|
Abstract
|
|
757-057
|
Performance Estimation of Speculative Multithreading through Whole Program Path
Kanemitsu Ootsu, Takashi Yokota, and Takanobu Baba
doi:
10.2316/P.2011.757-057
|
Abstract
|
|
757-073
|
A Class of Queuing Network Models for Multithreaded Processors
Miao Ju, Hun Jung, and Hao Che
doi:
10.2316/P.2011.757-073
|
Abstract
|
|
757-115
|
Parallel EDMD Simulation on Multi-Core Architectures
Umut Demirtaş and Fatih E. Sevilgen
doi:
10.2316/P.2011.757-115
|
Abstract
|
|
757-067
|
Bottleneck Identification for Multithreaded Processors
Miao Ju, Hun Jung, and Hao Che
doi:
10.2316/P.2011.757-067
|
Abstract
|
|
757-068
|
Thread-Locking Work Stealing under Parallel Data List
Jorge Buenabad-Chávez, Edgar F. Hernández-Ventura, Miguel A. Castro-García, José L. Quiroz-Fabián, Graciela Román-Alonso, and Daniel M. Yellin
doi:
10.2316/P.2011.757-068
|
Abstract
|
|
757-110
|
Performance Improvement of Hot-Path based Thread Partitioning Technique by Unifying Loop Parallelization
Kanemitsu Ootsu, Takashi Yokota, and Takanobu Baba
doi:
10.2316/P.2011.757-110
|
Abstract
|
|
757-044
|
Multi-Objective Local Instruction Scheduling for GPGPU Applications
Constantin Timm, Frank Weichert, Peter Marwedel, and Heinrich Müller
doi:
10.2316/P.2011.757-044
|
Abstract
|
|
757-037
|
A Model based Approach for Computing Speedup on Parallel Machines using Static Code Analysis
Ioannis Zgeras, Jürgen Brehm, and Tobias Sprodowski
doi:
10.2316/P.2011.757-037
|
Abstract
|
|
757-097
|
A GPGPU Programming Framework based on a Shared-Memory Model
Kazuhiko Ohno, Dai Michiura, Masaki Matsumoto, Takahiro Sasaki, and Toshio Kondo
doi:
10.2316/P.2011.757-097
|
Abstract
|
|
757-050
|
HPC.NET: Enabling .NET Programs for GPU-based High Performance Computing
Hsuan-Hsiu (Senshaw) Ou, Spencer Davis, Chaochao Zhang, and Hai Jiang
doi:
10.2316/P.2011.757-050
|
Abstract
|
|
757-041
|
Implementation of Multiple-Precision Floating-Point Arithmetic Library for GPU Computing
Takatoshi Nakayama and Daisuke Takahashi
doi:
10.2316/P.2011.757-041
|
Abstract
|
|
757-021
|
A Non-Stop Distributed File System with I/O Replication on Proxy Servers
Akihiko Nishitani and Tomohiko Ogishi
doi:
10.2316/P.2011.757-021
|
Abstract
|
|
757-099
|
The Feasibility of Moving Terabyte Files between Campus and Cloud
Adam H. Villa and Elizabeth Varki
doi:
10.2316/P.2011.757-099
|
Abstract
|
|
757-108
|
Using Trade Wind to Sail in the Clouds
Charles Miers, Marcel de Barros, Marcos Simplício, Nelson Gonzalez, Pedro Evangelista, Walter Goya, Tereza Carvalho, Stefan Hellkvist, Joacim Halén, Jan-Erik Mångs, Bob Melander, and Victor Souza
doi:
10.2316/P.2011.757-108
|
Abstract
|
|
757-112
|
PAWS: A Toolkit for Analyzing Web Service Performance using Proxies
Michael D. Rogers, Sheikh Ghafoor, and Rob Dye
doi:
10.2316/P.2011.757-112
|
Abstract
|
|
757-059
|
Overlaying an Opportunistic Virtual Storage System on the UnaGrid Infrastructure
Mario Villamizar Cano, Arthur Oviedo, Harold Castro, and Juan Osorio
doi:
10.2316/P.2011.757-059
|
Abstract
|
|
757-080
|
Search on the Cloud File System
Rodrigo Savage, Dulce Tania Nava, Norma Elva Chávez, and Norma Saiph Savage
doi:
10.2316/P.2011.757-080
|
Abstract
|
|
757-022
|
An Efficient Topology Construction Algorithm for Mesh-Pull Peer-to-Peer Streaming Networks
Tomoyuki Ishii and Atsushi Inoie
doi:
10.2316/P.2011.757-022
|
Abstract
|
|
757-005
|
The Impact of Inter-Node Latency versus Intra-Node Latency on HPC Applications
Gilad Shainer, Pak Lui, Tong Liu, Todd Wilde, and Jeffrey Layton
doi:
10.2316/P.2011.757-005
|
Abstract
|
|
757-038
|
Execution Environment on FPGA for Smart PC Hetero-Cluster
Kiyoshi Hayakawa and Keita Ito
doi:
10.2316/P.2011.757-038
|
Abstract
|
|
757-081
|
Directoryless Shared Memory Coherence using Execution Migration
Mieszko Lis, Keun Sup Shim, Myong Hyon Cho, Omer Khan, and Srinivas Devadas
doi:
10.2316/P.2011.757-081
|
Abstract
|
|
757-101
|
Achieving High Throughput in High Radix Switches using Asymmetric Crossbar
Kefei Wang, Heyin Zhang, and Ming Fang
doi:
10.2316/P.2011.757-101
|
Abstract
|
|
757-052
|
Implementation of a Singular Value Decomposition Module on an FPGA
Masoud Hosseinimehr and Norma Montealegre
doi:
10.2316/P.2011.757-052
|
Abstract
|
|
757-009
|
Performance Analysis of Different Multiplication Strategies in Reconfigurable Hardware
Umer N. Misgar and Muhammad Hasan
doi:
10.2316/P.2011.757-009
|
Abstract
|
|
757-114
|
Power- and Cooling- Aware Parallel Performance Diagnosis
Rashawn L. Knapp, Karen L. Karavanic, Sriram Krishnamoorthy, and Andres Marquez
doi:
10.2316/P.2011.757-114
|
Abstract
|
|
What are Digital Object Identifers?