Loading…
Thursday, March 5 • 11:05 - 11:25
Fine-grained Seismic Algorithms: “Revisiting Kirchhoff Migration on GPUs,” Rajesh Gandham, Rice University; Thomas Cullison, Hess; and Scott Morton, Hess

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

DOWNLOAD PRESENTATION

WATCH VIDEO

Kirchhoff 3-D prestack depth migration is a widely used and computationally intensive seismic imaging approach that is highly parallelizable. There is parallelism both in the input seismic data and in the output image. At this workshop in 2008, we presented an algorithm with parallelism from the level of many serial Grid Engine tasks down to individual CUDA threads running on Nvidia GPUs. The performance of this algorithm gave clusters with GPUs a significant price-to- performance advantage over standard CPU clusters.

We recently revisited our Kirchhoff code with two computational goals, hardware portability and increased performance, as well as the geophysical goal of generalized gather creation. To address the possibility of hardware portability, we first tested the OCCA unified parallel programming model by re-coding our CUDA computational kernels in OCCA. This enabled our kernels to run on Nvidia or AMD GPUs and multi-core CPUs using CUDA, OpenCL or OpenMP. Because of the similarities between the OCCA and CUDA programming models, we were able to port and benchmark the code in less than a month without losing any production performance. In fact, the run-time compilation feature of OCCA generally resulted in better optimized kernels, since the compilers have knowledge of user parameters and the execution hardware, effectively tailoring the kernels for each production run.

With this portability success, we next explored two dimensions in the design of the algorithm to address the possibility of increasing the migration performance on newer GPU architecture. We explored (1) the fraction of the GPU memory used for input seismic data (as opposed to the output image samples) and (2) the trade-off of computation vs memory usage. We found significant im- provements in performance when we shifted the balance towards more input data and less output data in memory, and switched to a more compute intensive approach.

We have coded the new algorithm in OCCA and added the ability to generate generalized image gathers (e.g., offset gathers, offset vector gathers and reflection angle gathers). In this talk, we will discuss our porting, design experiences, and compare the advantages and performance of the old and new algorithms.

Speakers
avatar for Thomas Cullison

Thomas Cullison

Solutions Architect Oil & Gas, NVIDIA
Thomas is a Solutions Architect at NVIDIA with a focus on the Oil & Gas industry, and he has a background in computational geophysics, algorithms, and development of HPC and DSP applications for seismic imaging and seismic data processing. As an undergraduate at Colorado School of... Read More →
avatar for Rajesh Gandham

Rajesh Gandham

PhD Candidate, Rice University
Rajesh Gandham is a PhD candidate in the department of Computational and Applied Mathematics at Rice University. He is working with Prof. Tim Warburton on high performance high order numerical methods for ocean fluid flow problems. His research interests are in developing and implementing... Read More →
avatar for Scott Morton

Scott Morton

Manager and Global Geophysical Advisor, Hess Corporation
Scott Morton has 25 years of experience in computational and theoretical physics distributed between academia, the computer industry and the petroleum industry. Although originally trained as an astrophysicist, he switched to geophysics when he joined Shell in 1991 to do research... Read More →


Thursday March 5, 2015 11:05 - 11:25 CST
BioScience Research Collaborative 6500 Main Street, Houston, Tx 77005

Attendees (0)