Ian Briggs

University of Utah

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Ganesh Gopalakrishnan

University of Utah

Pavel Panchekha

University of Utah

Zvonimir Rakamarić

University of Utah

Arnab Das

Himalayan University

Sriram Krishnamoorthy

University of California, Santa Barbara

Alexey Solovyev

Institut Paoli-Calmettes

Dong H. Ahn

Nvidia (United States)

Michael Bentley

University of Utah

Gregory L. Lee

Lawrence Livermore National Laboratory

Marek Baranowski

University of Utah

Cooperative Institutions

University of Utah

Lawrence Livermore National Laboratory

Texas A&M University

Leibniz Supercomputing Centre

NSF National Center for Atmospheric Research

Pacific Northwest National Laboratory

A.P. Krylov All-Russian Oil and Gas Research Institute (Russia)

Brigham Young University

The University of Texas at Austin

University of Colorado Boulder

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

FPD etect

ACM Transactions on Architecture and Code Optimization (2020)

Arnab Das Sriram Krishnamoorthy Ian Briggs Ganesh Gopalakrishnan Ramakrishna Tipireddy

We present FPD etect , a low-overhead approach for detecting logical errors and soft errors affecting stencil computations without generating false positives. We develop an offline analysis that tightly estimates the number of floating-point bits preserved across stencil applications. This estimate rigorously bounds the values expected in the data space of the computation. Violations of this bound can be attributed with certainty to errors. FPD etect helps synthesize error detectors customized for user-specified levels of accuracy and coverage. FPD etect also enables overhead reduction techniques based on deploying these detectors coarsely in space and time. Experimental evaluations demonstrate the practicality of our approach.

Stencil

10.1145/3402451

Cite

Citations (3)

Moving the Needle on Rigorous Floating-Point Precision Tuning.

Marek Baranowski Ian Briggs Wei-Fan Chiang Ganesh Gopalakrishnan Zvonimir Rakamarić

Source

Cite

Citations (0)

FailAmp

ACM Transactions on Architecture and Code Optimization (2019)

Ian Briggs Arnab Das Mark Baranowski Vishal Sharma Sriram Krishnamoorthy

We present FailAmp, a novel LLVM program transformation algorithm that makes programs employing structured index calculations more robust against soft errors. Without FailAmp, an offset error can go undetected; with FailAmp, all subsequent offsets are relativized, building on the faulty one. FailAmp can exploit ISAs such as ARM to further reduce overheads. We verify correctness properties of FailAMP using an SMT solver, and present a thorough evaluation using many high-performance computing benchmarks under a fault injection campaign. FailAmp provides full soft-error detection for address calculation while incurring an average overhead of around 5%.

Solver

Soft error

Fault injection

Satisfiability Modulo Theories

10.1145/3369381

Cite

Citations (1)

ArcherGear

Samuel Thayer Ganesh Gopalakrishnan Ian Briggs Michael Bentley Dong H. Ahn

There is growing uptake of shared memory parallelism in high performance computing, and this has increased the need for data race checking during the creation of new parallel codes or parallelizing existing sequential codes. While race checking concepts and implementations have been around for many concurrency models, including tasking models such as Cilk and PThreads (e.g., the Thread Sanitizer tool), practically usable race checkers for other APIs such as OpenMP have been lagging. For example, the OpenMP parallelization of an important library (namely Hypre) was initially unsuccessful due to inexplicable nondeterminism introduced when the code was optimized, and later root-caused to a race by the then recently developed OpenMP race checker Archer [2]. The open-source Archer now enjoys significant traction within several organizations.

POSIX Threads

USable

10.1145/3332466.3374504

Cite

Citations (4)

Choosing mathematical function implementations for speed and accuracy

Ian Briggs Pavel Panchekha

Standard implementations of functions like sin and exp optimize for accuracy, not speed, because they are intended for general-purpose use. But just like many applications tolerate inaccuracy from cancellation, rounding error, and singularities, many application could also tolerate less-accurate function implementations. This raises an intriguing possibility: speeding up numerical code by using different function implementations.

Implementation

Rounding

Code (set theory)

10.1145/3519939.3523452

Cite

Citations (3)

FLiT: Cross-platform floating-point result-consistency tester and workload

Geof Sawaya Michael Bentley Ian Briggs Ganesh Gopalakrishnan Dong H. Ahn

Understanding the extent to which computational results can change across platforms, compilers, and compiler flags can go a long way toward supporting reproducible experiments. In this work, we offer the first automated testing aid called FLiT (Floating-point Litmus Tester) that can show how much these results can vary for any user-given collection of computational kernels. Our approach is to take a collection of these kernels, disperse them across a collection of compute nodes (each with a different architecture), have them compiled and run, and bring the results to a central SQL database for deeper analysis. Properly conducting these activities requires a careful selection (or design) of these kernels, input generation methods for them, and the ability to interpret the results in meaningful ways. The results in this paper are meant to inform two different communities: (a) those interested in seeking higher performance by considering "IEEE unsafe" optimizations, but then want to understand how much result variability to expect, and (b) those interested in standardizing compiler flags and their meanings, so that one may safely port code across generations of compilers and architectures. By releasing FLiT, we have also opened up the possibility of all HPC developers using it as a common resource as well as contributing back interesting test kernels as well as best practices, thus extending the floating-point result-consistency workload we contribute. This is the first such workload and result-consistency tester underlying floating-point reproducibility of which we are aware.

10.1109/iiswc.2017.8167780

Cite

Citations (16)

Moving the Needle on Rigorous Floating-Point Precision Tuning

Kalpa publications in computing (2018)

Marek Baranowski Ian Briggs Wei-Fan Chiang Ganesh Gopalakrishnan Zvonimir Rakamarić

Virtually all real-valued computations are carried out using floating-point data types and operations. With increasing emphasis on overall computational efficiency, compilers are increasingly attempting to optimize floating-point expressions. Practical reasoning about the correctness of these optimizations requires error analysis procedures that are rigorous (ideally, they can generate proof certificates), can handle a wide variety of operators (e.g., transcendentals), and handle all normal programmatic constructs (e.g., conditionals and loops). Unfortunately, none of today’s approaches can achieve this combination. This position paper summarizes recent progress achieved in the community on this topic. It then showcases the component techniques present within our own rigorous floating-point precision tuning framework called FPTuner—essentially offering a collection of “grab and go” tools that others can benefit from. Finally, we present FPTuner’s limitations and describe how we can exploit contemporaneous research to improve it.

Component (thermodynamics)

10.29007/f4f3

Cite

Citations (2)

FPDetect: Efficient Reasoning About Stencil Programs Using Selective Direct Evaluation

arXiv (Cornell University) (2020)

Arnab Das Sriram Krishnamoorthy Ian Briggs Ganesh Gopalakrishnan Ramakrishna Tipireddy

We present FPDetect, a low overhead approach for detecting logical errors and soft errors affecting stencil computations without generating false positives. We develop an offline analysis that tightly estimates the number of floating-point bits preserved across stencil applications. This estimate rigorously bounds the values expected in the data space of the computation. Violations of this bound can be attributed with certainty to errors. FPDetect helps synthesize error detectors customized for user-specified levels of accuracy and coverage. FPDetect also enables overhead reduction techniques based on deploying these detectors coarsely in space and time. Experimental evaluations demonstrate the practicality of our approach.

Stencil

Limiting

10.48550/arxiv.2004.04359

Cite

Citations (1)

Faster Math Functions, Soundly

arXiv (Cornell University) (2021)

Ian Briggs Pavel Panchekha

Standard library implementations of functions like sin and exp optimize for accuracy, not speed, because they are intended for general-purpose use. But applications tolerate inaccuracy from cancellation, rounding error, and singularities-sometimes even very high error-and many application could tolerate error in function implementations as well. This raises an intriguing possibility: speeding up numerical code by tuning standard function implementations. This paper thus introduces OpTuner, an automatic method for selecting the best implementation of mathematical functions at each use site. OpTuner assembles dozens of implementations for the standard mathematical functions from across the speed-accuracy spectrum. OpTuner then uses error Taylor series and integer linear programming to compute optimal assignments of function implementation to use site and presents the user with a speed-accuracy Pareto curve they can use to speed up their code. In a case study on the POV-Ray ray tracer, OpTuner speeds up a critical computation, leading to a whole program speedup of 9% with no change in the program output (whereas human efforts result in slower code and lower-quality output). On a broader study of 37 standard benchmarks, OpTuner matches 216 implementations to 89 use sites and demonstrates speed-ups of 107% for negligible decreases in accuracy and of up to 438% for error-tolerant applications.

10.48550/arxiv.2107.05761

Cite

Citations (0)

Rigorous floating-point mixed-precision tuning

ACM SIGPLAN Notices (2017)

Wei‐Fan Chiang Mark Baranowski Ian Briggs Alexey Solovyev Ganesh Gopalakrishnan

Virtually all real-valued computations are carried out using floating-point data types and operations. The precision of these data types must be set with the goals of reducing the overall round-off error, but also emphasizing performance improvements. Often, a mixed-precision allocation achieves this optimum; unfortunately, there are no techniques available to compute such allocations and conservatively meet a given error target across all program inputs. In this work, we present a rigorous approach to precision allocation based on formal analysis via Symbolic Taylor Expansions, and error analysis based on interval functions. This approach is implemented in an automated tool called FPTuner that generates and solves a quadratically constrained quadratic program to obtain a precision-annotated version of the given expression. FPTuner automatically introduces all the requisite precision up and down casting operations. It also allows users to flexibly control precision allocation using constraints to cap the number of high precision operators as well as group operators to allocate the same precision to facilitate vectorization. We evaluate FPTuner by tuning several benchmarks and measuring the proportion of lower precision operators allocated as we increase the error threshold. We also measure the reduction in energy consumption resulting from executing mixed-precision tuned code on a real hardware platform. We observe significant energy savings in response to mixed-precision tuning, but also observe situations where unexpected compiler behaviors thwart intended optimizations.

Double-precision floating-point format

Single-precision floating-point format

Code (set theory)

Vectorization (mathematics)

Quadratic growth

Accuracy and precision

10.1145/3093333.3009846

Cite

Citations (27)