News & Analysis
Comment
docdivakar
I agree with the point made in the article: benchmarking needs to address both ...
dberninger
The issue is broader than just supercomputers. The lack of an industry ...
Sandia upgrades supercomputer benchmarks
R Colin Johnson
11/18/2010 7:38 PM EST
PORTLAND, Ore.—Exascale supercomputers running a thousand times faster than today's petaflop machines will require newer performance measures, according to Sandia National Laboratories, which announced a 30 member committee effort to define a new standard with Intel, IBM, AMD, NVIDIA, and Oracle. Called Graph 500, a preliminary specification has already been made available to supercomputer makers for testing.
Graph 500 differs from the traditional Linpack by testing a supercomputer's skill at using graph theory to analyze the output streams from simulations in biological, security, social and similar large-scale problems. Graph 500 not only measures the traditional number crunching ability of supercomputers, but also their ability to shuttle around the very-large data sets represented by future supercomputers like those being addressed by the U.S. Department of Energy's exascale supercomputer initiative.
"Today's Linpack benchmark focuses on compute-intensive work, but we are looking at data intensive things," said Sandia researcher Richard Murphy.
The major problem today with very-large data sets is automating their analysis, which can sometimes take months longer than the actual collection and processing of the data. The theory is that the analysis of very-large simulations requires relatively simple calculations performed on select elements found in sparsely populated arrays representing huge numbers of participants. A three-tiered architecture is being proposed to partition the benchmarking problem into parallel executing graphs structures—one kernel handles linking sets of related members while a second performs parallel searches for the most relevant results from these graphs. And a third kernel is in the works.

"The goal is to have some influence over how industry puts together supercomputers," said Murphy. "The shift to multicore processor architectures really is stressing the existing programming model. In ten years we will have a very different way of thinking about how the machine works," he said.
Typical problems amenable to analysis by graph theoretic means include cybersecurity, medical informatics, data enrichment, social- and symbolic-networks. For instance, cybersecurity routinely requires full scans of data sets with as many as 15 billion log entries per day. Medical informatics routinely analyses up to 50 million patient records requiring resulting in billions of items. Data enrichment applications, such as maritime domain awareness, can include hundreds of millions of individual transponders on tens of thousands of ships and tens of millions of individual cargo containers. Social and symbolic networks include petabyte sized data sets—for instance, the human cortex has 25 billion neurons each with approximately 7,000 connections.
Graph 500 differs from the traditional Linpack by testing a supercomputer's skill at using graph theory to analyze the output streams from simulations in biological, security, social and similar large-scale problems. Graph 500 not only measures the traditional number crunching ability of supercomputers, but also their ability to shuttle around the very-large data sets represented by future supercomputers like those being addressed by the U.S. Department of Energy's exascale supercomputer initiative.
"Today's Linpack benchmark focuses on compute-intensive work, but we are looking at data intensive things," said Sandia researcher Richard Murphy.
The major problem today with very-large data sets is automating their analysis, which can sometimes take months longer than the actual collection and processing of the data. The theory is that the analysis of very-large simulations requires relatively simple calculations performed on select elements found in sparsely populated arrays representing huge numbers of participants. A three-tiered architecture is being proposed to partition the benchmarking problem into parallel executing graphs structures—one kernel handles linking sets of related members while a second performs parallel searches for the most relevant results from these graphs. And a third kernel is in the works.

Synthetic graph was generated by a method called Kronecker multiplication. Larger versions of this generator, modeling real-world graphs, are used in the Graph500 benchmark. (Courtesy of Jeremiah Willcock, Indiana University)
"The goal is to have some influence over how industry puts together supercomputers," said Murphy. "The shift to multicore processor architectures really is stressing the existing programming model. In ten years we will have a very different way of thinking about how the machine works," he said.
Typical problems amenable to analysis by graph theoretic means include cybersecurity, medical informatics, data enrichment, social- and symbolic-networks. For instance, cybersecurity routinely requires full scans of data sets with as many as 15 billion log entries per day. Medical informatics routinely analyses up to 50 million patient records requiring resulting in billions of items. Data enrichment applications, such as maritime domain awareness, can include hundreds of millions of individual transponders on tens of thousands of ships and tens of millions of individual cargo containers. Social and symbolic networks include petabyte sized data sets—for instance, the human cortex has 25 billion neurons each with approximately 7,000 connections.
Navigate to related information


Dave.Dykstra
11/19/2010 2:19 PM EST
Very interesting. This is a much needed change to the way things are done. Of course, there will be detractors but that is always the case with anything intended to provide benchmarking because it either will not measure what someone wants measured, or it will do it in an way that someone feels is biased. In this case, I am sure that the "influence over how industry puts together supercomputers" will result in some bias towards certain design methodologies.
Sign in to Reply
dberninger
11/21/2010 6:45 AM EST
The issue is broader than just supercomputers. The lack of an industry consensus compute unit prevents the cloud infrastructure as a service category from obtaining traction. Amazon created an abstraction it calls the ECU and a check of 20 other providers will reflect 18 other metrics. The topic is addressed at http://cloudpricecalculator.com/blog
Sign in to Reply
docdivakar
11/22/2010 3:53 PM EST
I agree with the point made in the article: benchmarking needs to address both numerical computation as well as managing and moving large sets of data. This point was made well during the ICCAD held in the Silicon Valley about two weeks ago.
Dr. Mark Shephard (RPI) who spoke at ParCAD (after keynote by Dr. Arvind of MIT) spoke of his experiences with the Big Blue machine for Adaptive Simulations on Massively Parallel Computers. In an upcoming simulation that could reach solving a billion equations (yes, not a typo!!), he was mentioning that while Big Blue may crunch this out in a few hours, it may take a few days to write the data from simulations!. To that end, he though the use of Linpack as a bench mark may not be truly representative of the cluster. Graph 500 may be the answer to truly assess the capabilities of a super computer.
Dr. MP Divakar
Sign in to Reply