An international team of scientists at the Planck observatory has made the closest reading yet of the most ancient story in our universe: the cosmic microwave background (CMB).
Using supercomputers at the U.S. Department of Energy’s (DOE) National Energy Research Scientific Computing Center (NERSC), Planck researchers managed to create the most detailed and accurate maps yet of the relic radiation from the big bang – revealing that the universe is 100 million years older than originally thought, with more matter and less dark energy.
“These maps are proving to be a goldmine containing stunning confirmations and new puzzles,” explained Martin White, a Planck scientist and physicist with University of California Berkeley and at Lawrence Berkeley National Laboratory (Berkeley Lab).
“This data will form the cornerstone of our cosmological model for decades to come and spur new directions in research.”
Written in light shortly after the big bang, the CMB is a faint glow that permeates the cosmos. Studying it can help us understand how our universe was born, its nature, composition and eventual fate.
“Encoded in its fluctuations are the parameters of all cosmology, numbers that describe the universe in its entirety,” said Julian Borrill, a Planck collaborator and cosmologist in the Computational Research Division at Berkeley Lab.
“However, CMB surveys are complex and subtle undertakings. Even with the most sophisticated detectors, scientists still need supercomputing to sift the CMB’s faint signal out of a noisy universe and decode its meaning.”
As such, hundreds of scientists from around the world study the CMB using supercomputers at NERSC, a DOE user facility based at Berkeley Lab. “NERSC supports the entire international Planck effort,” says Borrill.
A co-founder of the Computational Cosmology Center (C3) at the lab, Borrill has been developing supercomputing tools for CMB experiments for over a decade. The Planck observatory, a mission of the European Space Agency with significant participation from NASA, is the most challenging yet.
Parked in an artificial orbit about 800,000 miles away from Earth, Planck’s 72 detectors complete a full scan of the sky once every six months or so. Observing at nine different frequencies, Planck gathers about 10,000 samples every second, or a trillion samples in total for the 15 months of data included in this first release. In fact, Planck generates so much data that, unlike earlier CMB experiments, it’s impossible to analyze exactly, even with NERSC’s powerful supercomputers.
Instead, CMB scientists are forced to employ workarounds. Using approximate methods they are able to handle the Planck data volume, but then they need to understand the uncertainties and biases their approximations have left in the results.
One particularly challenging bias comes from the instrument itself. The position and orientation of the observatory in its orbit, the particular shapes and sizes of detectors (these vary) and even the overlap in Planck’s scanning pattern affect the data.
To account for such biases and uncertainties, researchers generate a thousand synthetic (or simulated) copies of the Planck data and apply the same analysis to these. Measuring how the approximations affect this simulated data allows the Planck team to account for their impact on the real data.
With each generation of NERSC supercomputers, the Planck team has adapted its software to run on more and more processors, pushing the limits of successive systems while reducing the time it takes to run a greater number of complex calculations.
“By scaling up to tens of thousands of processors, we’ve reduced the time it takes to run these calculations from an impossible 1,000 years down to a few weeks,” says Ted Kisner, a C3 member at Berkeley Lab and Planck scientist. To be sure, the team’s codes are so demanding that they’re often called on to push the limits of new NERSC systems.
Access to the NERSC Global Filesystem and vast online and offline storage has also been key.
“CMB data over the last 15 years have grown with Moore’s Law, so we expect a two magnitude increase in data in the coming 15 years, too,” Borrill added.