UNIVERSITY PARK, Pa. — Destructive wildfires in California cause the loss of homes, businesses and human life each year. A Penn State statistics professor is working to help mitigate the effects of these natural disasters by applying complex statistical models to better understand the likeliness that extreme forest fires will develop.
Assistant Professor of Statistics Benjamin Shaby, a Penn State Institute for CyberScience co-hire, has been working with California Department of Forestry and Fire Protection (Cal Fire) since 2014. The collaboration began in 2009, shortly after strong winds knocked over telephone poles, leading to several destructive wildfires. Cal Fire needed to know where utility companies should install stronger poles that wouldn’t blow down as easily, and brought in a team of fire experts to pinpoint the locations.
“That’s where I came in,” said Shaby. “I am not a fire risk expert; I am a statistician. What I did was look at historical weather data to try to determine where the weather would be extreme and where fire risk was most likely, geographically speaking.”
To begin to understand and interpret all of the information surrounding fire risk, Shaby focused on risk due to weather, critical factors in wind speed and wind direction, dryness, relative humidity, precipitation and temperature. His primary data source was output from a high-resolution numerical weather model. He wanted to understand the weather variables over the entire state of California, so they compiled the data from specific points, obtained from wind gauges, thermometers and observation locations.
“The idea was to use the numerical weather model to assimilate all the observations from these point locations to get a fuller picture of the weather,” Shaby said. “This was done by replicating the weather from over an entire decade.”
Dave Sapsis, research program specialist at Cal Fire, said that Shaby presented novel inputs into assessing rare, significant fire and wind events that are crucial for understanding fire risks.
The nitty gritty
Rare events such as these large, destructive fires are hard to study, difficult to model, and behave in fundamentally different ways than normal events.
Shaby's work is very computationally intensive and cannot run on an average computer, so Shaby uses the Institute for CyberScience Advanced CyberInfrastructure (ICS-ACI), Penn State’s high-performance research cloud. Shaby and his students write a code, determine the code’s time complexity, debug it and then fit it to their data. On the ICS-ACI supercomputer, it takes a day or two to fit this model on a single data set, whereas a regular computer could take up to a month.