Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Prediction of air quality in Sydney, Australia as a function of forest fire load and weather using Bayesian statistics

  • Michael Anthony Storey ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft

    mstorey@uow.edu.au

    Affiliations NSW Bushfire Risk Management Research Hub, Wollongong, NSW, Australia, Department of Earth, Atmospheric and Life Sciences, University of Wollongong, Wollongong, NSW, Australia

  • Owen F. Price

    Roles Conceptualization, Data curation, Funding acquisition, Methodology, Project administration, Supervision, Writing – review & editing

    Affiliations NSW Bushfire Risk Management Research Hub, Wollongong, NSW, Australia, Department of Earth, Atmospheric and Life Sciences, University of Wollongong, Wollongong, NSW, Australia

Abstract

Smoke from Hazard Reduction Burns (HRBs) and wildfires contains pollutants that are harmful to human health. This includes particulate matter less than 2.5 μm in diameter (PM2.5), which affects human cardiovascular and respiratory systems and can lead to increased hospitalisations and premature deaths. Better models are needed to predict PM2.5 levels associated with HRBs so that agencies can properly assess smoke pollution risk and balance smoke risk with the wildfire mitigation benefits of HRBs. Given this need, our aim was to develop a probabilistic model of daily PM2.5 using Bayesian regression. We focused on the region around Sydney, Australia, which regularly has hazard reduction burning, wildfires and associated smoke. We developed two regional models (mean daily and maximum daily) from observed PM2.5, weather reanalysis and satellite fire hotspot data. The models predict that the worst PM2.5 in Sydney occurs when PM2.5 was high the previous day, there is low ventilation index (i.e. the product of wind speed and planetary boundary layer height), low temperature, west to northwest winds in the Blue Mountains, an afternoon sea breeze and large areas of HRBs are being conducted, particularly to the west and north of Sydney. A major benefit of our approach is that models are fast to run, require simple inputs and Bayesian predictions convey both predicted PM2.5 and associated prediction uncertainty. Future research could include the application of similar methods to other regions, collecting more data to improve model precision and developing Bayesian PM2.5 models for wildfires.

Introduction

Airborne particulate matter less than 2.5 μm in diameter (PM2.5) is a pollutant small enough to cause cardiovascular and respiratory problems in humans, which can be estimated regionally through increased hospitalisations and premature deaths [1, 2]. PM2.5 levels can spike due to smoke from forest fires, both wildfires and hazard reduction burns (HRB), which has been shown to impose substantial costs on healthcare systems [3, 4]. Tools to estimate PM2.5 associated with forest fires have a critical role in fire planning, response and issuance of health advice.

HRBs are carried out as a wildfire mitigation measure. The aim is to reduce fuels in order to reduce ignition risk, slow future wildfire spread, reduce wildfire intensity and ultimately protect communities. However, a considerable side-effect of HRBs is that smoke pollution can affect large populated areas. For example, in Sydney, Australia, smoke from HRBs can flow into and sit over large areas of the city and suburbs. In one example, during May 2016 HRBs caused several very smoky days over Sydney, where mean daily PM2.5 of all monitors was 24 μgm-3 and there were an estimated 14 premature deaths and 29 and 50 cardiovascular and respiratory hospitalisations respectively [5]. Given the potential impacts, HRB programs must consider where smoke goes, the level of community exposure to pollutants and ways to reduce smoke risk, and weigh the smoke risks with the wildfire risk reduction benefits of HRBs.

Weather influences the timing, ignition and fire behaviour at an HRB. HRBs need to be scheduled during a limited range of weather conditions that allows fire to spread at a rate and intensity that is containable. Weather also influences smoke production by controlling fire intensity and fuel consumption rates [6, 7], with fuel consumption rates increasing as intensity increases [8]. Weather also determines smoke dispersal patterns: e.g. wind speeds, directions, boundary layer heights and broader synoptic patterns influence vertical and lateral smoke movement [9, 10]. Scheduling an HRB can be a complex decision for a fire agency, given they must consider how weather influences the success of an HRB and smoke dispersal, in addition to other factors such as fuel conditions, crew availability and risk to nearby houses. Tools to better assess the likelihood of negative smoke effects from HRBs under different weather conditions would substantially improve the ability of fire agencies to better understand pollution risks and safely complete HRB programs.

In different parts of the world, fire agencies currently use physics-based smoke dispersion models to predict community exposure to smoke, including PM2.5. An advanced example is the BlueSky system in the USA which links fire, emissions and dispersion models to predict regional smoke pollutant concentrations [11]. In NSW, CCAM-TAPM is used to model smoke dispersion and pollutant concentrations from HRBs. It is a combination of an advanced dispersion model TAPM (The Air Pollution Model, [12]) and regional climate model CCAM (Conformal Cubic and Atmospheric Model, [13]). Evaluations of such systems are rare in the published literature, as are studies of PM2.5 related to HRBs in general [14]. However, such models have been found to have a limited ability to predict PM2.5 levels from forest fires [15, 16]. Physics-based models also tend to require a large amount of computing power, can take a long time to run predictions and require specialists to run the models and interpret the outputs, all of which limit ease-of-use operationally.

An alternative to physics-based models are empirical models, which by comparison can be simple to run and have short computation times. Such models produce predictions of a single variable of interest based on observed data. These can be developed at different scales. However, producing highly accurate empirically based predictions from individual fires may not be realistic due to difficulties in collecting enough smoke-pollution observations and relating them to a particular fire: e.g. sparse monitoring networks (mostly clustered in cities) making detection from any particular fire unlikely and complex weather circulation patterns between a fire and a monitor. Some authors have produced empirical models of PM2.5 based on observations of monitors placed near HRBs, but these have been limited to within several kilometres of HRBs [17, 18]. Modelling at the individual-fire scale is complex, but predicting at a coarser regional-scale is likely to be more feasible and produce models applicable to the worst pollution days: i.e. when smoke is affecting a large area rather than just an individual station.

Here, we focused on developing a probabilistic model of daily PM2.5 across the Sydney area using Bayesian regression. The aim was to produce a model that provides a general, fast-to-run and operationally useful assessment of daily PM2.5 levels in the Sydney basin (Australia) as a function of regional fire and weather variables. The model is aimed as an operational tool that can complement existing air quality forecast tools to support decisions around the scheduling of HRBs near Sydney. For example, the model could help identify conditions where conducting HRBs requires further consideration of smoke effects before proceeding, which may be assessed in light of predictions from existing models (e.g. physics-based models). We used Bayesian modelling to produce probabilistic predictions that provide highly informative model outputs and provide a clear indication of prediction uncertainty.

Methods

Study area and fire data

We focused on the area around Sydney in New South Wales, Australia. Sydney has > 4 million residents and is situated in a low-lying basin between the coast and large tracts of eucalypt forest, including the Greater Blue Mountains World Heritage Area [19]. A combination of conducive geographic and weather characteristics make Sydney subject to long-lasting pollution events [20], with the worst events usually stemming from forest fires [21]. The forests around Sydney are wildfire-prone: many wildfires have burnt thousands of hectares and destroyed hundreds of homes, such as in the years 2002–03, 2013 and 2019–20 [22, 23].

We analysed fire activity within a 150 km buffer of the Chullora (a suburb of Sydney) air quality monitoring station (AQS). Chullora AQS is centrally located among five AQS in the area with longer-term PM2.5 records matching our study period (see below) and 150 km captures most of the Blue Mountains World Heritage Area (Fig 1). Our study period was 2012 to 2021 because this was the period for which VIIRS SNPP (Visible and Infrared Imager/Radiometer Suite, Suomi) satellite hotspots, our measure of fire activity, was available. As our interest was in smoke related to HRBs, our analysis was limited to March to September when most HRBs are usually conducted.

thumbnail
Fig 1. Study area map showing air quality monitoring stations used in analysis and example of VIIRS SNPP hotspots for one day and night between 2019-04-28 midday and 2019-04-29 midday.

150 km buffer around Chullora is black circle. West, South and North region were used for calculating separate fire area variables. Basemap: © OpenStreetMap contributors.

https://doi.org/10.1371/journal.pone.0272774.g001

For all March-September days from 2012 to 2021, we first identified days with active fires in the study area using VIIRS SNPP hotspots [24]. Active fire days were those with at least one cluster of three or more hotspots, with hotspots grouped into a cluster when within 5 km of each other (S2 Appendix). We used hotspot clusters to identify active days so as to exclude days where only one or two isolated hotspots were recorded, which may have been from very small non-HRBs such as wood heaps burning on farmland. We used VIIRS instead of MODIS hotspots because VIIRS is more fire-sensitive and has higher resolution (375 m vs. 1km at nadir), which allows more consistent detection of active fires [24, 25]. We did not use fire-agency fire-history data because it does not include a record of daily fire activity, only final burnt area, fire ignition date and fire containment date.

We generated fire area variables based on VIIRS hotspots as predictors for our statistical modelling. VIIRS captures hotspots during the day (~1 pm to ~3 pm) and night (~12 am to ~2 am). Often VIIRS captures hotspots once per location during the day and once at night, but two VIIRS swathes can overlap at the edges, meaning hotspots for one location are captured twice during the day and/or night. For each active fire day identified, we calculated the daily total hotspot (day + night) area inside the Chullora 150 km buffer which, due to potentially overlapping hotspots, was derived by conducting a GIS intersect of all hotspots for a 24-hour period with a 500 m x 500 m grid (25 ha) covering the study area. The total daily fire area was the total number of unique intersecting grid cells * 25 ha. Note that we counted night hotspots captured in the early morning (before 7 am) toward the prior day’s fire area. For statistical modelling predictors, we split fire area into sectors: fire area west (hotspots that were 270o ± 45o from Chullora), fire area north (360o ± 45o) and fire area south (180 o ± 45o) (Fig 1). We chose to include area using this method after we also explored simply using total fire area in the region, but found splitting fire area by direction produced better models. A limitation that should be noted is that hotspots may not be detected for fires burning under heavy cloud. This means very cloudy days with fires may have been wrongly determined as having no fire (thus not included in modelling), or fire in parts of the study area may have been missed.

PM2.5 and weather data

We downloaded all available PM2.5 data measured at air quality monitoring stations (AQS) from the NSW Department of Planning, Industry and Environment for the period 2012–2021, freely available at https://data.airquality.nsw.gov.au/docs/index.html. PM2.5 data is available before 2012, but we did not use this because VIIRS hotspots are only available from 2012. From the data, we determined that there were five AQS within the Sydney area that had records from 2012 to 2021: Chullora, Liverpool, Richmond, Camden, and Earlwood (Fig 1). The AQS data contained hourly averages of PM2.5, among other measurements that were not used here including ozone, PM10 and weather recorded at the AQS.

We calculated mean daily PM2.5 for each of the five AQS. We used daily means because there are reportable standards that currently exist for daily mean PM2.5. The Australian National Air Quality Standards state that mean daily PM2.5 > 25 μgm-3 at a single AQS is considered a reportable exceedance [26]. For our analysis, a daily period was from midday to midday to align with the cycle of fire activity and smoke output: in general, afternoon HRB ignition and smoke effects from the afternoon until the next morning.

In addition to VIIRS-based estimates of fire area, we sampled several variables as predictors for the modelling (Table 1). Weather was from ERA5, which is a gridded (30 km) hourly atmospheric reanalysis product with numerous weather variables available for surface and atmospheric levels [27, 28]. We chose variables that are used in fire behaviour and air quality prediction: U and V wind components (from which wind speed and direction are derived), temperature, planetary boundary layer height, total cloud cover and mean sea level pressure over the Tasman Sea and western Sydney [29, 30]. We sampled ERA5 variables from an ERA5 pixel located over western Sydney (Penrith). For the wind U and V components, we sampled from two additional locations to capture potentially differing wind effects along the coast and inland: 1) an ERA5 pixel on the Sydney coast (centroid near Sydney International Airport) to capture sea breezes, and 2) an ERA5 pixel inland in the Blue Mountains over Katoomba. A negative U component of wind (“U-wind”) indicates wind from the east (+ from west) and a negative V component (“V-wind”) indicates wind from the north (+ from south). We calculated wind speed for modelling from the wind component variables sampled over western Sydney only. However, during initial analysis we found a strong correlation between wind speed and planetary boundary layer height, so instead we used the product of wind speed and planetary boundary layer height, known as ventilation index [31].

We calculated daily mean values for all variables used in modelling, except for the coastal U-wind and V-wind where we calculated mean values for the afternoon (2 pm to 6 pm local time) to capture afternoon sea breeze occurrence. We initially tested using mean values for sub-daily periods (afternoon, night and morning) for all weather variables but found that: a) there was generally a high correlation between the values for different periods, and b) using daily values produced similar performing models to those with the sub-daily predictors. Thus, we used daily means for all variables, except for the afternoon sea breeze variables (U and V coastal wind). We included a lagged PM2.5 term in the models to account for temporal autocorrelation, i.e. so that the model considered the influence of the previous day’s PM2.5 on the subsequent day’s PM2.5. Daily PM2.5 is likely to be temporally auto-correlated as pollution can build up over several days over Sydney before it clears away [32].

Modelling approach

There were a total of 597 rows for active fire days between March and September in the years 2012–2021. 80% (477 days) were used for model training and 20% (120 days) for model testing. We derived one model of the mean of the daily (i.e. midday to midday) mean PM2.5 values from the five AQS (“mean model”), which indicates raised PM2.5 across the whole of the Sydney area (Fig 1). We derived a second model for the maximum from the five AQS daily mean PM2.5 values (“maximum model”), which indicates a high daily PM2.5 at any one of the five AQS. Note for model one, the mean of all AQS could be skewed by an extremely high value at one or two AQS, thus a high Sydney-wide mean does not necessarily affect all AQS. However, this approach would still provide a useful indication of more widespread pollution.

Our approach to fitting the models had two stages that involved fitting Generalised Additive Models (GAMs) followed by fitting Bayesian models. A GAM is a form of regression model that uses smoothing splines instead of simple linear effects [33]. Smoothing splines allow non-linear effects to be incorporated into a model, which has been found to be a good approach for air pollution research due to common complex non-linearity in the data [34, 35]. Bayesian models produce predictions in the form of a distribution of plausible values for a given set of predictor conditions based on the model training data, rather than a point estimate as in deterministic models. This provides flexibility in presenting and interpreting the predictions, as the predictive distribution can be summarised in a range of ways depending on user need (e.g. median, “credible intervals”, “highest posterior density interval”, chance that outcome will be > x) [36]. Our end goal was to fit Bayesian models using the “brms” package in R with smoothing splines for the predictors [37].

We initially tried running a model selection process in the “brms” package but found that fitting all models with smooths for all predictors was too time-consuming to be practical. Instead, we used the “mgcv” package in R [38] to fit GAMs (generalised additive models; Gamma family with log link) for a model selection process. We then fitted the best GAMs as Bayesian models using the “brms” package, i.e. the model formula from the best GAM was used to fit a Bayesian model via “brms”. The “brms” package uses the smooth functions from “mgcv” in model fitting.

For the GAM model selection process, we fitted all combinations of predictors as separate models and retained the Akaike Information Criterion (AIC) for each model [39]. The U-wind and V-wind variables were tested in this process as interaction terms, i.e. two-dimensional smooths with the “mgcv” syntax of “s(U coastal wind, V coastal wind)” and “s(U Katoomba wind, V Katoomba wind)”. We did this because both U and V components are required to understand wind speed and direction, including when making predictions.

We judged the best model to be that with the lowest AIC. To ensure there was no substantial drop in predictive accuracy from the model selection process, we compared the correlation-based coefficient of determination (i.e. R2) from test set predictions with the best model to R2 from test set predictions with the full model (all predictors included). Finally, we fitted the best model as a Bayesian model using the “brms” package in R. Bayesian models were fitted with four chains for 10000 iterations per chain, thinning of 10 and discarded warmup of 5000. We ensured model convergence using the Gelman-Rubin convergence diagnostic (all Rhat = 1) and inspection of MCMC plots.

In the results, we have reported on the GAM (deterministic) prediction accuracy of the best models and the models with all predictors included. We have provided plots of model effects and example probabilistic predictions from the Bayesian models.

Results

Descriptive analysis

Mean Sydney PM2.5 was situated mostly between 5 μgm-3 and 15 μgm-3 (mean 10 μgm-3), with a maximum of 47 μgm-3 and 90th percentile of 16 μgm-3 (Fig 2). Only 3% of days were > 25 μgm-3. Maximum Sydney PM2.5 was clustered between 5 μgm-3 and 25 μgm-3, with a maximum of 142 μgm-3 and 90th percentile of 24 μgm-3. There were 8% of days > 25 μgm-3 and 12 days > 50 μgm-3.

thumbnail
Fig 2. Histograms of daily mean PM2.5 of all five AQS and maximum daily PM2.5 from all five AQS (i.e. value from AQS with highest daily mean PM2.5).

“Daily” here means midday to midday.

https://doi.org/10.1371/journal.pone.0272774.g002

In initial analysis of individual predictors against mean Sydney PM2.5, some predictors appeared to have strong effects (Fig 3). Maximum Sydney PM2.5 had similar relationships with individual predictions, so we focus only on mean Sydney PM2.5 in this section. High mean PM2.5 only occurred when ventilation index was low. When daily temperature was > ~17 C, mean PM2.5 was generally low. Higher mean PM2.5 mostly occurred when coastal afternoon U-wind was negative, i.e. when an afternoon sea breeze was occurring. A positive U-wind at Katoomba indicates more westerly winds flowing towards Sydney. High mean PM2.5 occurred mostly when U-wind at Katoomba was positive, indicating winds were more from the west than from the east during high PM2.5 days. Lagged PM2.5 had a strong association with mean PM2.5. For the three fire area variables, there were positive associations with PM2.5 for all in the 0 to 3000 ha range, where most data was situated, and then differing effects at larger areas, although only a few observations were > 3000 ha. Although there were generally positive associations, there were still many days with low fire area that had high PM2.5.

thumbnail
Fig 3. Scatter plots of mean Sydney PM2.5 vs predictor variables, with single variable GAM smooths fit via the “mgcv” package in R.

https://doi.org/10.1371/journal.pone.0272774.g003

Model selection and predictor effects

For both mean and maximum Sydney PM2.5, the best models from the GAM-based selection contained all predictors except the MSLP variables and total cloud cover. The selected models were superior to the full models in terms of AIC and their predictive power was only marginally different (Table 2).

thumbnail
Table 2. GAM accuracy results for models with all variables and best models from the model selection process.

https://doi.org/10.1371/journal.pone.0272774.t002

When fit as Bayesian models, the predictors for both the mean and maximum models had generally similar shapes and directions of effects. Both are discussed here but plots for the mean PM2.5 model are included in the main text and plots for the maximum PM2.5 model are in S1A Fig in S1 Appendix. Lag mean PM2.5 had a strong positive effect between 0 μgm-3 and 20 μgm-3 in both models. Above a lag of 20 μgm-3, the effect became more uncertain as indicated by wider bounds (Fig 4A). In the mean model the effect flattened (Fig 4A) and in the maximum model the effect was weaker but still positive (S1Aa Fig in S1 Appendix). In both models, fire area variables had strong positive effects, although to the south and north > ~1500 ha there was no effect or a negative effect with low confidence (wide bounds) (Fig 4D–4F, S1Ad-S1Af Fig in S1 Appendix), which reflected the low number of days with such large fire areas. In both models, the afternoon coastal wind variables (U and V), when transformed to wind direction and speed, indicated that winds from the east (i.e. a sea breeze) increased PM2.5 and the effect increased with wind speed (Fig 4G, S1Ag Fig in S1 Appendix). The daily inland (Katoomba) wind variables indicated that the highest PM2.5 levels were associated with north-westerly winds, with the effect increasing with wind speed in both models (Fig 4H, S1Ah Fig in S1 Appendix). Easterly winds inland meant lower PM2.5, which decreased as wind speed increased. While the highest PM2.5 was associated with the lowest daily mean temperature for both models, temperature had a curved effect (Fig 4C, S1Ac Fig in S1 Appendix). The lowest PM2.5 was at ~19 C, with increases in PM2.5 associated with both lower temperatures (stronger effect) and higher temperatures. In both models, a low ventilation index over western Sydney was associated with high PM2.5, particularly < ~2000 m2 s−1. Above ~2000 m2 s−1, the effect levelled out and had very wide credible interval bounds (Fig 4B, S1Ab Fig in S1 Appendix), indicating a large deal of uncertainty due to a low number of observations at this level in the model data (Fig 3).

thumbnail
Fig 4. Bayesian mean effects plots for each variable for mean daily Sydney PM2.5 model.

The grey bands represent, from darkest to lightest grey, 0.5, 0.8 and 0.95 credible intervals. Effect of each fire area variable is shown with other two fire area variables held at zero. Other variables held at mean values. U and V wind components have been converted to wind speed and direction effects (bands show 0.5 credible interval). Wind directions are standard wind direction in degrees, i.e. 180 = southerly wind.

https://doi.org/10.1371/journal.pone.0272774.g004

Model predictions

Plots that demonstrate practical applications of the models are shown below. Predictions are shown as a) plots of the entire predictive distribution when levels of a single variable are varied in Fig 5, and b) a gridded or tiled output where the levels of multiple variables are varied, but predictive distributions are summarised based on the chance of PM2.5 exceeding a threshold, i.e. percent of predictive distribution greater than a threshold (Fig 6, S1B Fig in S1 Appendix).

thumbnail
Fig 5. Predictive distributions from the mean and maximum daily Sydney PM2.5 models.

Predictions for three levels of fire area west, with other variables at set levels: Lag PM2.5 = 10 μgm-3, ventilation index = 500 m2 s−1, temperature = 10 C, fire areas south and north = 100 ha, coast wind speed and direction = 10 km h-1 and easterly (sea breeze), inland wind speed and direction = 10 km h-1 and westerly. Dotted and dashed vertical lines are thresholds defined at 15 μgm-3 and 25 μgm-3 respectively, with percent of distribution > thresholds indicated in text within the plots.

https://doi.org/10.1371/journal.pone.0272774.g005

thumbnail
Fig 6. Tile plot of predictions for mean Sydney PM2.5 model (top) and maximum Sydney PM2.5 model (bottom).

Each coloured grid square shows the percent chance, under different predictor conditions, of exceedance above 25 μgm-3 (darker = higher chance, also in red text), i.e. percent of predictive distribution > 25 μgm-3. Other variables held at: daily temperature 14 C, fire areas north and south = 0 ha, inland (Katoomba) wind speed and direction = 15 km h-1 and westerly. Wind speeds and directions were calculated from the relevant U and V wind variables.

https://doi.org/10.1371/journal.pone.0272774.g006

Fig 5 demonstrates that as fire area in the west increases, mean and maximum Sydney PM2.5 are predicted to increase. For both models, the width of the predictive distribution also becomes wider as fire area increases, which reflects increasing uncertainty associated with the predictions. The example shows that under the specified set of conditions (Fig 5), when area in the west increases from 100 ha to 4000 ha, the chance of a mean Sydney PM2.5 exceeding 25 μgm-3 grows from 2% to 76%, and the chance of mean PM2.5 exceeding 15 μgm-3 grows from 42% to 98%. There is generally a much greater chance that maximum Sydney PM2.5 will exceed the thresholds under the conditions in Fig 5: a 31% to 100% chance of exceeding 25 μgm-3, and a 78% to 100% chance of exceeding 15 μgm-3. This reflects that it is more likely for one AQS within Sydney to record an exceedance (i.e. the maximum PM2.5 model) than it is for the mean of all AQS to exceed the same threshold (i.e. the mean PM2.5 model). Note that such plots could be reproduced with any exceedance threshold specified or with any variables set at different levels.

Amongst the columns in Fig 6A (i.e. levels of fire area west), the predicted chance of mean Sydney PM2.5 > 25 μgm-3 is generally higher on the right, indicating that increasing fire area increases the chance of a threshold exceedance. Amongst the rows, mean PM2.5 is predicted to be higher when a sea breeze (easterly) is occurring (top row). Of all six panels in Fig 6A, the highest chance of exceeding 25 μgm-3 is in the top right, with 3000 ha fire area in the west and a sea breeze. In this panel, there is predicted to be > 50% chance of mean PM2.5 > 25 μgm-3 when lag PM2.5 is high and when ventilation index is low. In each of the panels, the highest chance of exceedance is also the bottom right, with high lag PM2.5 and low ventilation index. When fire area in the west is low and there is no sea breeze (Fig 6A, bottom left), there is almost no predicted chance of mean Sydney PM2.5 > 25 μgm-3. Note that mean PM2.5 > 25 μgm-3 is rare in the observed data (Fig 2), so a plot of the chance of mean PM2.5 exceeding 15 μgm-3 is included in S1 Appendix for comparison.

The chance of maximum Sydney PM2.5 exceeding 25 μgm-3 is generally much higher, as this would require only one AQS to have a daily mean exceeding 25 μgm-3. However, the pattern is similar in that the greatest chance of a maximum PM2.5 exceedance is in the top right panel of Fig 6B (fire area west = 3000 ha, afternoon sea breeze occurring), particularly when ventilation index is low and lag PM2.5 is high. Under these conditions there is ~90% chance of an exceedance. There are also significant chances of exceedances in other conditions, with the maximums always being when ventilation index is low and lag PM2.5 is high: e.g. ~70% chance at 1500 ha fire area in the west and a sea breeze (easterlies) (top centre, Fig 6B).

Discussion

We developed and demonstrated Bayesian probabilistic models of mean and maximum daily (midday to midday) PM2.5 across Sydney based on fire and weather variables. We expect the models to be useful in the context of scheduling hazard reduction burns in the Sydney area, particularly identifying if a day has a high chance of exceeding hazardous PM2.5 thresholds. This would indicate that an HRB would require further consideration before ignition or the HRB should be postponed. Our use of Bayesian modelling means that predictions can be presented in a range of formats depending on user need: e.g. plots of predictive distributions (Fig 5) or calculating the chance of a PM2.5 threshold exceedance (Fig 6). The models were developed with March to September data, so are suitable for the main HRB period but not for summer wildfire-related predictions.

The lag PM2.5 variable (prior 24 hr mean PM2.5 for both models) had a strong effect on PM2.5 levels in both models. This suggests that smoke can build up over multiple days around Sydney, although the effect flattened out in the mean PM2.5 model above lag PM2.5 of 20 μgm-3. Conducting multiple days in a row of large HRBs, either multiple HRBs or one HRB split into sections may therefore lead to higher PM2.5 levels in Sydney. Further research would be required to test the effect of multi-day burning on PM2.5 levels.

We found that location of the HRBs was important. Fire area in the west had a strong positive effect, whereas north and south fire area had positive effects in the 0–1500 ha range, above which there was a great deal of uncertainty due to a low number of observations. Differences here were expected because the best conditions for HRBs in the Sydney area are with light west to northwesterly winds, which would push smoke from HRBs in the west or north-west over Sydney. An HRB in the south under westerly or northwesterly winds would carry smoke over the ocean away from Sydney. However, there are cases of multi-day burns and wildfires in the south affecting Sydney’s air quality.

The models are location specific to the Sydney area, but some of the effects found are likely generalizable and have been described in other studies. We confirmed that the ventilation index, which has been used as an indicator of air quality potential around fires in the USA [31], is useful in the context of PM2.5 modelling. A low value of ventilation index describes conditions of low wind speed and low boundary layer, which have been related to higher pollution levels in other studies [40, 41]. Di Virgilio (2018) also found low PBLH was a consistently important predictor of PM2.5 in relation to HRBs in the Sydney area. However, Duc, Rahman [42] found weak correlations between PBLH and PM2.5 for a short period in their study, which indicates that other factors are important. We also found that a lower daily temperature was indicative of worse PM2.5, which was also found in a study of PM2.5 near several HRBs in the Sydney area [18].

Jiang, Scorgie [9] described that the interaction between synoptic scale weather and regional weather determines local air quality in Sydney. We did not find a strong link between PM2.5 and MSLP at Sydney and over the Tasman, whereas other authors have linked high-pressure east of Sydney/over the Tasman to higher pollution in Sydney [10, 30]. It may be that by directly using local weather in our modelling, we accounted for any influence of MSLP and MSLP is a surrogate for local weather. We found a sea breeze to be an important determinant of PM2.5. There is support for this because several studies have suggested sea breezes can elevate pollution levels over coastal cities, either by trapping or recirculating pollutants [10, 40, 43]. We also found that a daily trend of westerly to north-westerly winds in the Blue Mountains west of Sydney was associated with higher PM2.5. Higher PM2.5 can therefore be expected when westerlies inland are pushing smoke towards Sydney and a sea breeze traps or recirculates smoke. The effect is likely to depend on how much and how long an HRB burns through the night [18]. In future studies, the length of night burning may be possible to determine with high temporal resolution satellite data, such as Himawari 8 which captures every 10 minutes [44]. Sea breezes may not have the same impact at more intense fires where high smoke plumes carry smoke above the surface level. It should be noted that there is some association between our predictors involving wind speed in our data, mainly that ventilation index (sampled over western Sydney) is correlated with daily wind speeds over Katoomba (calculated from U and V, R = 0.71). This means that when ventilation index is low, wind speed at Katoomba is also generally low. Our models are therefore likely to predict poorly for unusual conditions, e.g. ventilation index low but very high wind speeds at Katoomba. We expect this would not be an issue for HRB-related prediction, given that HRBs are only conducted on lower wind speed days.

Our use of Bayesian modelling has operational-use advantages: short compute times (a few seconds), they only require easily available/simple forecast weather variables to run predictions (wind, temperature and boundary layer height for ventilation index, plus yesterday’s mean PM2.5), and there is flexibility in the way predictions can be presented and communicated. This is advantageous over physics-based simulations that require high-performance computers, take hours to run and require expert interpretation. The models presented here could be used alongside existing methods to create multiple lines of evidence for fire planners to use when scheduling HRBs.

We have demonstrated two alternatives for presenting predictions (Figs 5 and 6), but the best way to communicate predictions would need to be operationally assessed. The particular model that is most useful would also depend on user need, with the best approach probably to consider both the minimum and maximum Sydney PM2.5 models in decision-making, as well as predictions via other models. The mean Sydney PM2.5 model would give a better picture of potential pollution affecting all or most of the Sydney area, although large outliers can also result in a high mean. The maximum Sydney PM2.5 is for understanding where at least one of the AQS may have high pollution levels. For what constitutes concerning PM2.5 levels, some consideration is required. The national standard for an exceedance is for daily mean PM2.5 > 25 μgm-3 at a single AQS [26], which is a suitable threshold for presenting predictions for our maximum PM2.5 model (e.g. Fig 6B). However, the mean PM2.5 model is for the daily mean of multiple AQS, which is much more rarely > 25 μgm-3: 3% of mean Sydney PM2.5 was > 25 μgm-3, but 8% of maximum Sydney PM2.5 was > 25 μgm-3 (Fig 2). This simply means that a lower threshold may also be useful when summarising mean PM2.5 model predictive distributions (Fig 6) to identify potentially hazardous days for HRBs, but this would be up to a model-user to determine (see examples in S1B Fig in S1 Appendix).

Conclusion

We have demonstrated two models that could be used in operational settings to assist in HRB scheduling in the Sydney area, with the aim that burning could be avoided on days at high risk of hazardous PM2.5 pollution. The models provide probabilistic predictions of mean and maximum Sydney daily PM2.5 based on weather, fire area in different locations and yesterday’s PM2.5. The conditions most likely to produce hazardous PM2.5 pollution are with low ventilation index in Sydney, low temperature, an afternoon sea breeze and westerly to northwesterly winds in the Blue Mountains, and large areas being burnt, particularly to the west and north of Sydney. Higher PM2.5 the previous day also substantially increases predicted PM2.5. The models were produced using Bayesian regression, which makes predictive outputs highly informative, including a clear communication of uncertainty, and flexible depending on user need. The models are fast-to-run and require easily accessible inputs, and could be used in conjunction with existing methods to improve information and decision-making around HRB scheduling.

Data of fires and PM2.5 continues to be collected, thus future work could use new data to improve the precision of the models presented here. We focused on HRBs near Sydney, but a similar approach could be applied to develop a model applicable to wildfire prediction and applied to PM2.5 and fire activity modelling in other regions of Australia and the world.

Supporting information

S1 Appendix. Effects plots for maximum Sydney PM2.5 model (Fig A) and additional tile plot of predictions for mean and maximum Sydney PM2.5 models (Fig B).

https://doi.org/10.1371/journal.pone.0272774.s001

(PDF)

S2 Appendix. Method used to create VIIRS SNPP hotspot clusters.

https://doi.org/10.1371/journal.pone.0272774.s002

(PDF)

Acknowledgments

The results contain modified Copernicus Climate Change Service information 2021. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains.

Basemap in Fig 1 was created by OpenStreetMap contributors and made available under the Open Database License: https://www.openstreetmap.org/copyright

References

  1. 1. Matz CJ, Egyed M, Xi G, Racine J, Pavlovic R, Rittmaster R, et al. Health impact analysis of PM2.5 from wildfire smoke in Canada (2013–2015, 2017–2018). Science of The Total Environment. 2020;725:138506. pmid:32302851
  2. 2. Johnston FH, Henderson SB, Chen Y, Randerson JT, Marlier M, DeFries RS, et al. Estimated Global Mortality Attributable to Smoke from Landscape Fires. Environmental Health Perspectives. 2012;120(5):695–701. WOS:000303546000033. pmid:22456494
  3. 3. Borchers-Arriagada N, Bowman DMJS, Price O, Palmer AJ, Samson S, Clarke H, et al. Smoke health costs and the calculus for wildfires fuel management: a modelling study. The Lancet Planetary Health. 2021;5(9):e608–e19. pmid:34508682
  4. 4. Johnston FH, Borchers-Arriagada N, Morgan GG, Jalaludin B, Palmer AJ, Williamson GJ, et al. Unprecedented health costs of smoke-related PM2.5 from the 2019–20 Australian megafires. Nature Sustainability. 2021;4(1):42–7.
  5. 5. Broome RA, Johnstone FH, Horsley J, Morgan GG. A rapid assessment of the impact of hazard reduction burning around Sydney, May 2016. Medical Journal of Australia. 2016;205(9):407–8. WOS:000390847700011. pmid:27809737
  6. 6. Rothermel RC. How to predict the spread and intensity of forest and range fires. General Technical Report. USA: Intermountain Forest and Range Experiment Station., 1983 INT-143.
  7. 7. Hollis JJ, Anderson WR, McCaw WL, Cruz MG, Burrows ND, Ward B, et al. The effect of fireline intensity on woody fuel consumption in southern Australian eucalypt forest fires. Australian Forestry. 2011;74(2):81–96.
  8. 8. Price OH, Nolan RH, Samson SA. Fuel consumption rates in resprouting eucalypt forest during hazard reduction burns, cultural burns and wildfires. Forest Ecology and Management. 2022;505:119894. https://doi.org/10.1016/j.foreco.2021.119894.
  9. 9. Jiang N, Scorgie Y, Hart M, Riley ML, Crawford J, Beggs PJ, et al. Visualising the relationships between synoptic circulation type and air quality in Sydney, a subtropical coastal-basin environment. 2017;37(3):1211–28. https://doi.org/10.1002/joc.4770.
  10. 10. Crawford J, Griffiths A, Cohen DD, Jiang N, Stelcer E. Particulate Pollution in the Sydney Region: Source Diagnostics and Synoptic Controls. Aerosol and Air Quality Research. 2016;16(4):1055–66.
  11. 11. Larkin NK O’Neill SM, Solomon R, Raffuse S, Strand T, Sullivan DC, et al. The BlueSky smoke modeling framework %J International Journal of Wildland Fire. 2009;18(8):906–20. https://doi.org/10.1071/WF07086.
  12. 12. Hurley PJ, Physick WL, Luhar AK. TAPM: a practical approach to prognostic meteorological and air pollution modelling. Environmental Modelling & Software. 2005;20(6):737–52. https://doi.org/10.1016/j.envsoft.2004.04.006.
  13. 13. McGregor JL. C-CAM: Geometric aspects and dynamical formulation. Canberra: CSIRO Atmospheric Research Dickson ACT; 2005.
  14. 14. Navarro KM, Schweizer D, Balmes JR, Cisneros R. A Review of Community Smoke Exposure from Wildfire Compared to Prescribed Fire in the United States. 2018;9(5):185.
  15. 15. Yao J, Brauer M, Henderson SB. Evaluation of a Wildfire Smoke Forecasting System as a Tool for Public Health Protection. Environmental Health Perspectives. 2014;121:1142–7.
  16. 16. Saide PE, Peterson DA, da Silva A, Anderson B, Ziemba LD, Diskin G, et al. Revealing important nocturnal and day-to-day variations in fire smoke emissions through a multiplatform inversion. Geophysical Research Letters. 2015;42(9):3609–18. WOS:000355878300069.
  17. 17. Pearce JL, Rathbun S, Achtemeier G, Naeher LP. Effect of distance, meteorology, and burn attributes on ground-level particulate matter emissions from prescribed fires. Atmospheric Environment. 2012;56:203–11.
  18. 18. Price OF, Forehead H. Smoke patterns around prescribed fires in Australian eucalypt forests, as measured by low-cost particulate monitors. Atmosphere. 2021;12:1389. https://doi.org/10.3390/atmos12111389.
  19. 19. UNESCO. Greater Blue Mountains Area Paris: United Nations; 2022 [cited 2022]. Available from: https://whc.unesco.org/en/list/917/.
  20. 20. Cohen DD, Stelcer E, Garton D, Crawford J. Fine particle characterisation, source apportionment and long-range dust transport into the Sydney Basin: a long term study between 1998 and 2009. Atmospheric Pollution Research. 2011;2(2):182–9. https://doi.org/10.5094/APR.2011.023.
  21. 21. Johnston FH, Hanigan I, Henderson S, Morgan G, Bowman D. Extreme air pollution events from bushfires and dust storms and their association with mortality in Sydney, Australia 1994–2007. Environmental Research. 2011;111(6):811–6. pmid:21601845
  22. 22. Filkov AI, Ngo T, Matthews S, Telfer S, Penman TD. Impact of Australia’s catastrophic 2019/20 bushfire season on communities and environment. Retrospective analysis and current trends. Journal of Safety Science and Resilience. 2020;1(1):44–56. https://doi.org/10.1016/j.jnlssr.2020.06.009.
  23. 23. Rea G, Paton-Walsh C, Turquety S, Cope M, Griffith D. Impact of the New South Wales fires during October 2013 on regional air quality in eastern Australia. Atmospheric Environment. 2016;131:150–63. https://doi.org/10.1016/j.atmosenv.2016.01.034.
  24. 24. Schroeder W, Oliva P, Giglio L, Csiszar IA. The New VIIRS 375m active fire detection data product: Algorithm description and initial assessment. Remote Sensing of Environment. 2014;143:85–96. https://doi.org/10.1016/j.rse.2013.12.008.
  25. 25. Oliva P, Schroeder W. Assessment of VIIRS 375m active fire detection product for direct burned area mapping. Remote Sensing of Environment. 2015;160:144–55.
  26. 26. Government Australian. National Environment Protection (Ambient Air Quality) Measure: F2021C00475. Canberra, ACT: Commonwealth of Australia, 2021 F2021C00475.
  27. 27. Hersbach H, Bell B., Berrisford P., Biavati G., Horányi A., Muñoz Sabater J., et al. ERA5 hourly data on single levels from 1979 to present: Copernicus Climate Change Service (C3S) Climate Data Store (CDS); 2021 [cited 2021 2021]. Available from: 10.24381/cds.adbb2d47.
  28. 28. Hersbach H, Bell B., Berrisford P., Biavati G., Horányi A., Muñoz Sabater J., et al. ERA5 hourly data on pressure levels from 1979 to present: Copernicus Climate Change Service (C3S) Climate Data Store (CDS); 2021 [cited 2021 2021]. Available from: 10.24381/cds.bd0915c6.
  29. 29. Di Virgilio G, Hart MA, Jiang N. Meteorological controls on atmospheric particulate pollution during hazard reduction burns. Atmos Chem Phys. 2018;18(9):6585–99.
  30. 30. Hart M, De Dear R, Hyde R. A synoptic climatology of tropospheric ozone episodes in Sydney, Australia. International Journal of Climatology. 2006;26(12):1635–49. WOS:000241478000006.
  31. 31. Hardy CC. Smoke management guide for prescribed and wildland fire. Boise, ID: National Wildfire Coordinating Group, 2001 PMS 420–2. NFES 1279.
  32. 32. Price OF, Williamson GJ, Henderson SB, Johnston F, Bowman DMJS. The Relationship between Particulate Pollution Levels in Australian Cities, Meteorology, and Landscape Fire Activity Detected from MODIS Hotspots. PLoS ONE. 2012;7(10):e47327. pmid:23071788
  33. 33. Hastie TJ, Tibshirani RJ. Generalized additive models (1st ed.). London, UK: Chapman and Hall; 1990.
  34. 34. Pearce JL, Beringer J, Nicholls N, Hyndman RJ, Tapper NJ. Quantifying the influence of local meteorology on air quality using generalized additive models. Atmospheric Environment. 2011;45(6):1328–36. https://doi.org/10.1016/j.atmosenv.2010.11.051.
  35. 35. Carslaw DC, Beevers SD, Tate JE. Modelling and assessing trends in traffic-related emissions using a generalised additive modelling approach. Atmospheric Environment. 2007;41(26):5289–99.
  36. 36. McElreath R. Statistical rethinking: a Bayesian course with examples in R and Stan. First edition. ed. Boca Raton, FL: CRC Press; 2018.
  37. 37. Bürkner P-C. Advanced Bayesian multilevel modeling with the R package brms. The R Journal. 2017;10(1):395–411.
  38. 38. Wood SN. Generalized additive models: an introduction with R. 2 ed. New York: Chapman and Hall/CRC; 2017.
  39. 39. Akaike H. Information theory and an extension of the maximum likelihood principle. Selected papers of hirotugu akaike. New York: Springer; 1998. p. 199–213.
  40. 40. Miao Y, Li J, Miao S, Che H, Wang Y, Zhang X, et al. Interaction Between Planetary Boundary Layer and PM2.5 Pollution in Megacities in China: a Review. Current Pollution Reports. 2019;5(4):261–71.
  41. 41. Li Q, Zhang H, Cai X, Song Y, Zhu T. The impacts of the atmospheric boundary layer on regional haze in North China. npj Climate and Atmospheric Science. 2021;4(1):9. pmid:34592300
  42. 42. Duc HN, Rahman MM, Trieu T, Azzi M, Riley M, Koh T, et al. Study of Planetary Boundary Layer, Air Pollution, Air Quality Models and Aerosol Transport Using Ceilometers in New South Wales (NSW), Australia. 2022;13(2):176.
  43. 43. Levy I, Mahrer Y, Dayan U. Coastal and synoptic recirculation affecting air pollutants dispersion: A numerical study. Atmospheric Environment. 2009;43(12):1991–9. https://doi.org/10.1016/j.atmosenv.2009.01.017.
  44. 44. Bessho K, Date K, Hayashi M, Ikeda A, Imai T, Inoue H, et al. An introduction to Himawari-8/9—Japan’s new-generation geostationary meteorological satellites. Journal of the Meteorological Society of Japan. 2016;94(2):151–83. https://doi.org/10.2151/jmsj.2016-009.