Disaggregation of Daily Precipitation to Half-hourly Data

High temporal of precipitation is required for design and simulation of building components and environments. Over the 1990s and into the early 2000s the Bureau of Meteorology (BoM), weather stations transitioned from read-daily rain gauges to recording precipitation data by using tipping bucket rain guagues with high temporal resolution.

Climate files composed by concatenating the 12 most typical (or most extreme) months selected from three decades often include months where that site did not have a tipping bucket rain guage and so only have daily precipitation data to 9AM local clock time.

For this siutation, we have resumed our quest to disaggregate precipitation data by using Long Short-Term Memory (LSTM) model applying associations between precipitation and the other weather elements which have been measured hourly or better over those same three decades. The diagram belows describes brief concept of how we implement the model.

We use an LSTM model, a type of recurrent neural network (RNN). In theory, LSTMs handle long-term dependencies and sequence disaggregation tasks using special memory cells, each with three gates: input, output, and forget gates. These gates regulate information, allowing the network to learn important details while discarding irrelevant information.

Additionally, our architecture includes hidden layers for processing information, fully connected layers to integrate learned features, and a SoftMax layer that outputs probabilities of rainfall for each half-hour period.

An interesting fact, despite the name ‘short-term memory’, LSTMs are used for learning long-term patterns. The ‘short-term’ in the name refers to the neural network’s ability to capture data dependencies and patterns over short sequences, which are then extended to longer sequences using the inbuilt mode-memory mechanism.

Below, we present a comprehensive approach to disaggregating half hourly precipitations using a LSTM-based neural network model. The ability of LSTM architecture to capture temporal dependencies with high accuracies has helped us achieve an overall average RMSE of 0.05. Along with the high accuracies, the training time needed has reduced significantly because of the simplicity of the model built by us.

The Approach

Step 1 – Pre-Processing.

  • Merge all RF data and produce half-hourly interpolations.
  • Scaling the columns [‘ERH’, ‘ERN’, ‘DNI’, ‘DIF’, ‘CLOUD’, ‘DBT’, ‘RH’, ‘PRESSURE’, ‘WD’, ‘WS’] using the QuantileTransformer to transform the features to follow a uniform distribution.
    • ERH: Extraterrestrial Radiation Horizontal
    • ERN: Extraterrestrial Radiation Normal
    • DNI: Direct Normal Irradiance
    • DIF: Diffuse Horizontal Irradiance
    • CLOUD: Cloud Cover
    • DBT: Dry Bulb Temp
    • RH: Relative Humidity
    • PRESSURE: Atmospheric Pressure
    • WD: Wind Direction
    • WS: Wind Speed
  • Drop duplicates, check missing values.

Step 2 – Modeling the LSTM architecture.

  • Date Embedding: Linear layer that transforms date information into a 32-dimensional embedding.
  • Climate Data Encoder: LSTM layer that encodes climate statistics with 10 features into hidden states.
  • Decoder: LSTM layer that decodes combined inputs (date embedding, hidden state from climate encoder, and total precipitation) into hidden states.
  • Output Layer: Linear layer that generates disaggregated precipitation proportions and uses SoftMax activation to output values.

Step 3 – Validation of Results.

  • The result is validated for the MSE loss at each epoch on the validation set.
  • After training the saved model is loaded for making disaggregated hour hourly precipitation values, which are further tested for RMSE loss using the Test data split.

Results

The Test results achieved are highlighted below:

Scatter plot for Actual vs Disaggregated Values
Box Plot highlighting the overall Average RMSE achieved.

The results achieved above highlights the model learning capacity. All half hourly disaggregated values are in 0.30 mm range, with the bulk of values averaging just below 0.1 for root mean squared error.

The highlighted results above, showcases the potential of using LSTMs for further analysis. We are now looking into implementing clustering techniques to further improve upon the current results. The updated result will be announced later.

Examining Canberra’s Climate Trends: Implications for PV System and Building Performance

As part of the ongoing investigation into our evolving climate, we routinely process and analyse meteorological data from successive years, conducting comparative assessments to reveal emerging trends and patterns.

Our previous temporal analyses only focused on examining variations in various weather elements. However, in this latest iteration, we have incorporated the results of EnergyPlus simulations, specifically targeting HVAC systems and heating and cooling dynamics within buildings. We have also added the results of System Advisor Model (SAM) photovoltaic (PV) system simulations to enhance the comprehensiveness of our investigation.

The most recent temporal analysis was carried out for all eight capital cities, though this issue of Exemplary Advances will focus on the city of Canberra. The findings for Adelaide were previously discussed in the March issue of Exemplary Advances. The analysis for other capital cities can be viewed here.

For the analysis of weather elements, we examined the temporal variations in dry bulb temperature, humidity, wind speed, global horizontal irradiation (GHI), direct normal irradiation (DNI), and total precipitation. The analysis involved averaging these elements over three 15-year periods—1990-2004, 2005-2019, and the latest 15-year period from 2009 to 2023—and then comparing the results. A comparison between data from the latest 15 years, the data corresponding to the years and months specified in Industry Standard Meteorological Year (ISMY) files, and the data exclusively from 2023 was also undertaken. ISMYs were originally developed for application in house energy rating software used in NatHERS and derive from historical Bureau of Meteorology (BOM) weather data spanning from 1990 to 2015. Over time, they have become the industry’s de facto standard. It is therefore important to compare against ISMY data, as it provides a reference to gauge alignment with established benchmarks and understand the significance of temporal variations in weather elements.

Comparing 1990-2004 with 2009-2023 showed an increase in Canberra’s mean temperature of 0.34°C, an increase to moisture of 4.48%, and a significant increase in wind speed of 22.42%. GHI had a small increase of 0.73%, while DNI had a decrease of 2.39%. Meanwhile, comparing 2005-2019 with 2009-2023 showed a decrease in the mean temperature of 0.20°C, an increase to moisture of 2.06%, an increase in wind speed of 0.79%, and a decrease in GHI and DNI of 1.01% and 3.28%, respectively. The decline in mean temperature, GHI, and DNI for 2005-2019 vs 2009-2023 is likely a result of 2020-2022 experiencing comparatively lower annual average dry bulb temperatures, GHI, and DNI when compared to other years.

Total precipitation in 2009-2023 averaged 8.28% higher than in 1990-2004. and 14.72% higher than the 2005-2019 period.

The annual total HVAC energy consumption reveals intriguing patterns across various archetypes. From 1990 to 2023, as well as in the periods of 1990-2004 and 2005-2019, there is a notable upward trend. However, during the period of 2009-2023, a decline in trends is observed. When examining cooling and heating energy consumption separately, for all archetypes, the cooling energy consumption trends are identical to the total HVAC energy consumption. Annual heating energy consumption shows similar trends that 3 storyes’ heating energy consumption is identical with total HVAC energy consumption as well. However 10 storyes’ heating energy consumption has downward from 1990 to 2023, while 1990-2004, 2005-2019, and 2009-2023 periods have identical trends to the total HVAC consumption. Regarding supermarket, only 2005-2019 period has upward trend on heating consumption, while the other periods have downward trends. As a result, we can see that all heating and cooling energy consumptions have downtrends on 2009-2023 but 1990-2023, 1990-2004, and 2005-2019 period has mostly upward trends. This indicates the notable differences in temperature, moisture and precipitation within the recent 2009-2023 period compared to older 15-year periods, and ISMY.

Investing in Waste to Energy: UAG Bio Nutrients

Following on from our report on the Brisbane tour of the Australian Association for Energy Productivity (A2EP) in our April “Exemplary Advances”, Exemplary has made a small investment in a bio-energy company nearby but not part of the A2EP event. That was a small short term loan to UAG Bio Nutrients to assist with cash flows to support the imminent move of their laboratories from Alstonville, NSW, to the Southern Cross University (SCU) campus nearby. This also facilitated advancing the commissioning of their modular demonstration plant in Hay, NSW, while developing a feasibility study for converting an old mine site in the Hunter Region for another waste-to-energy and nutrient capture centre.

Updates on these developments will be included in occasional future editions.

34 Years of Complete Weather Data 1990-2023

We are happy to announce that we are extending our weather data sets from 2022 to include 2023. We have had many inquiries about 2023 weather data, but our most trsuted solar data source, the Bureau of Meteorology (BoM)’s gridded solar data was undergoing its routine Quality Assurance (QA), so we were not able to fulfil those requests until now.

Finally, BoM has disseminated gridded solar data by daily, hourly and 10 minutely to the end of December 2023. The data is updated monthly with 3-4 month data latency. We used 2023 solar irradiance data from Solcast for the 8 capital cities for our Real Time Years (RTYs) and the Exemplary Weather and Index analysis based on it because of this latency. Comparison of the real-time data stream from Solcast with the BoM’s QAed equivalent will be an integral part of that work.

We’re currently processing gridded solar data from BoM to quality-assess the 2023 solar data for the eight capital cities. Following this, we’ll be updating data for other sites for the 34 years from 1990 to 2023. If you have interests in the 2023 weather data outside the 8 capital cities, please contact us at exemplary.energy@exemplary.com.au to change production priority. On a rolling as-completed basis, the updated data sets will be made available through our sales portal.

Image source: Bureau of Meteorology

Abstract of the Gridded Solar Data

The Bureau of Meteorology employed the Heliosat-4 radiation model (Qu et al, 2017) as implemented by Mines ParisTech (Gschwind et. al., 2020) to estimate downwelling solar radiation parameters. Heliosat-4 uses estimates of cloud properties derived from satellite observations, estimates of aerosol optical depth, and forecasts of atmospheric ozone and water vapour to estimate instantaneous surface global solar irradiance (known as global horizontal irradiance, GHI) in units of W m-2. Heliosat-4 also produces estimates of the instantaneous intensity of surface direct horizontal irradiance (BHI) in units of W m-2, from which the solar direct beam radiation falling on a surface normal to the beam (known as the direct normal irradiance, DNI, with units W m-2 is calculated. 

A bias correction scheme was developed and applied to remove systematic biases in GHI and BHI which can arise from biases and uncertainty in observations, auxiliary products used in the Heliosat-4 forward model, and/or the Heliosat-4 forward model itself. The surface diffuse horizonal irradiance (DIF) is calculated as the difference between GHI and BHI with units of W m-2.

GHI, DNI and DIF are calculated every 10 minutes between sunrise and sunset for each day for the Australian continent, bounded by 112-156.26°E and 10-44.5°S. These 10-minutely observations are integrated with respect to time to produce the daily integral of surface global irradiance (daily_integral_of_surface_global_irradiance, commonly referred to as “daily exposure”), daily integral of direct normal irradiance (daily_integral_of_direct_normal_irradiance) and the daily integral of surface diffuse irradiance (daily_integral_of_surface_diffuse_irradiance). The number of observations during daylight hours (number_of_observations), and the number of these observations classified as cloud (number_of_cloud_ observations) are also provided.