Figures
Abstract
The shape characteristics of flow hydrographs hold essential information for understanding, monitoring and assessing changes in flow and flood hydrology at reach and catchment scales. However, the analysis of individual hydrographs is time consuming, making the analysis of hundreds or thousands of them unachievable. A method or protocol is needed to ensure that the datasets being generated, and the metrics produced, have been consistently derived and validated. In this lab protocol, we present workflows in Python for extracting flow hydrographs with any available temporal resolution from any Open Access or publicly available gauging station records. The workflow identifies morphologically-defined flow and flood types (i.e. in-channel fresh, high flow and overbank flood) and uses them to classify hydrographs. It then calculates several at-a-station and upstream-to-downstream hydrograph shape metrics including kurtosis, skewness, peak hydrograph stage, peak arrival time, rate-of-rise, peak-to-peak travel time, flood wave celerity, flood peak attenuation, and flood wave attenuation index. Some metrics require GIS-derived data, such as catchment area and upstream-to-downstream channel distance between gauges. The output dataset provides quantified hydrograph shape metrics which can be used to track changes in flow and flood hydrographs over time, or to characterise the flow and flood hydrology of catchments and regions. The workflows are flexible enough to allow for additional hydrograph shape indicators to be added or swapped out, or to use a different hydrograph classification method that suits local conditions. The protocol could be considered a change detection tool to identify where changes in hydrology are occurring and where to target more sophisticated modelling exercises to explain the changes detected. We demonstrate the workflow using 117 Open Access gauging station records that are available for coastal rivers of New South Wales (NSW), Australia.
Citation: Arash AM, Fryirs K, Ralph TJ (2025) Protocol for extracting flow hydrograph shape metrics for use in time-series flood hydrology analysis. PLoS ONE 20(1): e0315796. https://doi.org/10.1371/journal.pone.0315796
Editor: Gabriela Ioana-Toroimac, University of Bucharest: Universitatea din Bucuresti, ROMANIA
Received: March 27, 2024; Accepted: November 26, 2024; Published: January 10, 2025
Copyright: © 2025 Arash et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The streamflow data was obtained from the Australian Bureau of Meteorology (BoM) and WaterNSW Open Access portals. Bureau of Meteorology (BoM). Water Data Online [Internet]. 2024. Available from: http://www.bom.gov.au/waterdata/. WaterNSW. Continuous Water Monitoring Network [Internet]. 2024. Available from: https://realtimedata.waternsw.com.au/.
Funding: This research was funded by an Australian Research Council Linkage grant (LP190100314) led by KF with industry partners Landcare Australia and Hunter-Central Rivers Local Land Services. AA is supported by an Australian Commonwealth Government International Research Training Program (iRTP) scholarship and Macquarie University Higher Degree Research Funds.
Competing interests: The authors have declared that no competing interests exist
1. Introduction
Streamflow hydrographs integrate spatial and temporal changes in water input, storage and transfer processes within a catchment. Therefore, they contain important quantifiable data that can be statistically analysed and used for a wide range of scientific and applied purposes [1–3]. For example, the shapes of hydrographs can be analysed to reveal the rise and fall, peak stage, travel time and celerity characteristics of flows and floods of different stage heights (water levels) and discharges [2, 4, 5]. When related to channel morphology, such analysis can be used to understand, monitor and assess hydrology during in-channel habitat forming flows or during overbank floods that create floodplains and inundate adjoining land [6].
Streamflow datasets can also be used for catchment and region hydrological classification, temporal and spatial variability analysis of extreme floods, flow peak stages, low flows and baseflows, uncertainty analysis and calibration and validation of hydrologic models, streamflow forecasting and time-series analysis to track changes in flow hydrology as a basis for understanding the impact of climate and anthropogenic disturbances on river systems [2, 7–17]. Across the world there exists tens of thousands of flow gauging stations, some of which contain records extending back decades. The benefits of using real data recorded at gauging stations include a lower level of uncertainty [18], whereas modelled data suffers from limitations associated with the use of a large number of assumptions, capacity to generate long-term data, validation, scalability, accessibility and coding [18–22]. For example, Stahl et al. [23] developed a streamflow records dataset for 441 small catchments in 15 countries across Europe that contained data from 1932 to 2004. This data was used to lab changes in hydrological regimes across these catchments. Parajka et al. [24] analysed 34 studies involving 3,874 catchments to characterise the similarities and differences in hydrology between catchments. This was then used to predict hydrographs in ungauged catchments and analyse underlying climate and landscape controls on their hydrology. Kuentz et al. [25] developed a dataset consisting of 16 flow indicators for 35,215 catchments and 1,366 river gauges across Europe to analyse physical landscape controls on hydrologic variability. Similarly, Brunner et al. [2] used hourly streamflow data from 163 structurally distinct Swiss catchments and 53 years of coverage to characterise flow hydrographs and identify regions of catchments with similar flood reactivity. Arash et al. [26] used over 7,000 hydrographs from rivers in 17 coastal catchments of New South Wales (NSW), Australia to undertake time-series analysis of changes in the hydrological characteristics of in-channel fresh, high flow (bankfull) and overbank hydrology. Gudmundsson et al. [16] investigated global changes in indicators of mean and extreme streamflow from 1951–2010 and 14 subcontinental regions using a data set of observations from over 30,000 sites around the world. Gnann et al. [27] used a MATLAB toolbox (TOSSH) to calculate and validate streamflow time series metrics (e.g. slope of the flow duration curve, mean half flow date, low flow, and runoff ratio etc.) U.S. catchments in that occur within the CAMELS dataset in the USA.
In all of these aforementioned studies, a large number of hydrographs from multiple gauges and spanning long periods of time (years and decades) are needed with which to undertake analyses [5, 28–30]. Traditionally, this was done by direct observation and even manual measurements [31]. However, with the advent of Big Data analysis techniques, scientists can now handle large data sets with wide spatial and temporal coverage and apply analyses in a more consistent and quality-controlled manner. Techniques for hydrograph extraction and analysis such as coding, graphical separating, and digital filtering are now much more widely used [32–35].
The workflow presented in this paper was successfully used in Arash et al. [26] to analyse decadal time‐series changes in flow hydrology to detect whether flood mitigation is occurring. The workflow can be used to extract hydrographs from streamflow time series gauge records and to calculate hydrograph shape metrics for each of them. The workflow has been developed in Python version 3.7. It is user-friendly, simple, quick and informative. Initial steps calculate hydrograph shape metrics at individual gauges, called at-a-station analysis. The metrics calculated include kurtosis (K), skewness (S), peak hydrograph stage (Hp), and peak arrival time (tp) for different flow and flood types. Then, the workflow uses Hp and tp to calculate rate-of-rise (RoR) with Excel. Then, ArcGIS Pro version 3.3 [36] is used to calculate contributing catchment area (Ac), and upstream-to-downstream channel distance between gauges (D). Ac and D metrics are then used to calculate upstream-to-downstream metrics including peak-to-peak travel time (tT), flood wave celerity (C), flood peak attenuation (Ha) and flood wave attenuation index (FWAI).
The aims of providing the workflows are to:
- Improve the utility, and realise the potential of using readily available, Open Access streamflow hydrographs for analysis of changes in flow and flood hydrology at-a-station and from upstream-to-downstream.
- Provide a method for undertaking time-series analysis of changes in at-a-station and upstream-to-downstream hydrology that can be used to detect where hydrology is changing and where 1D or 2D modelling could be undertaken to explain changes detected.
- Present a consistent set of workflows that simplifies the process of calculating hydrological characteristics across multiple stations, concurrently and consistently.
2. Materials and methods
The protocol described in this peer-reviewed article is published on protocols.io, https://doi.org/10.17504/protocols.io.rm7vzxqo8gx1/v1 and is included for printing as S1 File with this article. A worked example of the hydrograph shape metrics database that is produced is published as a S1 Data on figshare (https://doi.org/10.6084/m9.figshare.27187143).
2.1 Study area and gauge records
Arash et al. [26] presents the full set of results of hydrograph analysis and interpretation of flow and flood changes in 17 out of 20 coastal catchments of NSW, Australia (Fig 1). These coastal catchments receive approximately 1,000 mm to 1,600 mm mean annual rainfall, and the mean daily temperature varies between 18°C to 26°C in summer (December, January and February), and 8°C to 16°C in winter (June, July and August) [37]. Most of these streams flow year-round (perennial), while some streams are more intermittent and others only flow during rainfall events. There are three subregions: Northern Rivers, Central Rivers and Southern Rivers. These subregions experience a variable range of climates due to their vast geographical extent and varying topography [38]. Weather in the subtropical Northern Rivers subregion is wet and mild, and weather in the temperate Southern Rivers subregion is relatively drier and cooler [37]. Since the mid-20th century, there has been a decline of between 8% and 26% in mean annual rainfall and streamflow, especially on the Southern Rivers subregion (with up to 26% decline in mean annual precipitation), and a reduction in the frequency of floods, allowing for riparian vegetation growth in these subregions [30]. However, there will be an increase in intensity and frequency of rainfall and runoff in NSW and across Australia as a result of climate change [39].
Data source: BoM [42], WaterNSW [43], ESRI [36] and NSW River Styles [44].
Arash et al. [26] focused on understanding time-series changes in flow and flood hydrology for unregulated, rural rivers without large dams or reservoirs, and that have relatively long streamflow records that date back into the early to middle 20th Century. They developed and used the protocol presented here. In that study, 117 gauges were used in catchments that range in size from 513 km2 (Brunswick) to 22,333 km2 (Clarence). The elevation range of these catchments varies from 27 m to 1538 m, drainage density varies from 0.01 km km-2 to 57.07 km km-2, and elongation ratio varies from 0.27 (more elongate shape) to 0.53 (more round shape). Of 117 gauges, 27 gauges are located in confined rivers, 72 in partly confined rivers and 17 in laterally-unconfined rivers. Generally, rivers in confined settings are located in the upstream gorges, rivers in partly-confined setting are located in the upstream gorges, and rivers in laterally-unconfined setting are located in low-lying coastal areas [40, 41]. The analysis used Open Access streamflow data sourced from the Australian Bureau of Meteorology (BoM) [42], and WaterNSW [43]. The streamflow data in these databases is provided in various time intervals, from 15 minutes, to hourly, and daily.
2.2 Hydrograph analysis and quantification
Fig 2 shows the hierarchical structure of the hydrograph analysis protocol presented in this study. In this protocol, we quantify hydrograph shape metrics directly from streamflow time series data at individual gauges (at-a-station) and from upstream-to-downstream (Fig 2). Flow stage hydrographs are the key inputs required to run the protocol (Fig 2). The at-a-station output metrics are kurtosis–K (Eq 1), skewness–S (Eq 2), peak hydrograph stage–Hp, and peak arrival time–tp. Hp and tp are then used to calculate rate-of-rise–RoR (Eq 3) (Fig 2). The upstream-to-downstream analysis focuses on the moment of peak flow production. For upstream-to-downstream analysis, two gauges on the same streamline are required. The output metrics of peak-to-peak travel time (tT) (Eq 4), flood wave celerity (C) (Eq 5), flood peak attenuation (Ha) (Eq 6) and flood wave attenuation index (FWAI) (Eq 7) are calculated within Excel (Fig 2). For the upstream-to-downstream analysis, additional geospatial data is needed as an input (Fig 2), including contributing catchment area (Ac) at the gauge, and channel distance between the gauges (D).
Where μ refers to the arithmetic mean; μ3 corresponds to the third moment about the mean; μ4 corresponds to the fourth moment about the mean; Ϭ refers to the standard deviation; RoR refers to rate of rise (m hr-1); Hp refers to the peak hydrograph stage (m); tT refers to peak-to-peak travel time (hr); tp refers to the peak arrival time (hr); Ha refers to flood peak attenuation; FWAI refers to flood wave attenuation index; Ac refers to the contributing drainage area (km2); C refers to the peak-to-peak flood wave celerity (m s-1); and D refers to the upstream-to-downstream channel distance between gauges (km).
Data source: BoM [42], WaterNSW [43], ESRI [36] and NSW River Styles [44].
At-a-station analysis can be conducted on any single station data. However, for upstream-to-downstream analysis, two gauging stations need to be located along the same streamline. To synchronise the flow and flood dynamics along the channel or valley over time, we recommend that the peak arrival time (tp) values be based on the physical distance between the upstream and downstream gauges along the channel, and the calculated C values. Given that C values commonly range between 0.25 m s−1 and 10 m s−1 [45–47], any values outside this range can be considered outliers. This indicates that the flood event at the upstream gauge is independent of the recorded flood event at the downstream gauge, making them unsuitable for pairing in upstream-to-downstream analysis. However, these flood hydrographs can be separately used for at-a-station analysis.
Ac was calculated using a 30-m SRTM DEM using the flow direction and flow accumulation tools (ArcHydro Extension) within ArcGIS Pro. These tools indicate the flow direction in each streamline and accumulate all flows from upstream into gauges (pour points). The watershed tool was used to delineate subcatchments draining into each gauge, and the calculate geometry tool was used to calculate Ac for each gauge. D was calculated using the major and minor streamline and gauge location shapefiles in the NSW River Styles dataset [44], and calculate geometry tool in ArcGIS Pro, but any polyline stream network database can be used. In our study catchments, Ac varies from 26 km2 to 17,320 km2, and D ranges from 14 km to 118 km [26], demonstrating the range of catchment sizes that the workflow can be used on. FWAI is a dimensionless indicator of flood peak attenuation that is normalised by Ac at the upstream and downstream gauges. This indicator accounts for the lateral inflow from tributaries (a gaining reach) with negative FWAI and lateral outflow from tributaries (a losing reach) with positive FWAI [48].
2.3 Determine flow stage classes
Hydrologists and river experts often classify flow stage heights into categories based on water level at cross-sections. This protocol runs on stage height hydrographs instead of discharge for several reasons including, greater data availability, ease of comparison across different streams, ease of visualisation, and to match to stakeholders’ experience in the field and emergency warnings that are issued during floods [49]. Also, each river has a different morphology and therefore discharge or recurrence intervals for a bankfull flow are not consistent across the landscape and river morphology can change over time. McMahon and Peel [50] analysed 622 rating curves from 171 gauges and found that uncertainties in BoM dataset’s stage–discharge for daily streamflow ranges from −4.2% to 4.5%, therefore we use stage height hydrographs instead of discharge hydrographs.
This protocol runs on three morphologically-defined flow and flood stages which can then be used to undertake like-with-like comparisons of flow and flood behaviour across catchments and regions. The three flow stages are: in-channel fresh, high flow and overbank flood (Table 1 and Fig 3).
In-channel freshes are defined as flows that are larger than low flows (or baseflows). They may express as a noticeable rise in streamflow height that lasts for a few days, triggered by short bursts of rain [51]. These flows interact with submerged hydraulic units and low-elevation instream geomorphic units.
A high flow is defined as the bankfull water level (the top of the macrochannel bank). These flows interact with submerged hydraulic units and low-elevation instream geomorphic units.
Overbank floods are flows that completely fill the macrochannel and overflow onto floodplains. Any flow higher than bankfull flow level is considered an overbank flood. These flows connect channels to floodplains and interact with floodplain features, including riparian vegetation and wetlands.
To identify these flow and flood types using the workflow, morphological thresholds are first identified at each gauge station site. These thresholds and flow stage heights are manually calculated in Excel and use Python code. Table 1 contains definitions of these different flow stages and Fig 3 shows a schematic cross-section with stage height thresholds and flood flow types indicated.
The low flow absolute threshold is calculated in excel by using the flow frequency analysis and rating curves provided with gauge data (in our case from BoM [42] and WaterNSW [43]), and the US EPA 7Q10 approach [52]. 7Q10 is defined as the lowest 7-day average flow that is expected to occur once every 10 years, on average [52]. Here, low flows (or baseflows) are defined as sustained low-level flows that exceed low flow absolute threshold. The in-channel fresh absolute threshold is defined as half bankfull stage height. High flows are defined as flows that exceed the in-channel fresh level, but are restricted to bankfull level. Overbank floods are any flow that exceeds the bankfull flow absolute threshold by 5% or more. For example, based on BoM [42], Hunter River at Singleton (gauge ID: 210001) experienced an overbank flood in 09/03/2022 as a result of a 4-day precipitation event with accumulated rainfall of 125 mm (recorded at Glennies Creek rainfall gauging station), a high flow in 10/12/2021 as a result of a 3-day precipitation event with accumulated rainfall of 70 mm, and an in-channel fresh in 13/11/2021 as a result of a 2-day precipitation event with accumulated rainfall of 58 mm.
2.4 Extraction and quantification of flow hydrograph metrics from streamflow time series
The workflow to quantify flow hydrograph metrics both at-a-station and from upstream-to-downstream is shown in Fig 4. This workflow contains 10 main steps:
- Prepare input data including the hydrographs and geospatial data in Excel (Fig 2).
- Process gauging station cross-sections to identify the in-channel fresh, high flow and overbank flood stages at each of the gauges in Excel.
- Input the streamflow time series hydrographs from gauge records into Excel (in NSW these are sourced from the BoM [42] and WaterNSW [43]).
- Remove redundant data in the excel file of streamflow timeseries.
- Specify the start and end time of the flow or flood event using the hydrographs in Excel.
- Run the Python script to extract in-channel fresh, high flow and overbank flood hydrographs.
- Verify the hydrograph extraction process by visual observation and adjust the inputs (remove outliers and adjust start time and end time) if needed in the Python script.
- Calculate hydrograph metrics; kurtosis (Eq 1), skewness (Eq 2), peak flow stage, peak flow date and rate-of-rise (RoR) (Eq 3) for each hydrograph in Python.
- Calculate contributing catchment area (Ac), and upstream-to-downstream channel distance between gauges (D) in ArcGIS Pro and input them into the Python script (in NSW these metrics are calculated by using NSW River Styles streamlines layer [44] in ArcGIS Pro [36]).
- Calculate peak-to-peak travel time (tT) (Eq 4), flood wave celerity (C) (Eq 5), flood peak attenuation (Ha) (Eq 6) and flood wave attenuation index (FWAI) (Eq 7) in Excel.
Please note that Steps 8 and 10 produce a hydrograph metrics database as an output (See Fig 2).
The yellow boxes produce a metrics database as an output. The Python script indicated by a red star, is detailed in the protocols.io that accompanies this paper (https://doi.org/10.17504/protocols.io.rm7vzxqo8gx1/v1). Data source: BoM [42], WaterNSW [43], ESRI [36] and NSW River Styles [44]. Green boxes and white boxes denote parts of the protocol that require manual intervention by a user and semi-automatic parts, respectively.
In different places and for different gauges, stage height data may come to a user in different formats. Therefore, setting up the input data excel spreadsheet to a standard format is necessary. For example, the streamflow data download from BoM [42] and WaterNSW [43], portals for NSW streams needs to be adjusted and filtered before importing into the Python code. The label of the first column and row (cell A1) should be ‘date’ and the second column first row (B1) should be ‘value’. All other redundant information should be removed from the excel file. Then, this file should be saved in Microsoft Excel .xlsx format. Khaleghi et al. [53] showed 1-hr geomorphoclimatic unit hydrograph can be used in accurately simulating flow hydrograph and quantifying them using shape metrics (e.g. peak discharge and time-to-peak). Hence, the workflows use the 1-hour interval date data. The format of the date is “dd/mm/yyyy hh:mm:ss”, and the unit of stage height is meters. There is a possibility of using data with coarser (2-hour, 3-hour etc.) or finer date (30-minute, 15-minute etc.) intervals. The users can directly change the time interval in the Python code (See Protocols.io), or they can change their time-series to downscale or aggregate to hourly by using appropriate statistical methods, such as StreamFARM algorithm [54].
Preparation of other inputs include identification of the in-channel fresh, high flow and overbank flood stages at each gauge station and identification of the start time and end time of each flow or flood event of interest. The start time would typically be the time at which a precipitation event causes a rise in stage height. The end time normally indicates a return to base flow level (low flow level). To identify the start and end times, flow record variability, local flow conditions and the purpose of study should be considered. These are user decisions that need to take place at this stage of the protocol. Recognising that the duration of flows may vary in other landscapes, the start and end dates can be changed in the protocol. The only criteria are to ensure the full presentation of hydrograph is preserved in the output. For example, local-knowledge about flood dynamics can be used to define the duration of floods in the climatic and environmental settings where the protocol is being applied. It could be possible to use the time between the start of rainfall and onset of flooding as a reference for this purpose. Other users may wish to trial different durations by running the protocol multiple times to test any effect on the output. In our eastern NSW study, the identification of start and end dates for extracting hydrographs is undertaken using visual observation of the extracted hydrographs and knowledge of how floods of different magnitude function in this landscape. In the Arash et al. [26] study, 10 days was selected as the duration of flow and flood events. For ease, this is defined as 5 days back from the flow peak and 5 days forward from the flood peak (See protocols.io).
By running the Python code, the script extracts flow hydrographs and calculates kurtosis, skewness, peak value, peak date and some other statistics for each hydrograph. Before undertaking further analysis, the extracted hydrographs need to be checked to ensure that the delineated hydrographs and calculated metrics are correct and any inputs (streamflow time series, flow stages, and duration of hydrographs) adjusted where needed.
In Microsoft Excel, RoR is calculated using peak value and peak date in Eq 3. Total drainage area draining into each gauge and total area of each catchment is calculated using the watershed tool in ArcHyro extension in ArcGIS Pro [26]. Subcatchment area draining to each gauge is classified as small (e.g. < 2,000 km2), moderate (e.g. 2,000–10,000 km2) and large (e.g. > 10,000 km2). Also, gauges are classified as upstream (a gauge captured 10% of the total catchment area), midstream (11%– 50%), and downstream (> 51%).
Peak-to-peak travel time of each pair of hydrographs is calculated on the peak value at the upstream and downstream gauges. The distance between upstream and downstream gauge pairs is calculated in ArcGIS Pro by using any readily available, accurate, streamlines layer (in NSW this is the River Styles streamlines layer [44]). The distance between gauge pairs and peak-to-peak travel time are used to calculate flood wave celerity. FWAI is calculated using the peak stage height value and contributing drainage area at the upstream and downstream gauges.
The flow hydrographs, flow stage classes and Microsoft Excel file with descriptive statistics for each flow hydrograph are stored in corresponding folders (‘Overbank_flood_peaks’, ‘High_flow_peaks’, ‘In_channel_fresh_peaks’). Moreover, the computed summary statistics (count, mean, std, min, 25%, 50%, 75%, max, kurtosis, skewness, start_date, peak_date, end_date) for each flow stage class are saved as CSV files (In_Channel_fresh_peaks_sats.csv, High_flow_peaks_sats.csv and Overbank_flood_peaks_sats.csv). This enables users to undertake subsequent analysis, visualisation, and storage of data and hydrographs derived from the streamflow data, by organising it into separate folders and files.
3. Expected results and technical validation
In the Arash et al. [26] study, the hourly interval streamflow time series of 117 gauges across coastal catchments of NSW were used to extract hydrographs for different flow stage classes (Fig 1). The total number of in-channel fresh, high flow and overbank flood hydrographs is 4,600, 1,584 and 868, respectively. Overall, 3,371 hydrographs occur in the Northern Rivers region, 3,008 in the Central Rivers region and 673 in the Southern Rivers region (Fig 1). Fig 5 shows an example of the application of script to extract and quantify flow hydrographs for the in-channel fresh, high flow and overbank flood flow stage classes from streamflow time series data for the Hunter River at Singleton gauging station from 01/01/1913 to 21/11/2023.
The red circle labelled “a” is the 2007 flood. Data source: BoM [42], WaterNSW [43], ESRI [36] and NSW River Styles [44].
The script is executed iteratively, allowing for manual verification of the generated hydrographs. This ensures that the specified inputs consistently produce hydrographs that align with real-world expectations. Additionally, thorough checks are implemented to prevent the duplication of hydrographs, and mixed results in the output dataset. For our NSW study areas, the streamflow time series sourced from BoM [42] and WaterNSW [43] are of high quality and are run through various QA/QC checks within these agencies before release [50]. For example, if a gauge has been moved there is a calibration process put in place to ensure that the gauge sensors are recording the same stage height throughout the record. In other places, a user needs to undertake their own verification and QA/QC checks on the input data they are using before running the protocol. The protocol itself does not do QA/QC or uncertainty analysis for a user. Such analyses should be conducted ex-post or ex-ante the protocol being run.
4. Applications and limitations of the protocol
Hydrographs are valuable sources of information for understanding the spatial and temporal patterns of water flowing through the landscape and how this varies over time and in response to specific events at different locations in catchments. The data that underpins hydrographs can be used to calculate a range of meaningful hydrological metrics, and to undertake statistical analyses, to further understand the hydrological function of rivers and their catchments. Like elsewhere in the world, Australia has online data portals such as the Bureau of Meteorology (BoM) [42] and WaterNSW [43], that hold Open Access streamflow gauging station records, including extensive records of flood hydrographs that in some cases have a time series dating back many decades. The extraction and quantification of hydrographs using these gauging records can be used to better understand hydrological processes for a range of scientific and applied purposes [1, 2, 26, 31].
Applications of the semi-automated hydrograph data extraction and quantification methods presented in this lab protocol (and see protocols.io) and the workflows within it, can be broadly grouped based on their purpose: current understanding (i.e. present, contemporary focus), hindcasting (i.e. historical, past focus), or forecasting (i.e. predictive, future focus). For studies with a contemporary focus, users can run the protocol and workflows provided here to extract hydrographs from current and recent historical time-series gauge records and to calculate at-a-station and upstream-to-downstream hydrograph shape metrics. At present, these metrics include kurtosis, skewness, peak hydrograph stage, peak arrival time, rate-of-rise, peak-to-peak travel time, flood wave celerity, flood peak attenuation, and flood wave attenuation index. The method is adaptable and has potential for customisation by incorporating additional flow hydrograph metrics, thereby allowing for further improvements to suit user preferences and research objectives. It facilitates the uptake and use of existing flow hydrograph databases (both real-time flow data and historical flood records), while maintaining flexibility to add new flow time-series records as they become available.
The protocol uses statistical and machine learning methods to identify patterns, trends, and anomalies in data [55–57]. The outputs produced by the protocol can also be integrated and used in various GIS and hydrological model applications, to help build a more comprehensive understanding of hydrological processes in catchments [58–61]. The protocol may be employed to produce user-friendly summary data or databases without overcomplicated analysis [62–64]. Users can conduct quality checks on the data to make sure that it accurately reflects the current conditions [65–67] and matches on-ground observations and experiences of community, particularly during floods [68–70].
Use of the protocol enables implementation of standardised formats and methods for collecting, analysing, and sharing hydrograph data [71–73]. Using standardised methods to assess hydrographs across rivers, catchments, regions, and even internationally, allows robust comparison of key metrics that has previously been restricted to coarse-data or modelling approaches. The protocol provides hydrograph analysis tools and makes them accessible to authorities, researchers, and the public [74, 75]. Analysis of outputs derived from use of the protocol can facilitate research in hydrology and river management or be applied by managers in flood monitoring and management [65, 67, 76]. Simple hydrograph datasets can also be used to prepare visualised educational materials to raise public awareness about flood risks and flood hydrology [77–79].
The outputs produced using the protocol can be used to make better interpretations of past and emerging patterns and changes in the flow hydrology at-a-station, from upstream-to-downstream on key streamlines, or across different catchments and regions. In this regard, the protocol could be considered a change detection tool to identify where changes in hydrology are occurring and where to target more sophisticated modelling exercises to explain the changes detected. Hence, it facilitates hindcasting and forecasting of hydrological processes using an efficient and cost-effective method for generating streamflow metrics. Hindcasting efforts may use extracted hydrograph metrics from modern rivers to help reconstruct conditions in catchments prior to current hydrological gauge records. This may be achieved by identifying catchments with similar environmental setting and fluvial controls on hydrology, and then extrapolating the most likely hydrograph characteristics in the past for similar catchments. In terms of forecasting, the protocol may be used to help predict future hydrograph size and shape in catchments. This understanding can help aid river and flood modelling, planning and management, and for the detection of hydrological impacts of any river recovery or natural flood management rehabilitation activity [76–83]. Overall, the protocol can be universally applied to any streamflow gauging records, providing standardised processes to compare flow hydrograph characteristics at site, reach, catchment, and regional scales, allowing for many applications in natural sciences.
There are some considerations that users need to be aware of when using the workflow and protocol.
First, in the data preparation step, 1-hour interval stage height is used. To run data with different intervals, users can change the Python code, or downscale or aggregate the time-series from the default date interval to sub-daily or hourly by using other statistical methods (e.g. [54, 84]).
Second, given the diverse hydrological, meteorological, and geomorphic characteristics of rivers around the world, choosing the start and end dates for the extraction of hydrographs is a decision that needs to be based on visual observation of the extracted hydrographs and local knowledge, with the proviso that the full hydrograph is preserved in the output. Other approaches to identifying start and end dates have been provided above.
Third, the hydrograph shape metrics calculated by the protocol capture multiple flow characteristics (such as flood peak height, time-to-peak, the steepness of the rising and recession limbs, and flatness of the peak). However, the workflow can be tailored to include other hydrograph shape metrics that may be of interest to a user. This can be done manually in Excel at the data preparation stage. For upstream-to-downstream hydrograph shape metrics (e.g. C and tT) this is done by using the provided Python code, and for catchment morphometrics (e.g. Ac and D) in ArcGIS Pro.
Finally, while the incorporation of rainfall data into hydrograph characterisation analysis could aid understanding of the (de)synchroneity of flood wave propagation from upstream to downstream, the protocol does not quantify nor consider the timing of precipitation or the magnitude of precipitation events. Analysis of lag time between rainfall, runoff and the expression of flow in river channels, preconditioning of the landscape and the role of disparate contributions from tributaries or episodes of rainfall in different parts of catchments are not considered within the scope of the protocol presented here. Further work and expansion of the protocol would be needed to detect, quantify or account for these factors.
5. Conclusions
In this protocol, and associated protocols.io (https://doi.org/10.17504/protocols.io.rm7vzxqo8gx1/v1), we provide a set of workflows to extract hydrographs from time-series gauge records that can be used to calculate at-a-station and upstream-to-downstream hydrograph shape metrics. The protocol employs efficient coding, supported by relevant analytical libraries, and includes comprehensive, easy-to-follow comments. The applications of this protocol include as a change detection tool to identify where changes in hydrology are occurring and where to target more sophisticated modelling exercises to explain the changes detected, to visualise and analyse flow hydrograph data to enhance the understanding of data types, structures, missing values, and potential biases while uncovering relationships, trends, and patterns over space and time. This method is designed to be user-friendly, thereby facilitating wider application in various hydrological projects.
Supporting information
S1 Data. This spreadsheet is available on figshare (https://doi.org/10.6084/m9.figshare.27187143).
https://doi.org/10.1371/journal.pone.0315796.s001
(XLSX)
Acknowledgments
The streamflow data was obtained from the Australian Bureau of Meteorology (BoM) [42] and WaterNSW [43] Open Access portals.
Bureau of Meteorology (BoM). Water Data Online [Internet]. 2024. Available from: http://www.bom.gov.au/waterdata/
WaterNSW. Continuous Water Monitoring Network [Internet]. 2024. Available from: https://realtimedata.waternsw.com.au/water.stm.
References
- 1. Hannah DM, Smith BP, Gurnell AM, McGregor GR. An approach to hydrograph classification. Hydrological processes. 2000;14(2):317–38.
- 2. Brunner MI, Viviroli D, Furrer R, Seibert J, Favre AC. Identification of flood reactivity regions via the functional clustering of hydrographs. Water Resources Research. 2018;54(3):1852–67.
- 3. Fleischmann AS, Paiva RCD, Collischonn W, Sorribas MV, Pontes PRM. On river-floodplain interaction and hydrograph skewness. Water Resources Research. 2016;52(10):7615–30.
- 4. Anderson BG, Rutherfurd ID, Western AW. An analysis of the influence of riparian vegetation on the propagation of flood waves. Environmental Modelling & Software. 2006;21(9):1290–6.
- 5. Lininger KB, Latrubesse EM. Flooding hydrology and peak discharge attenuation along the middle Araguaia River in central Brazil. Catena. 2016;143:90–101.
- 6. Joo H, Lee J, Jun H, Kim K, Hong S, Kim J, et al. Optimal stream gauge network design using entropy theory and importance of stream gauge stations. Entropy. 2019;21(10):991.
- 7. Coxon G, Freer J, Wagener T, Odoni N, Clark M. Diagnostic evaluation of multiple hypotheses of hydrological behaviour in a limits‐of‐acceptability framework for 24 UK catchments. Hydrological Processes. 2014;28(25):6135–50.
- 8. Berghuijs WR, Aalbers EE, Larsen JR, Trancoso R, Woods RA. Recent changes in extreme floods across multiple continents. Environmental Research Letters. 2017;12(11):114035.
- 9. Coxon G, Freer J, Westerberg IK, Wagener T, Woods R, Smith P. A novel framework for discharge uncertainty quantification applied to 500 UK gauging stations. Water resources research. 2015;51(7):5531–46. pmid:26924859
- 10. McMillan H, Booker D, Cattoën C. Validation of a national hydrological model. Journal of Hydrology. 2016;541:800–15.
- 11. Beck HE, Vergopolan N, Pan M, Levizzani V, Van Dijk AI, Weedon GP, et al. Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling. Hydrology and Earth System Sciences. 2017;21(12):6201–17.
- 12. Addor N, Nearing G, Prieto C, Newman AJ, Le Vine N, Clark MP. A ranking of hydrological signatures based on their predictability in space. Water Resources Research. 2018;54(11):8792–812.
- 13. Alvarez-Garreton C, Mendoza PA, Boisier JP, Addor N, Galleguillos M, Zambrano-Bigiarini M, et al. The CAMELS-CL dataset: catchment attributes and meteorology for large sample studies–Chile dataset. Hydrology and Earth System Sciences. 2018;22(11):5817–46.
- 14. Slater LJ, Villarini G. Enhancing the predictability of seasonal streamflow with a statistical‐dynamical approach. Geophysical Research Letters. 2018;45(13):6504–13.
- 15. Melsen LA, Addor N, Mizukami N, Newman AJ, Torfs PJ, Clark MP, et al. Mapping (dis) agreement in hydrologic projections. Hydrology and Earth System Sciences. 2018;22(3):1775–91.
- 16. Gudmundsson L, Leonard M, Do HX, Westra S, Seneviratne SI. Observed trends in global indicators of mean and extreme streamflow. Geophysical Research Letters. 2019;46(2):756–66.
- 17. Sawicz K, Wagener T, Sivapalan M, Troch PA, Carrillo G. Catchment classification: empirical analysis of hydrologic similarity based on catchment function in the eastern USA. Hydrology and Earth System Sciences. 2011;15(9):2895–911.
- 18. Razavi T, Coulibaly P. Streamflow prediction in ungauged basins: review of regionalization methods. Journal of hydrologic engineering. 2013;18(8):958–75.
- 19. Jiang D, Wang K. The role of satellite-based remote sensing in improving simulated streamflow: A review. Water. 2019;11(8):1615.
- 20. Addor N, Do HX, Alvarez-Garreton C, Coxon G, Fowler K, Mendoza PA. Large-sample hydrology: recent progress, guidelines for new datasets and grand challenges. Hydrological Sciences Journal. 2020;65(5):712–25.
- 21. Andréassian V, Hall A, Chahinian N, Schaake J. Introduction and synthesis: Why should hydrologists work on a large number of basin data sets?: AISH; 2006.
- 22. Gupta H, Perrin C, Blöschl G, Montanari A, Kumar R, Clark M, et al. Large-sample hydrology: a need to balance depth with breadth. Hydrology and Earth System Sciences. 2014;18(2):463–77.
- 23. Stahl K, Hisdal H, Hannaford J, Tallaksen L, Van Lanen H, Sauquet E, et al. Streamflow trends in Europe: evidence from a dataset of near-natural catchments. Hydrology and Earth System Sciences. 2010;14(12):2367–82.
- 24. Parajka J, Viglione A, Rogger M, Salinas J, Sivapalan M, Blöschl G. Comparative assessment of predictions in ungauged basins–Part 1: Runoff-hydrograph studies. Hydrology and Earth System Sciences. 2013;17(5):1783–95.
- 25. Kuentz A, Arheimer B, Hundecha Y, Wagener T. Understanding hydrologic variability across Europe through catchment classification. Hydrology and Earth System Sciences. 2017;21(6):2863–79.
- 26. Arash AM, Fryirs K, Ralph TJ. Detection of decadal time‐series changes in flow hydrology in eastern Australia: Considerations for river recovery and flood management. Earth Surface Processes and Landforms. 2023.
- 27. Gnann SJ, Coxon G, Woods RA, Howden NJ, McMillan HK. TOSSH: A toolbox for streamflow signatures in hydrology. Environmental Modelling & Software. 2021;138:104983.
- 28. Willems P. A time series tool to support the multi-criteria performance evaluation of rainfall-runoff models. Environmental Modelling & Software. 2009;24(3):311–21.
- 29. Ternynck C, Alaya MAB, Chebana F, Dabo-Niang S, Ouarda TB. Streamflow hydrograph classification using functional data analysis. Journal of hydrometeorology. 2016;17(1):327–44.
- 30. Cohen TJ, Suesse T, Reinfelds I, Zhang N, Fryirs K, Chisholm L. The re-greening of east coast Australian rivers: An unprecedented riparian transformation. Science of The Total Environment. 2022;810:151309. pmid:34752861
- 31. Mei Y, Anagnostou EN. A hydrograph separation method based on information from rainfall and runoff records. Journal of Hydrology. 2015;523:636–49.
- 32. Tang W, Carey SK. Hyd R un: a MATLAB toolbox for rainfall–runoff analysis. Hydrological Processes. 2017;31(15):2670–82.
- 33. Oppel H, Mewes B. On the automation of flood event separation from continuous time series. Frontiers in Water. 2020;2:18.
- 34. Thomas BF. Hydrograph recession extraction algorithm (HYDRA): Minimizing influence of stage uncertainty in identification of recession events. Advances in Water Resources. 2021;152:103937.
- 35. Pozdniakov SP, Wang P, Grinevsky SO, Frolova NL. A Physically based model of a two‐pass digital filter for separating groundwater runoff from streamflow time series. Water Resources Research. 2022;58(3):e2021WR031333.
- 36.
ESRI. ArcGIS Pro: Release 3.2. Redlands, CA: Environmental Systems Research Institute. 2024.
- 37.
(BoM) BoM. Climate Data Online 2024 [Available from: http://www.bom.gov.au/climate/data/.
- 38. Rogers K, Woodroffe CD. Geomorphology as an indicator of the biophysical vulnerability of estuaries to coastal and flood hazards in a changing climate. Journal of Coastal Conservation. 2016;20(2):127–44.
- 39. O’Kane M, Fuller M. Report of the 2022 NSW Flood Inquiry. 2022.
- 40. Fryirs K, Hancock F, Healey M, Mould S, Dobbs L, Riches M, et al. Things we can do now that we could not do before: Developing and using a cross-scalar, state-wide database to support geomorphologically-informed river management. PloS one. 2021;16(1):e0244719. pmid:33481832
- 41. Fryirs K, Brierley GJ. Antecedent controls on river character and behaviour in partly confined valley settings: Upper Hunter catchment, NSW, Australia. Geomorphology. 2010;117(1):106–20.
- 42.
Bureau of Meteorology (BoM). Water Data Online [Internet]. 2024. Available from: http://www.bom.gov.au/waterdata/
- 43.
WaterNSW. Continuous Water Monitoring Network [Internet]. 2024. Available from: https://realtimedata.waternsw.com.au/
- 44.
NSW River Styles [Internet]. 2024. Available from: https://spatial.industry.nsw.gov.au/arcgis/rest/services/PUBLIC/NSW_RiverStyles/MapServer.
- 45. Turner-Gillespie DF, Smith JA, Bates PD. Attenuating reaches and the regional flood response of an urbanizing drainage basin. Advances in Water Resources. 2003;26(6):673–84.
- 46. Croke J, Reinfelds I, Thompson C, Roper E. Macrochannels and their significance for flood-risk minimisation: examples from southeast Queensland and New South Wales, Australia. Stochastic environmental research and risk assessment. 2014;28:99–112.
- 47. Allen GH, David CH, Andreadis KM, Hossain F, Famiglietti JS. Global estimates of river flow wave travel times and implications for low‐latency satellite data. Geophysical Research Letters. 2018;45(15):7551–60.
- 48. Lane SN. Natural flood management. Wiley Interdisciplinary Reviews: Water. 2017;4(3):e1211.
- 49. Barbetta S, Moramarco T, Perumal M. A Muskingum-based methodology for river discharge estimation and rating curve development under significant lateral inflow conditions. Journal of Hydrology. 2017;554:216–32.
- 50. McMahon TA, Peel MC. Uncertainty in stage–discharge rating curves: application to Australian Hydrologic Reference Stations data. Hydrological Sciences Journal. 2019;64(3):255–75.
- 51. Merz SK. Flows: A Method for Determining Environmental Water Requirements in Victoria: Department of Environment and Primary Industries; 2013.
- 52. Dyer J, Mercer A, Raczyński K. Identifying Spatial Patterns of Hydrologic Drought over the Southeast US Using Retrospective National Water Model Simulations. Water. 2022;14(10):1525.
- 53. Khaleghi M, Gholami V, Ghodusi J, Hosseini H. Efficiency of the geomorphologic instantaneous unit hydrograph method in flood hydrograph simulation. Catena. 2011;87(2):163–71.
- 54. Rebora N, Silvestro F, Rudari R, Herold C, Ferraris L. Downscaling stream flow time series from monthly to daily scales using an auto-regressive stochastic algorithm: StreamFARM. Journal of Hydrology. 2016;537:297–310.
- 55. Kulanuwat L, Chantrapornchai C, Maleewong M, Wongchaisuwat P, Wimala S, Sarinnapakorn K, et al. Anomaly detection using a sliding window technique and data imputation with machine learning for hydrological time series. Water. 2021;13(13):1862.
- 56. Kalu I, Ndehedehe CE, Okwuashi O, Eyoh AE, Ferreira VG. An assimilated deep learning approach to identify the influence of global climate on hydrological fluxes. Journal of Hydrology. 2022;614:128498.
- 57. Kumar V, Kedam N, Sharma KV, Mehta DJ, Caloiero T. Advanced machine learning techniques to improve hydrological prediction: a comparative analysis of streamflow prediction models. Water. 2023;15(14):2572.
- 58. Peng J, Liu T, Huang Y, Ling Y, Li Z, Bao A, et al. Satellite-based precipitation datasets evaluation using gauge observation and hydrological modeling in a typical arid land watershed of Central Asia. Remote Sensing. 2021;13(2):221.
- 59. Thakur JK, Singh SK, Ekanthalu VS. Integrating remote sensing, geographic information systems and global positioning system techniques with hydrological modeling. Applied Water Science. 2017;7(4):1595–608.
- 60. Merkuryeva G, Merkuryev Y, Sokolov BV, Potryasaev S, Zelentsov VA, Lektauers A. Advanced river flood monitoring, modelling and forecasting. Journal of computational science. 2015;10:77–85.
- 61. Boothroyd RJ, Williams RD, Hoey TB, MacDonell C, Tolentino PL, Quick L, et al. National-scale geodatabase of catchment characteristics in the Philippines for river management applications. Plos one. 2023;18(3):e0281933. pmid:36888610
- 62. Agnew D, Graves BP, Fryirs K. A GIS workflow for the identification of corridors of geomorphic river recovery across landscapes. PLoS One. 2022;17(12):e0278831. pmid:36512598
- 63. Chen D, Shams S, Carmona-Moreno C, Leone A. Assessment of open source GIS software for water resources management in developing countries. Journal of Hydro-environment Research. 2010;4(3):253–64.
- 64. Diehl RM, Gourevitch JD, Drago S, Wemple BC. Improving flood hazard datasets using a low-complexity, probabilistic floodplain mapping approach. Plos one. 2021;16(3):e0248683. pmid:33780467
- 65. Horsburgh JS, Reeder SL, Jones AS, Meline J. Open source software for visualization and quality control of continuous hydrologic and water quality sensor data. Environmental Modelling & Software. 2015;70:32–44.
- 66. Crochemore L, Isberg K, Pimentel R, Pineda L, Hasan A, Arheimer B. Lessons learnt from checking the quality of openly accessible river flow data worldwide. Hydrological Sciences Journal. 2020;65(5):699–711.
- 67. Pang J, Zhang H, Xu Q, Wang Y, Wang Y, Zhang O, et al. Hydrological evaluation of open-access precipitation data using SWAT at multiple temporal and spatial scales. Hydrology and Earth System Sciences. 2020;24(7):3603–26.
- 68. Njue N, Kroese JS, Gräf J, Jacobs S, Weeser B, Breuer L, et al. Citizen science in hydrological monitoring and ecosystem services management: State of the art and future prospects. Science of the Total Environment. 2019;693:133531. pmid:31635016
- 69. Starkey E, Parkin G, Birkinshaw S, Large A, Quinn P, Gibson C. Demonstrating the value of community-based (‘citizen science’) observations for catchment modelling and characterisation. Journal of Hydrology. 2017;548:801–17.
- 70. Nardi F, Cudennec C, Abrate T, Allouch C, Annis A, Assumpcao T, et al. Citizens AND HYdrology (CANDHY): conceptualizing a transdisciplinary framework for citizen science addressing hydrological challenges. Hydrological Sciences Journal. 2022;67(16):2534–51.
- 71. Atkinson R, Dornblut I, Smith D. An international standard conceptual model for sharing references to hydrologic features. Journal of Hydrology. 2012;424–425:24–36.
- 72. Ballio F, Molinari D, Minucci G, Mazuran M, Arias Munoz C, Menoni S, et al. The RISPOSTA procedure for the collection, storage and analysis of high quality, consistent and reliable damage data in the aftermath of floods. Journal of Flood Risk Management. 2015;11(S2).
- 73. De Martino M, Rosim S, Quarati A, editors. Hydrographic datasets in open government data portals: Mitigation of reusability issues through provenance documentation. Research Conference on Metadata and Semantics Research; 2019: Springer.
- 74. Johnson JM, Blodgett DL, Clarke KC, Pollak J. Restructuring and serving web-accessible streamflow data from the NOAA National Water Model historic simulations. Scientific Data. 2023;10(1):725. pmid:37863923
- 75. Lindersson S, Brandimarte L, Mård J, Di Baldassarre G. A review of freely accessible global datasets for the study of floods, droughts and their interactions with human societies. Wiley Interdisciplinary Reviews: Water. 2020;7(3):e1424.
- 76. Rossetto R, De Filippis G, Borsi I, Foglia L, Cannata M, Criollo R, et al. Integrating free and open source tools and distributed modelling codes in GIS environment for data-based groundwater management. Environmental Modelling & Software. 2018;107:210–30.
- 77. Stefanidis S, Alexandridis V, Theodoridou T. Flood exposure of residential areas and infrastructure in Greece. Hydrology. 2022;9(8):145.
- 78. Lane B, Garousi‐Nejad I, Gallagher MA, Tarboton DG, Habib E. An open web‐based module developed to advance data‐driven hydrologic process learning. Hydrological Processes. 2021;35(7):e14273.
- 79. Hall CA, Saia SM, Popp AL, Dogulu N, Schymanski SJ, Drost N, et al. A hydrologist’s guide to open science. Hydrology and Earth System Sciences. 2022;26(3):647–64.
- 80. Ekeu-wei IT, Blackburn GA. Applications of open-access remotely sensed data for flood modelling and mapping in developing regions. Hydrology. 2018;5(3):39.
- 81. Arash AM, Yasi M. The assessment for selection and correction of RS‐based DEMs and 1D and 2D HEC‐RAS models for flood mapping in different river types. Journal of Flood Risk Management. 2023;16(1):e12871.
- 82. Yaykiran S, Cuceloglu G, Ekdal A. Estimation of Water Budget Components of the Sakarya River Basin by Using the WEAP-PGM Model. Water. 2019;11(2).
- 83. Delorit J, Gonzalez Ortuya EC, Block P. Evaluation of model-based seasonal streamflow and water allocation forecasts for the Elqui Valley, Chile. Hydrology and Earth System Sciences. 2017;21(9):4711–25.
- 84. Hekmatzadeh AA, Torabi Haghighi A, Hosseini Guyomi K, Amiri SM, Kløve B. The effects of extremes and temporal scale on multifractal properties of river flow time series. River Research and Applications. 2020;36(1):171–82.