The NASA LIS system was used in the present study, as it supports DA
58 (link),59 (link), multiple LSMs, and streamflow routing through the HyMAP routing model
45 (link). The LIS LSM and DA system is well suited to resolve this and other extreme events, as it has been successfully demonstrated in several DA scenarios, specifically in the assimilation of surface soil moisture
39 (link),58 (link),59 (link), snow
24 (link)–26 (link),39 (link), leaf area index
20 (link),57 (link), and other surface variables
18 (link),57 (link),60 (link).
Version 4.0.1 of the Noah-MP LSM
44 (link) is used with dynamic vegetation, and the runoff scheme is TOPMODEL with groundwater
44 (link). Noah-MP is run at 10-km grid resolution for the full Mississippi River basin domain (28.55°–49.95° N, 77.95°–113.95° W). Noah-MP uses four soil levels with depths (from top to bottom) of 0.1, 0.3, 0.6 and 1.0 m. The OL simulation was spun up from 1 January 1980 and executed through 1 January 2021 with MERRA2 atmospheric forcing and IMERG precipitation (when available starting in 2002). HyMAP streamflow routing is added to the OL simulation starting 1 January 2000 and continued through 1 January 2021. As the SMAP dataset does not begin until early 2015, the AMSR-2 DA, SMAP DA, and full multi-variate DA simulations were all started on 1 April 2015 with initial conditions from the OL simulation. The MODIS-DA simulation was initialized from the OL simulation on 1 July 2002, at the start of the MODIS LAI dataset.
LIS uses a 1-dimensional Ensemble Kalman Filter
61 (link) (EnKF) for DA. This DA method is used for all three datasets and the multivariate DA. EnKF has been widely used in prior studies
61 (link)–66 , and it enables the DA to account for both model errors and non-linear tendencies within the model processes. EnKF additionally allows for the characterization of model errors with an ensemble, and it is capable of resolving non-linear model dynamics and discontinuities in temporal observations
61 (link). EnKF DA has also been demonstrated to add value to model variables in a theoretical case study, where forcing and model biases were introduced, with snow and soil moisture
65 (link). The EnKF process alternates between the model forecast timestep and the analysis update step. EnKF is executed starting with the forecast timestep (i.e., running the LSM forward). In Eq. 1, let the forecast timestep be (k) and
ƒsu the prior analysis timestep be (k−1). Subscripts (i) and (j) are the x and y grid points in the model domain, respectively. When the LSM is run, the model state (for any given variable) at a grid point is projected forward from (
) to the LSM state at k (
). The analysis step is then computed as follows:
Here, the posterior state (
) combines the observation state (
) and the a priori state (
). In this equation,
represents the observation operator that relates the model states to the observation.
is the Kalman gain, which weights the impact of forecast innovations
in the analysis update. Kalman gain is based on the model and observation error covariances.
The DA methods and settings used here follow previous work
57 (link), and the settings for forcing perturbations and model variable perturbations are shown in Supplementary Table
S1. The forcing perturbations are the same for the snow depth, soil moisture, LAI, and multivariate DA simulations, and these settings follow prior applications of EnKF DA
59 (link),67 (link). Note that the state vectors for snow depth, soil moisture, and LAI, are all independent and executed as separate instances of EnKF DA. All DA cases have 20 ensemble members. The 20-member ensemble configuration is sufficient to avoid sampling errors for snow DA, and has been demonstrated in prior work
26 (link). The forcing state perturbation is 1-h, and the model state perturbation is 3-h. Observation state perturbations depend on the data temporal resolution and are 3-h for soil moisture and snow depth, and 24-h for LAI. Perturbations to atmospheric forcing account for cross-correlation in space and time between variables, which can better simulate the effects of model uncertainty
25 (link).
For snow depth DA, AMSR-2 microwave snow depth retrievals are used, and this is constrained by MODIS snow cover. The Snow Depth and SWE variables of Noah-MP were updated for the AMSR-2 assimilation step. Both of these variables are also perturbed in the DA step (see Table
S1 in Supplementary Material). Previous work demonstrated that this method could improve simulated snow states across North America
24 (link). Furthermore, the Noah-MP snow scheme adds value over the earlier Noah LSM, as it uses a 3-layer snow model that accounts for the full energy balance of the snow pack
44 (link). This generally improves the performance of the Noah-MP LSM for resolving snowpack
68 (link)–72 (link).
SMAP soil moisture
20 (link),41 (link),57 (link) is used for soil moisture assimilation. SMAP soil moisture is downscaled to 1-km using the Thermal Hydraulic of Soil Moisture Disaggregation (THySM) algorithm
73 (link), which is also available online through the USDA Crop-CASMA system (
https://nassgeo.csiss.gmu.edu/CropCASMA/). When available, both ascending (6 PM) and descending (6 AM) paths of data are used for assimilation. As THySM SMAP product is based on surface soil moisture, the shallowest layer (0–10 cm) of Noah-MP is updated. The LIS “anomaly correction” algorithm is used to correct any biases in the SMAP product
74 (link).
The MCD15A2H Version 6 MODIS product
75 is used for LAI assimilation. While the MODIS MCD15A2H Version 6 product has an 8-day temporal resolution, a smoothing algorithm within the LIS system is used, so that assimilation may be performed daily.
For all the model experiments, means and standard deviations for the model variables (including soil moisture and LAI) are computed from 2016 through 2020. While we acknowledge this is a short analysis period, it is necessary because of the limits of the DA datasets (SMAP is only available from April 2015 to present). For analysis, SWE melt, soil moisture, and LAI are considered in 2019. SWE melt and soil moisture analysis use anomalies (analysis year—mean), and LAI analysis uses standardized anomalies (analysis year–mean)/standard deviation, respectively.
SSIM for SWE melt and LAI is computed using the SSIM function of the python scikit package (available online at:
https://scikit-image.org/docs/stable/auto_examples/transform/plot_ssim.html), and gaussian smoothing was performed using the ndimage.filters.gaussian_filter function of the python SciPy package (available online at:
https://docs.scipy.org/doc/scipy-0.14.0/reference/generated/scipy.ndimage.filters.gaussian_filter.html). Note that this additional step was necessary since EnKF does not include smoothing when DA is performed, and smoothing of model and observation data eliminates the influence local-scale discontinuities and errors in the DA and observation based datasets.
Lahmers T.M., Kumar S.V., Locke K.A., Wang S., Getirana A., Wrzesien M.L., Liu P.W, & Ahmad S.K. (2023). Interconnected hydrologic extreme drivers and impacts depicted by remote sensing data assimilation. Scientific Reports, 13, 3411.