Note

Go to the end to download the full example code.

Modelling multivariate extremes#

Extreme value modelling is a central topic in the design of offshore structures.

In this package, we used the Peaks Over Threshold (POT) approach: the data above a defined threshold is kept, independent exceedances are identified based on a time-separation criterion and a GPD distribution is fitted to the set of clusters maxima. The particular case of Exponential distribution is selected using an AIC criterion. This procedure is based on the excellent package pyextremes.

We also propose in this package to compute multivariate extremes using the methodology described in Raillard et al (2019).

Data extraction and univariate models#

We have selected a location next to Ushant island, node number 36855 and we compute the wind intensity using the resourcecode.utils.zmcomp2metconv() function of the package.

We also define here the tuning parameters for fitting the models to the data:

quantile above which the model is fitted;
de-clustering parameter, specified in hours.

Then, we fit the POT model to the data.

client = resourcecode.Client()
data = client.get_dataframe_from_url(
    "https://resourcecode.ifremer.fr/explore?pointId=136855",
    parameters=("hs", "uwnd", "vwnd"),
)
data["wspd"], data["wdir"] = resourcecode.utils.zmcomp2metconv(data.uwnd, data.vwnd)

# Tuning parameters for the POT model
quantile_fit = 0.95
r = "72H"
return_periods = np.array([1, 2, 5, 10, 25, 50, 100])
models = get_fitted_models(data[["hs", "wspd"]], quantile_fit, r)
gpd_parameters = get_gpd_parameters(models)

/opt/hostedtoolcache/Python/3.11.11/x64/lib/python3.11/site-packages/pyextremes/extremes/peaks_over_threshold.py:18: FutureWarning:

'H' is deprecated and will be removed in a future version. Please use 'h' instead of 'H'.

/opt/hostedtoolcache/Python/3.11.11/x64/lib/python3.11/site-packages/pyextremes/eva.py:525: FutureWarning:

'H' is deprecated and will be removed in a future version. Please use 'h' instead of 'H'.

/opt/hostedtoolcache/Python/3.11.11/x64/lib/python3.11/site-packages/pyextremes/extremes/peaks_over_threshold.py:18: FutureWarning:

'H' is deprecated and will be removed in a future version. Please use 'h' instead of 'H'.

/opt/hostedtoolcache/Python/3.11.11/x64/lib/python3.11/site-packages/pyextremes/eva.py:525: FutureWarning:

'H' is deprecated and will be removed in a future version. Please use 'h' instead of 'H'.

Fitted model for \(H_s\)#

We can plot the fitted model against the observations

fig = models[0].plot_diagnostic()
plot.show()

Return value plot, Probability density plot, Q-Q plot, P-P plot

Below, one can find the estimated return levels for \(H_s\)

return_levels_Hs = models[0].get_summary(return_period=return_periods, alpha=0.95)
return_levels_Hs

	return value	lower ci	upper ci
return period
1.0	9.558622	9.178443	9.927872
2.0	10.522578	9.945388	10.961319
5.0	11.661462	10.759296	12.367068
10.0	12.431003	11.280748	13.418608
25.0	13.340192	11.853686	14.638731
50.0	13.954528	12.213844	15.467969
100.0	14.512067	12.521210	16.244726

Fitted model for \(W_s\)#

The diagnostic plots are below

fig = models[1].plot_diagnostic()
plot.show()

Lastly, the estimated return levels for \(W_s\)

return_levels_Wspd = models[1].get_summary(return_period=return_periods, alpha=0.95)
return_levels_Wspd

	return value	lower ci	upper ci
return period
1.0	21.225659	20.771930	21.723203
2.0	22.654234	22.021782	23.396844
5.0	24.367800	23.357643	25.457977
10.0	25.543490	24.178229	27.073757
25.0	26.953722	25.092697	29.107160
50.0	27.921291	25.674009	30.590525
100.0	28.810989	26.153189	32.033587

Multivariate model: Gaussian copula#

We will fit the censored Gaussian copula to estimate the joint distribution of extremes. thanks to the resourcecode.eva.censgaussfit() function.

rho_nataf = censgaussfit(data[["hs", "wspd"]].values, quantile_fit).x

In the reference paper, they used the Huseby approach to compute environmental countours, which is a method based on monte-carlo simulation. The function func:resourcecode.eva.run_simulation is thus used to simulate from the fitted model, using the marginal distributions found earlier.

nataf = run_simulation(rho_nataf, quantile_fit, gpd_parameters, n_simulations=1000000)

Environmental contours#

We first have to define the appropriate level set for the Huseby method.# They are defined using the number of joint excess of \(H_s\) and \(W_s\).

npy = data.query(
    f"hs>{models[0].extremes_kwargs['threshold']} and wspd>{models[1].extremes_kwargs['threshold']}"
).shape[0] / (np.unique(data.index.year).size)
levels = 1 - 1 / (return_periods * npy)

We can then use the resourcecode.eva.huseby() function to effectively compute the contours. The value of 120 can be changed to obtain smoother contours, at the price of higher computationnal cost.

with warnings.catch_warnings():
    warnings.simplefilter("ignore")
    # on some cases, we may divide by zero, and numpy warns us
    # about that. Let's hide this warning to the users.
    X, Y, theta = huseby(nataf, levels, 120)

Lastly, we can plot the contours, here single lines.

fig = go.Figure()

fig.add_traces(
    [
        go.Scatter(
            x=X[:, i],
            y=Y[:, i],
            mode="lines",
            name=str(retlev),
        )
        for i, retlev in enumerate(return_periods)
    ]
)

fig.update_layout(
    scene=dict(
        xaxis_title=data.columns[0],
        yaxis_title=data.columns[1],
        zaxis_title=data.columns[2],
    ),
    height=1200,
)
fig

Total running time of the script: (0 minutes 24.366 seconds)

Gallery generated by Sphinx-Gallery

Modelling multivariate extremes

Contents

Modelling multivariate extremes#

Data extraction and univariate models#

Fitted model for \(H_s\)#

Fitted model for \(W_s\)#

Multivariate model: Gaussian copula#

Environmental contours#