Add sum model #364

MuellerSeb · 2024-08-11T15:05:52Z

This PR adds a SumModel class to represent sums of Covariance models.

Sum-Model

added SumModel class
- represents sum of covariance models
- behaves just as a normal covariance model with kriging and field generation
- covariance models can be added with overloaded + operator: model = m1 + m2
- class is subscriptable to access sub-models by index: m1 = model[0]
- included models will get a nugget of 0 and the nugget is stored separately in the sum-model
- model variance is the sum of the sub-model variances
- model length-scale is weighted sum of sub-model len-scales, where the weights are the ratios of the sub-models variance to the sum variance (motivated by the integral scale, which satisfies this relation)
- anisotropy and rotation need to be the same for all included sub-models
- parameters of the sub-models can be accessed by name with added index suffix: model[0].nu == model.nu_0
- fitting: if len_scale is fixed, none of the len_scale_<i> can be fixed since len_scale is calculated from variance ratios
added Nugget class (empty SumModel)
- allow len scale of 0 in CovModel to enable a pure nugget model
- added zero_var and model attributes to Generator ABC to shortcut field generation for pure nugget models

import gstools as gs
m1 = gs.Gaussian(dim=2, var=1.0, len_scale=2.0)
m2 = gs.Gaussian(dim=2, var=2.0, len_scale=20.0)
model = m1 + m2
model.plot()

Other changes

removed var_raw attribute from CovModel (was rarely used and only relevant for the truncated power law models)
- BREAKING CHANGE (but not to many should be affected)
- TPLCovModel now has a intensity attribute which calculates what var_raw was before
simplified variogram fitting (var_raw was a bad idea in the first place)
variogram plotting now handles a len-scale of 0 (to properly plot nugget models)
fitting: when sill is given and var and nugget are deselected from fitting, an error is raised if given var+nugget is not equal to sill (before, they were reset under the hood in a strange way)

…_integral_scale

…cale

…er repr

…operty to TPL models

… CovModel

…re derived from correlation not cor

…into add_sum_model

MuellerSeb · 2025-02-08T13:12:10Z

@LSchueler this is ready for review.

LSchueler · 2025-02-10T11:41:04Z

Awesome, I'll need some time to thoroughly go throw your changes.
But I already saw that you claimed example number 11 for the sum model after I did the same for the plurigaussian fields :-D Do we just keep adding new examples in an ascending order according to the time a PR is merged?

MuellerSeb · 2025-02-10T11:43:39Z

First come, first serve :-D

LSchueler

Sorry for taking so long to finish this review, but as you know, I had the best reasons ;-)

I'm really looking forward to sum models being integrated into GSTools, but I found quite a few things which should be addressed before merging.

LSchueler · 2025-02-11T12:45:57Z

examples/11_sum_model/00_simple_sum_model.py

+We'll combine a Spherical and a Gaussian covariance model to construct
+a sum model, visualize its variogram, and generate spatial random fields.
+
+Let's start with importing GSTools setting up the domain size.


"and" missing

LSchueler · 2025-02-11T12:47:38Z

examples/11_sum_model/00_simple_sum_model.py

+# First, we create two individual covariance models: a :any:`Spherical` model and a
+# :any:`Gaussian` model. The Spherical model will emphasize small-scale variability,
+# while the Gaussian model captures larger-scale patterns.
+


This formulation might give the impression that the small-scale and large-scale patterns are due to the covariance models. You should clarify that this is due to the chosen length scales.

LSchueler · 2025-02-11T12:53:23Z

examples/11_sum_model/00_simple_sum_model.py

+# As shown, the Spherical model controls the behavior at shorter distances,
+# while the Gaussian model dominates at longer distances.


I think you should also briefly mention how the variance of the covariance models influences the resulting fields.

LSchueler · 2025-02-11T12:57:30Z

examples/11_sum_model/01_fitting_sum_model.py

+
+###############################################################################
+# We fit the sum model to the empirical variogram using GSTools' built-in
+# fitting capabilities. As seen in the representation, the variances and length


I don't really understand the "As seen in the representation" part.

LSchueler · 2025-02-11T12:59:01Z

examples/11_sum_model/01_fitting_sum_model.py

+###############################################################################
+# We fit the sum model to the empirical variogram using GSTools' built-in
+# fitting capabilities. As seen in the representation, the variances and length
+# scales of the individual models can be accessed by the attributes


I would also generalise this from "the variances and length scales" to "the properties of the individual models, e.g. variance and length scales" or something similar.

LSchueler · 2025-03-23T20:28:18Z

src/gstools/covmodel/sum_tools.py

+        raise ValueError(msg)
+    ids = range(len(summod))
+    if fail := set(skip) - set(ids):
+        msg = f"SumModel.set_var_weights: skip ids not valid: {fail}"


"skip invalid ids"

fix this further down too, please.

LSchueler · 2025-03-23T20:29:17Z

src/gstools/covmodel/sum_tools.py

+    var_sum = sum(summod.models[i].var for i in skip)
+    var_diff = var - var_sum
+    if var_diff < 0:
+        msg = "SumModel.set_var_weights: skipped variances to big."


skipped too large variances

LSchueler · 2025-03-23T20:30:22Z

src/gstools/covmodel/sum_tools.py

+    len_sum = sum(summod[i].len_scale * summod.ratios[i] for i in skip)
+    len_diff = len_scale - len_sum
+    if len_diff < 0:
+        msg = "SumModel.set_len_weights: skipped length scales to big."


Please fix grammar

LSchueler · 2025-03-23T20:39:06Z

src/gstools/covmodel/fit.py

+    return para, sill, anis, sum_cfg
+
+
+def _check_sill(model, para_select, sill, bnd, sum_cfg):


Please add a short description of what exactly is checked.

LSchueler · 2025-03-23T20:44:59Z

src/gstools/covmodel/fit.py

+    if is_sum:
+        if sum_cfg["var_size"] > 0:
+            var_vals = popt[para_skip : para_skip + sum_cfg["var_size"]]
+            para_skip += sum_cfg["var_size"]
+            if sum_cfg["var_fix"]:
+                model.set_var_weights(
+                    stick_breaking_uniform(var_vals),
+                    sum_cfg["var_skip"],
+                    sum_cfg["fix"]["var"],
+                )
+            else:
+                for i, val in zip(sum_cfg["var_fit"], var_vals):
+                    setattr(model, f"var_{i}", val)
+        if sum_cfg["len_size"] > 0:
+            len_vals = popt[para_skip : para_skip + sum_cfg["len_size"]]
+            para_skip += sum_cfg["len_size"]
+            if sum_cfg["len_fix"]:
+                model.set_len_weights(
+                    stick_breaking_uniform(len_vals),
+                    sum_cfg["len_skip"],
+                    sum_cfg["fix"]["len_scale"],
+                )
+            else:
+                for i, val in zip(sum_cfg["len_fit"], len_vals):
+                    setattr(model, f"len_scale_{i}", val)
+        for i in range(model.size):
+            fit_para[f"var_{i}"] = model.vars[i]
+            fit_para[f"len_scale_{i}"] = model.len_scales[i]
+    # handle sill
+    if sill is not None and para["var"]:
+        nugget = sill - model.var
+        fit_para["nugget"] = nugget
+        model.nugget = nugget


There are quite a few pretty large if is_sum branches all over the place here. Maybe you can find a better structure for the code in this module?

MuellerSeb added 12 commits July 26, 2024 23:11

plot: handle len_scale of 0

94b6c29

covmodel: init len_scale with integral_scale if given; remove unused …

abbb062

…_integral_scale

bounds: allow interval with equal bounds if closed

c39866a

CovModel: add force_values logic

df9f2ba

generator: shortcut for 0 variance

a3f1ec6

CovModel: safer usage of privat attr; anis set bug fix; simpler int s…

8b3ad81

…cale

CovModel: check _init in bounds setter; also compare geo_scale; simpl…

a60b3d7

…er repr

CovModel: remove force arguments mechanic

92210ad

pylint fixes

8999bf9

TPL: remove var_raw and var_factor logic and only add intensity as pr…

d643b45

…operty to TPL models

tests: remove tests for var_raw

81e5421

CovModel: remove mechanism for fixed args again

669e9a2

MuellerSeb marked this pull request as draft August 11, 2024 15:05

MuellerSeb added 17 commits August 12, 2024 00:34

CovModel: simplify fitting routines

01d1d82

CovModel: fix setting integral scale as list of values

418038f

no need to set var last anymore

d8b037f

fit: add comment

0b63225

add ratio error class; better arg checking routines

21d0e4a

CovModel: better dim setting

e6374db

add sum_tools submodule

dcb003b

add SumModel class

057c888

add pure Nugget model

656125a

CovModel: let the sum magic happen

dbdfa27

pylint fixes

2858f06

fix doc-string of SumModel for docs

947619a

CovModel: add switch to add doc string

480fa32

Fourier: fix doc

171efba

SumModel: models need to either be all instances or all subclasses of…

e7838cf

… CovModel

SumModel: sum models have a constant rescale factor of 1 since they a…

0e35a22

…re derived from correlation not cor

CovModel: add 'needs_fourier_transform' attribute

2ded2e0

MuellerSeb added 11 commits January 6, 2025 17:26

lint

d03146f

Merge branch 'add_sum_model' of github.com:GeoStat-Framework/GSTools …

ee485c2

…into add_sum_model

pylint: set all config in pyproject

5dc6b16

Normalizer: implement simple derivative to remove scipy dependency

64b102e

fit: correctly check for parameters to fit

f475c1e

fit: fix init guess for var_i

8ea78f8

fix typo

3eee1dd

Doc: add mini-gallery to tutorials

e3b944e

typo

8a97d0d

Docs: add first examples for sum models

f0b113e

Merge branch 'main' into add_sum_model

361860a

MuellerSeb added this to the 1.7 milestone Feb 3, 2025

MuellerSeb added enhancement New feature or request Refactoring Code-Refactoring needed here labels Feb 3, 2025

MuellerSeb self-assigned this Feb 3, 2025

MuellerSeb added 9 commits February 7, 2025 22:15

SumModel: copy CovModel instances to prevent strange errors

af06e49

SumModel: also copy models of other summodels

fb0a01e

make sum-models comparable

e8ea93c

SumModel: remove RatioError, use ValueError instead

c64af49

SumModel: add tests

c885952

add test data to manifest

c66d722

Merge branch 'add_sum_model' of github.com:GeoStat-Framework/GSTools …

942bca7

…into add_sum_model

pylint fix

fedb294

SumModel: more tests

48ec88c

MuellerSeb marked this pull request as ready for review February 8, 2025 13:11

MuellerSeb requested a review from LSchueler February 8, 2025 13:11

LSchueler requested changes Mar 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sum model #364

Add sum model #364

MuellerSeb commented Aug 11, 2024 •

edited

Loading

MuellerSeb commented Feb 8, 2025

LSchueler commented Feb 10, 2025

MuellerSeb commented Feb 10, 2025

LSchueler left a comment

LSchueler Feb 11, 2025

LSchueler Feb 11, 2025

LSchueler Feb 11, 2025

LSchueler Feb 11, 2025

LSchueler Feb 11, 2025

LSchueler Mar 23, 2025

LSchueler Mar 23, 2025

LSchueler Mar 23, 2025

LSchueler Mar 23, 2025

LSchueler Mar 23, 2025

		# As shown, the Spherical model controls the behavior at shorter distances,
		# while the Gaussian model dominates at longer distances.

		return para, sill, anis, sum_cfg


		def _check_sill(model, para_select, sill, bnd, sum_cfg):

Add sum model #364

Are you sure you want to change the base?

Add sum model #364

Conversation

MuellerSeb commented Aug 11, 2024 • edited Loading

Sum-Model

Other changes

MuellerSeb commented Feb 8, 2025

LSchueler commented Feb 10, 2025

MuellerSeb commented Feb 10, 2025

LSchueler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MuellerSeb commented Aug 11, 2024 •

edited

Loading