Replies: 12 comments
-
Another topic that often confuses me:
My understanding is that only the number of obs used/rejected and the spread (and mean) |
Beta Was this translation helpful? Give feedback.
-
Kevin raises a pretty complex issue here that we have definitely not discussed enough in the documentation. The preassim stage is the one that is most straightforward to interpret in a DA context. It is the actual prior for the Bayesian update done by the data assimilation and it is the one that should have an appropriate spread/RMSE relationship. It is also the one that is consistent with the priors in the observation space diagnostics which are the forward operators applied to the inflated prior ensemble. Kevin's comment about 'non-physical' is relevant, although primarily for a subset of the state. Obvious example is when the inflation leads to a quantity that is guaranteed to be positive definite by the model becoming negative. The question of comparing to the analysis stage is also subtle. If one is using only prior inflation, as is most common for us, the assumption is that the posterior spread is mostly okay (or we just don't care). We could use both prior and posterior inflation and then things would be more comparable in some sense. As we know, I am pretty strongly opposed to validating posteriors, even in OSSEs, so I am not sure if I'm that worried about this. Does Moha want to weigh in on this? I'm thinking that the use of the 'preassim' as the default is the best choice, but... |
Beta Was this translation helpful? Give feedback.
-
I think that my comment about "nonphysical" includes more than "unallowed values". I hadn't thought of comparing the inflated preassim with an inflated "output" stage. If we compare the (inflated) preassim and output stages, we'll be misled. |
Beta Was this translation helpful? Give feedback.
-
In the case of no prior inflation, the files from the preassim and forecast stages should be exactly the same. As far as I'm concerned, inflation is the only difference separating these two stages. Are there any differences? If the answer is no, then I agree with Jeff the preassim stage files are the ones we should at. In principle, these are the priors that describe the prior distribution of the state before the update. Keep in mind, the ensemble mean will always be the same before and after inflation. This also applies to the RMSE. If the user wants to assess the individual ensemble members, he/she/they should be aware that forecast is pure model and preassim includes inflation. We can stress this point on the webpage if it's not clear. With that said, comparing the analysis and preassim means and spreads (to get an idea of the DA increment) is totally fine. Again, the only place where one should be careful is when looking at the individual ensemble members. Remark: Just a clarification on comparing the RMSE with the prior spread. What we would like to match is the RMSE and the total spread (rather than only the spread of the ensemble) |
Beta Was this translation helpful? Give feedback.
-
Ah yes, I'd forgotten about the means being preserved. Thanks for that. |
Beta Was this translation helpful? Give feedback.
-
Means are preserved and RMSE is just a function of the mean. The original 1999 paper was predicated on the idea that multiplicative inflation preserved means and correlations, but not covariances. That goes flying out the window as soon as there's any localization or if the inflation is 'spatially-varying'. The bounded quantity issue is still relevant here, however. |
Beta Was this translation helpful? Give feedback.
-
Here's a possible misunderstanding. |
Beta Was this translation helpful? Give feedback.
-
I was also confused by the statement that the RMSE is just a function of
the mean.
The 'squaring' part makes the actual values important.
forecast = [-3 -2 -1 0 1 2 3];
preassim = [-4 -3 -2 0 2 3 4];
[mean(forecast) mean(preassim)]
ans = 0 0
[rms(forecast) rms(preassim)]
ans = 2.0000 2.8785
|
Beta Was this translation helpful? Give feedback.
-
The 'error' is the vector difference between the prior ensemble mean and the observation. The prior ensemble mean is unchanged by the inflation. Therefore, the error is unchanged by the inflation and so is the RMSE which is simply a function of the error. |
Beta Was this translation helpful? Give feedback.
-
Got it ... that is consistent with the way *obs_diag.f90* works since it is computing the stated quantities using the ensemble mean and observation values, not the individual values of the forward operators. Not sure this is clear for anyone creating their own post-processing routines (which means writing their own observation sequence file reader, and always putting out the full ensemble - something very few people actually do).
|
Beta Was this translation helpful? Give feedback.
-
Another possible misunderstanding: |
Beta Was this translation helpful? Give feedback.
-
Could also be in state space. Error is the difference between the prior ensemble mean and the truth. Again, has nothing to do with the prior ensemble spread. Vector here referes to the fact that there are 1 or more prior quantities being evaluated. Those could be prior estimates of state variables, or any function of state variables (including estimates of observations). The 'M' in RMSE is computing a mean over the elements of the squared errors. |
Beta Was this translation helpful? Give feedback.
-
The page recommends using the "preassim" stage for evaluation of an assimilation.
This is often not the best choice because it has the inflated fields in it,
which are non-physical, and can't be compared with the "analysis" stage.
The "forecast" stage is comparable to the analysis.
Beta Was this translation helpful? Give feedback.
All reactions