153-End to End build test. #266

charles-turner-1 · 2024-11-20T07:54:18Z

Closes #153.

This contains an end to end build test, where a subset of the catalog is built, and queries executed against it.

All queries on the subset catalog have been tested for correctness against the production catalog.
I've used CMIP5 & a single OM2 experiment to test against.
I've added a few minor refactors to the clip.py module to make testing easier.

@rbeucher can you link me to the Gadi SSH workflows you showed me the other day & I'll create a trigger for this based off those.

…ad of no args - much easier to test

…take-catalog into 153-e2e

codecov · 2024-11-20T07:58:49Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.61%. Comparing base (6092573) to head (6d543fd).
Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #266      +/-   ##
==========================================
+ Coverage   97.90%   98.61%   +0.71%     
==========================================
  Files          11       11              
  Lines         811     1010     +199     
==========================================
+ Hits          794      996     +202     
+ Misses         17       14       -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

marc-white · 2024-11-21T00:27:14Z

src/access_nri_intake/cli.py

@@ -87,18 +94,19 @@ def _check_build_args(args_list):

    if len(names) != len(set(names)):
        seen = set()
-        dupes = [name for name in names if name in seen or seen.add(name)]
+        dupes = [name for name in names if name in seen or seen.add(name)]  # type: ignore
+        # seen.add(name) returns None & so is always Falsey - so what is it doing?


Please tell me Falsey is a British programming idiom :P

Seriously though, I think the or is being used as a sneaky way to avoid an if else statement to add things to the "I've now seen this name" set (seen).

It's going in order through the names list. The or is evaluated left to right, so:

If the name has been seen earlier in the list (i.e., name in seen == True), then it gets added to dupes. The seen.add(name) is never reached.

If name in seen == False, then it moves on to the second part of the or statement, which adds name to the seen set. You're right that it's always False, but that's the point - it's False on this pass, but now that it's been added to the seen set, if the same name recurs on a future pass, it is now known to be a duplicate.

(Interestingly, False or None evaluates to None, not False, but for this list comprehension I'm assuming anything not True is close enough to False to not care.)

In [13]: (False or None) is None Out[13]: True In [14]: None or False Out[14]: False

Also, if a duplicated is repeated N times, then it will appear (N-1) times in the error message below - not sure if that's a bug or feature.

Funnily enough, I think Falsey is pretty common worldwide - just googled it and it looks like it mostly crops up in javascript land though.

More importantly though, I had completely forgotten that the or will get executed left -> right & that seen.add() will still evaluate, even though it returns None.

I think the False or None == None and None or False == False thing is the result of the same trick - None and False are both falsey, so we wind up getting the second value in either case. Makes perfect sense, but completely counterintuitive.

Either way, I'll remove the unnecessary comment & see if I can strip it down to 1 error message, not N-1.

marc-white · 2024-11-21T00:46:38Z

e2e/conftest.py

+    Get the XFAILS environment variable. We use a default of 1, indicating we expect
+    to add xfail marker to `test_parse_access_ncfile[AccessOm2Builder-access-om2/output000/ocean/ocean_grid.nc-expected0-True]`
+    unless specified.


Probably should be updated to reference which test a value of 1 will point to.

marc-white · 2024-11-21T00:47:23Z

e2e/conftest.py

+    """
+    This function is called by pytest to modify the items collected during test
+    collection. We use it here to mark the xfail tests in
+    test_builders::test_parse_access_ncfile when we check the file contents & to


marc-white · 2024-11-21T00:47:42Z

e2e/conftest.py

+        if (
+            item.name
+            in (
+                "test_parse_access_ncfile[AccessOm2Builder-access-om2/output000/ocean/ocean_grid.nc-expected0-True]",


marc-white · 2024-11-21T00:56:33Z

e2e/test_end_to_end.py

+    Return the current catalog as an intake catalog.
+    """
+    metacat = intake.cat.access_nri
+    yield metacat


What's the idea behind yielding here instead of returning? (Not an issue, just for my own edification.)

To be honest, I'm not entirely sure. Pytest recommend using yield for fixtures, in particular if you might want some teardown code.

I originally wrote this using the tempfile module - I'd forgotten pytest contains its own temporary filesystem handlers - so I thought I might have needed to include some teardown code.

Given there is no teardown, I think it makes no practical difference.

e2e/test_end_to_end.py

marc-white

At the risk of causing the build time to blow out, is it worth making sure each Builder is represented in the E2E test?

marc-white · 2024-11-21T02:27:47Z

A couple of other things:

I presume the e2e directory is outside the tests directory so that it can have a separate conftest.py. However, I think that should still work as expected if e2e is a subdirectory of tests. If that's not the case, could e2e be renamed to something like tests_e2e so we have better visibility that it's a test suite, not code?
I think it would be nice if the standard pytest call doesn't try to execute the E2E tests. I think there's ways that a skipif can be used to make this happen, e.g.: https://stackoverflow.com/questions/33084190/default-skip-test-unless-command-line-parameter-present-in-py-test

rbeucher · 2024-11-21T02:31:12Z

Hi @charles-turner-1,

Regarding the Gadi deployment actions, the workflow file can be found here.

This workflow triggers the build.sh PBS script located in the admin/access-nri-intake-catalog/bin folder.

I’ve deployed the current main branch to the conda/access-med-0.10 environment. Fingers crossed, it works as expected!

rbeucher · 2024-11-21T02:40:22Z

Just tested with a subset that only includes CMIP5 and it worked fine:
https://github.com/ACCESS-NRI/access-nri-intake-catalog/actions/runs/11945531317
We should have a way to trigger a build test from GitHub.

charles-turner-1 · 2024-11-21T03:08:33Z

A couple of other things:

I presume the e2e directory is outside the tests directory so that it can have a separate conftest.py. However, I think that should still work as expected if e2e is a subdirectory of tests. If that's not the case, could e2e be renamed to something like tests_e2e so we have better visibility that it's a test suite, not code?

I think it would be nice if the standard pytest call doesn't try to execute the E2E tests. I think there's ways that a skipif can be used to make this happen, e.g.: https://stackoverflow.com/questions/33084190/default-skip-test-unless-command-line-parameter-present-in-py-test

I virtually always run pytest using pytest tests - hence sticking it in a separate E2E folder & adding that to the .github_worfkows/ci.yml. Funnily enough, I originally called it e2e, not test_e2e or similar to avoid pytest discovering & running it erroneously. Obviously, that didn't work (see 1.)

I think both your suggestions would make this much cleaner.

charles-turner-1 · 2024-11-22T01:52:14Z

I've done a bit more looking to this - frustratingly, looks like there might be some complications into the pytest skipif decorator not working on fixtures. Hopefully this should have little/no impact, but it's gonna take a touch more digging to make sure we can merge the confests & not cause any issues.

… working smoothly from there

for more information, see https://pre-commit.ci

…take-catalog into 153-e2e

tests/e2e/test_end_to_end.py

…take-catalog into 153-e2e

…gs location

for more information, see https://pre-commit.ci

.github/workflows/e2e.yaml

marc-white

I think we're good to go now, merge when ready (and if you're satisfied the E2E testing is all set up correctly on the GitHub end).

rbeucher · 2024-11-26T05:44:55Z

.github/workflows/e2e.yaml

+    steps:
+      - name: Checkout repository
+        ### Latest at time of writing
+        uses: actions/checkout@v4.2.2


I would just assume we have a copy on Gadi for now. You can delete the checkout and the rsync.

charles-turner-1 and others added 14 commits November 18, 2024 14:00

Updated functions in cli.py to use argv[Optional[Sequence[str]] inste…

aeb978e

…ad of no args - much easier to test

Skeleton of e2e test

9e785f4

Peelin apart argparse for e2e test

6206a7e

Pass build in using argparse

a4aab12

End to end build test working - now to add queries

c7efe95

Pre-commit

1f4feea

Updated workflow to only run tests in 'tests' dir

02fdd23

Lots of tests working - mostly just the content tests to finish

e194ead

Pre-commit

be30446

End to end test done & working. Now just needs a workflow trigger

37d3a68

formatting

436676a

Merge branch 'main' into 153-e2e

9be91f3

Removed unused build_subset.sh file

1d69c30

Merge branch '153-e2e' of https://github.com/ACCESS-NRI/access-nri-in…

5071f0b

…take-catalog into 153-e2e

marc-white reviewed Nov 21, 2024

View reviewed changes

e2e/test_end_to_end.py Outdated Show resolved Hide resolved

marc-white reviewed Nov 21, 2024

View reviewed changes

charles-turner-1 and others added 3 commits November 25, 2024 08:36

Removed some redundant stuff (Marc's comments)

6511acc

Moved end to end test into tests dir (yet to see if we can get it all…

ab511ab

… working smoothly from there

[pre-commit.ci] auto fixes from pre-commit.com hooks

efd3fcd

for more information, see https://pre-commit.ci

charles-turner-1 and others added 5 commits November 25, 2024 09:33

Added workflow & shell script to submit it to Gadi

7a56289

Clean up test file (remove unused string, etc etc)

b91d1df

Clean up test file (remove unused string, scope=session => module)

823613b

Merge branch '153-e2e' of https://github.com/ACCESS-NRI/access-nri-in…

d7715be

…take-catalog into 153-e2e

Merge branch 'main' into 153-e2e

04d14c4

charles-turner-1 marked this pull request as ready for review November 24, 2024 22:57

charles-turner-1 requested review from rbeucher and marc-white November 24, 2024 22:58

marc-white reviewed Nov 25, 2024

View reviewed changes

tests/e2e/test_end_to_end.py Show resolved Hide resolved

marc-white reviewed Nov 25, 2024

View reviewed changes

tests/e2e/test_end_to_end.py Outdated Show resolved Hide resolved

charles-turner-1 and others added 5 commits November 25, 2024 11:40

Added reference to e2e tests in docs

40918c0

Merge branch '153-e2e' of https://github.com/ACCESS-NRI/access-nri-in…

96b88a6

…take-catalog into 153-e2e

Cleaned up fixture, removed redundant second conftest.py, fixed confi…

dd8c090

…gs location

Cleaning up a few tests where mocks no longer necessary

268c98f

[pre-commit.ci] auto fixes from pre-commit.com hooks

a63bdab

for more information, see https://pre-commit.ci

charles-turner-1 commented Nov 25, 2024

View reviewed changes

.github/workflows/e2e.yaml Show resolved Hide resolved

charles-turner-1 mentioned this pull request Nov 25, 2024

[Potential BUG] Teardown of temporary file structures in tests may not be complete #270

Closed

Merge remote-tracking branch 'origin/main' into 153-e2e

4629f27

marc-white approved these changes Nov 26, 2024

View reviewed changes

rbeucher approved these changes Nov 26, 2024

View reviewed changes

charles-turner-1 added 2 commits November 26, 2024 16:54

Fixed workflow trigger & type hint that disappeared

54abc39

Pre-commit

6d543fd

charles-turner-1 merged commit 39bea88 into main Nov 26, 2024
18 checks passed

charles-turner-1 mentioned this pull request Nov 26, 2024

End to end build test #238

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

153-End to End build test. #266

153-End to End build test. #266

charles-turner-1 commented Nov 20, 2024

codecov bot commented Nov 20, 2024 •

edited

Loading

marc-white Nov 21, 2024

marc-white Nov 21, 2024

marc-white Nov 21, 2024

charles-turner-1 Nov 21, 2024

marc-white Nov 21, 2024

charles-turner-1 Nov 21, 2024

marc-white Nov 21, 2024

marc-white Nov 21, 2024

marc-white Nov 21, 2024

charles-turner-1 Nov 21, 2024

marc-white left a comment

marc-white commented Nov 21, 2024

rbeucher commented Nov 21, 2024

rbeucher commented Nov 21, 2024

charles-turner-1 commented Nov 21, 2024

charles-turner-1 commented Nov 22, 2024

marc-white left a comment

rbeucher Nov 26, 2024

153-End to End build test. #266

153-End to End build test. #266

Conversation

charles-turner-1 commented Nov 20, 2024

codecov bot commented Nov 20, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marc-white left a comment

Choose a reason for hiding this comment

marc-white commented Nov 21, 2024

rbeucher commented Nov 21, 2024

rbeucher commented Nov 21, 2024

charles-turner-1 commented Nov 21, 2024

charles-turner-1 commented Nov 22, 2024

marc-white left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Nov 20, 2024 •

edited

Loading