feat(backend): configurable log level for driver / launcher images #11278

CarterFendley · 2024-10-08T18:52:31Z

IMPORTANT

Design changes have been made since the original description, see this comment for updated usage

Description of your changes:

This PR adds the ability to change log level in the driver / launcher containers. This is implemented in a similar pattern as the overrides for driver / launcher images. Specifically, you can add the following environment variables to the ml-pipeline deployments:

    spec:
      containers:
      - env:
        - name: DRIVER_LOG_LEVEL
          value: "3"
        - name: LAUNCHER_LOG_LEVEL
          value: "3"

Note: A numerical value such as the literal 3 not "3" here will be invalid deployment spec and validation on the spec will fail causing kubectl edit to reject it with the message: error: deployments.apps "ml-pipeline" is invalid.

Other minor alterations

In this commit two locations were updated to use the workflowCompiler.driverImage and workflowCompiler.launcherImage attributes which are populated here. This is a very minor change but seemed better to invoke only once and match other such usages (in importer.go and dag.go). If there are reasons this should be re-invoked, please let me know.
The --copy flag were moved into the arguments block, to match other implementations. Again, lmk if this is not wanted.

Feedback wanted

The environment variable for this is similar to the V2_LAUNCHER_IMAGE and V2_DRIVER_IMAGE but without the V2_ prefix. If anyone has preferences here, I do not, so happy to take any path.

Checklist:

You have signed off your commits
The title for your pull request (PR) should follow our title convention. Learn more about the pull request title convention used in this repository.

google-oss-prow · 2024-10-08T18:52:42Z

Hi @CarterFendley. Thanks for your PR.

I'm waiting for a kubeflow member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

hbelmiro · 2024-10-08T19:00:21Z

/ok-to-test

droctothorpe

LGTM. Thank you for tackling this, @CarterFendley!

rimolive · 2024-10-09T15:33:04Z

Maybe worth adding just one unit test to verify if setting both env vars will generate the Workflow yaml with the new flags.

CarterFendley · 2024-10-10T18:31:32Z

Maybe worth adding just one unit test to verify if setting both env vars will generate the Workflow yaml with the new flags.

Will do :)

CarterFendley · 2024-10-15T21:59:49Z

Okay in this commit I have updated the compiler tests with logic to optional take in environment variables and set them:

if tt.envVars != nil {
    for _, envVar := range tt.envVars {
	    parts := strings.Split(strings.ReplaceAll(envVar, " ", ""), "=")
	    os.Setenv(parts[0], parts[1])
    
	    // Unset after test cases has ended
	    defer os.Unsetenv(parts[0])
    }
}

To test cases and golden files have been added to test the logic included in this PR.

{
	jobPath:          "../testdata/hello_world.json",
	platformSpecPath: "",
	argoYAMLPath:     "testdata/with_logging/hello_world.yaml",
	envVars:          []string{"DRIVER_LOG_LEVEL=5", "LAUNCHER_LOG_LEVEL=5"},
},
{
	jobPath:          "../testdata/importer.json",
	platformSpecPath: "",
	argoYAMLPath:     "testdata/with_logging/importer.yaml",
	envVars:          []string{"DRIVER_LOG_LEVEL=5", "LAUNCHER_LOG_LEVEL=5"},
},

rimolive · 2024-10-15T22:25:05Z

/lgtm

hbelmiro

Thank you @CarterFendley.
Looks good. I just left a few minor comments.

backend/src/v2/cmd/driver/main.go

backend/src/v2/cmd/launcher-v2/main.go

backend/src/v2/compiler/argocompiler/argo_test.go

hbelmiro · 2024-10-16T21:17:04Z

backend/src/v2/compiler/argocompiler/container.go

@@ -303,8 +318,9 @@ func (c *workflowCompiler) addContainerExecutorTemplate(refName string) string {
 		InitContainers: []wfapi.UserContainer{{
 			Container: k8score.Container{
 				Name:    "kfp-launcher",
-				Image:   GetLauncherImage(),
-				Command: []string{"launcher-v2", "--copy", component.KFPLauncherPath},
+				Image:   c.launcherImage,


Any reason for this change besides optimization?

No not really, just made it a bit concise to add flags and follows the pattern used in the other Container definitions for driver / launcher

hbelmiro

/lgtm

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

launcher Signed-off-by: carter.fendley <carter.fendley@gmail.com>

CarterFendley · 2025-02-20T00:38:15Z

Modifications have been made to address this issue found by @gregsheremeta, thanks for pointing that out! Since the instance was one where the driver sets the log level of the launcher a design change was made to have one unified PIPELINE_LOG_LEVEL env var to prevent the somewhat confusing implementations of passing the LAUNCHER_LOG_LEVEL as a command line argument to the driver (or similar implementations).

The new usage is to update the environment variable on the ml-pipelines deployment to the following.

    spec:
      containers:
      - env:
        - name: PIPELINE_LOG_LEVEL
          value: "3"

Importantly, as before, it is important that a numerical value such as the literal 3 not "3" here will be invalid deployment spec and validation on the spec will fail causing kubectl edit to reject it with the message: error: deployments.apps "ml-pipeline" is invalid.

After these updates, the main container now also runs the launcher with configured log level.

@hbelmiro @HumairAK, or any others: Please let me know if you have any additional feedback on this PR. Apologies for the delay in the patch!

backend/src/v2/compiler/argocompiler/container.go

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

HumairAK · 2025-02-20T22:57:19Z

/lgtm
/approve

google-oss-prow · 2025-02-20T22:57:25Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: droctothorpe, HumairAK

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~backend/OWNERS~~ [HumairAK]
~~manifests/kustomize/OWNERS~~ [HumairAK]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

google-oss-prow bot requested review from HumairAK and rimolive October 8, 2024 18:52

google-oss-prow bot added the size/M label Oct 8, 2024

google-oss-prow bot added the needs-ok-to-test label Oct 8, 2024

google-oss-prow bot added ok-to-test and removed needs-ok-to-test labels Oct 8, 2024

CarterFendley force-pushed the carter/log-level branch from d66664a to ce4527f Compare October 9, 2024 03:00

google-oss-prow bot added size/L and removed size/M labels Oct 9, 2024

HumairAK added this to the KFP 2.4.0 milestone Oct 9, 2024

droctothorpe approved these changes Oct 9, 2024

View reviewed changes

CarterFendley force-pushed the carter/log-level branch 2 times, most recently from 3f73d4c to 714901f Compare October 15, 2024 21:46

google-oss-prow bot added size/XL and removed size/L labels Oct 15, 2024

github-actions bot added ci-passed All CI tests on a pull request have passed and removed ci-passed All CI tests on a pull request have passed labels Oct 15, 2024

google-oss-prow bot assigned rimolive Oct 15, 2024

google-oss-prow bot added the lgtm label Oct 15, 2024

hbelmiro suggested changes Oct 16, 2024

View reviewed changes

CarterFendley force-pushed the carter/log-level branch from 714901f to 264d809 Compare October 17, 2024 17:24

google-oss-prow bot removed the lgtm label Oct 17, 2024

hbelmiro reviewed Oct 17, 2024

View reviewed changes

google-oss-prow bot assigned hbelmiro Oct 17, 2024

HumairAK modified the milestones: KFP 2.4.0, KFP 2.5.0 Jan 15, 2025

CarterFendley force-pushed the carter/log-level branch from 264d809 to 8d0113b Compare January 28, 2025 17:56

google-oss-prow bot added size/L and removed lgtm size/XL labels Jan 28, 2025

CarterFendley force-pushed the carter/log-level branch from 8d0113b to 302b9a2 Compare January 28, 2025 17:57

CarterFendley added 7 commits February 19, 2025 15:30

Do not invoke get image methods twice.

530f837

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

Add configurable driver / launcher log level

0ea49ab

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

Add configurable driver / launcher log level

43a90fc

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

Update argocompiler golden files

6c4c31b

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

Handle errors from flag setting and tests

173a237

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

Update gold files & image masking to use ghcr

0b75b59

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

Update tests with new images

5741ce9

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

CarterFendley force-pushed the carter/log-level branch from 302b9a2 to edb2172 Compare February 20, 2025 00:14

google-oss-prow bot added size/XL and removed size/L labels Feb 20, 2025

Use unified var for driver / launcher log level + patch user code

d4dacb3

launcher Signed-off-by: carter.fendley <carter.fendley@gmail.com>

CarterFendley force-pushed the carter/log-level branch from edb2172 to d4dacb3 Compare February 20, 2025 00:28

CarterFendley mentioned this pull request Feb 20, 2025

chore(ci): Flaky test in KFP SDK Execution Tests workflow #11598

Open

droctothorpe approved these changes Feb 20, 2025

View reviewed changes

HumairAK requested changes Feb 20, 2025

View reviewed changes

backend/src/v2/compiler/argocompiler/container.go Show resolved Hide resolved

Add PIPELINE_LOG_LEVEL to deployment for discoverability

e6b2d65

Signed-off-by: carter.fendley <carter.fendley@gmail.com>

google-oss-prow bot assigned HumairAK Feb 20, 2025

google-oss-prow bot added the lgtm label Feb 20, 2025

google-oss-prow bot added the approved label Feb 20, 2025

google-oss-prow bot merged commit d2c0376 into kubeflow:master Feb 21, 2025
34 of 36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(backend): configurable log level for driver / launcher images #11278

feat(backend): configurable log level for driver / launcher images #11278

CarterFendley commented Oct 8, 2024 •

edited

Loading

google-oss-prow bot commented Oct 8, 2024

hbelmiro commented Oct 8, 2024

droctothorpe left a comment

rimolive commented Oct 9, 2024 •

edited

Loading

CarterFendley commented Oct 10, 2024

CarterFendley commented Oct 15, 2024 •

edited

Loading

rimolive commented Oct 15, 2024

hbelmiro left a comment

hbelmiro Oct 16, 2024

CarterFendley Oct 17, 2024

hbelmiro left a comment

CarterFendley commented Feb 20, 2025

HumairAK commented Feb 20, 2025

google-oss-prow bot commented Feb 20, 2025

feat(backend): configurable log level for driver / launcher images #11278

feat(backend): configurable log level for driver / launcher images #11278

Conversation

CarterFendley commented Oct 8, 2024 • edited Loading

IMPORTANT

Description of your changes:

Other minor alterations

Feedback wanted

Checklist:

google-oss-prow bot commented Oct 8, 2024

hbelmiro commented Oct 8, 2024

droctothorpe left a comment

Choose a reason for hiding this comment

rimolive commented Oct 9, 2024 • edited Loading

CarterFendley commented Oct 10, 2024

CarterFendley commented Oct 15, 2024 • edited Loading

rimolive commented Oct 15, 2024

hbelmiro left a comment

Choose a reason for hiding this comment

hbelmiro Oct 16, 2024

Choose a reason for hiding this comment

CarterFendley Oct 17, 2024

Choose a reason for hiding this comment

hbelmiro left a comment

Choose a reason for hiding this comment

CarterFendley commented Feb 20, 2025

HumairAK commented Feb 20, 2025

google-oss-prow bot commented Feb 20, 2025

CarterFendley commented Oct 8, 2024 •

edited

Loading

rimolive commented Oct 9, 2024 •

edited

Loading

CarterFendley commented Oct 15, 2024 •

edited

Loading