Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding IAM role authentication support for redshift #630

Closed
wants to merge 11 commits into from

Conversation

abbywh
Copy link
Contributor

@abbywh abbywh commented Oct 11, 2023

Committer: Abby Whittier abbywhittier309@gmail.com

resolves #623
docs dbt-labs/docs.getdbt.com/#

Problem

dbt-redshift currently does not natively support the redshift authentication method get-cluster-credentials-with-iam. A dbt customer using IAM roles as db users currently has no way to authenticate as those users.

Solution

There is now a new redshift connection method to authenticate as an IAM role, called IAMR. This strongly resembles the classic IAM authentication, except the user field is unnecessary.

Checklist

  • I have read the contributing guide and understand what's expected of me
  • I have run this code in development and it appears to resolve the stated issue
  • This PR includes tests, or tests are not required/relevant for this PR
  • This PR has no interface changes (e.g. macros, cli, logs, json artifacts, config files, adapter interface, etc) or this PR has already received feedback and approval from Product or DX

@abbywh abbywh requested a review from a team as a code owner October 11, 2023 14:01
@cla-bot
Copy link

cla-bot bot commented Oct 11, 2023

Thank you for your pull request and welcome to our community. We could not parse the GitHub identity of the following contributors: Whittier.
This is most likely caused by a git client misconfiguration; please make sure to:

  1. check if your git client is configured with an email to sign commits git config --list | grep email
  2. If not, set it up using git config --global user.email email@example.com
  3. Make sure that the git commit email is configured in your GitHub account settings, see https://github.com/settings/emails

@abbywh abbywh changed the title adding SSO support for redshift adding IAM role authentication support for redshift Oct 11, 2023
@abbywh abbywh force-pushed the Feature/iamr_auth branch from 26b6355 to 6075b92 Compare October 11, 2023 18:58
@cla-bot cla-bot bot added the cla:yes label Oct 11, 2023
@abbywh abbywh closed this Oct 11, 2023
@abbywh abbywh force-pushed the Feature/iamr_auth branch from 6075b92 to 5d4f3f5 Compare October 11, 2023 19:11
@emmyoop emmyoop temporarily deployed to PypiTest October 12, 2023 09:47 — with GitHub Actions Inactive
@abbywh
Copy link
Contributor Author

abbywh commented Oct 12, 2023

Is this being tested now? Any chance we can reopen for tracking purposes?

@abbywh abbywh reopened this Oct 12, 2023
@cla-bot
Copy link

cla-bot bot commented Oct 12, 2023

Thank you for your pull request and welcome to our community. We could not parse the GitHub identity of the following contributors: Whittier.
This is most likely caused by a git client misconfiguration; please make sure to:

  1. check if your git client is configured with an email to sign commits git config --list | grep email
  2. If not, set it up using git config --global user.email email@example.com
  3. Make sure that the git commit email is configured in your GitHub account settings, see https://github.com/settings/emails

@cla-bot cla-bot bot removed the cla:yes label Oct 12, 2023
@cla-bot
Copy link

cla-bot bot commented Oct 13, 2023

Thank you for your pull request and welcome to our community. We could not parse the GitHub identity of the following contributors: Whittier.
This is most likely caused by a git client misconfiguration; please make sure to:

  1. check if your git client is configured with an email to sign commits git config --list | grep email
  2. If not, set it up using git config --global user.email email@example.com
  3. Make sure that the git commit email is configured in your GitHub account settings, see https://github.com/settings/emails

@colin-rogers-dbt
Copy link
Contributor

@abbywh looks like there's something wrong with your git setup and is throwing off our CLA check

@abbywh
Copy link
Contributor Author

abbywh commented Oct 13, 2023

@colin-rogers-dbt do you have any documentation for your bot? I can explain what happened. I ammended the commit to remove my private email address in favor of my public, and you can see my email in the commit, but it's still blocking. I would happily parse through how to fix it (maybe a git filter repo?) if I can find some documentation around it.

abbywh and others added 3 commits October 16, 2023 10:48
Committer: Abby Whittier <abbywhittier309@gmail.com>
* The first element of the result is the PID
* Debug-level logging of high-level message + SQL
* Using redshift_connector `cursor.fetchone()`  returns `(<something>,)`
* Use cursor to call `select pg_terminate_backend({pid})` directly rather than using the `SQLConnectionManager`

---------

Co-authored-by: Mike Alfare <13974384+mikealfare@users.noreply.github.com>
@abbywh abbywh force-pushed the Feature/iamr_auth branch from 4a8cc19 to 48128e2 Compare October 16, 2023 14:51
@cla-bot cla-bot bot added the cla:yes label Oct 16, 2023
@abbywh
Copy link
Contributor Author

abbywh commented Oct 16, 2023

for anyone digging this up, make sure you update the config as given by the bot, then do git amend --reset-author as opposed to git amend --author="Author Name <email@address.com>".

@dataders
Copy link
Contributor

@jiezhen-chen @soksamnanglim @sathiish-kumar y'all wanna take a look here as well?

@jiezhen-chen
Copy link
Contributor

Instead of introducing a new IAMR auth method, would yall consider having it under IAM? Looks like the behavior of IAM and IAMR are the same except user is now optional, can we optionally pass in self.credential.user only when it is provided in the yml file?

@abbywh
Copy link
Contributor Author

abbywh commented Oct 16, 2023

@jiezhen-chen i also considered that option. would be happy to change it to that, but i was worried about muddling the error cases for the iam authentication.

for example, consider i'm a user that wants to use IAM auth and omits the user field. the error i would get would be something like: Error: IAM role xyz does not have permission GetClusterCredentialWithIam on cluster 123. This might incorrectly lead a user to add that permission. We also couldn't catch that error and throw a different message, because then we wouldn't cover the error case where a user actually intends to use iamr and had that permission misconfigured.

I think this is mostly a stylistic choice that should be left to the maintainers. But we should add a note about that error message to the documentation if we choose to combine the routes. Just let me know which you would prefer knowing this and I will implement it 😃

@jiezhen-chen
Copy link
Contributor

@abbywh Having a IAMR auth method makes sense. I think that if you want to use GetClusterCredentialsWithIAM v2 where IAM role is used, you'd also need to set group_federation to true, according to this issue. I don't see this added as a parameter passed into the connect() method. Is the current IAMR connect() method able to connect using GetClusterCredentialsWithIAM v2 without needing to set group_federation to true?

@abbywh
Copy link
Contributor Author

abbywh commented Oct 18, 2023

yes you are correct there was a mistake in my testing setup. I will make the changes to the tests that should catch this + that code change.

@misteliy
Copy link

Hi all, I would like to understand the progress of this issue since we would like to switch completely to IAM (not specifying user explicitly but rather through IAM) - thus IAMR would be great to have as soon as possible.

@abbywh
Copy link
Contributor Author

abbywh commented Dec 11, 2023

+1. This not being deployed is still causing big issues for us.

@joaobernardopa
Copy link

Hello everyone, this would actually be a game changer for us as in our current setup we still need to create all external schemas...

@saratvysyaraju
Copy link

Hey All,

Any updates on when this feature may get shipped? Our team is currently waiting for this to begin adopting the IAM Auth

@mikealfare
Copy link
Contributor

@abbywh I'll be looking at this PR over the next week or so, with the intent of merging it within the next two weeks. Given the duration for which this has been open, I wanted to check with you to see if you're available to make updates, if needed. I don't mind taking it over, and I would do so in a way that you maintain credit for your contributions either way. But we prefer to offer the contributor the opportunity to complete the PR if so desired.

@mikealfare
Copy link
Contributor

I pushed the merge conflict resolution so that the branch can pull updates from main. It's a large update because this PR pre-dates the core/adapter de-coupling work; I think my IDE said 94 files or so.

Copy link
Contributor

@mikealfare mikealfare left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left a few comments, mostly around aesthetics and issues arising from the de-coupling work.

The one thing that we'll need to add is an integration test verifying that we can connect with an IAM role. I need to setup some infra for that test to run in CI. In the meantime, we need to do a few things:

  • add REDSHIFT_TEST_IAM_ROLE and REDSHIFT_TEST_IAM_PROFILE to ./test.env.example here
  • pull these new values into the test fixtures in ./tests/conftest.py here
  • create a test module for the new tests: ./tests/functional/adapter/test_auth_methods.py
  • create a test in the new module that connects via IAM role and runs a basic query to demonstrate access

The test should probably remove things like user for a positive test. And it's probably worth adding a negative test as well, by removing role.

dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/adapters/redshift/connections.py Show resolved Hide resolved
dbt/include/redshift/profile_template.yml Show resolved Hide resolved
@mikealfare mikealfare mentioned this pull request Apr 23, 2024
4 tasks
@mikealfare
Copy link
Contributor

In the interest of time, I will take over this PR in #781 and close this one.

@mikealfare mikealfare closed this Apr 23, 2024
mikealfare added a commit that referenced this pull request Apr 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[ADAP-926] [Feature] IAM authentication - make 'user' an optional property
10 participants