Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

redo pvar indexing logic & lazily load pvar file to save memory #8

Merged
merged 3 commits into from
Nov 19, 2024

Conversation

kscott-1
Copy link
Contributor

This fixes #7 and also adds some lazy functionality from polars. I did scrap this together rather quickly, so I suggest pulling down, running tests, and building any additional fixes on top of this.

 * root -> `row_nr()` not a recognized pl function

Signed-off-by: Kyle Scott <kms309@miami.edu>
* in the case where the user decides to load an extremely large pvar
  file, it should not be read into memory immediately if the variants
  will be subset
* has no negative impact on the case where the user loads all variants

Signed-off-by: Kyle Scott <kms309@miami.edu>
Signed-off-by: Kyle Scott <kms309@miami.edu>
@kscott-1
Copy link
Contributor Author

I see the issue - should be fixed in the third commit

@salcc salcc self-requested a review November 19, 2024 22:37
@salcc
Copy link
Member

salcc commented Nov 19, 2024

Thanks!

@salcc salcc merged commit 9426fde into AI-sandbox:main Nov 19, 2024
4 checks passed
@kscott-1 kscott-1 deleted the dev branch November 20, 2024 00:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

bug: pgen & bed readers fail due to row_nr()
2 participants