memory issue with new summary approach #364

macartan · 2024-10-11T09:00:30Z

the new summary approach generates a lot of large objects, like the parameter matrix and the ambiguity matrix; on the fly. other code avoids the generation of these, making them on a need to know basis

this will be unmanageable with larger models:

> model <- make_model("A -> Y <- B; C-> Y", add_causal_types = FALSE)
> summary <- summary(model)
> object.size(model)
82256 bytes
> object.size(summary)
79637008 bytes

it seems that if a summary is called for then everything gets piled into this objects including all posterior distributions, stan objects and so on

can we revert to generating these only when they are explicitly requested?

The text was updated successfully, but these errors were encountered:

gerasy1987 · 2024-10-11T16:31:03Z

@macartan, thanks for pointing this out. I can work on this next week. Do you have a preference for what objects should be in the summary by default beyond what is in the causal_model it is called on?

macartan · 2024-10-14T17:53:24Z

Thanks Gosha My instinct would be to keep the minimum as default There are a lot of big objects Ambiguities, parameter matrix, the distributions I imagine two approaches 1. Have a small summary and shift code to the grab.inspect function. Feels like going backeards a little 2. Have an include argument in summary that gets passed to print? So summary o my includes extra objects as needed

…

On Fri 11. Oct 2024 at 18:31, Gosha Syunyaev ***@***.***> wrote: @macartan <https://github.com/macartan>, thanks for pointing this out. I can work on this next week. Do you have a preference for what objects should be in the summary by default beyond what is in the causal_model it is called on? — Reply to this email directly, view it on GitHub <#364 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADBE57N53GZ2QJ22I6UGB3LZ274OFAVCNFSM6AAAAABPYPK3COVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMBXG42TMMZYGM> . You are receiving this because you were mentioned.Message ID: ***@***.***>

gerasy1987 · 2024-10-20T23:33:01Z

@macartan proposed fix is in #366 and ready for review

gerasy1987 · 2024-10-21T15:26:31Z

addressed by #366

macartan · 2024-11-11T10:47:00Z

Sorry to reopen -- we cannot run all the code in the paper because of the memory requirements of the summary approach

we want to to do this:

make_model("A -> E <- B; C-> E <- D", add_causal_types = FALSE) |>
  grab("parameters") |> 
  length()

but grabbing requires creating causal types and other very large objects

seems we are back to this issue of why generate all these things on the fly when they are not needed or saved

note this is still fast:

make_model("A -> E <- B; C-> E <- D", add_causal_types = FALSE) |>
  CausalQueries:::get_parameters() |>
  length()

grab and inspect really target particular objects and seems wise to create the objects targeted and nothing else.

macartan assigned gerasy1987 Oct 11, 2024

gerasy1987 mentioned this issue Oct 20, 2024

Methods optimization #366

Merged

gerasy1987 closed this as completed Oct 21, 2024

macartan reopened this Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory issue with new summary approach #364

memory issue with new summary approach #364

macartan commented Oct 11, 2024 •

edited

Loading

gerasy1987 commented Oct 11, 2024

macartan commented Oct 14, 2024 via email

gerasy1987 commented Oct 20, 2024

gerasy1987 commented Oct 21, 2024

macartan commented Nov 11, 2024

memory issue with new summary approach #364

memory issue with new summary approach #364

Comments

macartan commented Oct 11, 2024 • edited Loading

gerasy1987 commented Oct 11, 2024

macartan commented Oct 14, 2024 via email

gerasy1987 commented Oct 20, 2024

gerasy1987 commented Oct 21, 2024

macartan commented Nov 11, 2024

macartan commented Oct 11, 2024 •

edited

Loading