Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Component and dataset upgrades #524

Merged
merged 5 commits into from
Jun 14, 2024
Merged

Component and dataset upgrades #524

merged 5 commits into from
Jun 14, 2024

Conversation

jgalan
Copy link
Member

@jgalan jgalan commented Jun 12, 2024

jgalan Medium: 100 Powered by Pull Request Badge

  • Added new methods Range and ApplyRange to TRestDataSet they allow to define a sample subset range from the dataset. When we use ApplyRange the internal data frame will be updated too. If we only invoke Range a RDF::Node with the specified range will be returned, but no internal modification will happen.

  • Solving an issue in ExtractParametricNodes appearing when the dataset is too large, of the order of 1500M entries. The, fSplitEntries=600,000 divides the operation of node extraction into several steps. See also ROOT-forum entry: https://root-forum.cern.ch/t/problem-with-large-number-of-entries-inside-rdataframe/59632

  • TRestComponentDataSet::fDFRange data member added. It allows to control the range of the dataset entries that will be used to generate the component.

  • In a component we can re-scale the distribution using weights, for the moment the weights were a column from the dataFrame, but now it is allowed also to write down a constant.

In this example, the distribution will be built with 2*Ngamma, where Ngamma is the rate contribution from each event.

<parameter name="weights" value="{NGamma,2}"/>

@jgalan jgalan self-assigned this Jun 12, 2024
@jgalan jgalan marked this pull request as ready for review June 13, 2024 16:16
@jgalan jgalan requested a review from nkx111 as a code owner June 13, 2024 16:16
@jgalan jgalan requested a review from a team June 13, 2024 16:16
@jgalan jgalan merged commit 9591b5a into master Jun 14, 2024
64 checks passed
@jgalan jgalan deleted the jgalan_dataset_updates branch June 14, 2024 06:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants