[ENH] Get reproducible results with PCGA (set the initial vector used by scipy.sparse.linalg.eigsh ...) #18

antoinecollet5 · 2022-12-04T21:07:05Z

Up to now, when running pyPCGA twice or more with the same parameters, the results are slightly different each time.

This is what I attribute to the covariance matrix low-rank approximation relying on scipy.sparse.linalg.eigsh beauce the initial vector v0 is not provided and consequently chosen randomly.

The solution would be to let the possibility for the user set a seed (aka random_state) to generate a reproducible v0.

See: https://stackoverflow.com/a/52403508

The text was updated successfully, but these errors were encountered:

jonghyunharrylee · 2022-12-24T09:03:10Z

Thank you Antoine and you are right that low-rank approx will not give users unique vectors. I guess I implemented it with oversampling parameters (let's say the number of eigenvectors computed to k + p where p is an oversampling parameter so that later we keep only "k" eigenmodes - this technique commonly used in randomized low-rank approximation) so that users expect less variability in results but not very sure. User-specified random seed would be a great option for reproducible results. I will take a look at it and will merge your PR. Happy holidays!

Best,
Harry

antoinecollet5 · 2023-01-05T12:54:29Z

Hi Harry,

Happy new year and best wishes for 2023.

I just corrected a last bug in the changes I've made this morning. I tested and everything seems to work fine now.

Cheers
Antoine

antoinecollet5 · 2023-07-31T15:43:16Z

Hi @jonghyunharrylee,

Any chance you will have time to look at it ?

Best regards

Antoine

antoinecollet5 mentioned this issue Dec 4, 2022

[ENH] Get reproducible results with PCGA (set the initial vector used by scipy.sparse.linalg.eigsh ...) #19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Get reproducible results with PCGA (set the initial vector used by scipy.sparse.linalg.eigsh ...) #18

[ENH] Get reproducible results with PCGA (set the initial vector used by scipy.sparse.linalg.eigsh ...) #18

antoinecollet5 commented Dec 4, 2022

jonghyunharrylee commented Dec 24, 2022

antoinecollet5 commented Jan 5, 2023

antoinecollet5 commented Jul 31, 2023

[ENH] Get reproducible results with PCGA (set the initial vector used by scipy.sparse.linalg.eigsh ...) #18

[ENH] Get reproducible results with PCGA (set the initial vector used by scipy.sparse.linalg.eigsh ...) #18

Comments

antoinecollet5 commented Dec 4, 2022

jonghyunharrylee commented Dec 24, 2022

antoinecollet5 commented Jan 5, 2023

antoinecollet5 commented Jul 31, 2023