`PluginIndexes.Unindex.query_index` seems to be overly complex and should be refactored #56

d-maurer · 2019-03-07T11:43:11Z

Its docstring is misleading
There is duplicate code for cache and not_param handling
Why is caching handled differently for operator "or" and not "or" (likely "and"). We could either cache in both cases a "setlist" or in both cases a set (with in the "and" case the intersection of the current "setlist"s).
Do we need this caching at all? How often do we have the same index query (with identical parameters) in the same request?

I propose the following new structure:

we determine the cache key and obtain a cached result: either None or a sequence of sets
if the cache has no result, we determine from record a sequence of keys to be looked up; for "and", the intermediate result is the sequence of looked up document sets, ordered by len (note that determining the size of an IITreeSet loads all buckets from the ZODB; if this requires real storage accesses, then this length determination can even be more expensive than performing the intersection directly); for "or", the result is the one element sequence of the "multiunion" of the looked up document sets. Cache the intermediate result
Intersect with resultset and return the final result

The text was updated successfully, but these errors were encountered:

d-maurer added enhancement question labels Mar 7, 2019

d-maurer added a commit that referenced this issue Mar 8, 2019

refactor PluginIndexes.unindex.UnIndex.query_index fixing #55 and #56

c8580d1

d-maurer added a commit that referenced this issue Mar 10, 2019

#56: improve interface documentation, docstrings; add tests

7387f24

d-maurer mentioned this issue Mar 12, 2019

refactor PluginIndexes.unindex.UnIndex.query_index fixing #55 and #56 #57

Closed

hannosch pushed a commit that referenced this issue Mar 16, 2019

#56: improve interface documentation, docstrings; add tests

dd30d44

Provide feedback