Extreme score from Query-Likelihood Quantized Index #572

J9rryGou · 2024-01-22T16:33:24Z

I created a quantized index by following:

cd /home/jg6226/code/raw_pisa/build
./bin/create_wand_data -c /hdd1/data/ssd2_data_backup/ssd2/data/index/cw09b/CW09B.url.inv -o /ssd2/data/index/cw09b_ql_index/CW09B.ql.quantized.wand --quantize --scorer qld -b 128

./bin/compress_inverted_index -c /hdd1/data/ssd2_data_backup/ssd2/data/index/cw09b/CW09B.url.inv -o /ssd2/data/index/cw09b_ql_index/CW09B.ql.quantized.index.opt -e block_simdbp --quantize --scorer qld --wand /ssd2/data/index/cw09b_ql_index/CW09B.ql.quantized.wand --check

Then I use my edited evaluate_queries to run on a query dataset selected from TREC05

cd /home/jg6226/code/20230101_pisa_termscore_small_size/pisa/build
./bin/evaluate_queries_didordered -e block_simdbp -a ranked_or -i /ssd2/data/index/cw09b_quantized_index/CW09B.quantized.index.opt -q /home/jg6226/data/Hit_Ratio_Project/TREC0506_query/cleaned_query/trec05_testing_queries.txt -k 1000 --scorer quantized --wand /ssd2/data/index/cw09b_quantized_index/CW09B.quantized.wand  --documents /home/jg6226/data/index/cw09b/CW09B.url.fwd.doclex --terms /home/jg6226/data/index/cw09b/CW09B.fwd.termlex -f /home/jg6226/data/Hit_Ratio_Project/TREC0506_query/evaluate_result/trec05_testing_quantized_output.txt -d

I found there are some extreme high score for a document, is there anything wrong with my code?

The text was updated successfully, but these errors were encountered:

elshize · 2024-01-27T14:47:03Z

@J9rryGou I fixed a bug with quantization: #573 can you check if you're still getting this issue?

elshize · 2024-02-05T02:39:59Z

I just realized that --check has no effect when compressing with quantization. I will see if this can be implemented.

J9rryGou · 2024-02-05T03:00:41Z

I just realized that --check has no effect when compressing with quantization. I will see if this can be implemented.

Sounds good.

I also tried compress_inverted_index by not passing --check, the index still has the issue I mentioned above.

elshize · 2024-02-05T03:06:55Z

Yeah, not passing --check will have no effect, it's just being ignored. I'll work on implementing the check for quantized, then maybe that can reveal something...

elshize · 2024-02-06T01:49:27Z

I haven't figured this one out yet, but I definitely see something is broken.

For one, quantized index using any of the non-blocked encoding is fundamentally broken -- but I think I have an idea why and how to fix it.

Second, I see that at compression time, there is a score that wants to be written: 4294967295, which happens to be a 32-bit int with all 1s, or 2^32 - 1. Not sure yet why but it's a lead.

elshize · 2024-02-06T02:02:59Z

Also, BM25 doesn't seem to be affected.

J9rryGou · 2024-02-06T03:47:28Z

I haven't figured this one out yet, but I definitely see something is broken.

For one, quantized index using any of the non-blocked encoding is fundamentally broken -- but I think I have an idea why and how to fix it.

Second, I see that at compression time, there is a score that wants to be written: 4294967295, which happens to be a 32-bit int with all 1s, or 2^32 - 1. Not sure yet why but it's a lead.

Sounds good, thank you so much! Seems like we are very close to the bug when using qld as the ranking function. Yeah, all outputs by using bm25 are all good, according to the results from previous runs.

J9rryGou · 2024-02-06T03:54:05Z

I haven't figured this one out yet, but I definitely see something is broken.

For one, quantized index using any of the non-blocked encoding is fundamentally broken -- but I think I have an idea why and how to fix it.

Second, I see that at compression time, there is a score that wants to be written: 4294967295, which happens to be a 32-bit int with all 1s, or 2^32 - 1. Not sure yet why but it's a lead.

But I have a question about this:
Since the quantized score has range 0 to 255 (256 is very rare). I did see 256 occur in quantized bm25 score, maybe the way pisa store the quantized score is like this: if it is in range 0 to 255, use one byte, if it is 256, use 2 bytes. That's why before you did that modification of quantizer, it worked well before. For the quantized index of qld, there are some extremely large scores, the PISA will store them with more bytes (maybe up to 8 bytes? I see some score that is even larger than 2^32 -1, but I am not 100% sure.). This can explain why the size of quantized qld index is about 47GB, whereas the size of quantized bm25 index is about 25GB.

So, the way that PISA storing quantized score is not fixing it to 1 byte, but will use more byte if the score is very large?

J9rryGou · 2024-02-06T13:07:08Z

For one, quantized index using any of the non-blocked encoding is fundamentally broken -- but I think I have an idea why and how to fix it.

BTW, when you say this, is quantized index of bm25 using elias_fano encoding also broken?

elshize · 2024-02-06T13:52:14Z

For one, quantized index using any of the non-blocked encoding is fundamentally broken -- but I think I have an idea why and how to fix it.

BTW, when you say this, is quantized index of bm25 using elias_fano encoding also broken?

Yeah, I believe so, but I would have to confirm that. Some tests I wrote fail for those indexes, so there's clearly something wrong.

elshize · 2024-02-07T01:18:24Z

So, the way that PISA storing quantized score is not fixing it to 1 byte, but will use more byte if the score is very large?

Writing is done the same way as frequencies, so depends on the encoding used. Quantization is really just done when computing the score, if that score is 256, then the codec will write it.

elshize · 2024-02-08T00:11:39Z

@J9rryGou Actually nvm about what I said about non-blocked encodings. They also seem to work for BM25 after all.

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Fixes: #572

elshize · 2024-02-08T03:32:13Z

@J9rryGou the culprit is how we encode frequencies: we always encode frequency - 1 (because they are all positive). When some scores are quantized to 0, it breaks down, because we end up with 2^32-1 after that subtraction (underflow).

Could you please try the fix branch #575 and report back if it fixes the issue?

Note that I've discovered different issue with PL2 & DPH scorers but both QLD and BM25 should work fine.

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Fixes: #572

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Changelog-changed: Scores are quantized starting at 1 instead of 0 Fixes: #572 Signed-off-by: Michal Siedlaczek <michal@siedlaczek.me>

elshize · 2024-02-13T00:23:58Z

@J9rryGou I closed it with the fix in #575 If you encounter this issue again on the new version, feel free to reopen or open a new one.

J9rryGou added the bug Something isn't working label Jan 22, 2024

elshize added a commit that referenced this issue Feb 8, 2024

Quantize in range [1, 2^b)

4c790af

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Fixes: #572

elshize mentioned this issue Feb 8, 2024

Quantize in range [1, 2^b) #575

Merged

elshize added a commit that referenced this issue Feb 8, 2024

Quantize in range [1, 2^b)

a821618

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Fixes: #572

elshize added a commit that referenced this issue Feb 11, 2024

Quantize in range [1, 2^b)

c56d323

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Fixes: #572

elshize added a commit that referenced this issue Feb 12, 2024

Quantize in range [1, 2^b)

b2a0439

Due to quantization, some scores can be 0, but our frequency encoding (which is used for scores) assumes positive values. To fix it, we quantize into a range starting at 1 instead. Fixes: #572

elshize closed this as completed in #575 Feb 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extreme score from Query-Likelihood Quantized Index #572

Extreme score from Query-Likelihood Quantized Index #572

J9rryGou commented Jan 22, 2024 •

edited

Loading

elshize commented Jan 27, 2024

elshize commented Feb 5, 2024

J9rryGou commented Feb 5, 2024

elshize commented Feb 5, 2024

elshize commented Feb 6, 2024

elshize commented Feb 6, 2024

J9rryGou commented Feb 6, 2024

J9rryGou commented Feb 6, 2024

J9rryGou commented Feb 6, 2024

elshize commented Feb 6, 2024

elshize commented Feb 7, 2024

elshize commented Feb 8, 2024

elshize commented Feb 8, 2024

elshize commented Feb 13, 2024

Extreme score from Query-Likelihood Quantized Index #572

Extreme score from Query-Likelihood Quantized Index #572

Comments

J9rryGou commented Jan 22, 2024 • edited Loading

elshize commented Jan 27, 2024

elshize commented Feb 5, 2024

J9rryGou commented Feb 5, 2024

elshize commented Feb 5, 2024

elshize commented Feb 6, 2024

elshize commented Feb 6, 2024

J9rryGou commented Feb 6, 2024

J9rryGou commented Feb 6, 2024

J9rryGou commented Feb 6, 2024

elshize commented Feb 6, 2024

elshize commented Feb 7, 2024

elshize commented Feb 8, 2024

elshize commented Feb 8, 2024

elshize commented Feb 13, 2024

J9rryGou commented Jan 22, 2024 •

edited

Loading