-
Notifications
You must be signed in to change notification settings - Fork 49
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Basically a rewrite: use custom Lucene queries, drop Protobuf depende…
…ncy for API (#46) - Remove the usage of Protobufs at the API level. Instead implemented a more idiomatic Elasticsearch API. Now using custom case classes in scala and data classes in Python, which is more tedious, but worth it for a more intuitive API. - Remove the pipelines in favor of processing/indexing vectors in the custom mapping. The model parameters are defined in the mapping and applied to any document field with type `elastiknn_sparse_bool_vector` or `elastiknn_dense_float_vector`. This eliminates the need for a pipeline/processor and the need to maintain custom mappings for the indexed vectors. - Implement all queries using custom Lucene queries. This is tightly coupled to the custom mappings, since the mappings determine how vector hashes are stored and can be queried. For now I've been able to use very simple Lucene Term and Boolean queries. - Add a "sparse indexed" mapping for jaccard and hamming similarities. This stores the indices of sparse boolean vectors as Lucene terms, allowing you to run a term query to get the intersection of the query vector against all stored vectors.
- Loading branch information
1 parent
679b199
commit fbda811
Showing
122 changed files
with
2,741 additions
and
2,856 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
133 changes: 0 additions & 133 deletions
133
client-elastic4s/src/main/scala/com/klibisz/elastiknn/client/ElastiKnnClient.scala
This file was deleted.
Oops, something went wrong.
78 changes: 0 additions & 78 deletions
78
client-elastic4s/src/main/scala/com/klibisz/elastiknn/client/Elastic4sUtils.scala
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.