vulkan support for typescript bindings, gguf support #1390

jacoobes · 2023-09-02T06:39:15Z

Describe your changes

Issue ticket number and link

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.
I have added thorough documentation for my code.
I have tagged PR with relevant project labels. I acknowledge that a PR without labels may be dismissed.
If this PR addresses a bug, I have provided both a screenshot/video of the original bug and the working solution.

Demo

Steps to Reproduce

Notes

iimez · 2023-09-16T13:51:41Z

Did some testing today.

Found some typing issues:

model.llm.availableGpus is not a function - but its defined in types. probably should be listGpu?
DEFAULT_PROMPT_CONTEXT is typed as DEFAULT_PROMT_CONTEXT (note missing P in PROMT)
loadModel takes EmbeddingOptions | InferenceOptions, should be EmbeddingModelOptions | InferenceModelOptions

On GPU support:
Its working and I'm able to do inference on 13B models with about double to triple the speed. Amazing!
Interestingly its consistently not utilizing GPU for the first model I try to load with device=gpu, but it does for the second. For example:

const firstModel = await loadModel('llama-2-7b-chat.ggmlv3.q4_0', {
	type: 'inference',
	device: 'gpu',
})

// this will work, but using cpu
const firstCompletion = await firstModel.generate('Hello, my name is')

const secondModel = await loadModel('llama-2-7b-chat.ggmlv3.q4_0', {
	type: 'inference',
	device: 'gpu',
})

// this will use gpu
const secondCompletion = await secondModel.generate('Hello, my name is')

Not sure if this is an issue in bindings code, gonna try again after we have synced main.

Whats the best way to check if a model uses GPU (in code?) I noted that I get a llama.cpp: using Vulkan on AMD Radeon RX 6750 XT (RADV NAVI22) when GPU is being utilized but can't find a way to check for it in code. hasGpuDevice only reports whether my machine has a GPU available.

… GPU's otherwise fallback to CPU.

fixes issues w/ multiple of the same gpu

… vulkan recognizes.

…loading a model.

iimez · 2023-10-31T01:18:55Z

gpu-2023-10-31-01-16-03.md

Switching models works flawlessly now. Looking great, lets get this out!

gpt4all-bindings/typescript/index.cc

gpt4all-bindings/typescript/src/util.js

gpt4all-bindings/typescript/src/gpt4all.d.ts

gpt4all-bindings/typescript/src/gpt4all.js

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

* adding some native methods to cpp wrapper * gpu seems to work * typings and add availibleGpus method * fix spelling * fix syntax * more * normalize methods to conform to py * remove extra dynamic linker deps when building with vulkan * bump python version (library linking fix) * Don't link against libvulkan. * vulkan python bindings on windows fixes * Bring the vulkan backend to the GUI. * When device is Auto (the default) then we will only consider discrete GPU's otherwise fallback to CPU. * Show the device we're currently using. * Fix up the name and formatting. * init at most one vulkan device, submodule update fixes issues w/ multiple of the same gpu * Update the submodule. * Add version 2.4.15 and bump the version number. * Fix a bug where we're not properly falling back to CPU. * Sync to a newer version of llama.cpp with bugfix for vulkan. * Report the actual device we're using. * Only show GPU when we're actually using it. * Bump to new llama with new bugfix. * Release notes for v2.4.16 and bump the version. * Fallback to CPU more robustly. * Release notes for v2.4.17 and bump the version. * Bump the Python version to python-v1.0.12 to restrict the quants that vulkan recognizes. * Link against ggml in bin so we can get the available devices without loading a model. * Send actual and requested device info for those who have opt-in. * Actually bump the version. * Release notes for v2.4.18 and bump the version. * Fix for crashes on systems where vulkan is not installed properly. * Release notes for v2.4.19 and bump the version. * fix typings and vulkan build works on win * Add flatpak manifest * Remove unnecessary stuffs from manifest * Update to 2.4.19 * appdata: update software description * Latest rebase on llama.cpp with gguf support. * macos build fixes * llamamodel: metal supports all quantization types now * gpt4all.py: GGUF * pyllmodel: print specific error message * backend: port BERT to GGUF * backend: port MPT to GGUF * backend: port Replit to GGUF * backend: use gguf branch of llama.cpp-mainline * backend: use llamamodel.cpp for StarCoder * conversion scripts: cleanup * convert scripts: load model as late as possible * convert_mpt_hf_to_gguf.py: better tokenizer decoding * backend: use llamamodel.cpp for Falcon * convert scripts: make them directly executable * fix references to removed model types * modellist: fix the system prompt * backend: port GPT-J to GGUF * gpt-j: update inference to match latest llama.cpp insights - Use F16 KV cache - Store transposed V in the cache - Avoid unnecessary Q copy Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> ggml upstream commit 0265f0813492602fec0e1159fe61de1bf0ccaf78 * chatllm: grammar fix * convert scripts: use bytes_to_unicode from transformers * convert scripts: make gptj script executable * convert scripts: add feed-forward length for better compatiblilty This GGUF key is used by all llama.cpp models with upstream support. * gptj: remove unused variables * Refactor for subgroups on mat * vec kernel. * Add q6_k kernels for vulkan. * python binding: print debug message to stderr * Fix regenerate button to be deterministic and bump the llama version to latest we have for gguf. * Bump to the latest fixes for vulkan in llama. * llamamodel: fix static vector in LLamaModel::endTokens * Switch to new models2.json for new gguf release and bump our version to 2.5.0. * Bump to latest llama/gguf branch. * chat: report reason for fallback to CPU * chat: make sure to clear fallback reason on success * more accurate fallback descriptions * differentiate between init failure and unsupported models * backend: do not use Vulkan with non-LLaMA models * Add q8_0 kernels to kompute shaders and bump to latest llama/gguf. * backend: fix build with Visual Studio generator Use the $<CONFIG> generator expression instead of CMAKE_BUILD_TYPE. This is needed because Visual Studio is a multi-configuration generator, so we do not know what the build type will be until `cmake --build` is called. Fixes nomic-ai#1470 * remove old llama.cpp submodules * Reorder and refresh our models2.json. * rebase on newer llama.cpp * python/embed4all: use gguf model, allow passing kwargs/overriding model * Add starcoder, rift and sbert to our models2.json. * Push a new version number for llmodel backend now that it is based on gguf. * fix stray comma in models2.json Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * Speculative fix for build on mac. * chat: clearer CPU fallback messages * Fix crasher with an empty string for prompt template. * Update the language here to avoid misunderstanding. * added EM German Mistral Model * make codespell happy * issue template: remove "Related Components" section * cmake: install the GPT-J plugin (nomic-ai#1487) * Do not delete saved chats if we fail to serialize properly. * Restore state from text if necessary. * Another codespell attempted fix. * llmodel: do not call magic_match unless build variant is correct (nomic-ai#1488) * chatllm: do not write uninitialized data to stream (nomic-ai#1486) * mat*mat for q4_0, q8_0 * do not process prompts on gpu yet * python: support Path in GPT4All.__init__ (nomic-ai#1462) * llmodel: print an error if the CPU does not support AVX (nomic-ai#1499) * python bindings should be quiet by default * disable llama.cpp logging unless GPT4ALL_VERBOSE_LLAMACPP envvar is nonempty * make verbose flag for retrieve_model default false (but also be overridable via gpt4all constructor) should be able to run a basic test: ```python import gpt4all model = gpt4all.GPT4All('/Users/aaron/Downloads/rift-coder-v0-7b-q4_0.gguf') print(model.generate('def fib(n):')) ``` and see no non-model output when successful * python: always check status code of HTTP responses (nomic-ai#1502) * Always save chats to disk, but save them as text by default. This also changes the UI behavior to always open a 'New Chat' and setting it as current instead of setting a restored chat as current. This improves usability by not requiring the user to wait if they want to immediately start chatting. * Update README.md Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> * fix embed4all filename https://discordapp.com/channels/1076964370942267462/1093558720690143283/1161778216462192692 Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * Improves Java API signatures maintaining back compatibility * python: replace deprecated pkg_resources with importlib (nomic-ai#1505) * Updated chat wishlist (nomic-ai#1351) * q6k, q4_1 mat*mat * update mini-orca 3b to gguf2, license Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * convert scripts: fix AutoConfig typo (nomic-ai#1512) * publish config https://docs.npmjs.com/cli/v9/configuring-npm/package-json#publishconfig (nomic-ai#1375) merge into my branch * fix appendBin * fix gpu not initializing first * sync up * progress, still wip on destructor * some detection work * untested dispose method * add js side of dispose * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/gpt4all.d.ts Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/gpt4all.js Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/util.js Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * fix tests * fix circleci for nodejs * bump version --------- Signed-off-by: Aaron Miller <apage43@ninjawhale.com> Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> Co-authored-by: Aaron Miller <apage43@ninjawhale.com> Co-authored-by: Adam Treat <treat.adam@gmail.com> Co-authored-by: Akarshan Biswas <akarshan.biswas@gmail.com> Co-authored-by: Cebtenzzre <cebtenzzre@gmail.com> Co-authored-by: Jan Philipp Harries <jpdus@users.noreply.github.com> Co-authored-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> Co-authored-by: Alex Soto <asotobu@gmail.com> Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>

Update to llama.cpp python: bump bindings version for AMD fixes update llama.cpp-mainline vulkan support for typescript bindings, gguf support (nomic-ai#1390) * adding some native methods to cpp wrapper * gpu seems to work * typings and add availibleGpus method * fix spelling * fix syntax * more * normalize methods to conform to py * remove extra dynamic linker deps when building with vulkan * bump python version (library linking fix) * Don't link against libvulkan. * vulkan python bindings on windows fixes * Bring the vulkan backend to the GUI. * When device is Auto (the default) then we will only consider discrete GPU's otherwise fallback to CPU. * Show the device we're currently using. * Fix up the name and formatting. * init at most one vulkan device, submodule update fixes issues w/ multiple of the same gpu * Update the submodule. * Add version 2.4.15 and bump the version number. * Fix a bug where we're not properly falling back to CPU. * Sync to a newer version of llama.cpp with bugfix for vulkan. * Report the actual device we're using. * Only show GPU when we're actually using it. * Bump to new llama with new bugfix. * Release notes for v2.4.16 and bump the version. * Fallback to CPU more robustly. * Release notes for v2.4.17 and bump the version. * Bump the Python version to python-v1.0.12 to restrict the quants that vulkan recognizes. * Link against ggml in bin so we can get the available devices without loading a model. * Send actual and requested device info for those who have opt-in. * Actually bump the version. * Release notes for v2.4.18 and bump the version. * Fix for crashes on systems where vulkan is not installed properly. * Release notes for v2.4.19 and bump the version. * fix typings and vulkan build works on win * Add flatpak manifest * Remove unnecessary stuffs from manifest * Update to 2.4.19 * appdata: update software description * Latest rebase on llama.cpp with gguf support. * macos build fixes * llamamodel: metal supports all quantization types now * gpt4all.py: GGUF * pyllmodel: print specific error message * backend: port BERT to GGUF * backend: port MPT to GGUF * backend: port Replit to GGUF * backend: use gguf branch of llama.cpp-mainline * backend: use llamamodel.cpp for StarCoder * conversion scripts: cleanup * convert scripts: load model as late as possible * convert_mpt_hf_to_gguf.py: better tokenizer decoding * backend: use llamamodel.cpp for Falcon * convert scripts: make them directly executable * fix references to removed model types * modellist: fix the system prompt * backend: port GPT-J to GGUF * gpt-j: update inference to match latest llama.cpp insights - Use F16 KV cache - Store transposed V in the cache - Avoid unnecessary Q copy Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> ggml upstream commit 0265f0813492602fec0e1159fe61de1bf0ccaf78 * chatllm: grammar fix * convert scripts: use bytes_to_unicode from transformers * convert scripts: make gptj script executable * convert scripts: add feed-forward length for better compatiblilty This GGUF key is used by all llama.cpp models with upstream support. * gptj: remove unused variables * Refactor for subgroups on mat * vec kernel. * Add q6_k kernels for vulkan. * python binding: print debug message to stderr * Fix regenerate button to be deterministic and bump the llama version to latest we have for gguf. * Bump to the latest fixes for vulkan in llama. * llamamodel: fix static vector in LLamaModel::endTokens * Switch to new models2.json for new gguf release and bump our version to 2.5.0. * Bump to latest llama/gguf branch. * chat: report reason for fallback to CPU * chat: make sure to clear fallback reason on success * more accurate fallback descriptions * differentiate between init failure and unsupported models * backend: do not use Vulkan with non-LLaMA models * Add q8_0 kernels to kompute shaders and bump to latest llama/gguf. * backend: fix build with Visual Studio generator Use the $<CONFIG> generator expression instead of CMAKE_BUILD_TYPE. This is needed because Visual Studio is a multi-configuration generator, so we do not know what the build type will be until `cmake --build` is called. Fixes nomic-ai#1470 * remove old llama.cpp submodules * Reorder and refresh our models2.json. * rebase on newer llama.cpp * python/embed4all: use gguf model, allow passing kwargs/overriding model * Add starcoder, rift and sbert to our models2.json. * Push a new version number for llmodel backend now that it is based on gguf. * fix stray comma in models2.json Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * Speculative fix for build on mac. * chat: clearer CPU fallback messages * Fix crasher with an empty string for prompt template. * Update the language here to avoid misunderstanding. * added EM German Mistral Model * make codespell happy * issue template: remove "Related Components" section * cmake: install the GPT-J plugin (nomic-ai#1487) * Do not delete saved chats if we fail to serialize properly. * Restore state from text if necessary. * Another codespell attempted fix. * llmodel: do not call magic_match unless build variant is correct (nomic-ai#1488) * chatllm: do not write uninitialized data to stream (nomic-ai#1486) * mat*mat for q4_0, q8_0 * do not process prompts on gpu yet * python: support Path in GPT4All.__init__ (nomic-ai#1462) * llmodel: print an error if the CPU does not support AVX (nomic-ai#1499) * python bindings should be quiet by default * disable llama.cpp logging unless GPT4ALL_VERBOSE_LLAMACPP envvar is nonempty * make verbose flag for retrieve_model default false (but also be overridable via gpt4all constructor) should be able to run a basic test: ```python import gpt4all model = gpt4all.GPT4All('/Users/aaron/Downloads/rift-coder-v0-7b-q4_0.gguf') print(model.generate('def fib(n):')) ``` and see no non-model output when successful * python: always check status code of HTTP responses (nomic-ai#1502) * Always save chats to disk, but save them as text by default. This also changes the UI behavior to always open a 'New Chat' and setting it as current instead of setting a restored chat as current. This improves usability by not requiring the user to wait if they want to immediately start chatting. * Update README.md Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> * fix embed4all filename https://discordapp.com/channels/1076964370942267462/1093558720690143283/1161778216462192692 Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * Improves Java API signatures maintaining back compatibility * python: replace deprecated pkg_resources with importlib (nomic-ai#1505) * Updated chat wishlist (nomic-ai#1351) * q6k, q4_1 mat*mat * update mini-orca 3b to gguf2, license Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * convert scripts: fix AutoConfig typo (nomic-ai#1512) * publish config https://docs.npmjs.com/cli/v9/configuring-npm/package-json#publishconfig (nomic-ai#1375) merge into my branch * fix appendBin * fix gpu not initializing first * sync up * progress, still wip on destructor * some detection work * untested dispose method * add js side of dispose * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/gpt4all.d.ts Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/gpt4all.js Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/util.js Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * fix tests * fix circleci for nodejs * bump version --------- Signed-off-by: Aaron Miller <apage43@ninjawhale.com> Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> Co-authored-by: Aaron Miller <apage43@ninjawhale.com> Co-authored-by: Adam Treat <treat.adam@gmail.com> Co-authored-by: Akarshan Biswas <akarshan.biswas@gmail.com> Co-authored-by: Cebtenzzre <cebtenzzre@gmail.com> Co-authored-by: Jan Philipp Harries <jpdus@users.noreply.github.com> Co-authored-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> Co-authored-by: Alex Soto <asotobu@gmail.com> Co-authored-by: niansa/tuxifan <tuxifan@posteo.de> ts/tooling (nomic-ai#1602) Updated readme for correct install instructions (nomic-ai#1607) Co-authored-by: aj-gameon <aj@gameontechnology.com> llmodel_c: improve quality of error messages (nomic-ai#1625) Add .gguf files to .gitignore and remove unused Dockerfile argument and app/__init__.py file Delete gpt4all-api/gpt4all_api/app/api_v1/routes/__init__.py Signed-off-by: Daniel Salvatierra <dsalvat1@gmail.com> Delete gpt4all-api/test.py Signed-off-by: Daniel Salvatierra <dsalvat1@gmail.com> Delete gpt4all-api/completiontest.py Signed-off-by: Daniel Salvatierra <dsalvat1@gmail.com> Revert "Delete gpt4all-api/completiontest.py" This reverts commit 08e8eea. Revert "Delete gpt4all-api/test.py" This reverts commit 7de26be. Delete test files for local LLM development Refactor code for improved readability and performance. Delete gpt4all-api/completiontest.py Signed-off-by: Daniel Salvatierra <dsalvat1@gmail.com> Delete gpt4all-api/test.py Signed-off-by: Daniel Salvatierra <dsalvat1@gmail.com> Refactor code for improved readability and performance. Resolve Delete test batched completion function with OpenAI API.

jacoobes added 7 commits September 1, 2023 23:07

adding some native methods to cpp wrapper

8585ff4

gpu seems to work

dc4b704

typings and add availibleGpus method

94c217c

fix spelling

0487394

fix syntax

ab72ef2

more

57cc5c3

normalize methods to conform to py

efde701

apage43 and others added 22 commits September 16, 2023 13:44

remove extra dynamic linker deps when building with vulkan

463d9cb

bump python version (library linking fix)

73ff1c4

Don't link against libvulkan.

71e2000

vulkan python bindings on windows fixes

e5b0d2d

Bring the vulkan backend to the GUI.

be83035

When device is Auto (the default) then we will only consider discrete…

246ba22

… GPU's otherwise fallback to CPU.

Show the device we're currently using.

74b4800

Fix up the name and formatting.

ea41e60

init at most one vulkan device, submodule update

299dabe

fixes issues w/ multiple of the same gpu

Update the submodule.

2a913ca

Add version 2.4.15 and bump the version number.

ce9f64e

Fix a bug where we're not properly falling back to CPU.

0a19cef

Sync to a newer version of llama.cpp with bugfix for vulkan.

11e459e

Report the actual device we're using.

3c5b5f0

Only show GPU when we're actually using it.

6eb6f23

Bump to new llama with new bugfix.

780da62

Release notes for v2.4.16 and bump the version.

b63c162

Fallback to CPU more robustly.

635b40d

Release notes for v2.4.17 and bump the version.

81bdcc7

Bump the Python version to python-v1.0.12 to restrict the quants that…

4570660

… vulkan recognizes.

Link against ggml in bin so we can get the available devices without …

3c9acad

…loading a model.

Send actual and requested device info for those who have opt-in.

d713c4c

jacoobes marked this pull request as ready for review October 20, 2023 03:42

jacoobes and others added 3 commits October 20, 2023 08:40

sync up

90c6c27

progress, still wip on destructor

ce6fb7c

some detection work

e129be4

cebtenzzre marked this pull request as draft October 21, 2023 14:31

jacoobes and others added 5 commits October 25, 2023 13:58

merge

177172f

untested dispose method

1721cdb

add js side of dispose

2fda274

Merge branch 'main' into feat(ts)/gpu

c94fb9f

Merge branch 'main' into feat(ts)/gpu

b878bea

iimez mentioned this pull request Oct 31, 2023

typescript: "Do you have runtime libraries installed?", null llmodel #1497

Closed

jacoobes marked this pull request as ready for review October 31, 2023 02:32

cebtenzzre reviewed Oct 31, 2023

View reviewed changes

jacoobes and others added 8 commits October 31, 2023 00:11

Update gpt4all-bindings/typescript/index.cc

150095b

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

Update gpt4all-bindings/typescript/index.cc

b2679cb

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

Update gpt4all-bindings/typescript/index.cc

3b8a17f

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

Update gpt4all-bindings/typescript/src/gpt4all.d.ts

354455c

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

Update gpt4all-bindings/typescript/src/gpt4all.js

a6a29b4

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

Update gpt4all-bindings/typescript/src/util.js

88d8e53

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com>

fix tests

68e83d2

Merge branch 'main' into feat(ts)/gpu

a206269

jacoobes requested a review from cebtenzzre November 1, 2023 17:53

cebtenzzre approved these changes Nov 1, 2023

View reviewed changes

jacoobes added 2 commits November 1, 2023 13:43

fix circleci for nodejs

74e242b

bump version

661b522

jacoobes merged commit da95bcf into main Nov 1, 2023
3 of 4 checks passed

jacoobes deleted the feat(ts)/gpu branch November 1, 2023 19:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan support for typescript bindings, gguf support #1390

vulkan support for typescript bindings, gguf support #1390

jacoobes commented Sep 2, 2023

iimez commented Sep 16, 2023

iimez commented Oct 31, 2023

vulkan support for typescript bindings, gguf support #1390

vulkan support for typescript bindings, gguf support #1390

Conversation

jacoobes commented Sep 2, 2023

Describe your changes

Issue ticket number and link

Checklist before requesting a review

Demo

Steps to Reproduce

Notes

iimez commented Sep 16, 2023

iimez commented Oct 31, 2023