Ampere AI

All

41 repositories

sentence-transformers
Public
State-of-the-Art Text Embeddings
Python
•
Apache License 2.0
•2.5k•0•0•0•Updated Dec 13, 2024Dec 13, 2024
ampere-ai-llama-chat
Public
Python
•0•1•2•0•Updated Dec 12, 2024Dec 12, 2024
transformers
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•27k•0•0•0•Updated Dec 5, 2024Dec 5, 2024
ampere_model_library
Public
AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)
machine-learning natural-language-processing computer-vision model-zoo tensorflow inference pytorch artificial-intelligence arm64 aarch64
Python
•
Apache License 2.0
•7•21•7•6•Updated Dec 4, 2024Dec 4, 2024
llama.cpp
Public
Ampere optimized llama.cpp
meta ai llama arm64 ampere llm llamacpp
Python
•0•8•4•1•Updated Nov 7, 2024Nov 7, 2024
releases_meta
Public
0•0•0•0•Updated Oct 16, 2024Oct 16, 2024
llama-scaling
Public
Llama3-8B scale out scripts
Python
•0•0•0•0•Updated Sep 7, 2024Sep 7, 2024
ampere-ai-serge-chat
Public
Shell
•1•1•1•0•Updated Aug 28, 2024Aug 28, 2024
reviewers_day
Public
Scripts to reproduce AI results on AmpereOne platform.
Jupyter Notebook
•0•1•0•0•Updated Aug 19, 2024Aug 19, 2024
tensorflow-serving
Public
Fork of tensorflow serving for ARM64 build
C++
•
Apache License 2.0
•2•2•0•0•Updated Jul 31, 2024Jul 31, 2024
local-rag
Public
Python
•1•0•0•0•Updated May 22, 2024May 22, 2024
llm_app_frameworks
Public
Integrating Ampere's high performance LLM inference with popular application building frameworks in the industry
Python
•
Apache License 2.0
•1•0•3•0•Updated May 22, 2024May 22, 2024
ampere-ai-ref-apps
Public
Shell
•
Apache License 2.0
•1•0•0•0•Updated May 16, 2024May 16, 2024
whisper
Public
Robust Speech Recognition via Large-Scale Weak Supervision
Python
•
MIT License
•8.8k•0•0•1•Updated Apr 25, 2024Apr 25, 2024
llama-cpp-python
Public
Python
•
MIT License
•0•1•0•1•Updated Apr 16, 2024Apr 16, 2024
AutoGPTQ
Public
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python
•
MIT License
•491•1•1•0•Updated Mar 13, 2024Mar 13, 2024
cloud-ai-sdk
Public
Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low latency across Computer Vision, Object Detection, Natural Language Processing and Generative AI models.
Jupyter Notebook
•
Other
•7•0•0•0•Updated Mar 11, 2024Mar 11, 2024
aio-examples
Public
Python
•3•5•2•0•Updated Mar 5, 2024Mar 5, 2024
llama_index
Public
LlamaIndex is a data framework for your LLM applications
Python
•
MIT License
•5.4k•0•0•0•Updated Mar 4, 2024Mar 4, 2024
transformers-deprecated
Public
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Python
•
Apache License 2.0
•27k•0•0•0•Updated Dec 16, 2023Dec 16, 2023
stable-diffusion-webui
Public
Stable Diffusion web UI
Python
•
GNU Affero General Public License v3.0
•27k•0•0•0•Updated Nov 24, 2023Nov 24, 2023
stablediffusion
Public
High-Resolution Image Synthesis with Latent Diffusion Models
Python
•
MIT License
•5.1k•0•0•0•Updated Nov 24, 2023Nov 24, 2023
images
Public
0•0•0•0•Updated Jun 20, 2023Jun 20, 2023
yolov5-demo
Public
Shell
•
Apache License 2.0
•2•3•0•0•Updated May 24, 2023May 24, 2023
ai-poc-benchmarks
Public
Shell
•0•0•0•0•Updated Apr 26, 2023Apr 26, 2023
inference_results_v2.0
Public
Python
•12•0•0•0•Updated Mar 13, 2023Mar 13, 2023
Paddle
Public
Fork of PaddlePaddle framework
C++
•
Apache License 2.0
•0•0•0•0•Updated Mar 2, 2023Mar 2, 2023
server
Public
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Python
•
BSD 3-Clause "New" or "Revised" License
•1.5k•0•0•0•Updated Mar 2, 2023Mar 2, 2023
tensorrt_backend
Public
The Triton backend for TensorRT.
C++
•
BSD 3-Clause "New" or "Revised" License
•30•0•0•0•Updated Mar 2, 2023Mar 2, 2023
oneDNN
Public
Fork of oneDNN
C++
•
Apache License 2.0
•0•0•1•0•Updated Mar 1, 2023Mar 1, 2023