Skip to content
/ PUMA Public

Aim to be a lightweight, high-performance inference engine for heterogeneous devices. WIP.

License

Notifications You must be signed in to change notification settings

InftyAI/PUMA

Repository files navigation

PUMA

Puma aims to be a lightweight, high-performance inference engine for heterogeneous devices. Currently under active development.

How to Run

Build

Run make build to build the puma binary.

Run

Run ./puma help to see all available commands.

For example, you can run ./puma version to see the binary version.

Supported Backends

Use llama.cpp as the default backend for quick prototyping, will implement our own backend in the future.

About

Aim to be a lightweight, high-performance inference engine for heterogeneous devices. WIP.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published