cynfrence

A cython based inference backend for keras ( ... in the making ).

use

In the future I will prepare a single setup.py file that compiles everything. You can then add a cython_backend.py into the keras code and specify to use it. For now the use is somewhat manual.

Compile the relevant .pyx file by changing setup.py and running python setup.py build_ext --inplace
Extract the weights of each layer in your model.
Write the model as a class like you would in pytorch using the cython functions and the weights you retrieved from the model.

motivation

Keras is great to train. Keras backend ( other than theano which is no longer maintained ) are hard to install on target computers ( especially arm computers ). They are also not optimized out of the box.

cynfrence is meant to be a super light weight set of code that can automatically turn keras models into cython compiled .so ( or pyc for windows ) files that run the network with almost no overhead.

Since cynfrence will be all cython it will easily compile on all OS's with optimizations appropriate for the machine.

How it works.

The vision : load a keras model and it will be automatically parsed and reconstructed as a pyx + setup.py file and compiled into an so. The weights will be hard coded, the layers will be super specific and so have very little overhead. This will allow for possibly the fastest inference system available on cpu only computers ( especially arm computers ( IoT )).

Current state:

There are 4 layers available currently that allow the inference of fully 2d convolutional networks

conv2d_no_padding
conv2d_with_padding
seperable_conv2d_no_padding
seperable_conv2d_with_padding Strides and dilations and supported ( somewhat tested ). All standards follow tensorflow for compatibility.

TODO :

Here is where i would very much appreciate your help... ;) short term:

create thorough tests.
Create a simple translator from keras models to cython using the available layers.

longer term:

make the installation super simple and pain free while keeping very high performance.

long long term:

add all keras layers with full features support.

1D, 3D, transpose Convolution
Reccurent Layers ( GRU, LSTM, simpleRNN... )

add specialized optimizations such as fft based convolutions when kernel is > 7x7 ( or even run a set of tests on the machine in order to evaluate which optimizations are best to use ).
fork keras and add cynfrence as a backend with all the learning capabilities turned off.

Supported layers

I'm sowly adding all generic keras layers. If you need a specific layer feel free to PM me and I'll work on it in advance. you can also just try to create it your self - its super easy given the layers that already exist.

I only list the layers which are selected for dev:

Core layers

Dense
Activation
Flatten
Reshape

Convolutional layers

Pooling layers

Reccurent Layers

RNN
SimpleRNN
GRU
LSTM

Merge Layers

Noemalization Layers

BatchNormalization

Activations

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.idea		.idea
basic_ops		basic_ops
README.md		README.md
test_runtime_speed_versus_keras.py		test_runtime_speed_versus_keras.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cynfrence

use

motivation

How it works.

TODO :

Supported layers

About

Releases

Packages

Languages

danFromTelAviv/cynfrence

Folders and files

Latest commit

History

Repository files navigation

cynfrence

use

motivation

How it works.

TODO :

Supported layers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages