Loading ONNX Model in Java

Tags:

I have a trained PyTorch model that I would now like to export to Caffe2 using ONNX. This part seems fairly simple and well documented. However, I now want to "load" that model into a Java program in order to perform predictions within my program (a Flink streaming application). What is the best way to do this? I haven't been able to find any documentation on the website describing how to do this.

463

asked Nov 23 '17 23:11

igodfried

1 Answers

Currently it's a bit tricky but there is a way. You will need to use JavaCPP:

NGraph https://github.com/bytedeco/javacpp-presets/tree/master/ngraph
ONNX https://github.com/bytedeco/javacpp-presets/tree/master/onnx

I will use single_relu.onnx as example:

    //read ONNX
    byte[] bytes = Files.readAllBytes(Paths.get("single_relu.onnx"));
    ModelProto model = new ModelProto(); 
    ParseProtoFromBytes(model, new BytePointer(bytes), bytes.length); // parse ONNX -> protobuf model

    //preprocess model in any way you like (you can skip this step)
    check_model(model);
    InferShapes(model);
    StringVector passes = new StringVector("eliminate_nop_transpose", "eliminate_nop_pad", "fuse_consecutive_transposes", "fuse_transpose_into_gemm");
    Optimize(model, passes);
    check_model(model);
    ConvertVersion(model, 8);
    BytePointer serialized = model.SerializeAsString();
    System.out.println("model="+serialized.getString());

    //prepare nGraph backend
    Backend backend = Backend.create("CPU");
    Shape shape = new Shape(new SizeTVector(1,2 ));
    Tensor input =backend.create_tensor(f32(), shape);
    Tensor output =backend.create_tensor(f32(), shape);
    Function ng_function = import_onnx_model(serialized); // convert ONNX -> nGraph
    Executable exec = backend.compile(ng_function);
    exec.call(new NgraphTensorVector(output), new NgraphTensorVector(input));

    //collect result to array
    float[] r = new float[2];
    FloatPointer p = new FloatPointer(r);
    output.read(p, 0, r.length * 4);
    p.get(r);

    //print result
    System.out.println("[");
    for (int i = 0; i < shape.get(0); i++) {
        System.out.print(" [");
        for (int j = 0; j < shape.get(1); j++) {
            System.out.print(r[i * (int)shape.get(1) + j] + " ");
        }
        System.out.println("]");
    }
    System.out.println("]");

answered Sep 27 '22 20:09

alagris

Related questions
                            
                                Kotlin - nonnull getter for a nullable field
                            
                                Initiating a Phonegap plugin after device restart
                            
                                Different default 'initialCapacity' HashSet and LinkedHashSet
                            
                                HttpOnly cookies not sent by request
                            
                                How to combine Angular-Cli frontend to Spring-boot backend
                            
                                How to define a member interface in a static context in java?
                            
                                "New request is not allowed to start because it should come with valid transaction descriptor" on Sql Server 2012
                            
                                Is the Classloader part of the JVM or in the JRE?
                            
                                Limiting rate of requests with Reactor
                            
                                Don't re-execute Retrofit call if it's still in progress using RxJava 2
                            
                                How to properly use Dagger2 with the new Android Architecture Components
                            
                                Update RecyclerView.Adapter after Camera Intent
                            
                                Mule / Spring transaction is not propagated
                            
                                Direct self-reference leading to cycle Superclass issue JSON
                            
                                ExecutorService: how to prevent thread starvation when synchronization barriers are done in the threads
                            
                                How many requests can a java UDP socket handle?
                            
                                Spring boot (mysql with jpa ): No bean named 'entityManagerFactory' available
                            
                                PDFBox U+00A0 is not available in this font's encoding
                            
                                Algorithms: Hybrid MergeSort and InsertionSort Execution Time
                            
                                Communication in Netty Nio java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Loading ONNX Model in Java

Tags:

java

caffe2

igodfried

People also ask

1 Answers

alagris

Recent Activity

Donate For Us