onnxer

Pure Go bindings for ONNX Runtime using ebitengine/purego.

This library provides a pure Go interface to ONNX Runtime without requiring cgo, enabling cross-platform machine learning inference in Go applications.

Why onnxer?

Pure Go — no CGO required. Cross-compiles everywhere Go does.
GenAI support — text generation and multimodal inference via ONNX Runtime GenAI.
Multi-version API — supports ORT 1.23.x and 1.24.x simultaneously.
Generics tensor API — type-safe NewTensorValue[T] / GetTensorData[T] with compile-time checks.
Context cancellation — context.Context wired through to ORT RunOptions for real cancellation.
Session pooling — goroutine-safe SessionPool with built-in metrics and observability hooks.
Profiling — built-in ORT profiling support for diagnosing per-operator latency.
LoRA adapters — hot-swap fine-tuned adapters per inference run without reloading the base model.
Comprehensive — string tensors, IO binding, model metadata, type introspection, Float16/BFloat16, sequence/map outputs.

Feature Comparison

Feature	onnxer	onnxruntime_go
Pure Go (no CGO)	Yes	No
GenAI support	Yes	No
Multi-version API (v23+v24)	Yes	No
Generics tensor API	Yes	No
String tensors	Yes	Yes
Session options (graph opt, threading, memory)	Yes	Yes
Model metadata	Yes	Yes
Context cancellation (wired to ORT)	Yes	No
Session pooling with metrics	Yes	No
Inference hooks (observability)	Yes	No
IO binding	Yes	Yes
Type introspection	Yes	Yes
Sequence/Map outputs	Yes	Yes
Float16/BFloat16	Yes	Yes
Profiling (per-operator timing)	Yes	No
LoRA adapter hot-swap	Yes	No
Optimized model caching	Yes	No
Zero-copy tensor access	Yes	No
Symbolic dimension introspection	Yes	No
Dynamic dimension overrides	Yes	Yes
Deterministic compute mode	Yes	No
Run tagging (log correlation)	Yes	No
IO binding synchronization	Yes	No
Prepacked weights sharing (pool)	Yes	No
Global thread pools	Yes	No
Race-tested concurrent pool	Yes	No

Supported Versions

Library	Supported Version
ONNX Runtime	1.23.x, 1.24.x
ONNX Runtime GenAI	0.11.x

Prerequisites

You need to have the ONNX Runtime shared library installed on your system:

macOS: libonnxruntime.dylib
Linux: libonnxruntime.so
Windows: onnxruntime.dll

Download the appropriate library from the ONNX Runtime releases.

The library will be automatically discovered if placed in standard system locations:

macOS: /usr/local/lib, /opt/homebrew/lib, /usr/lib
Linux: /usr/local/lib, /usr/lib, /lib
Windows: Standard DLL search paths

Alternatively, you can specify a custom path when creating the runtime.

Installation

go get github.com/benedoc-inc/onnxer

Quick Start

package main

import (
	"context"
	"fmt"
	"os"

	ort "github.com/benedoc-inc/onnxer/onnxruntime"
)

func main() {
	runtime, _ := ort.NewRuntime("", 23)
	defer runtime.Close()

	env, _ := runtime.NewEnv("example", ort.LoggingLevelWarning)
	defer env.Close()

	f, _ := os.Open("model.onnx")
	defer f.Close()

	session, _ := runtime.NewSessionFromReader(env, f, &ort.SessionOptions{
		IntraOpNumThreads: 4,
		GraphOptimization: ort.GraphOptimizationAll,
	})
	defer session.Close()

	input, _ := ort.NewTensorValue(runtime, []float32{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}, []int64{1, 10})
	defer input.Close()

	outputs, _ := session.Run(context.Background(), map[string]*ort.Value{
		session.InputNames()[0]: input,
	})

	data, shape, _ := ort.GetTensorData[float32](outputs[session.OutputNames()[0]])
	fmt.Printf("Output shape: %v, data: %v\n", shape, data)
}

One-Line Model Loading

For simple use cases, Model wraps Runtime + Env + Session into a single object:

model, _ := ort.LoadModelFromFile("model.onnx", &ort.ModelConfig{
    SessionOptions: &ort.SessionOptions{
        IntraOpNumThreads: 4,
        GraphOptimization: ort.GraphOptimizationAll,
    },
})
defer model.Close()

outputs, _ := model.Run(ctx, map[string]*ort.Value{"input": tensor})

Session Pooling

SessionPool manages multiple sessions for safe concurrent inference from many goroutines:

pool, _ := ort.NewSessionPool(runtime, env, modelBytes, 8, &ort.PoolConfig{
    Hooks: []ort.Hook{
        ort.AfterRunHook(func(info *ort.RunInfo) {
            log.Printf("inference took %v", info.Duration)
        }),
    },
})
defer pool.Close()

// Safe to call from many goroutines concurrently:
outputs, _ := pool.Run(ctx, map[string]*ort.Value{"input": tensor})

// Built-in metrics:
stats := pool.Stats()
fmt.Printf("runs=%d avg=%v errors=%d\n", stats.TotalRuns, stats.AvgLatency(), stats.TotalErrors)

Examples

See the examples/ directory for complete usage examples:

resnet — Image classification
roberta-sentiment — Sentiment analysis
yolov10 — Object detection
string-tensor — String tensor inputs for NLP
metadata — Model introspection
pool — Concurrent inference with session pooling, hooks, warm-up
profiling — Per-operator profiling and latency analysis
lora — LoRA adapter hot-swap for fine-tuned models
io-binding — IO binding for optimized repeated inference
global-threads — Global thread pools with prepacked weights
cancellation — Context-based cancellation
genai/phi3 — Text generation with Phi-3
genai/phi3.5-vision — Multimodal vision-language

ONNX Runtime GenAI Support

This library also includes experimental support for ONNX Runtime GenAI, enabling text generation with large language models. See the GenAI examples for details.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.githooks		.githooks
.github/workflows		.github/workflows
examples		examples
genai		genai
internal/cstrings		internal/cstrings
onnxruntime		onnxruntime
tools/codegen		tools/codegen
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile.dev		Dockerfile.dev
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
compose.yml		compose.yml
download.sh		download.sh
download_genai.sh		download_genai.sh
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

onnxer

Why onnxer?

Feature Comparison

Supported Versions

Prerequisites

Installation

Quick Start

One-Line Model Loading

Session Pooling

Examples

ONNX Runtime GenAI Support

About

Uh oh!

Releases 8

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

onnxer

Why onnxer?

Feature Comparison

Supported Versions

Prerequisites

Installation

Quick Start

One-Line Model Loading

Session Pooling

Examples

ONNX Runtime GenAI Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages