Blue3 port by bathan1 · Pull Request #12 · JHU-PL-Lab/caprice-lang

bathan1 · 2026-04-23T21:05:20Z

Features added

Dedicated solver loop for handling easy formulas:

solve.ml

Integer Difference Logic solver:

integer.ml

Boolean text parser

boolean.ml

Overview

I want to merge in my toy SMT solver ("Blue3") into the concolic evaluator so it can attempt to fast-solve certain formulas. The IDL solver and Boolean text parser are more-or-less ported over 1-to-1. The primary change from the port is writing out a basic DPLL (T) solver loop.

Benchmarks

Here are the benchmark results you can find by inputting analysis.sql into an SQLite database
after running the benchmark script:

What were the average runtimes from both solvers?

avg_blue3	avg_z3
292.0μs	304.0μs

Rows with the MIN times from both solvers

trial_num	formula_id	formula	time_us_blue3	time_us_z3	which_min
0	64	(0 < a) ^ (a < 0)	3.814697	112.056732	blue3_min
2	30	(6 <= a) ^ (a < 0)	3.814697	122.070312	blue3_min
3	36	(2 < a) ^ (a < 0)	3.814697	133.037567	blue3_min
3	89	(1 <= a) ^ (0 < a) ^ (not ((a % 1) = 0))	113.010406	69.856644	z3_min
4	30	(6 <= a) ^ (a < 0)	3.814697	111.103058	blue3_min

Rows with the MAX times from both solvers

trial_num	formula_id	formula	time_us_blue3	time_us_z3	which_min
0	108	(c <= (b % a)) ^ (c <= a) ^ (0 < c) ^ (0 < a) ^ (0 < b) ^ (n	8136.034012	3366.947174	blue3_min
		ot ((b % a) = 0)) ^ (not (c = 0)) ^ (not (a = 0)) ^ (((b * a
		) / c) < b)

How much faster were the fast cases on average?

num_fast_cases	avg_faster_by	avg_percent_faster
345	110.83μs	49.6%

How much slower were the slow cases on average?

num_slow_cases	avg_slower_by	avg_percent_slower
548	50.25μs	9.81%

What was the max time difference blue3 beat z3 by?

max_diff
836.0μs

What was the max time difference z3 beat blue3 by?

max_diff
4769.0μs

brandonzstride

My comments are getting higher and higher level. I responded to an existing comment on to_propositional that will be especially impactful, I believe. To save energy in expectation of somewhat drastic changes in the wake of that comment, my review of integer.ml and solve.ml was rather quick.

I can really see this coming along now. Good work in handling so many of the comments from last time!

brandonzstride · 2026-04-28T14:40:41Z

+let rec contains_binop : type a k. _ Binop.t -> (a, k) t -> bool =
+ fun target -> function
+  | Binop (op, l, r) ->
+      Binop.poly_equal op target || contains_binop target l
+      || contains_binop target r
+  | _ -> false


The change only covers not, which was strictly an example. The point is that this is not comprehensive, for example, And is not explored.

brandonzstride · 2026-04-28T14:45:55Z

+  | Binop
+    ( ((Less_than_eq | Less_than | Equal) as binop),
+    Const_int c,
+    Binop (((Plus | Minus) as op), Key (I a), Key (I b)) ) -> (
+      match op with
+      | Plus ->
+        (* c <= a + b  ==>  c - b <= a *)
+        Formula.binop binop
+          (Formula.binop Minus (Formula.const_int c) (int_symbol b))
+          (int_symbol a)
+      | Minus ->
+        (* c <= a - b  ==>  c + b <= a *)
+        Formula.binop binop
+          (Formula.binop Plus (Formula.const_int c) (int_symbol b))
+          (int_symbol a)
+      | _ -> failwith "unreachable")


This suggestion applies to many cases above where convenient. The current code is reconstructing formulas when not necessary.

Suggested change

| Binop

( ((Less_than_eq | Less_than | Equal) as binop),

Const_int c,

Binop (((Plus | Minus) as op), Key (I a), Key (I b)) ) -> (

match op with

| Plus ->

(* c <= a + b ==> c - b <= a *)

Formula.binop binop

(Formula.binop Minus (Formula.const_int c) (int_symbol b))

(int_symbol a)

| Minus ->

(* c <= a - b ==> c + b <= a *)

Formula.binop binop

(Formula.binop Plus (Formula.const_int c) (int_symbol b))

(int_symbol a)

| _ -> failwith "unreachable")

| Binop

( ((Less_than_eq | Less_than | Equal) as binop),

Const_int c,

Binop (((Plus | Minus) as op), (Key _ as a), (Key _ as b)) ) -> (

match op with

| Plus ->

(* c <= a + b ==> c - b <= a *)

Formula.binop binop

(Formula.binop Minus (Formula.const_int c) b)

a

| Minus ->

(* c <= a - b ==> c + b <= a *)

Formula.binop binop

(Formula.binop Plus (Formula.const_int c) b)

a

| _ -> failwith "unreachable")

So that was what I was trying to do initially as well, but for cases where I explicitly reconstruct, it's because I come across an error like this:

The value b has type ($a1, 'a) Formula.t but an expression was expected of type (int, 'b) Formula.t Type $a1 is not compatible with type int Hint: $a1 is an existential type bound by the constructor Binop.

I agree with your approach since that also reduces the number of total function calls that need to be made using my solver. How can we make this work?

brandonzstride · 2026-04-28T14:54:29Z

+let to_propositional
+    ?(to_symbol : int -> (bool, 'k) Symbol.t =
+      fun uid -> uid |> Uid.of_int |> fun uid -> Symbol.B uid)
+    (formula : (bool, 'k) Formula.t) =


The output is a pure SAT boolean formula whose symbols are associated with SMT formulas, then you should have a pure SAT boolean formula type to represent this. That way there is no mixing. This also would mean we can take out Or from the "occurs" binary operations, as we discussed offline at one point. So please make a boolean formula type instead of reusing Formula.t.

The current assumptions you make do not hold. For example, it seems like if k0 is the way I write a key with uid 0, then the following formula has a conflict. (k0) ^ (k1 = k2). The first formula falls into the | expr -> expr case and is untouched, and the second formula can be called k0 via to_symbol. If you argue that this Char.code 'p' thing changes anything, then just consider if k0 was replaced by whatever to_symbol 0 returns. Hence to_propositional (k0 ^ (k1 = k2)) returns (k0 ^ k0), which is bad.

brandonzstride · 2026-04-28T15:22:56Z

+let rec is_idl_solvable : type k. (bool, k) Formula.t -> bool =
+  fun formula ->
+    match formula with
+    | Formula.And clauses ->
+        List.for_all is_idl_solvable clauses
+    | clause ->
+        is_idl_clause clause


This isn't especially important, but it is worth noting that this level of polymorphism on the key type k is not needed. The type k. syntax is necessary for polymorphic recursion, which means the type that k takes on may change within recursive calls. Here though, k just needs to be a standard polymorphic type variable. A locally abstract type (with (type k)) suffices to enforce the polymorphism necessary.

It may be helpful to some readers (including myself) to use exactly the required amount of polymorphic annotations.

Suggested change

let rec is_idl_solvable : type k. (bool, k) Formula.t -> bool =

fun formula ->

match formula with

| Formula.And clauses ->

List.for_all is_idl_solvable clauses

| clause ->

is_idl_clause clause

let rec is_idl_solvable (type k) (formula : (bool, k) Formula.t) : bool =

match formula with

| Formula.And clauses -> List.for_all is_idl_solvable clauses

| clause -> is_idl_clause clause

P.S. a simple 'k here (without the full (type k)) may even be best, but because it doesn't actually enforce any polymorphism, it technically is not the minimally required annotation. Some would say (type k) should be reserved for GADTs, and 'k. (bool, 'k) Formula.t -> bool here is the most appropriate if you annotate types at all.

Ended up having to use the : type k ... = fun syntax

Huh. Yeah I think that relates a certain OCaml issue that I don't want to link for sake of avoiding noise. But this works:

let rec is_idl_solvable (e : (bool, 'k) Formula.t) : bool = match e with | Formula.And clauses -> List.for_all is_idl_solvable clauses | clause -> is_idl_clause clause

Sorry for the originally wrong suggestion.

brandonzstride · 2026-04-28T15:48:02Z

+let to_string
+  (type a k)
+  (model : k t)
+  ~(uid : Uid.t -> (a, k) Symbol.t * string)
+  : string =


This function can only print either the int or bool mappings from the model. Did you mean to have a polymorphic uid function? Like this?

let to_string (model : 'k t) ~(uid : 'a. Uid.t -> ('a, 'k) Symbol.t * string) : string =

Alternatively, you may just make an assumption that the uid is packed into a key as-is, quite like is done in Grammar.Input_env.of_model. I think that makes usage easiest.

I moved the packing logic from Grammar to be owned by Model:

type 'k key = | Bool_key : (bool, 'k) Symbol.t -> 'k key | Int_key : (int, 'k) Symbol.t -> 'k key type 'k t = { value : 'a. ('a, 'k) Symbol.t -> 'a option ; domain : 'k key list }

Now the to_string looks like this:

let to_string (type k) (model : k t) ~(key : k key -> string) : string = let indent = " " in

Let me know if I did that wrong.

bathan1 and others added 25 commits April 18, 2026 13:30

Initial commit for port

bfc0632

Placeholder simplifier

799f1c1

Stubs

e17e5ba

Port over rewrite

3a01e2d

Finish rewrite_int_bounds

cf356b2

Add bellman ford solver

c53938d

Rebase

0b32699

Add boolean module

c3d4401

Almost finish solve loop

f58930b

Finish main solve loop from jaylang

699f1f2

Fix bellman_ford

c642a3b

Fix int rewriter

90923ea

Almost finish solve loop

f3dd3fc

Turn dpll into solver

253863c

Finish dpll theory solver

1a55847

Get workings solver

8043188

Finish port draft

4393197

add yojson as dependency

5a0aa61

configure test directory from makefile

bfd9a94

bump ocaml version

5dacfd8

add types to binary operators that occur in formulas

29460bb

Log inconsistent SATs

4f90943

Add clause checking

37160c2

Fix pipeline to not use rewrite double

99dd0a0

Add pipeline changes

cf9d9ab

bathan1 added the draft label Apr 23, 2026

Add analysis

500822a

bathan1 assigned bathan1 and brandonzstride and unassigned bathan1 Apr 23, 2026

brandonzstride reviewed Apr 28, 2026

View reviewed changes

bathan1 added 29 commits April 30, 2026 01:33

Pack symbol type into Model

812780d

Fix formatting in Integer and add extend function to Solve

308439e

Fix more formatting issues

98bb51e

Fix neq eq bug

5f63b51

wip: cdcl conflict analysis done

7f003e8

Simplify

9feb20a

wip2: finish cdcl

07b1d0f

Finish cdcl t

579a568

Got something to work

93159dc

Fix predecessor bug

2887006

Remove offsetting

3bce793

Fix some formatting

0f74bb9

Connect to cdcl

6fbadd5

Get cdcl to work with idl

153ebc0

Almost done

7fd6223

Debug

274de0c

Debug linearize

8618768

Debug prints

a9c4196

Make implied_concretization

8d9e9cf

Fix rebuild

d64638e

Fix pipeline

e753875

wip: one case left

5d65652

check out with euf

c5dd291

Done

a3a39bf

Fix cnf bug

487159d

Cleanup bellman_ford

a4c1d9f

Cleanup connector

c0ead28

Fix UF

a7f5399

Just use no

84dfa9b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blue3 port#12

Blue3 port#12
bathan1 wants to merge 75 commits intomainfrom
blue3-port

bathan1 commented Apr 23, 2026

Uh oh!

brandonzstride left a comment

Uh oh!

Uh oh!

Uh oh!

brandonzstride Apr 28, 2026

Uh oh!

brandonzstride Apr 28, 2026

Uh oh!

bathan1 Apr 30, 2026

Uh oh!

brandonzstride Apr 28, 2026

Uh oh!

Uh oh!

brandonzstride Apr 28, 2026

Uh oh!

bathan1 Apr 30, 2026

Uh oh!

brandonzstride Apr 30, 2026

Uh oh!

Uh oh!

Uh oh!

brandonzstride Apr 28, 2026

Uh oh!

bathan1 Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bathan1 commented Apr 23, 2026

Features added

Overview

Benchmarks

What were the average runtimes from both solvers?

Rows with the MIN times from both solvers

Rows with the MAX times from both solvers

How much faster were the fast cases on average?

How much slower were the slow cases on average?

What was the max time difference blue3 beat z3 by?

What was the max time difference z3 beat blue3 by?

Uh oh!

brandonzstride left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants