Support calculated attribute values #82

mikee47 · 2026-01-15T11:53:46Z

This PR is a suggestion for how #75 might be approached. The idea is to interpret a @ prefix in an attribute name as a calculated value. The @ prefix was chosen arbitrarily.

An example is included in the test application:

"@default": "os.environ.get('SIMPLE_STRING', 'donkey')"

During loading, the attribute value is evaluated in python and the result stored in default.

Note: "@default" and "default" (for example) are considered the same attribute and using both in the same property list will result in an error.

To test this, build and run as follows:

make clean
SIMPLE_STRING="donkey2" make -j run

The test application now fails as the value has changed - "donkey" is expected.

integer expressions

JSON does not support extended number formats, such as 0x12, so this mechanism enables calculated values for numeric properties by adding the @ prefix and using a string value:

"@default": "8 + 27",
"minimum": 0,
"@maximum": "0xffff"

Applies to default, minimum, maximum properties

mikee47 · 2026-01-16T09:27:47Z

A primary use case for calculated defaults is using different values for debug/release builds. It would therefore be appropriate to move the generated code from out/ConfigDB into the appropriate build sub-folder, e.g. out/Esp8266/debug/ConfigDB.

However, even with this the build system cannot track such dependencies without rebuilding it every time so ultimately it's going to be up to the developer to determine if the schema needs rebuilding.

mikee47 · 2026-01-16T09:38:49Z

I'm not particularly keen on using eval() for expressions, it was just the easiest way to get something working.

Security: eval() allows execution of arbitrary python code which may be undesirable. Having said that, it's probably no different to executing shell script or python code from within makefiles so it's down to a matter of trust.

A simplified syntax would probably be better, so instead of os.environ['SIMPLE_INT'] we could write $SIMPLE_INT. The developer then provides the appropriate information in environment variables.

pljakobs · 2026-01-16T09:40:12Z

let me see if I understand this correctly:

@default will have a calculated value and the code to do so is basically the python environment available to dbgen.py

so for my initial issue, could I use

        "logEnabled": {
          "type":"boolean",
          "@default":"\"true\" if os.environ.get(\"SMING_RELEASE\") !=\"1\" else \"false\"",
          "ctype":"telemetryLog",
      },

pljakobs · 2026-01-16T09:48:55Z

regarding the security vs. flexibility thoughts: there's a ton of arbitrary code already being run anyway, the code will run with the user's privileges, so I don't necessarily see hightened security concerns unless a developer pushes malicious code to the repo and a user blindly builds this.
if we wanted, I guess we could have a white list of os functions that can be executed.
I think for an initial release, os.environment.get is plenty powerful as the makefile is the natural place for a dev to set environment conditions.
but that would then mean that you'd have to parse the python code and fail for any function call that is not whitelisted.
To me, that feels like too much hassle for the threat level I can see.

mikee47 · 2026-01-16T10:01:56Z

let me see if I understand this correctly:

@default will have a calculated value and the code to do so is basically the python environment available to dbgen.py

so for my initial issue, could I use
        "logEnabled": {
          "type":"boolean",
          "@default":"\"true\" if os.environ.get(\"SMING_RELEASE\") !=\"1\" else \"false\"",
          "ctype":"telemetryLog",
      },

Yep, or simplified:

      "@default": "os.environ['SMING_RELEASE'] != '1'"

But I'm thinking this kind of syntax shouldn't be too hard to implement and would be more intuitive and meet most use cases:

      "@default": "$SMING_RELEASE != '1'"

dbgen just needs to perform text substitution of $SMING_RELEASE into os.environ['SMING_RELEASE'] before running it through eval().

pljakobs · 2026-01-16T10:01:57Z

thinking again about the use cases, it really comes down to "doing something with one or more environment variables" doesn't it, and "something" could be as simple as conditionals and basic arithmetic. Maybe having a very simple language that just exposes
IF / ELSE, +-*/ maybe power(base, exponent) or even power2 and log2 (might be helpful if one wants to default values based on bit width for a platform) and access to the dictionary of environment variables might be enough

pljakobs · 2026-01-16T10:10:50Z

for readability, I would prefer

"@default":"'FALSE' IF $SMING_RELEASE=1 ELSE 'TRUE'"

passing the true/false return implicitly is efficient but a bit unintuitive (for me) to read

mikee47 · 2026-01-16T10:29:53Z

So I've done a very simple regex substitution so we can write stuff like "@default": "'false' if '$SMING_RELEASE' == '1' else 'true'". Thus $SMING_RELEASE is substituted directly for the environment value so requires quotes around it.

mikee47 · 2026-01-16T10:31:34Z

Actually, expression must be "@default": "False if '$SMING_RELEASE' == '1' else True" to generate valid JSON. That is, python True or False which then gets emitted to JSON true or false.

pljakobs · 2026-01-16T11:19:24Z

I'm slightly ashamed to admit I used Gemini for this, but this seems to be rather userful:

#!/usr/bin/env python3
import ast
import operator as op
import math
import os
import sys
import re

class ConfigDBEvaluator:
    def __init__(self):
        self.functions = {
            'pow2': lambda x: 2**x,
            'log2': lambda x: math.log2(x),
            'align_up': lambda v, a: (v + a - 1) & ~(a - 1),
            'max': max,
            'min': min
        }
        
        self.operators = {
            ast.Add: op.add, ast.Sub: op.sub, ast.Mult: op.mul, ast.Div: op.truediv,
            ast.Eq: op.eq, ast.NotEq: op.ne, ast.Gt: op.gt, ast.Lt: op.lt,
            ast.GtE: op.ge, ast.LtE: op.le, ast.LShift: op.lshift, ast.RShift: op.rshift,
            ast.BitAnd: op.and_, ast.BitOr: op.or_, ast.Invert: op.invert,
            ast.And: lambda a, b: a and b, ast.Or: lambda a, b: a or b,
            ast.Not: lambda x: not self._is_truthy(x),
            ast.USub: op.neg,
            ast.In: lambda a, b: a in b,
            ast.NotIn: lambda a, b: a not in b
        }

    def _is_truthy(self, val):
        """Factually converts common C/Env strings to booleans."""
        if isinstance(val, bool): return val
        if isinstance(val, (int, float)): return val != 0
        if isinstance(val, str):
            if val.lower() in ('false', '0', 'no', 'off', ''): return False
            return True
        return bool(val)

    def _eval(self, node):
        if isinstance(node, ast.Constant):
            return node.value
        
        elif isinstance(node, ast.Name):
            if node.id == 'True': return True
            if node.id == 'False': return False
            val = os.environ.get(node.id)
            if val is None:
                raise NameError(f"Environment variable '{node.id}' is not defined.")
            try:
                if val.lower().startswith('0x'): return int(val, 16)
                return float(val) if '.' in val else int(val)
            except ValueError:
                return val

        elif isinstance(node, ast.BinOp):
            return self.operators[type(node.op)](self._eval(node.left), self._eval(node.right))

        elif isinstance(node, ast.UnaryOp):
            return self.operators[type(node.op)](self._eval(node.operand))

        elif isinstance(node, ast.BoolOp):
            values = [self._eval(v) for v in node.values]
            result = values[0]
            for next_val in values[1:]:
                result = self.operators[type(node.op)](result, next_val)
            return result

        elif isinstance(node, ast.Compare):
            left_val = self._eval(node.left)
            right_val = self._eval(node.comparators[0])
            op_type = type(node.ops[0])
            if op_type not in self.operators:
                raise TypeError(f"Unsupported comparison operator: {op_type.__name__}")
            return self.operators[op_type](left_val, right_val)

        elif isinstance(node, ast.IfExp):
            # Use truthy helper for the condition
            return self._eval(node.body) if self._is_truthy(self._eval(node.test)) else self._eval(node.orelse)

        elif isinstance(node, ast.Call):
            if not isinstance(node.func, ast.Name):
                raise TypeError(f"Unsupported function call type: {type(node.func).__name__}")
            
            func_name = node.func.id
            if func_name in self.functions:
                func = self.functions[func_name]
                args = [self._eval(arg) for arg in node.args]
                
                # Factual check for argument counts
                import inspect
                sig = inspect.signature(func)
                try:
                    sig.bind(*args)
                except TypeError:
                    raise ValueError(f"Function '{func_name}' expects {len(sig.parameters)} arguments (got {len(args)})")
                
                return func(*args)
            raise ValueError(f"Forbidden function call: {func_name}")

        raise TypeError(f"Unsupported syntax: {type(node).__name__}")

    def run(self, expr):
        if not expr or not expr.strip(): return "Error: Empty expression"
        processed = expr.replace("&&", " and ").replace("||", " or ")
        processed = re.sub(r'!(?!=)', ' not ', processed)
        try:
            tree = ast.parse(processed.strip(), mode='eval')
            return self._eval(tree.body)
        except Exception as e:
            return f"Error: {e}"

if __name__ == "__main__":
    if len(sys.argv) < 2: sys.exit("Usage: ./ConfigDBDSL.py 'expression'")
    print(ConfigDBEvaluator().run(sys.argv[1]))

basically this implements a pretty flexible DSL interpreter that can evaluate conditions like

"True if CPU_COUNT == 4 && ARCH == 'xtensa' else False"

or

"'esp' in SMING_SOC"

it also implements basic arithmetic, bit shifts, hex numbers, bit operations etc.
I have a test script that checks a number of expressions:

#!/usr/bin/env python3
import os
from ConfigDBDSL import ConfigDBEvaluator

def run_full_suite():
    evaluator = ConfigDBEvaluator()
    
    # Environment Setup
    os.environ.update({
        "CPU_COUNT": "4",
        "ARCH": "xtensa",
        "CHIP": "esp32s3",
        "RAM_SIZE": "0x80000", # 512KB
        "DEBUG_LEVEL": "2",
        "IS_RELEASE": "False",
        "FEATURES": "wifi,ble,mesh"
    })

    # Format: (Expression, Expected, Category)
    test_cases = [
        # --- Arithmetic & Bitwise ---
        ("CPU_COUNT * 2", 8, "Basic Arithmetic"),
        ("RAM_SIZE / 1024", 512.0, "Hex Handling"),
        ("1 << CPU_COUNT", 16, "Bit Shift"),
        ("~0 & 0xFF", 255, "Bitwise Invert & AND"),
        ("align_up(15, 8)", 16, "Custom Function"),

        # --- C-Style Logical Aliases ---
        ("True if CPU_COUNT == 4 && ARCH == 'xtensa' else False", True, "AND alias (&&)"),
        ("True if ARCH == 'arm' || ARCH == 'xtensa' else False", True, "OR alias (||)"),
        ("True if !(ARCH == 'arm') else False", True, "NOT alias (!)"),
        ("True if ARCH != 'arm' else False", True, "NOT EQUAL (!=)"),

        # --- Substring & Membership ---
        ("'esp' in CHIP", True, "Substring match (in)"),
        ("'mesh' in FEATURES", True, "Env string search"),
        ("True if '32s' in CHIP && 'wifi' in FEATURES else False", True, "Nested Logic+Member"),
        ("'riscv' not in ARCH", True, "Negative Member"),

        # --- Nested Ternary & Math ---
        ("pow2(CPU_COUNT) if CPU_COUNT > 2 else 0", 16, "Function in Ternary"),
        ("8 if CHIP == 'esp32' else (16 if CHIP == 'esp32s3' else 32)", 16, "Nested Else-If"),
        ("(1 if !IS_RELEASE else 0)", 1, "Bool Env + NOT (!)"),

        # --- Error Handling ---
        ("UNDEFINED_VAR + 1", "Error: Environment variable 'UNDEFINED_VAR' is not defined.", "Missing Var"),
        ("pow2(2, 4, 6)", "Error: Function 'pow2' expects 1 arguments (got 3)", "Param Mismatch"),

        # --- Code Injection (Security Audit) ---
        ("__import__('os').system('ls')", "Error: Unsupported function call type: Attribute", "Injection: RCE"),
        ("os.popen('whoami')", "Error: Unsupported function call type: Attribute", "Injection: Attribute"),
        ("open('/etc/passwd')", "Error: Forbidden function call: open", "Injection: File Access"),
        ("().__class__.__mro__", "Error: Unsupported syntax: Attribute", "Injection: Jailbreak"),
        ("1; import os", "Error: invalid syntax", "Injection: Multi-stmt"),
    ]

    print(f"{'Category':<20} | {'Expression':<55} | {'Result'}")
    print("-" * 110)

    passed = 0
    for expr, expected, cat in test_cases:
        actual = evaluator.run(expr)
        
        # Logic: If 'expected' is an error string, check if it's contained in the 'actual' result
        if isinstance(expected, str) and expected.startswith("Error"):
            success = expected in str(actual)
        else:
            success = (actual == expected)
        
        if success:
            passed += 1
            status = "\033[92mPASS\033[0m"
        else:
            status = f"\033[91mFAIL\033[0m (Got: {actual})"
        
        print(f"{cat:<20} | {expr:<55} | {status}")

    print("-" * 110)
    print(f"Final Score: {passed}/{len(test_cases)}")

if __name__ == "__main__":
    run_full_suite()

This should eliminate the code injection risk and still be extremely powerful.

mikee47 · 2026-01-16T11:53:10Z

Regardless of how it was done, that is a nice solution. I can see immediately there are other tools in Sming which would benefit from this (IFS fsbuild, Storage hwconfig, Graphics resource compiler). I propose adding this to Sming so it's generally available. There is the Tools/Python directory which is intended for this. Do you fancy doing a PR for this? Could just call it Evaluator.

pljakobs · 2026-01-16T11:56:28Z

certainly, I wasn't sure what you think of using AI code so I didn't just want to sneak it in.
I have found it to be a useful tool to rapidly generate prototypes, especially in Python and JavaScript

mikee47 · 2026-01-16T12:00:48Z

certainly, I wasn't sure what you think of using AI code so I didn't just want to sneak it in. I have found it to be a useful tool to rapidly generate prototypes, especially in Python and JavaScript

It's fine, though a bit weird it's been written as a class. Nothing a bit of refactoring and tidying wouldn't sort!

mikee47 · 2026-01-16T12:16:31Z

...although now I think about it, a class can be more easily extended for specialised use, such as resolving variables from other places.

mikee47 · 2026-01-16T12:29:36Z

Maybe let's check for existing python packages which might do most of the work for us. For example https://pypi.org/project/expression-parser/

pljakobs · 2026-01-16T12:49:04Z

why not re-invent the wheel if we can ;-)

but more seriously, would we stand to gain much from this? at a quick look, we'd probably need a bit of glue code (reading in the environment, setting the functions) to implement expression-parser - which would probably be quite similar to the AST parsing Evaluator does - and given that the current code is rather short already, it might be a toss in the end.
But expression-parser could certainly be advantageous if we need different environment or function sets for different use cases.

Also, if I understand expression-parser correctly, it has to be called with a variable dictionary, which we'd need to populate ahead of time - Evaluator fetches variables as it walks the tree.

pljakobs · 2026-01-16T13:03:06Z

I have created PR #2986 but will be happy any way this goes

pljakobs · 2026-01-16T13:13:44Z

btw, thinking about other uses:
having this available in .hw files would be nice as it would remove the need of having platform specific files (for the price of having one that might become rather cluttered) and also it could allow automatic stacking of partitions, making it easier to calculate a layout without having to use the graphical editor.

mikee47 added 2 commits January 15, 2026 11:44

Support hex/calculated expressions for integers in schema

00c668e

Applies to default, minimum, maximum properties

Support "@default" calculated strings

cc8644c

mikee47 marked this pull request as draft January 15, 2026 11:57

mikee47 added 2 commits January 15, 2026 12:06

Revert 00c668

8be59fd

Standardise on "@" prefix

dea7b0f

mikee47 marked this pull request as ready for review January 15, 2026 12:31

Accept any attribute starting with '@' as calculated

9613e5a

mikee47 changed the title ~~Feature/calculated values~~ Support calculated attribute values Jan 15, 2026

Interpret $VAR as environment variable

d9f9d8f

Only evaluate strings, leave numeric values unchanged

d023dc3

Save copies of actual schema for reference purposes

b4fc984

Support calculated attribute values #82

Are you sure you want to change the base?

Support calculated attribute values #82

Conversation

mikee47 commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

mikee47 commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

pljakobs commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mikee47 commented Jan 15, 2026 •

edited

Loading

mikee47 commented Jan 16, 2026 •

edited

Loading