for estimating the inference time instead of running it once measure variance across multiple runs to have better stats
for estimating the inference time instead of running it once measure variance across multiple runs to have better stats