I’m wondering is the engine intended to be used on bare metal target e.g. STM32H7? These are pretty resource constrained chips e.g. 1MB of RAM. 2MB of flash. Typically we do not free memory from the heap after allocation to avoid fragmentation. Is it possible to use the engine in this type of environment?