yes that’s what i am working on these days but there is a need for a generally available neural chip (see google’s coral as one attempt). in my tests, esp32s3 is very very slow for any model with conv2d involved.
i just want a tiiiny gpu for $10 so i can run smaller models at higher speed than possible with xtensa/rp2040 having limited simd support etc.
Are you utilizing the SIMD and acceleration instructions in the S3? What kind of performance are you seeing?
Neural accelerators are coming into MCUs. The just released STM32N6 is probably among the best. Alif with the U55/U85 has been out for a little while. Maxim MAX78000 has a CNN accelerator out for a couple of years. More will come in the next few years - though not from Nvidia any time soon.
i just want a tiiiny gpu for $10 so i can run smaller models at higher speed than possible with xtensa/rp2040 having limited simd support etc.