Hmm, push, just editing does not do anything to make this visible.
V5 is somewhat of a pre-release, I originally planned to release it following the release of BT817/BT818.
But things went very well and my optimisations had more effect than I anticipated.
I only removed everything I had implemented for BT817/BT818.
This is what it looks like to build the same small display list on a Cortex-M0+ with 48MHz:
V4
TFT_display() with DMA: 129µs
prog-size: 15436
TFT_display() without DMA: 360µs
prog-size: 15276
V5
TFT_display() with DMA: 51µs
prog-size: 12524
TFT_display() without DMA: 324µs
prog-size: 13420
And this is my example code running on an Arduino UNO clone with Mega328 and 16MHz.
V4:
TFT_display(): 664µs
prog-size: 12272
V5:
TFT_display(): 520µs
prog-size: 10322