
A fast C++ implementation of TensorFlow Lite Unet on a bare Raspberry Pi 4.
Once overclocked to 1850 MHz, the app runs at 7.2 FPS!
Special made for a bare Raspberry Pi 4 see Q-engineering deep learning examples
Papers: https://arxiv.org/abs/1606.00915 
Training set: VOC2017 
Size: 257x257 
Frame rate Unet Lite : 4.0 FPS (RPi 4 @ 1900 MHz - 32 bits OS) 
Frame rate Unet Lite : 7.2 FPS (RPi 4 @ 1875 MHz - 64 bits OS) 
To run the application, you have to:
- A raspberry Pi 4 with a 64-bit operating system. It can be the Raspberry 64-bit OS, or Ubuntu 18.04 / 20.04. Install 64-bit OS 
- TensorFlow Lite framework installed. Install TensorFlow Lite 
- OpenCV 64 bit installed. Install OpenCV 4.5 
- Code::Blocks installed. ($ sudo apt-get install codeblocks)
To extract and run the network in Code::Blocks 
$ mkdir MyDir 
$ cd MyDir 
$ wget https://github.com/Qengineering/TensorFlow_Lite_Segmentation_RPi_64-bit/archive/refs/heads/master.zip 
$ unzip -j master.zip 
Remove master.zip and README.md as they are no longer needed. 
$ rm master.zip 
$ rm README.md 
 
Your MyDir folder must now look like this: 
cat.jpg.mp4 
deeplabv3_257_mv_gpu.tflite 
TestUnet.cpb 
Unet.cpp
Run TestUnet.cpb withCode::Blocks. More info or
if you want to connect a camera to the app, follow the instructions at Hands-On.
I fact you can run this example on any aarch64 Linux system. 
See the movie at: https://www.youtube.com/watch?v=Kh9DLMgCIIE

