It's a different type of software and is meant to become a competitive alternative to Google Speech Recognition & Synthesis. It has a lot of room for improvement including simply not reading newlines which can waste a lot of time if the text being passed to it doesn't have the whitespace stripped down.
It doesn't use hardware acceleration yet and was trained on a AMD RX 6600 prior to obtaining an RTX 5090. It can't be expected to keep up with Google's yet but it will get better.