Hi Gerald and co,
We last conversed about a year ago (following your talk at ESC '15). It's good to see the MOVI project doing well. Which means it's a good time to revisit my earlier question...
If I understand correctly, the main thing keeping MOVI-style speech recognition and synthesis out of reach of most hobbyists is some combination of
(1) a packaging issue--the software is [mostly?] free but many packages must be integrated properly, and
(2) an IP issue--the right models, dictionaries, etc. come with a price tag.
What I would really like to see is MOVI (or its equivalent) as a software-only distribution that folks could run on an existing computer for integration into a home-automation system, VOIP, and so on. I understand that MOVI on RPi, for example, could be integrated into my existing network, but I already have a fast ARM machine (NVidia Jetson board) for mailserver/webserver/NAS/wifi-AP/openhab/freeswitch/etc duties. The speech UI belongs there too; besides, it means one less Linux machine to administer/update/worry about, always a good thing.
All of this just to make clear that I'm not trying to weasel out of paying $$ for a working system.
What do you see standing in the way of a SW-only solution for Linux users? Does it boil down to IP licensing? Can someone figure out a way to deliver the open-source stuff separately, then sell the magic binary blob for a reasonable price?
Even reasonable-quality speech synthesis (let alone speech recognition) is still effectively out of reach for 99+% of Linux users on the desktop, in the year 2016. This seems incredible to me, and I wonder what we could do to improve the situation. Obviously, MOVI is a great start, but it is not the whole answer either.
In the short term, is it feasible to "teleport" the executable environment from the MOVI onto another armhf architecture machine, and run it there? Does my purchase of MOVI give me a license to transfer the IP onto another machine?
Login below to reply: