Hi, I have an AI backend ready to go, so I’m looking for an Android app that will be the implementation of the Android voice assistant and be able to convert my speech to text and send that text via HTTP request to a specific URL (my backend). Then read the response text, convert it to speech and say it.

I know that something like it is possible with the Tasker app, but it is paid and closed-source.

Thank you all in advance for your responses :)

  • MalReynolds@slrpnk.net
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    7 months ago

    I found Tasker speech to be pretty clunky, inaccurate and minimally supported. If it’s for personal use I have google recorder on a Pixel 7 with GrapheneOS and it’s STT is outstanding. It’ll download the offline engine and then you can kill it’s network and rip out google serices and it works fine (for now). Might be something to explore… If you find something good please reply, I’m thinking alon the same lines, but was just going to send the audio directly and do stt on the server. I know home assistant is planning voice command next year, might be something helpful there. You could also try using the Termux environment to spin up whatever linux engine you like.

  • 𝒍𝒆𝒎𝒂𝒏𝒏@lemmy.one
    link
    fedilink
    arrow-up
    1
    ·
    7 months ago

    If MIT AppInventor is still kicking around, you should be able to use it for this… although sadly you won’t have access to the source code since it’s a Scratch-like way to create apps.

    By default the Android voice assistant uses Google tech AFAIK, if you’re after a truly source-available solution then there’s ”Futo voice input" to handle STT, and “RHVoice” to handle TTS - though these would still need a HTTP API bridge to do what you want