App Inventor Speech Recognizer1 mp3 or WAV input option

thomas1 · January 13, 2023, 3:44pm

Hi, All

When using the Speech Recognizer block is it possible to have a wave or mp3 file loaded and transcribed into text or text file.
I dont want to use the microphone on the android device.

Basically, I have a remote device that can send audio wave files to the mobile phone device and need it translated to text and either sent back to the mobile device or saved in a file on the device.

Thank You

Thomas

SteveJG · January 13, 2023, 4:07pm

Welcome Thomas.

I don't think so Thomas. I tried using the microphone (which you do not want to do) and I found it virtually impossible to get the SpeechRecognizer to recognize the song's lyrics when playing a song using another device. Probably an issue with the background music making it impossible for the SR to capture the spoken work (lyrics).

I do not think it is possible to capture lyrics from a song unless the background music is very soft. Capturing lyrics from the mp3 directly is probably impossible. Capturing your speech is only possible if you speak in a clear voice with little background interference.

SteveJG · January 13, 2023, 4:20pm

mp3 to text non App Inventor possible alternatives to transcribing text from mp3.

and

https://www.google.com/search?q=transcripe+mp3+lyrics+android&rlz=1C1CHBF_enUS887US887&ei=K4TBY9_kG66fptQPoNKeqAs&ved=0ahUKEwifpoCY-cT8AhWuj4kEHSCpB7UQ4dUDCBA&uact=5&oq=transcripe+mp3+lyrics+android&gs_lcp=Cgxnd3Mtd2l6LXNlcnAQAzIHCCEQoAEQCjIHCCEQoAEQCjIHCCEQoAEQCjIICCEQFhAeEB06CggAEEcQ1gQQsAM6BggAEBYQHjoFCAAQhgM6BQghEKsCSgQIQRgASgQIRhgAUNYVWM8nYK0raAFwAXgAgAGSAYgB6QaSAQMxLjeYAQCgAQHIAQjAAQE&sclient=gws-wiz-serp

thomas1 · January 13, 2023, 6:43pm

Hi, thanks for the reply.
Just to explain in more detail, Im basically trying to do a Speech to Text convertor for a project of mine using an ESP32 Microcontroller.

My ESP32 device has Dual MEM microphones and records speech when its spoken too.
It then sends the speech as a WAV or MP3 Audio file to my mobile device APP where I was hoping APP Inventor Speechrecognizer would convert the audio file to text and send it back to my ESP32 device.

I can use AWS Polly and other online API calls but the complexity and latencey going through certificates and other issues is just hair pulling lol.

Is APP Inventor looking at adding MP3 or WAV file module to its Speechrecognizer1 function.

Thanks

Thomas.
P.S (I've been an engineer for 20 years and APP Inventor is the best interface Ive ever had the pleasure to use. Hats off to MIT students.)

thomas1 · January 13, 2023, 6:44pm

Sorry
My typo, Speech Recognition to Text Convertor.

Thanks

Thomas

SteveJG · January 13, 2023, 6:50pm

Not using your microcontroller but might be something you find useful

Juan_Antonio · January 13, 2023, 7:58pm

Offtopic.

I know this topic is about converting speech to text, but I would like to share this link, we upload a song and get a file with the music (Instrumental) and another with the sound of the lyrics (Vocal). Try.