Add a server-side app for the audio captions feature and record proto-events for this data. As it is, only behaves as a pass-through module. The idea is to include all the business intelligence in this app.
Use the user's talking state to trigger a speech API recovery after long periods of silence.
Hardcoded pt-BR prototype for closed captions generated by the browser's WebSpeech API.