Speech Recognition Polyfill Autor: apersongithub
Allows speech recognition in websites such as Google Translate, Duolingo, etc. using OpenAI's Whisper API locally.
EksperymentalneEksperymentalne
Metadane rozszerzenia
Zrzuty ekranu
O tym rozszerzeniu
On first install this extension will open the options page, the default model language is English but this is changeable. This extension allows per-site customization and a multitude of different models to decipher language. Keep in mind that this is not a complete solution and the API doesn't have full support. Speech Detection is not as instantaneous like in Google Chrome's API which is sent to a server but it is manageable. The extension icon color/indicator changes depending on the process so pin it to your menu to verify the extension is working as intended. A red mic/error icon does not necessarily mean your mic isn't working but rather the speech may have been cancelled by user input or that it is unintelligible (usually its the latter).
Make sure you are using the correct mic and speak loud, slow, and clear otherwise your voice may not be detected or unintelligble. Change the default model to the larger ones if you experience problems with voice recognition (this WILL impair performance). The extension does not support continuous recording.
If you're using Duolingo or similar and are trying to do the speaking practice of the language that you are learning, set the language in the extension to the one you are learning (navigate to the site -> click extension icon -> set language then click "save for site"). This will help with speech detection since the model now knows what language you are trying to speak. (This isn't necessary for every site, one example is google translate which has the auto-language implementation) Look at the images for more help.
The extension will take ~1GB of ram on normal use. Remember it is using a local AI Model so, I've implemented decent memory management to compensate.
~~~~~~~~~~~~~~~~~~~~
❗ General Recommendations:
• 8GB of RAM is a minimum requirement since it could easily take up to 3GB when utilizing larger models.
• A modern CPU is recommended.
• An internet connection. Even though the model runs locally, the extension re-downloads it either when idle or after closing the tab/opening a new one that utilizes the extension (for memory preservation purposes). This is ultimately better than packaging the large models within the extension for the time being and for most models, the download speed will be near instant for the general population.
Make sure you are using the correct mic and speak loud, slow, and clear otherwise your voice may not be detected or unintelligble. Change the default model to the larger ones if you experience problems with voice recognition (this WILL impair performance). The extension does not support continuous recording.
If you're using Duolingo or similar and are trying to do the speaking practice of the language that you are learning, set the language in the extension to the one you are learning (navigate to the site -> click extension icon -> set language then click "save for site"). This will help with speech detection since the model now knows what language you are trying to speak. (This isn't necessary for every site, one example is google translate which has the auto-language implementation) Look at the images for more help.
The extension will take ~1GB of ram on normal use. Remember it is using a local AI Model so, I've implemented decent memory management to compensate.
~~~~~~~~~~~~~~~~~~~~
❗ General Recommendations:
• 8GB of RAM is a minimum requirement since it could easily take up to 3GB when utilizing larger models.
• A modern CPU is recommended.
• An internet connection. Even though the model runs locally, the extension re-downloads it either when idle or after closing the tab/opening a new one that utilizes the extension (for memory preservation purposes). This is ultimately better than packaging the large models within the extension for the time being and for most models, the download speed will be near instant for the general population.
Ocenione na 0 przez 0 recenzentów
Uprawnienia i dane
Wymagane uprawnienia:
- Mieć dostęp do kart przeglądarki
- Mieć dostęp do danych użytkownika na wszystkich stronach
Zbieranie danych:
- Autorzy tego rozszerzenia twierdzą, że nie wymaga ono zbierania danych.
Więcej informacji
- Wersja
- 0.2.3
- Rozmiar
- 44,94 KB
- Ostatnia aktualizacja
- 4 dni temu (13 sty 2026)
- Powiązane kategorie
- Licencja
- Mozilla Public License 2.0
- Historia wersji
- Etykiety
- Dodaj do kolekcji
Autor tego rozszerzenia prosi o pomoc we wspieraniu jego rozwoju przez drobny datek.