Speech Recognition Polyfill door apersongithub
Allows speech recognition in websites such as Google Translate, Duolingo, etc. using OpenAI's Whisper API locally.
ExperimenteelExperimenteel
Metagegevens van extensie
Schermafbeeldingen
Over deze extensie
On first install this extension will open the options page, the default model language is English but this is changeable. This extension allows per-site customization and a multitude of different models to decipher language. Keep in mind that this is not a complete solution and the API doesn't have full support. Speech Detection is not as instantaneous like in Google Chrome's API which is sent to a server but it is manageable. The extension icon color/indicator changes depending on the process so pin it to your menu to verify the extension is working as intended. A red mic/error icon does not necessarily mean your mic isn't working but rather the speech may have been cancelled by user input or that it is unintelligible (usually its the latter).
Make sure you are using the correct mic and speak loud, slow, and clear otherwise your voice may not be detected or unintelligble. Change the default model to the larger ones if you experience problems with voice recognition (this WILL impair performance). The extension does not support continuous recording.
If you're using Duolingo or similar and are trying to do the speaking practice of the language that you are learning, set the language in the extension to the one you are learning (navigate to the site -> click extension icon -> set language then click "save for site"). This will help with speech detection since the model now knows what language you are trying to speak. (This isn't necessary for every site, one example is google translate which has the auto-language implementation) Look at the images for more help.
The extension will take ~1GB of ram on normal use. Remember it is using a local AI Model so, I've implemented decent memory management to compensate.
~~~~~~~~~~~~~~~~~~~~
❗ General Recommendations:
• 8GB of RAM is a minimum requirement since it could easily take up to 3GB when utilizing larger models.
• A modern CPU is recommended.
• An internet connection. Even though the model runs locally, the extension re-downloads it either when idle or after closing the tab/opening a new one that utilizes the extension (for memory preservation purposes). This is ultimately better than packaging the large models within the extension for the time being and for most models, the download speed will be near instant for the general population.
Make sure you are using the correct mic and speak loud, slow, and clear otherwise your voice may not be detected or unintelligble. Change the default model to the larger ones if you experience problems with voice recognition (this WILL impair performance). The extension does not support continuous recording.
If you're using Duolingo or similar and are trying to do the speaking practice of the language that you are learning, set the language in the extension to the one you are learning (navigate to the site -> click extension icon -> set language then click "save for site"). This will help with speech detection since the model now knows what language you are trying to speak. (This isn't necessary for every site, one example is google translate which has the auto-language implementation) Look at the images for more help.
The extension will take ~1GB of ram on normal use. Remember it is using a local AI Model so, I've implemented decent memory management to compensate.
~~~~~~~~~~~~~~~~~~~~
❗ General Recommendations:
• 8GB of RAM is a minimum requirement since it could easily take up to 3GB when utilizing larger models.
• A modern CPU is recommended.
• An internet connection. Even though the model runs locally, the extension re-downloads it either when idle or after closing the tab/opening a new one that utilizes the extension (for memory preservation purposes). This is ultimately better than packaging the large models within the extension for the time being and for most models, the download speed will be near instant for the general population.
Met 0 gewaardeerd door 0 beoordelaars
Toestemmingen en gegevens
Vereiste machtigingen:
- Browsertabbladen benaderen
- Uw gegevens voor alle websites benaderen
Gegevensverzameling:
- De ontwikkelaar zegt dat deze extensie geen gegevensverzameling vereist.
Meer informatie
- Add-on-koppelingen
- Versie
- 0.2.3
- Grootte
- 44,94 KB
- Laatst bijgewerkt
- 4 dagen geleden (13 jan. 2026)
- Verwante categorieën
- Licentie
- Mozilla Public License 2.0
- Versiegeschiedenis
- Labels
- Toevoegen aan collectie
De ontwikkelaar van deze extensie vraagt uw steun voor verdere ontwikkeling door middel van een kleine bijdrage.