Descripteur de contenu IA
Par Carter Temm , various contributors and translators
This add-on makes it possible to describe the focus object, navigator object, or screen using popular vision capable AI language models, like Claude, Gemini, or GPT4. It also lets one understand where their face is positioned in the frame of a connected camera. Though content descriptions are quite detailed, they may not always be completely accurate or reflect real world information. Press NVDA+shift+i to pop up a menu asking how you wish to describe based on the current position, or NVDA+shift+u to describe the navigator object, or NVDA+shift+y for an image that has been copied to the clipboard such as in windows explorer, or NVDA+shift+c to ask additional questions about a description. Other keystrokes are customizable from the input gestures dialog. By default, usage of GPT4 is free, thanks to the generocity of the team at PollinationsAI. If you would like to use other models from OpenAI, head to https://platform.openai.com/account/api-keys and create an account, then create a key for interacting with the API. Then, choose the "AI content describer" category from NVDA's settings dialog -> manage models and enter your API key. The process is similar for other model providers, see add-on documentation for more information on this.
Téléchargements disponibles
Autres détails
- Compatibilité NVDA : de 2023.1 à 2024.1
- Code source
- Licence : GPL v2
Partager cette page sur :