GAVPI (Speech Recognition Video Game and Application Voice Interface) is a powerful, open-source tool that translates voice commands into keyboard and mouse actions. While originally designed for gamers, it serves as an excellent, lightweight solution for full, hands-free Windows automation.
Here is how to set up GAVPI to control your PC entirely with your voice. Prerequisites and Installation
Before configuring GAVPI, you must ensure your operating system has the necessary speech infrastructure installed.
Enable Windows Speech: GAVPI relies on the native Microsoft Speech API. Open your Windows Settings, navigate to Time & Language > Speech, and ensure your microphone is calibrated.
Download GAVPI: Download the latest release from the official GitHub repository or trusted hosting source.
Extract and Run: GAVPI is portable. Extract the downloaded ZIP folder to a permanent location (e.g., C:\GAVPI) and run the GAVPI.exe file as an Administrator to ensure it can inject keystrokes into elevated applications. Creating Your First Voice Profile
GAVPI organizes your voice commands into files called profiles.
Launch GAVPI and click on File > New to create a fresh profile.
Click File > Save As and name your profile (e.g., Windows_Automation.xml).
Locate the Trigger Phrase box. This is the global “wake word” or button activation toggle, though you can leave it blank for an “always listening” setup. Programming Voice Commands
To automate a task, you must link a spoken phrase to a specific PC action.
Click the Plus (+) icon or New Command button in the main interface.
In the Phrase field, type exactly what you want to say (e.g., “Open Notepad” or “Scroll Down”).
Navigate to the Actions panel on the right to assign the behavior:
For Keyboard Shortcuts: Select Keys, type the key combination (like Ctrl+C), and choose Tap, Press, or Release.
For Mouse Control: Select Mouse, then input specific X and Y screen coordinates, or choose a click action (e.g., Left Click).
For Launching Apps: Select Execute, and paste the file path of the application (like C:\Windows\System32\notepad.exe).
Click Insert to add the action to the command sequence, then click Save. Managing Profiles and Testing
Once your commands are programmed, you need to activate the engine.
Click the Start button (microphone icon) on the GAVPI dashboard to begin listening.
Speak your command clearly into your microphone. The GAVPI log window will display the recognized text and confirm if the action triggered successfully.
Click Stop whenever you need to edit commands or pause voice tracking. Advanced Optimization Tips
To make your hands-free setup reliable, implement these advanced configurations:
Adjust Confidence Thresholds: In the settings menu, adjust the threshold slider. A higher threshold (e.g., 0.80) prevents random background noise from triggering commands accidentally, while a lower threshold makes it easier to trigger commands if you have a faint accent.
Chain Multiple Actions: You can stack multiple actions into a single voice command. For example, saying “Check Email” can be programmed to launch your browser, wait two seconds (using a Delay action), and then type your password.
Use Dictation Mode: For typing long texts hands-free, toggle the profile into Dictation mode, which passes raw speech straight to your active text field rather than looking for specific macro triggers. To help you customize this guide, tell me:
What specific applications or games are you trying to automate?
Do you need help mapping complex mouse coordinates or multi-key macros? Are you experiencing any microphone recognition errors?
I can provide the exact command strings and action sequences for your specific workflow.
Leave a Reply