TalkType

Voice dictation for Linux Wayland - Press F8 to talk, release to type. Powered by OpenAI's Whisper AI for accurate speech recognition.

Screenshots

System tray menu • Recording indicator with timer

General settings with model selection • Advanced settings with GPU acceleration

Audio settings with microphone test • Custom voice commands

Built-in help with getting started guide • Complete voice commands reference

Features

Push-to-Talk or Toggle Mode - F8 (hold) or F9 (toggle) - fully customizable hotkeys
AI-Powered Transcription - Uses OpenAI's Whisper models (tiny to large-v3)
GPU Acceleration - Optional NVIDIA CUDA support for 3-5x faster transcription
Smart Text Processing - Auto-punctuation, smart quotes, auto-spacing
Voice Commands - Say "comma", "period", "new paragraph", "undo last word", and more
Custom Commands - Define your own phrase shortcuts (e.g., "my email" → [email protected])
Visual Feedback - On-screen recording indicator with timer
GNOME Integration - Native shell extension for GNOME desktop
Auto-Updates - Built-in update checker with one-click downloads
Wayland Native - Works seamlessly on modern Linux desktops

Installation

Arch Linux (AUR)

yay -S talktype-appimage
# or
paru -S talktype-appimage

AppImage (All Distros)

Download the latest AppImage from Releases:

chmod +x TalkType-v*.AppImage
./TalkType-v*.AppImage

The AppImage includes everything needed - just download and run!

Note: AppImages require FUSE 2 (libfuse.so.2). Install if needed:

Fedora/RHEL: sudo dnf install fuse

Ubuntu/Debian: sudo apt install libfuse2

Arch/Manjaro: sudo pacman -S fuse2

openSUSE: sudo zypper install libfuse2

System Requirements

Requirement	Details
OS	Linux with Wayland
Dependencies	ydotool, wl-clipboard (installed automatically on first run)
Audio	Working microphone
GPU (optional)	NVIDIA GPU for CUDA acceleration

Quick Start

Launch TalkType - Run the AppImage or use your app launcher
First-run setup - TalkType will guide you through initial configuration
Start dictating - Press F8 (hold to record) or F9 (toggle mode)
Speak naturally - Text appears where your cursor is
Use voice commands - Say "comma", "period", "new line", etc.

Hotkey Modes

Mode	Hotkey	How it works
Push-to-Talk	F8	Hold to record, release to transcribe
Toggle	F9	Press to start, press again to stop

Voice Commands

Punctuation

Say This	Result
"comma"	,
"period" / "full stop"	.
"question mark"	?
"exclamation point"	!
"colon"	:
"semicolon"	;
"open quote" / "close quote"	" " (smart quotes)
"dot dot dot" / "ellipsis"	...

Formatting

Say This	Result
"new line"	Line break
"new paragraph"	Double line break
"tab"	Tab character

Editing

Say This	Result
"undo last word"	Deletes last word
"undo last sentence"	Deletes to previous sentence
"undo everything"	Clears all dictated text

Literal Words

Say "literal" before any command to output the word instead:

"literal comma" → types "comma" (not ,)
"literal period" → types "period" (not .)

AI Models

Choose the right model for your needs in Preferences → General:

Model	Size	Speed	Accuracy	Best For
tiny	39 MB	Fastest	Basic	Quick notes
base	74 MB	Fast	Good	Casual use
small	244 MB	Balanced	Very Good	Recommended
medium	769 MB	Slower	Excellent	Professional
large-v3	~3 GB	Slowest	Best	Technical work

Tip: Start with "small" for everyday use. Enable GPU acceleration for larger models.

GPU Acceleration

TalkType supports NVIDIA CUDA for 3-5x faster transcription:

Automatic detection - TalkType detects your NVIDIA GPU on first run
One-click download - Download CUDA libraries (~800MB) when prompted
Automatic activation - GPU mode enables after download

You can also enable GPU later: Preferences → Advanced → Download CUDA Libraries

Configuration

Settings are stored in ~/.config/talktype/config.toml:

model = "small"           # AI model: tiny, base, small, medium, large-v3
device = "cpu"            # "cpu" or "cuda" (GPU)
hold_hotkey = "F8"        # Push-to-talk key
toggle_hotkey = "F9"      # Toggle recording key
mode = "hold"             # Default mode: "hold" or "toggle"
language_mode = "auto"    # "auto" or specific language code
beeps = true              # Audio feedback sounds
smart_quotes = true       # Use curly quotes " "
auto_space = true         # Auto-space between utterances
auto_period = true        # Add period at end of sentences

Development

From Source

# Prerequisites (Fedora/Nobara)
sudo dnf install -y portaudio-devel ffmpeg ydotool wl-clipboard \
                    python3-gobject libappindicator-gtk3 libnotify

# Clone and install
git clone https://github.com/ronb1964/TalkType.git
cd TalkType
poetry install

# Run
poetry run dictate-tray

ydotool Setup

TalkType requires ydotool for text injection:

# Create systemd service
mkdir -p ~/.config/systemd/user
cat > ~/.config/systemd/user/ydotoold.service <<'EOF'
[Unit]
Description=ydotool daemon
After=graphical-session.target

[Service]
Environment=XDG_RUNTIME_DIR=%t
ExecStart=/usr/bin/ydotoold --socket-path=%t/.ydotool_socket
Restart=on-failure

[Install]
WantedBy=default.target
EOF

# Enable and start
systemctl --user daemon-reload
systemctl --user enable --now ydotoold.service

Troubleshooting

Text not appearing?

Check ydotoold is running: systemctl --user status ydotoold
Verify socket exists: ls $XDG_RUNTIME_DIR/.ydotool_socket

Hotkey not working?

Another app may be using F8/F9 - try different keys in Preferences
Ensure TalkType service is running (check tray icon)

Transcription slow?

Enable GPU acceleration if you have NVIDIA GPU
Try a smaller model (tiny or base)
Use Performance presets in tray menu

Tray icon not visible (GNOME)?

TalkType offers to install its GNOME extension on first run
Or manually: Preferences → Advanced → Install Extension

License

MIT License - see LICENSE file for details.

TalkType - Voice dictation that just works.
Download • Report Bug • Request Feature

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
aur-repo		aur-repo
aur		aur
docker-testing		docker-testing
gnome-extension		gnome-extension
screenshots		screenshots
scripts		scripts
src/talktype		src/talktype
test-screenshots		test-screenshots
tests		tests
.gitignore		.gitignore
APPIMAGEHUB_STATUS.md		APPIMAGEHUB_STATUS.md
APPIMAGEHUB_SUBMISSION.md		APPIMAGEHUB_SUBMISSION.md
AppImageBuilder.yml		AppImageBuilder.yml
CLAUDE.md		CLAUDE.md
CLAUDE_RULES.md		CLAUDE_RULES.md
CROSS_PLATFORM_PORTING_GUIDE.md		CROSS_PLATFORM_PORTING_GUIDE.md
DEVELOPMENT_PLAN.md		DEVELOPMENT_PLAN.md
DEVELOPMENT_SEPARATION.md		DEVELOPMENT_SEPARATION.md
DEV_NOTES.md		DEV_NOTES.md
DEV_SETUP.md		DEV_SETUP.md
GNOME_EXTENSION_FIXES_NEEDED.md		GNOME_EXTENSION_FIXES_NEEDED.md
GNOME_EXTENSION_IDEAS.md		GNOME_EXTENSION_IDEAS.md
ICON_DOCUMENTATION.md		ICON_DOCUMENTATION.md
IMAGE_GENERATION_TOOLS.md		IMAGE_GENERATION_TOOLS.md
LICENSE		LICENSE
MARKETING_PLAN.md		MARKETING_PLAN.md
MCP_SCREENSHOT_USAGE.md		MCP_SCREENSHOT_USAGE.md
MCP_SERVER_DEBUG_SUMMARY.md		MCP_SERVER_DEBUG_SUMMARY.md
NO_PREINSTALL_BUNDLING_NOTES.md		NO_PREINSTALL_BUNDLING_NOTES.md
README.md		README.md
README_DEV.md		README_DEV.md
REPOSITORY_STRUCTURE.md		REPOSITORY_STRUCTURE.md
ROADMAP_v0.4.0.md		ROADMAP_v0.4.0.md
STABLE_DIFFUSION_SETUP.md		STABLE_DIFFUSION_SETUP.md
TESTING_PROCEDURES.md		TESTING_PROCEDURES.md
TalkType-v0.3.8-x86_64.AppImage.zsync		TalkType-v0.3.8-x86_64.AppImage.zsync
VISUAL_STYLE_GUIDE.md		VISUAL_STYLE_GUIDE.md
build-release.sh		build-release.sh
build-with-appimage-builder.sh		build-with-appimage-builder.sh
cleanup-docker-artifacts.sh		cleanup-docker-artifacts.sh
container-build.sh		container-build.sh
fresh-start-dev.sh		fresh-start-dev.sh
fresh-start-for-testing.sh		fresh-start-for-testing.sh
fresh-start-kubuntu.sh		fresh-start-kubuntu.sh
fresh-test-env.sh		fresh-test-env.sh
install-xdotool.sh		install-xdotool.sh
io.github.ronb1964.TalkType.appdata.xml		io.github.ronb1964.TalkType.appdata.xml
io.github.ronb1964.TalkType.desktop		io.github.ronb1964.TalkType.desktop
io.github.ronb1964.TalkType.png		io.github.ronb1964.TalkType.png
launch-gtk-inspector.sh		launch-gtk-inspector.sh
list-atspi-apps.py		list-atspi-apps.py
marketing-posts.md		marketing-posts.md
mcp-screenshot-server.py		mcp-screenshot-server.py
package-extension.sh		package-extension.sh
patch_pytorch.py		patch_pytorch.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
regenerate-autostart-dev.py		regenerate-autostart-dev.py
regenerate-autostart.py		regenerate-autostart.py
run-dev.sh		run-dev.sh
talktype-gnome-extension.zip		talktype-gnome-extension.zip
test-all-de.sh		test-all-de.sh
test-atspi.py		test-atspi.py
test-autostart.sh		test-autostart.sh
test-cuda-cancel.py		test-cuda-cancel.py
test-cuda-confirm.py		test-cuda-confirm.py
test-dev-paths.py		test-dev-paths.py
test-dev.sh		test-dev.sh
test-extension-auto-enable.sh		test-extension-auto-enable.sh
test-extension-enable-verbose.py		test-extension-enable-verbose.py
test-fresh-extension-install.py		test-fresh-extension-install.py
test-logout-glow.py		test-logout-glow.py
test-model-download.py		test-model-download.py
test-no-extension.py		test-no-extension.py
test-splash.py		test-splash.py
test-terminal-detection.py		test-terminal-detection.py
test-unified-download.py		test-unified-download.py
test-welcome-height.py		test-welcome-height.py
test-welcome-pulse.py		test-welcome-pulse.py
test_download_progress.py		test_download_progress.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TalkType

Screenshots

Features

Installation

Arch Linux (AUR)

AppImage (All Distros)

System Requirements

Quick Start

Hotkey Modes

Voice Commands

Punctuation

Formatting

Editing

Literal Words

AI Models

GPU Acceleration

Configuration

Development

From Source

ydotool Setup

Troubleshooting

Text not appearing?

Hotkey not working?

Transcription slow?

Tray icon not visible (GNOME)?

License

About

Uh oh!

Releases 19

Packages

Contributors 2

Uh oh!

Languages

License

ronb1964/TalkType

Folders and files

Latest commit

History

Repository files navigation

TalkType

Screenshots

Features

Installation

Arch Linux (AUR)

AppImage (All Distros)

System Requirements

Quick Start

Hotkey Modes

Voice Commands

Punctuation

Formatting

Editing

Literal Words

AI Models

GPU Acceleration

Configuration

Development

From Source

ydotool Setup

Troubleshooting

Text not appearing?

Hotkey not working?

Transcription slow?

Tray icon not visible (GNOME)?

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 19

Packages 0

Contributors 2

Uh oh!

Languages

Packages