feat(Text2Speech): Add support for text to speech #210

lukasdotcom · 2025-04-30T01:24:05Z

This creates an implementation for text to speech. (nextcloud/server#52051)

I don't really know for sure what to do about the voices because I couldn't find an endpoint that lists the voices. Right now its just hardcoded to one of openai's voices. Some ideas that I have are:

Make the input field text for the voice and just fail when an invalid voice is given
Detect when open ai endpoint is being used and then use hardcoded list
Let admin create a list of valid values to use

Openai's documentation for text to speech: https://platform.openai.com/docs/guides/text-to-speech#voice-options

julien-nc

Nicely done!
Can you use tabs for indentation in Php files?
You can fix all your files at once with composer run cs:fix

src/components/AdminSettings.vue

github-actions · 2025-05-14T02:53:27Z

Hello there,
Thank you so much for taking the time and effort to create a pull request to our Nextcloud project.

We hope that the review process is going smooth and is helpful for you. We want to ensure your pull request is reviewed to your satisfaction. If you have a moment, our community management team would very much appreciate your feedback on your experience with this PR review process.

Your feedback is valuable to us as we continuously strive to improve our community developer experience. Please take a moment to complete our short survey by clicking on the following link: https://cloud.nextcloud.com/apps/forms/s/i9Ago4EQRZ7TWxjfmeEpPkf6

Thank you for contributing to Nextcloud and we hope to hear from you soon!

(If you believe you should not receive this message, you can add yourself to the blocklist.)

kyteinsky

thanks for the nice work!
It works wonderfully.

btw, the limit for the input is 4096 characters (https://platform.openai.com/docs/api-reference/audio/createSpeech#audio-createspeech-input). It would be a QoL improvement to split the sentences as such but it can be a feature for later.

lib/AppInfo/Application.php

lib/Service/OpenAiAPIService.php

lib/Service/OpenAiSettingsService.php

lib/Service/OpenAiAPIService.php

src/components/AdminSettings.vue

lib/TaskProcessing/TextToSpeechProvider.php

lukasdotcom · 2025-05-27T16:37:43Z

Also I rebased this branch to make sure that the change I had to add for every test also was added to the emoji provider.

$iResponse->method('getHeader')->with('Content-Type')->willReturn('application/json');

lib/Service/OpenAiAPIService.php

Signed-off-by: Lukas Schaefer <[email protected]>

… to pass Signed-off-by: Lukas Schaefer <[email protected]>

Signed-off-by: Lukas Schaefer <[email protected]>

kyteinsky

some minor changes

tests/unit/Providers/OpenAiProviderTest.php

lib/TaskProcessing/TextToSpeechProvider.php

lib/Service/OpenAiAPIService.php

src/components/AdminSettings.vue

Signed-off-by: Lukas Schaefer <[email protected]>

kyteinsky

thanks a lot 🚀

lukasdotcom · 2025-06-25T13:55:52Z

Note: Will need to fix the failing unit tests.

lukasdotcom mentioned this pull request Apr 30, 2025

docs for text to speech nextcloud/documentation#13096

Merged

julien-nc requested changes Apr 30, 2025

View reviewed changes

src/components/AdminSettings.vue Outdated Show resolved Hide resolved

lukasdotcom marked this pull request as ready for review May 5, 2025 23:08

lukasdotcom mentioned this pull request May 5, 2025

Fix: phpdoc is incorrect #211

Merged

github-actions bot added the feedback-requested label May 14, 2025

kyteinsky requested changes May 18, 2025

View reviewed changes

kyteinsky requested a review from julien-nc May 18, 2025 15:19

lukasdotcom requested a review from kyteinsky May 26, 2025 11:42

kyteinsky reviewed May 28, 2025

View reviewed changes

lib/Service/OpenAiAPIService.php Outdated Show resolved Hide resolved

lukasdotcom added 13 commits June 20, 2025 08:39

feat(Text2Speech): Add support for text to speech

8a73d24

Signed-off-by: Lukas Schaefer <[email protected]>

cs:fix, add quota, and default_tts model

6482df5

Signed-off-by: Lukas Schaefer <[email protected]>

Allow for specifying voices and admin default

92ae7c0

Signed-off-by: Lukas Schaefer <[email protected]>

Add Rating for openai tts

66efc68

Signed-off-by: Lukas Schaefer <[email protected]>

add tests

34c008a

Signed-off-by: Lukas Schaefer <[email protected]>

Improve formating for voice selection

ae5985f

Signed-off-by: Lukas Schaefer <[email protected]>

Decode json only when content type is json

3112f77

Signed-off-by: Lukas Schaefer <[email protected]>

Create task type if task type doesn't exist

f0a3c58

Signed-off-by: Lukas Schaefer <[email protected]>

Forgot to run composer cs:fix and rebased updating emojiprovider test…

67052c1

… to pass Signed-off-by: Lukas Schaefer <[email protected]>

Add back speed correctly

aaf8707

Signed-off-by: Lukas Schaefer <[email protected]>

switch wav to mp3 and remove elses

ba7cf46

Signed-off-by: Lukas Schaefer <[email protected]>

Oops typo

4a8b3c9

Signed-off-by: Lukas Schaefer <[email protected]>

limit bounds for speed in openai api

0e80763

Signed-off-by: Lukas Schaefer <[email protected]>

kyteinsky reviewed Jun 24, 2025

View reviewed changes

lukasdotcom added 5 commits June 24, 2025 08:40

Vue 3 and other minor changes

f69d675

Signed-off-by: Lukas Schaefer <[email protected]>

typo in test

719fd29

Signed-off-by: Lukas Schaefer <[email protected]>

used model-value instead v-model

84e60f0

Signed-off-by: Lukas Schaefer <[email protected]>

forgot docstring

2ec8dee

Signed-off-by: Lukas Schaefer <[email protected]>

cs:fix changes

f23ae0c

Signed-off-by: Lukas Schaefer <[email protected]>

lukasdotcom added 3 commits June 25, 2025 09:26

composer update

d479b5d

Signed-off-by: Lukas Schaefer <[email protected]>

alphabetize authors

8e5ccd0

Signed-off-by: Lukas Schaefer <[email protected]>

Add note about input limit in openai

37a3fce

Signed-off-by: Lukas Schaefer <[email protected]>

kyteinsky approved these changes Jun 25, 2025

View reviewed changes

lukasdotcom merged commit 4bfb7ae into nextcloud:main Jun 25, 2025
23 of 29 checks passed

kyteinsky mentioned this pull request Jul 9, 2025

3.6.0 #232

Merged

lukasdotcom mentioned this pull request Jul 31, 2025

fix: default tts model selector #247

Merged

feat(Text2Speech): Add support for text to speech #210

feat(Text2Speech): Add support for text to speech #210

Uh oh!

Conversation

lukasdotcom commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

julien-nc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented May 14, 2025

Uh oh!

kyteinsky left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lukasdotcom commented May 27, 2025

Uh oh!

Uh oh!

kyteinsky left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kyteinsky left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lukasdotcom commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lukasdotcom commented Apr 30, 2025 •

edited

Loading

lukasdotcom commented Jun 25, 2025 •

edited

Loading