Releases · fishaudio/fish-speech · GitHub

15 Sep 09:49

leng-yue

V1.4.1 Latest

Latest

This release includes bug fix and container optimization.

Assets 2

12 Sep 14:38

leng-yue

Fish Speech V1.4 Release

Fish Speech V1.4 is a leading TTS model trained on 700k hours of audio data in multiple languages.

Supported languages:

English (en) ~300k hours
Chinese (zh) ~300k hours
German (de) ~20k hours
Japanese (ja) ~20k hours
French (fr) ~20k hours
Spanish (es) ~20k hours
Korean (ko) ~20k hours
Arabic (ar) ~20k hours

Have fun :)

Assets 2

10 Sep 00:24

leng-yue

V1.2.1

This is the final stable release before 1.4 release on Sep 10.

Assets 2

18 Jul 16:41

leng-yue

Fish Speech V1.2 Release

In this release, we roll-out both 1.2 pretrain and SFT model, and also support auto-reranking for stable generation.

Assets 2

02 Jul 04:55

leng-yue

V1.1.2

This is the final stable release before 1.2

Assets 2

08 Jun 16:58

leng-yue

v1.1.1

Improve overall performance and experience, including lots of bug fixes

Assets 2

11 May 14:20

leng-yue

v1.1.0

In this release, we added the VITS decoder module, which provides better phone level accuracy and semantic similarity.

Assets 2

30 Apr 06:49

leng-yue

v1.0.0

This is a major release of fish speech, models can be found at HuggingFace.
Live demo can be found at HuggingFace Space and Fish Audio.

Models are released under BY-CC-NC-SA 4.0 License.

Assets 2

25 Dec 11:55

leng-yue

v0.2.0

This version provides basic (arch) model implementation, inference acceleration, and pretrained model.
Most functions / pipelines are tested and working properly.

Assets 2