Hi everyone!

A few days ago I released Whishper, a new version of a project I’ve been working for about a year now.

It’s a self-hosted audio transcription suite, you can transcribe audio to text, generate subtitles, translate subtitles and edit them all from one UI and 100% locally (it even works offline).

I hope you like it, check out the website for self-hosting instructions: https://whishper.net

  • Axiochus@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    10 months ago

    Oh, awesome! Does it do speaker detection? That’s been one of my main gripes with Whisper.

    • pluja@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      7
      ·
      edit-2
      10 months ago

      Unfortunately, not yet. Whisper per se is not able to do that. Currently, there are few viable solutions for integration, and I’m looking at this one, but all current solutions I know about need GPU for this.

      • jherazob@kbin.social
        link
        fedilink
        arrow-up
        2
        ·
        10 months ago

        VERY understandable, requiring a GPU would limit it’s application and spread, i hope a good GPU-less solution is found eventually