博客

111篇文章

MikuTools 最新文章:工具教程、产品更新、AI 工具实践和工程笔记。

  1. Og Image

    What SSML support actually looks like across TTS APIs in 2026

    SSML is the standard for telling a text-to-speech engine how to read your text. Half the modern providers ignore it. Here is what is actually implemented across the major TTS APIs in 2026 and what to use when SSML is not on the menu.

    7 分钟阅读
  2. Og Image

    How to match TTS voices to narration jobs without guessing

    The text-to-speech tool ships dozens of preset voices across nine languages. Which one fits a tutorial? A meditation? An ad read? Here is a practical taxonomy of when to reach for which voice, and how to test fast.

    8 分钟阅读
  3. Og Image

    Pick the right text-to-speech format for your output

    MP3, Opus, AAC, FLAC, WAV, OGG, and PCM all come out of the same TTS endpoint. They are not interchangeable. Here is which format to ask for, and when reaching for the wrong one quietly costs you bandwidth, quality, or compatibility.

    7 分钟阅读
  4. Og Image

    Live captions are not the same as accessible captions

    Auto-generated captions look like an accessibility solution and usually aren't. Here is the difference between live captions and accessible captions, why ADA and WCAG care, and what good practice looks like in 2026.

    6 分钟阅读
  5. Og Image

    Turn a podcast episode into show notes in 20 minutes

    A practical recipe for taking a 60-minute recorded podcast episode through transcription to ready-to-publish show notes, chapters, and social cuts in roughly 20 minutes of working time.

    6 分钟阅读
  6. Og Image

    Why I stopped uploading client photos to image compressors

    An opinion piece on why browser-side image compression has quietly become the right default for client-bound work, where cloud compressors still earn their keep, and the honest limitations of staying local.

    11 分钟阅读