Daily Note: TIL — Polly SSML <mark> tags
Polly's SSML <mark> tags emit timing events over the stream. Useful for synchronizing on-screen captions to voice playback.
TIL: Polly's <mark> SSML tags surface as timing events in the audio stream. If you're doing voice + screen-pop, you can synchronize captions to the exact word.
<speak>
Please hold while I <mark name="lookup"/> look up your account.
</speak>
On the consumer side, you subscribe to the mark events and flip UI state when lookup fires. This is the missing piece I needed for a low-latency voice+visual experience I've been prototyping.
Docs: Amazon Polly → "SpeechMarks" → JSON event with time and value.
Related
Keep reading
Daily Note: Shipping the first blog post from my phone
The meta post. Stood up this blog, wrote this from the couch on my phone. Quick Note works.
Daily Note: Debugging a Production API Flow
Integration test green, staging green, prod failing for 1% of callers. The bug was in a place I would never have guessed.
Building Voice Integrations on Top of Async Chatbots
What breaks when you front an async chatbot with Amazon Connect + Lex, and how to keep latency, barge-in, and context handoff sane.
Keep going
Where to next?
Browse more technical writing, see the engineering case studies, or reach out directly.