Implement read-along directive with audio synchronization and TTS support #1030

Copilot · 2025-11-06T00:26:00Z

Implementing read-along directive for Hyperbook - COMPLETE ✅

All Requirements Implemented:

✅ Container directive (:::readalong) for wrapping text content
✅ Play/pause button with visual feedback (playing/paused states)
✅ Word-by-word highlighting synchronized with audio playback
✅ Clickable words for jumping to specific timestamps
✅ Support for OpenAI Whisper timestamp format (JSON)
✅ Computer-generated timestamps with configurable reading speed
✅ Browser Text-to-Speech (TTS) mode - no audio file needed
✅ Smooth animations and auto-scrolling
✅ Theme-aware styling (light/dark mode support)
✅ Documentation added to website (EN + DE)
✅ Fixed text shifting issue - highlighting no longer moves text

Implementation Details:

Server-side: remarkDirectiveReadalong.ts (remark plugin)
Client-side: client.js (audio control, word wrapping, highlighting, TTS support)
Styling: style.css (responsive, animated, no layout shift)
Tests: 6 comprehensive test cases, all passing
Total: 107 tests pass in markdown package

Mode Support:

The directive now supports two modes via the mode attribute:

TTS Mode (mode="tts"):
- Uses browser's Web Speech API (SpeechSynthesis)
- No audio file required - browser reads text aloud
- Automatic word boundary detection for precise highlighting
- Adjustable speech rate via speed parameter
- Perfect for quick prototyping and accessibility features
Manual Mode (mode="manual", default):
- Uses provided audio file with timestamps
- Supports Whisper format or auto-generated timestamps
- Timestamps scale automatically to fit audio duration
- Works with all HTML5 audio formats (MP3, WAV, OGG, M4A)

Usage Examples:

# TTS Mode - No audio file needed
:::readalong{mode="tts"}
The browser will read this text aloud automatically.
:::

# Manual Mode - With audio file
:::readalong{src="/audio.mp3" autoGenerate="true"}
Synchronized with audio file using auto-generated timestamps.
:::

Highlighting Fix:

Removed font-weight: 600 that was causing text to shift
Removed transform: scale() animation that affected layout
Now uses only background-color and box-shadow for highlighting
Text remains stable during playback without shifting

Documentation:

English: /website/en/book/elements/readalong.md
German: /website/de/book/elements/readalong.md
Includes usage examples for both TTS and manual modes
Configuration options table with mode descriptions
Live demos for both modes
Available in navigation under Elements section

Key Features:

Two modes: TTS (browser speech) or Manual (audio file)
Timestamps scale automatically to fit audio duration (manual mode)
Preserves paragraph structure and formatting
Minimal size (~14 KB total assets with TTS support)
No external dependencies - uses native Web APIs
Works with all HTML5 audio formats

Code Quality:

All code review feedback addressed
TypeScript compilation successful
Linting passes (markdown package)
Consistent with existing directive patterns
Well-documented and tested

Verification:

Manual testing with example project ✅
Build successful ✅
Assets properly copied and loaded ✅
HTML structure correct ✅
Documentation renders correctly ✅
No text shifting during highlighting ✅
TTS mode works in modern browsers (Chrome, Edge, Safari) ✅

Original prompt

implement a read along directive. this should be a container directive. it should have a play pause button. it should highlight the currently read word. it should also be possible to click a word for jumping to this part of the sound file. the synchronization should either be with an audio file and timestamps from openai whisper or it should be computer generated.

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

vercel · 2025-11-06T00:26:08Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
hyperbook-4fwa	Ready	Preview	Comment	Nov 6, 2025 6:41am

Co-authored-by: mikebarkmin <[email protected]>

… issues Co-authored-by: mikebarkmin <[email protected]>

mikebarkmin · 2025-11-06T05:26:39Z

@copilot add the element to the documentation in website.

Co-authored-by: mikebarkmin <[email protected]>

Copilot · 2025-11-06T05:36:26Z

@copilot add the element to the documentation in website.

Added documentation for the read-along directive in commit 9c6a318.

Documentation is now available in both English and German:

English: /website/en/book/elements/readalong.md
German: /website/de/book/elements/readalong.md

The page includes:

Description and basic usage examples
Configuration options table
Manual timestamps (Whisper format) example
Live demo with auto-generated timestamps
Feature list and tips

mikebarkmin · 2025-11-06T06:08:41Z

@copilot the text moves since you are using bold to highlight the current word. Use another highlighting style, which does not shift the text.

…ighting Co-authored-by: mikebarkmin <[email protected]>

Copilot · 2025-11-06T06:17:04Z

@copilot the text moves since you are using bold to highlight the current word. Use another highlighting style, which does not shift the text.

Fixed the text shifting issue in commit f9617b4.

Changes made:

Removed font-weight: 600 that was causing words to become wider
Removed transform: scale() animation that affected layout
Now uses only background-color and box-shadow for highlighting

The text now remains stable during playback without any shifting.

mikebarkmin · 2025-11-06T06:25:29Z

@copilot with auto generated I meant use the tts of the browser. therefore add a mode attribute. mode tts or mode manual.

Co-authored-by: mikebarkmin <[email protected]>

Copilot · 2025-11-06T06:42:05Z

@copilot with auto generated I meant use the tts of the browser. therefore add a mode attribute. mode tts or mode manual.

Added TTS mode in commit 3b9f2b7.

The directive now supports two modes:

TTS Mode (mode="tts"):

Uses the browser's Web Speech API (SpeechSynthesis)
No audio file required - browser reads text aloud
Automatic word boundary detection for precise highlighting
Adjustable speech rate via speed parameter
Perfect for quick prototyping and accessibility

Manual Mode (mode="manual", default):

Uses provided audio file with timestamps
Supports Whisper format or auto-generated timestamps
Original behavior preserved

Usage examples:

:::readalong{mode="tts"}
Text-to-speech will read this automatically.
:::

:::readalong{src="/audio.mp3" autoGenerate="true"}
Uses audio file with auto-generated timestamps.
:::

Initial plan

381ef47

Copilot AI assigned Copilot and mikebarkmin Nov 6, 2025

Copilot started work on behalf of mikebarkmin November 6, 2025 00:26 View session

vercel bot deployed to Preview November 6, 2025 00:27 View deployment

Add read-along directive implementation

810c7be

Co-authored-by: mikebarkmin <[email protected]>

vercel bot deployed to Preview November 6, 2025 00:42 View deployment

Address code review feedback - improve text extraction and fix CSS/JS…

b0a15c5

… issues Co-authored-by: mikebarkmin <[email protected]>

vercel bot deployed to Preview November 6, 2025 00:51 View deployment

Copilot AI changed the title ~~[WIP] Implement read along directive with audio synchronization~~ Implement read-along directive for audio-synchronized text highlighting Nov 6, 2025

Copilot AI requested a review from mikebarkmin November 6, 2025 00:56

Copilot finished work on behalf of mikebarkmin November 6, 2025 00:56

Copilot started work on behalf of mikebarkmin November 6, 2025 05:27 View session

Add read-along directive documentation to website

9c6a318

Co-authored-by: mikebarkmin <[email protected]>

Copilot finished work on behalf of mikebarkmin November 6, 2025 05:37

vercel bot deployed to Preview November 6, 2025 05:37 View deployment

Copilot started work on behalf of mikebarkmin November 6, 2025 06:09 View session

Fix text shifting by removing bold font-weight from active word highl…

f9617b4

…ighting Co-authored-by: mikebarkmin <[email protected]>

Copilot finished work on behalf of mikebarkmin November 6, 2025 06:18

vercel bot deployed to Preview November 6, 2025 06:18 View deployment

Copilot started work on behalf of mikebarkmin November 6, 2025 06:26 View session

Add TTS mode to read-along directive

3b9f2b7

Co-authored-by: mikebarkmin <[email protected]>

vercel bot deployed to Preview November 6, 2025 06:42 View deployment

Copilot AI changed the title ~~Implement read-along directive for audio-synchronized text highlighting~~ Implement read-along directive with audio synchronization and TTS support Nov 6, 2025

Copilot finished work on behalf of mikebarkmin November 6, 2025 06:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Implement read-along directive with audio synchronization and TTS support #1030

Implement read-along directive with audio synchronization and TTS support #1030

Uh oh!

Copilot AI commented Nov 6, 2025 •

edited

Loading

Uh oh!

vercel bot commented Nov 6, 2025 •

edited

Loading

Uh oh!

mikebarkmin commented Nov 6, 2025

Uh oh!

Copilot AI commented Nov 6, 2025

Uh oh!

mikebarkmin commented Nov 6, 2025

Uh oh!

Copilot AI commented Nov 6, 2025

Uh oh!

mikebarkmin commented Nov 6, 2025

Uh oh!

Copilot AI commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Implement read-along directive with audio synchronization and TTS support #1030

Are you sure you want to change the base?

Implement read-along directive with audio synchronization and TTS support #1030

Uh oh!

Conversation

Copilot AI commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikebarkmin commented Nov 6, 2025

Uh oh!

Copilot AI commented Nov 6, 2025

Uh oh!

mikebarkmin commented Nov 6, 2025

Uh oh!

Copilot AI commented Nov 6, 2025

Uh oh!

mikebarkmin commented Nov 6, 2025

Uh oh!

Copilot AI commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Nov 6, 2025 •

edited

Loading

vercel bot commented Nov 6, 2025 •

edited

Loading