Better spectrograms for UI by akashmjn · Pull Request #452 · orcasound/orcahello

akashmjn · 2026-03-13T19:40:45Z

Summary

Spectrogram visualizer generates a mel spectrogram with freq labels for better readability. Uses src.audio_frontend module (same for model inference)
480 mel bins match the 480px image height for 1:1 pixel-per-bin rendering (n_fft=4096, hop_length=1024)
Adds optional test_spectrogram_viz pytest for local visual inspection (--save-debug)
Color map change magma -> Blues

Test plan

Manually running pytest tests/test_audio_preprocessing.py -k "spectrogram_viz" --save-debug — generates 1280x480 PNG to tests/tmp/ for inspection

Verify orchestrator still runs end-to-end, saving images to azure with *LocalDebug orch_config

Fixes #429
Fixes #139

Replace the split-in-half STFT approach in spectrogram_visualizer with the model's audio_frontend (load_processed_waveform + featurize_waveform). This produces a single mel spectrogram with uniform color normalization, fixing the purple/blue half-and-half artifacts in the moderator UI. https://claude.ai/code/session_018X71PrWAjeFTXkEdqH65W7

…ms (#429) Spectrogram visualizer now uses its own visualization config instead of the model's inference config: - No resampling — uses native sample rate (48kHz for hydrophones) - mel_f_min=20Hz, mel_f_max=Nyquist (full audible bandwidth) - 960 mel bins matching 960px image height for 1:1 pixel rendering - n_fft=8192, hop_length=2048 for good frequency resolution - Output: 1920x960 PNG with uniform magma colormap Also adds an optional pytest (test_spectrogram_viz) under test_audio_preprocessing for local visual inspection via --save-debug. https://claude.ai/code/session_018X71PrWAjeFTXkEdqH65W7

Copilot

Pull request overview

This PR updates spectrogram generation in the InferenceSystem so the moderator UI receives more consistent, visualization-optimized spectrogram images derived from the audio clip’s native sample rate (instead of the model inference config). It also adds a local-only pytest for visually inspecting the generated PNG output.

Changes:

Refactors spectrogram_visualizer.write_spectrogram() to use model.audio_frontend and a visualization-specific config derived from the WAV’s native sample rate.
Adds an optional test_spectrogram_viz test that generates and validates a 1280x480 spectrogram PNG (and can save a debug artifact).
Registers a new optional pytest marker.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
InferenceSystem/src/spectrogram_visualizer.py	Replaces the previous “two halves stitched together” approach with a single mel-spectrogram render pipeline using audio_frontend + Matplotlib.
InferenceSystem/tests/test_audio_preprocessing.py	Adds an optional visual-inspection test to generate/validate a spectrogram image.
InferenceSystem/tests/conftest.py	Registers the `optional` pytest marker for local-debug tests.

You can also share your feedback on Copilot code review. Take the survey.

InferenceSystem/src/spectrogram_visualizer.py

InferenceSystem/tests/test_audio_preprocessing.py

InferenceSystem/tests/conftest.py

InferenceSystem/src/spectrogram_visualizer.py

InferenceSystem/tests/test_audio_preprocessing.py

Copilot

Pull request overview

Improves the InferenceSystem’s spectrogram PNG generation used by the orchestrator (and ultimately shown in the moderator UI) by switching the visualizer to the shared model.audio_frontend mel-spectrogram pipeline and adding a small visualization-focused test.

Changes:

Refactors spectrogram_visualizer.write_spectrogram() to compute a mel spectrogram via model.audio_frontend and render it with matplotlib (including frequency labels + new colormap).
Adds a new pytest test that generates a spectrogram from the 1-minute fixture WAV and asserts the output image dimensions (with optional --save-debug output copying).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
InferenceSystem/src/spectrogram_visualizer.py	Replaces the previous STFT/concatenation approach with a single mel-spectrogram render path and overlays frequency labels.
InferenceSystem/tests/test_audio_preprocessing.py	Adds a visualization-oriented test that generates and validates the PNG output (and optionally saves it for local inspection).

You can also share your feedback on Copilot code review. Take the survey.

InferenceSystem/src/spectrogram_visualizer.py

InferenceSystem/tests/test_audio_preprocessing.py

Co-authored-by: Dave Thaler <dthaler1968@gmail.com>

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

claude and others added 5 commits March 13, 2026 18:46

tweak height

f8321c3

fix bugs, param better

154afe4

rm unused code

c1b6e32

akashmjn changed the title ~~Refactor spectrogram generation to use model's audio frontend~~ Better spectrograms for UI Mar 13, 2026

akashmjn requested a review from dthaler March 13, 2026 21:07

akashmjn marked this pull request as ready for review March 13, 2026 21:09

akashmjn requested review from TruaShamu and micya as code owners March 13, 2026 21:09

update cmap to blues

092fa53

dthaler requested a review from Copilot March 14, 2026 00:59

Copilot started reviewing on behalf of dthaler March 14, 2026 01:00 View session

Copilot AI reviewed Mar 14, 2026

View reviewed changes

akashmjn added 4 commits March 13, 2026 19:25

pr nits

2b7f972

ruff fix imports

9d62398

nit

01978a6

nit

77dc4b6

dthaler approved these changes Mar 14, 2026

View reviewed changes

InferenceSystem/tests/test_audio_preprocessing.py Outdated Show resolved Hide resolved

dthaler requested a review from Copilot March 14, 2026 13:33

Copilot started reviewing on behalf of dthaler March 14, 2026 13:33 View session

Copilot AI reviewed Mar 14, 2026

View reviewed changes

akashmjn and others added 3 commits March 14, 2026 08:53

Merge branch 'main' into claude/debug-orchestrator-images-Ag6nR

bb39c06

Update InferenceSystem/tests/test_audio_preprocessing.py

8a6dd3a

Co-authored-by: Dave Thaler <dthaler1968@gmail.com>

Potential fix for pull request finding

4543297

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

akashmjn merged commit 6a81be9 into main Mar 14, 2026
26 checks passed

akashmjn deleted the claude/debug-orchestrator-images-Ag6nR branch March 14, 2026 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better spectrograms for UI#452

Better spectrograms for UI#452
akashmjn merged 13 commits intomainfrom
claude/debug-orchestrator-images-Ag6nR

akashmjn commented Mar 13, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

akashmjn commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

akashmjn commented Mar 13, 2026 •

edited

Loading