CNNT: dynamic positional embeddings to support variable input sizes by SarthakJagota · Pull Request #150 · ML4SCI/DeepLense

SarthakJagota · 2026-02-23T10:13:43Z

This PR removes the hard-coded patch count used to initialize positional embeddings in the CNNT model and instead creates them dynamically based on the CNN output sequence length.

Changes

Compute patch sequence length from CNN feature maps
Create positional embeddings dynamically to avoid shape mismatches
Properly register positional embeddings so they are tracked by the optimizer
Maintain existing behavior for default input configurations

Motivation

The previous implementation assumed a fixed input resolution, which limited reuse across DeepLense tasks and could lead to runtime errors when image sizes change.
This update makes CNNT resolution-agnostic and improves model flexibility.

#149

Update CNNT.py

1c9f690

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CNNT: dynamic positional embeddings to support variable input sizes#150

CNNT: dynamic positional embeddings to support variable input sizes#150
SarthakJagota wants to merge 1 commit intoML4SCI:mainfrom
SarthakJagota:fix/cnnt-dynamic-pos-embedding

SarthakJagota commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SarthakJagota commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant