Fix scheduler stepping and label dtype handling in training loop by SarthakJagota · Pull Request #153 · ML4SCI/DeepLense

SarthakJagota · 2026-02-24T08:26:15Z

This PR introduces two small training stability improvements:

• Replaced scheduler.step(loss) with scheduler.step() to ensure compatibility with schedulers such as CosineAnnealingWarmRestarts which do not use the loss value.
• Replaced labels.type(torch.LongTensor).to(device) with labels.long().to(device) to avoid unnecessary CPU tensor creation and ensure consistent device handling.

These changes do not modify training behavior conceptually but improve correctness and stability of the training loop.

#152

Update train.py

77a2734

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix scheduler stepping and label dtype handling in training loop#153

Fix scheduler stepping and label dtype handling in training loop#153
SarthakJagota wants to merge 1 commit intoML4SCI:mainfrom
SarthakJagota:fix/training-scheduler-dtype

SarthakJagota commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SarthakJagota commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SarthakJagota commented Feb 24, 2026 •

edited

Loading