3643 - Feedback about NLP From Scratch: Translation with a Sequence to Sequence Network and Attention #3662

patrocinio · 2025-11-25T17:04:12Z

Fixes #3643

Description

Checklist

The issue that is being fixed is referred in the description (see above "Fixes #ISSUE_NUMBER")
Only one issue is addressed in this pull request
Labels from the issue that this PR is fixing are added to this pull request
No unnecessary issues are included into this pull request.

pytorch-bot · 2025-11-25T17:04:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3662

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

- Add dedicated PAD_token (index 2) for proper padding - Use pack_padded_sequence in encoder to handle variable-length sequences - Ensure encoder hidden state represents actual content, not padding - Add ignore_index=PAD_token to loss function to exclude padding from gradients - Update all embedding layers with padding_idx parameter - Add comprehensive documentation explaining padding handling best practices Fixes issues where: 1. GRU final hidden state could be from PAD tokens 2. Loss was computed on PAD tokens affecting training

meta-cla bot added the cla signed label Nov 25, 2025

patrocinio force-pushed the 3643 branch from 76337be to cfd2f36 Compare November 25, 2025 23:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

3643 - Feedback about NLP From Scratch: Translation with a Sequence to Sequence Network and Attention #3662

3643 - Feedback about NLP From Scratch: Translation with a Sequence to Sequence Network and Attention #3662

Uh oh!

patrocinio commented Nov 25, 2025

Uh oh!

pytorch-bot bot commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

3643 - Feedback about NLP From Scratch: Translation with a Sequence to Sequence Network and Attention #3662

Are you sure you want to change the base?

3643 - Feedback about NLP From Scratch: Translation with a Sequence to Sequence Network and Attention #3662

Uh oh!

Conversation

patrocinio commented Nov 25, 2025

Description

Checklist

Uh oh!

pytorch-bot bot commented Nov 25, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3662

❗ 2 Active SEVs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant