Improving metadata extraction efficiency and removing ROI Contour Sequence from metadata #165

jakub-trusina · 2025-11-13T17:44:51Z

removed ROI Contour Sequence as those can have hundreds of MB which leads to OOM during calling js=ds.to_dict_json() .
deletion of binary images before converting pydicom dataset (with skipped loading of large items) into json - thus we prevent reading of binary images and contours from the file

CLAassistant · 2025-11-13T17:44:58Z

All committers have signed the CLA.

dmoore247

@jakub-trusina Interesting. What's the motivation for removing this particular tag?

jakub-trusina · 2025-11-21T08:29:31Z

@dmoore247, it caused OOM on ds.to_json_dict(). My dataset has there hundreds of MB. When debugging, I needed 16GB cluster to process single image in Python only (not using UDF). Probably pydicom doesn't do this conversion efficiently in terms of memory usage. BTW, @erinaldidb knows the context of the project.

jakub-trusina added 2 commits November 13, 2025 17:23

removing RTStruct from metadata, improving processing efficiency

40a8849

updated comment

806ef7c

dmoore247 reviewed Nov 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improving metadata extraction efficiency and removing ROI Contour Sequence from metadata #165

Improving metadata extraction efficiency and removing ROI Contour Sequence from metadata #165

jakub-trusina commented Nov 13, 2025

Uh oh!

CLAassistant commented Nov 13, 2025 •

edited

Loading

Uh oh!

dmoore247 left a comment

Uh oh!

jakub-trusina commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improving metadata extraction efficiency and removing ROI Contour Sequence from metadata #165

Are you sure you want to change the base?

Improving metadata extraction efficiency and removing ROI Contour Sequence from metadata #165

Conversation

jakub-trusina commented Nov 13, 2025

Uh oh!

CLAassistant commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dmoore247 left a comment

Choose a reason for hiding this comment

Uh oh!

jakub-trusina commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Nov 13, 2025 •

edited

Loading