Skip to content

Conversation

@jakub-trusina
Copy link

  • removed ROI Contour Sequence as those can have hundreds of MB which leads to OOM during calling js=ds.to_dict_json() here.
  • deletion of binary images before converting pydicom dataset (with skipped loading of large items) into json - thus we prevent reading of binary images and contours from the file

@CLAassistant
Copy link

CLAassistant commented Nov 13, 2025

CLA assistant check
All committers have signed the CLA.

Copy link
Collaborator

@dmoore247 dmoore247 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jakub-trusina Interesting. What's the motivation for removing this particular tag?

@jakub-trusina
Copy link
Author

@dmoore247, it caused OOM on ds.to_json_dict(). My dataset has there hundreds of MB. When debugging, I needed 16GB cluster to process single image in Python only (not using UDF). Probably pydicom doesn't do this conversion efficiently in terms of memory usage. BTW, @erinaldidb knows the context of the project.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants