-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Support task level network topology constrain #4675
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Welcome @3sunny! |
Summary of ChangesHello @3sunny, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request integrates a new feature into Volcano that enables users to specify network topology constraints for individual tasks within a job. This capability is crucial for optimizing the performance of distributed workloads by ensuring that related tasks are scheduled on nodes that meet specific network proximity criteria. The implementation involves comprehensive updates to CRD schemas, modifications to the core controller logic for managing these new policies, and the addition of robust validation mechanisms to prevent misconfigurations. Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request introduces support for task-level network topology constraints by adding a partitionPolicy to various CRDs and implementing the corresponding controller logic. The changes are extensive, touching CRD definitions, API types, controller actions, and admission webhooks.
Overall, the implementation looks solid, but I've identified a few issues related to correctness and maintainability. These include a bug in the Clone method for JobInfo, potential for stale data in UpdatePod, insufficient error handling, and some code duplication in the validation logic. I've also pointed out areas in the tests that could be improved for better coverage and correctness. Addressing these points will enhance the robustness and quality of the new feature.
5b852be to
0d3928e
Compare
|
relate talk 【Volcano Weekly Meeting: 2025-11-07】 【精准空降到 31:16】 https://www.bilibili.com/video/BV1st23BdEfh/?share_source=copy_web&vd_source=984444f47d5c4830f6101d7f27154d98&t=1876 |
2c05715 to
3ac3b5d
Compare
|
please update the example in the PR description. |
3a29053 to
7cf2be8
Compare
done |
0c6038a to
70cf826
Compare
740881f to
6f27f6b
Compare
Signed-off-by: 3sunny <[email protected]>
6f27f6b to
81d0a72
Compare
|
/lgtm |
|
/cc @wangyang0616 |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: wangyang0616 The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind feature
What this PR does / why we need it:
Add support for task-level network topology constraints, including deployment file updates and webhooks and controller sections.
Which issue(s) this PR fixes:
Fixes #4188
Special notes for your reviewer:
Does this PR introduce a user-facing change?
Example YAML for vcJob after modification
Example YAML for podGroup after modification