cuD-PDLP#1391
Open
Bubullzz wants to merge 155 commits into
Open
Conversation
…he cycle seems to be fixed, cuopt compiles
+ style too
compiles and runs
Contributor
Author
|
/ok to test 368b3b3 |
Contributor
Author
|
/ok to test 6948bc5 |
Contributor
Author
|
/ok to test 1563cdc |
Contributor
Author
|
/ok to test d0de284 |
Contributor
Author
|
/ok to test 7bb6945 |
Contributor
Author
|
/ok to test 21cdccc |
CI Test Summary16 failed · 15 passed · 0 skipped |
Contributor
Author
|
/ok to test fa88ed9 |
Contributor
Author
|
/ok to test 7b6f96a |
Contributor
Author
|
/ok to test b7d4d91 |
Contributor
Author
|
/ok to test 89c8878 |
Contributor
Author
|
/ok to test 2fc3add |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Implemented metis-partitionned multi-GPU PDLP.
To run PDLP using multi-GPU run :
./cpp/build/cuopt_cli ../path/to/file.mps --method 1 --use-distributed-pdlp true --presolve 0, the exact number of GPUs used can be set with--distributed-pdlp-num-gpus nAll benchmarking results against D-PDLP and single GPU CuOpt can be found in this spreadsheet
Here is the bottom line of the results
On 8 NVLINKed B200 :
against CuOpt :
against D-PDLP
to note: the speedups against D-PDLP are computed with NVLS_SHARP=0 disabling a feature that could give them a speedup from 1.1x to 1.75x I am looking with the compute-lab team to make it work
closes #891