Conversation
Collaborator
|
This is very nice, but it is actually a strong scaling example... :D To do a weak scaling plot, you have to increase also the number of processors when you increase the degrees of freedom, i.e., run a 8 times bigger problem on 8 times more processors. If you ran this same test on 8, 64, 512, and 4096 procs, and putting on the same plot all of the results (for each stage), you'd get both strong and weak scaling. |
Owner
Author
Owner
Author
The classical weak scaling plot looks like this: (with 8,64, and 512 procs). All the new components (i.e. everything besides the AMG preconditioner) are scaling weakly |
9c5ab95 to
f9a49bb
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.


This PR adds an instrumentation file to show the weak scaling. It solves an elliptic 3D problem with polynomial degree$p=2$ by increasing simultaneously:
Runs with 256 processors on CINECA are as follows. Processors to the right of the red vertical line have more than$10^5$ DoFs, showing the classical trend shown in the "Distributed Computing paper" Algorithms and Data Structures for Massively Parallel Generic Finite Element Codes (fig. 10) in the case of standard shapes. @luca-heltai