Bug for client optimizer

Hi, @samiul272 , After my careful debugging, I finally found the problem. I have found that if I take the repository code directly and run the default command in README.md:
`
python main_resnet.py --data_name CIFAR10 \
                      --model_name resnet18 \ 
                      --control_name 1_100_0.1_non-iid-2_dynamic_a1-b1-c1-d1-e1_bn_1_1 \
                      --exp_name roll_test \
                      --algo roll \
                      --g_epoch 3200 \
                      --l_epoch 1 \
                      --lr 2e-4 \
                      --schedule 1200 \
                      --seed 31 \
                      --num_experiments 3 \
                      --devices 0 1 2
`
then each client locally uses the **Adam** optimizer instead of the **SGD** optimizer! 

I guess the reason for this is that your default optimizer in config.yml is **Adam**.  Although you changed the value of `cfg['optimizer_name']` in the `process_control` function in the `utils.py` file, this change is only valid for the main process. However, the ray framework runs in parallel, assigning a different process to each client, so when the client declares a new optimizer in `step` function, all the parameters it gets from `cfg` are still parameters in the `config.yml` file, which means that the client is actually running the `Adam` optimizer. To test this, we print out the information for the optimizer on the client side, as shown below.
![image](https://user-images.githubusercontent.com/60345931/225015324-8cf27f5f-de12-4f63-b328-66ab7f8346cc.png)
Then, ① set `optimizer_name` to **Adam** in the `config.yml` file (which is also the default setting in your code), run the command above and we can see the result as follow:
![image](https://user-images.githubusercontent.com/60345931/225015890-13095976-fe6b-412f-bed7-acb15815cf28.png)
② set `optimizer_name` to **SGD** in the `config.yml` file, run the command above and we can see that:
![image](https://user-images.githubusercontent.com/60345931/225016116-766cd7fb-be88-4720-a8a4-610b811a94b9.png)
After testing, mode ① can achieve the effect of Table 3, while mode ② cannot be trained. That's why I ran out of practice in issue # 7.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug for client optimizer #8

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Bug for client optimizer #8

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions