Initial jax+dask example.#158

Open

asmith26 wants to merge 7 commits intodask:mainfrom

asmith26:master

asmith26 commented Jul 11, 2020 •

edited

Loading

This notebook example is a learning exercise during the Scipy2020 Dask sprint to establish how dask might be used to parallelize jax/dm-haiku deep learning model training and prediction.

I've committed my notebook that is working end-to-end, and demonstrates a neural network for learning the sine function.


          Initial jax+dask example.

1a2e849

review-notebook-app bot commented Jul 11, 2020

Check out this pull request on

Review Jupyter notebook visual diffs & provide feedback on notebooks.

Powered by ReviewNB


          Add background + tidy.

23a058c

asmith26 marked this pull request as ready for review

July 22, 2020 17:26

mrocklin reviewed

View reviewed changes

machine-learning/jax-haiku-dask-dataframe-distributed-example.ipynb Outdated Show resolved Hide resolved


          Update machine-learning/jax-haiku-dask-dataframe-distributed-example.…

cbc3a1c

…ipynb

Co-authored-by: Matthew Rocklin <mrocklin@gmail.com>

mrocklin reviewed

View reviewed changes

machine-learning/jax-haiku-dask-dataframe-distributed-example.ipynb

+                  "        df_one_partition = ddf_one_partition.compute()\n",
+                  "        scaled_x = jnp.array(df_one_partition[[\"scaled_x\"]].values)\n",
+                  "        y = jnp.array(df_one_partition[[\"y\"]].values)\n",
+                  "        params, opt_state = update(params, opt_state, scaled_x, y)"

Member

mrocklin Jul 22, 2020

It might be worth taking a look at some of the functionality in dask-ml, which might do some of these things for you already if you're interested.

cc'ing @stsievert and @TomAugspurger

mrocklin reviewed

View reviewed changes

machine-learning/jax-haiku-dask-dataframe-distributed-example.ipynb

+                  "    futures = []\n",
+                  "    for ddf_one_partition in ddf_train.partitions:\n",
+                  "        # Compute the gradients in parallel\n",
+                  "        futures.append(client.submit(dask_compute_grads_one_partition_wrapper, ddf_one_partition, params))\n",

Member

mrocklin Jul 22, 2020

I recommend instead ...

from dask.distributed import futures_of
futures = futures_of(df.map_partitions(func, **params).persist())

Author

asmith26 Jul 22, 2020

Thanks, I've tried this but .map_partitions() requires you to return either a Dask.DataFrame or Dask.Series (I think?). My function returns a set of gradients, grads, which is a Python dictionary (with more python dicts inside, i.e. a tree-like structure), so I don't think this will work in this case (please correct me if I am mistaken).

Member

TomAugspurger Jul 22, 2020

You can probably work around that with to_delayed() instead of map_partitions. I can take a closer look later.

mrocklin reviewed

View reviewed changes

machine-learning/jax-haiku-dask-dataframe-distributed-example.ipynb Outdated Show resolved Hide resolved

mrocklin reviewed

View reviewed changes

machine-learning/jax-haiku-dask-dataframe-distributed-example.ipynb

+                  "        # Bring the gradients back to the client, and update the model with the optimizer on the client\n",
+                  "        grads = future.result()\n",
+                  "        updates, opt_state = optimizer.update(grads, opt_state)\n",
+                  "        params = optix.apply_updates(params, updates)"

Member

mrocklin Jul 22, 2020

This is also the kind of thing for which Actors is probably a decent fit.

Author

asmith26 Jul 22, 2020 •

edited

Loading

Yes, I've been trying to think how to perform training with shared parameters (and optimizer state) among workers via Actors. Haven't quite got my head around how this might work yet.

Member

mrocklin Jul 22, 2020

This might be a start: https://docs.dask.org/en/latest/futures.html#example-parameter-server

Member

stsievert Jul 23, 2020

That example doesn't run, maybe a bad merge. I've put in a PR to correct that: dask/dask#6449

asmith26 added 3 commits

July 22, 2020 19:36


          Fix typo, fix typing.

14bd741


          Add following suggestion, small re-wordings.

3935f51


          Small refactor.

e42d48b

stsievert mentioned this pull request

DOC: complete parameter server implementation dask/dask#6449

Merged


          Fix typos + rewording.

dd7d0bc

Base automatically changed from master to main

January 27, 2021 16:07

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet