update recurrent branch with latest BVLC/master by dribnet · Pull Request #9 · jeffdonahue/caffe

dribnet · 2015-04-27T21:34:27Z

Merged in latest BVLC/master to keep BVLC#2033 up to date and mergeable. Resolved seemingly straightforward conflict introduced by BVLC#2370 vs a7eaaf5. Confirmed make runtest completes successfully (659 tests).

(With layers whose backwards accumlate gradients), this effectively decouples the computational batch from the SGD minibatch. Each iteration accumulates gradients over iter_size batches, then parameters are updated.

(double impl from NVIDIA dev docs; float impl included in CUDA as "atomicAdd")

with unit tests

everything in Reshape)

…types

Removed the CPU_ONLY fix introduced in BVLC#2370 because the surrounding Net<Dtype>::Update() logic was previously removed in this feature branch at a7eaaf5. Merge remote-tracking branch 'jeff/recurrent' * jeff/recurrent: (26 commits) RecurrentLayer bugfix: params still need backprop Prototxts + script for training LRCN COCO image captioning model Prototxts + script for training COCO caption language model Add scripts to create HDF5 datasets from COCO captions Add scripts for downloading COCO2014 tools & data Add LSTMLayer and LSTMUnitLayer, with tests Add RNNLayer, with tests Add RecurrentLayer: an abstract superclass for other recurrent layer types TestNet fixes for Net weight sharing modifications Modifications to Net to facilitate unrolled recurrent networks Allow ConcatLayer to take a single bottom Blob (for testing) Allow SliceLayer to have a single top Blob (for testing) EltwiseLayer with coeff blob GPU kernel EltwiseLayer can take a blob of per-num coefficients AccuracyLayer: add 'denominator' param FlattenLayer fix -- top should always Share* from bottom (and do everything in Reshape) Add (very simple version of) ReshapeLayer EmbedBackward with no loops -- use caffe_gpu_atomic_add instead Add EmbedLayer for inner products with sparse input (one-hot vectors), with unit tests test_gradient_check_util: check_bottom < -1 only checks params ... Conflicts: src/caffe/net.cpp

longjon and others added 27 commits March 13, 2015 13:12

zero-init param diffs and accumulate gradients

6965746

(With layers whose backwards accumlate gradients), this effectively decouples the computational batch from the SGD minibatch. Each iteration accumulates gradients over iter_size batches, then parameters are updated.

zero-init param diffs in gradient checker

37bddae

accumulate gradients in inner product layer

05185e2

accumulate gradients in (de)conv layers

c8f0bbe

accumulate gradients in cudnn conv layer

2983572

Add gpu_util.cuh, with caffe_gpu_atomic_add

ea39cb5

(double impl from NVIDIA dev docs; float impl included in CUDA as "atomicAdd")

test_gradient_check_util: check_bottom < -1 only checks params

0c19b6b

Add EmbedLayer for inner products with sparse input (one-hot vectors),

bb8a91d

with unit tests

EmbedBackward with no loops -- use caffe_gpu_atomic_add instead

9271abf

Add (very simple version of) ReshapeLayer

23d5c03

FlattenLayer fix -- top should always Share* from bottom (and do

d2c304c

everything in Reshape)

AccuracyLayer: add 'denominator' param

8bafc35

EltwiseLayer can take a blob of per-num coefficients

654f5df

EltwiseLayer with coeff blob GPU kernel

f6ef1ce

Allow SliceLayer to have a single top Blob (for testing)

f0d036f

Allow ConcatLayer to take a single bottom Blob (for testing)

a10bf5b

Modifications to Net to facilitate unrolled recurrent networks

a7eaaf5

TestNet fixes for Net weight sharing modifications

49fbab9

Add RecurrentLayer: an abstract superclass for other recurrent layer …

f2724ea

…types

Add RNNLayer, with tests

69e875c

Add LSTMLayer and LSTMUnitLayer, with tests

cea7bc2

Add scripts for downloading COCO2014 tools & data

9be6d73

Add scripts to create HDF5 datasets from COCO captions

bf1c90c

Prototxts + script for training COCO caption language model

87d9038

Prototxts + script for training LRCN COCO image captioning model

80e9c41

RecurrentLayer bugfix: params still need backprop

d3ebf3e

jeffdonahue force-pushed the recurrent branch from d3ebf3e to 506c4d9 Compare September 4, 2015 00:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update recurrent branch with latest BVLC/master#9

update recurrent branch with latest BVLC/master#9
dribnet wants to merge 27 commits intojeffdonahue:recurrentfrom
dribnet:recurrent-net-fix

dribnet commented Apr 27, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dribnet commented Apr 27, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants