Skip to content

Commit 4b9632a

Browse files
committed
Merge remote-tracking branch 'upstream/master' into graphics_fix
2 parents df62ffc + 37502f5 commit 4b9632a

File tree

86 files changed

+820269
-69027
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

86 files changed

+820269
-69027
lines changed

.github/workflows/test.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,7 @@ jobs:
3131
- {test: "vtr_reg_nightly_test3", cores: "16", options: "", cmake: "", extra_pkgs: ""}
3232
- {test: "vtr_reg_nightly_test4", cores: "16", options: "", cmake: "", extra_pkgs: ""}
3333
- {test: "vtr_reg_nightly_test5", cores: "16", options: "", cmake: "", extra_pkgs: ""}
34+
- {test: "vtr_reg_nightly_test6", cores: "16", options: "", cmake: "-DODIN_USE_YOSYS=ON", extra_pkgs: ""}
3435
- {test: "vtr_reg_strong", cores: "16", options: "", cmake: "-DVTR_ASSERT_LEVEL=3", extra_pkgs: "libeigen3-dev"}
3536
- {test: "vtr_reg_strong", cores: "16", options: "-skip_qor", cmake: "-DVTR_ASSERT_LEVEL=3 -DVTR_ENABLE_SANITIZE=ON", extra_pkgs: "libeigen3-dev"}
3637
- {test: "vtr_reg_yosys", cores: "16", options: "", cmake: "-DWITH_YOSYS=ON -DYOSYS_SV_UHDM_PLUGIN=ON", extra_pkgs: ""}

ODIN_II/regression_test/benchmark/task/koios/synthesis_result.json

Lines changed: 25802 additions & 13897 deletions
Large diffs are not rendered by default.

ODIN_II/regression_test/benchmark/task/koios/task.conf

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,8 @@ circuits_dir=../../../../vtr_flow/benchmarks/verilog/koios
2424
#-------------------------------------------------------
2525
# specify the benchmarks
2626
#-------------------------------------------------------
27-
circuit_list_add=tpu_like.small.v
27+
circuit_list_add=tpu_like.small.os.v
28+
circuit_list_add=tpu_like.small.ws.v
2829
circuit_list_add=dla_like.small.v
2930
circuit_list_add=bnn.v
3031
circuit_list_add=attention_layer.v
@@ -45,10 +46,10 @@ includes_dir=../../../../vtr_flow/benchmarks/verilog/koios
4546
#-------------------------------------------------------
4647
# specify the include files
4748
#-------------------------------------------------------
48-
# Some benchmarks instantiate complex dsp blocks to implement features
49-
# like native floating point math, cascade chains, etc. This functionality
50-
# is guarded under the `complex_dsp` macro. The complex_dsp_include.v file
51-
# defines this macro, thereby enabling instantiations of the complex dsp.
52-
include_list_add=complex_dsp_include.v
49+
# Some benchmarks instantiate hard dsp and memory blocks
50+
# This functionality is guarded under the `complex_dsp` and `hard_mem` macros.
51+
# The hard_block_include.v file
52+
# defines this macros, thereby enabling instantiations of the hard blocks
53+
include_list_add=hard_block_include.v
5354

5455
synthesis_parse_file=regression_test/parse_result/conf/synth.toml

ODIN_II/regression_test/benchmark/task/yosys+odin/koios/synthesis_result.json

Lines changed: 217 additions & 974 deletions
Large diffs are not rendered by default.

ODIN_II/regression_test/benchmark/task/yosys+odin/koios/task.conf

Lines changed: 13 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -25,18 +25,20 @@ circuits_dir=../vtr_flow/benchmarks/verilog/koios
2525
#-------------------------------------------------------
2626
# specify the benchmarks
2727
#-------------------------------------------------------
28-
circuit_list_add=tpu_like.small.v
28+
circuit_list_add=tpu_like.small.os.v
29+
circuit_list_add=tpu_like.small.ws.v
2930
circuit_list_add=dla_like.small.v
3031
circuit_list_add=bnn.v
3132
circuit_list_add=attention_layer.v
3233
circuit_list_add=conv_layer_hls.v
3334
circuit_list_add=conv_layer.v
3435
circuit_list_add=gemm_layer.v
35-
circuit_list_add=eltwise_layer.v
36-
circuit_list_add=robot_rl.v
37-
circuit_list_add=reduction_layer.v
38-
circuit_list_add=spmv.v
39-
circuit_list_add=softmax.v
36+
circuit_list_add=bwave_like.fixed.small.v
37+
circuit_list_add=bwave_like.fixed.large.v
38+
circuit_list_add=bwave_like.float.small.v
39+
circuit_list_add=bwave_like.float.large.v
40+
circuit_list_add=proxy.5.v
41+
circuit_list_add=proxy.8.v
4042

4143
#-------------------------------------------------------
4244
# specify the directory to look for include file in
@@ -46,10 +48,10 @@ includes_dir=../vtr_flow/benchmarks/verilog/koios
4648
#-------------------------------------------------------
4749
# specify the include files
4850
#-------------------------------------------------------
49-
# Some benchmarks instantiate complex dsp blocks to implement features
50-
# like native floating point math, cascade chains, etc. This functionality
51-
# is guarded under the `complex_dsp` macro. The complex_dsp_include.v file
52-
# defines this macro, thereby enabling instantiations of the complex dsp.
53-
include_list_add=complex_dsp_include.v
51+
# Some benchmarks instantiate hard dsp and memory blocks
52+
# This functionality is guarded under the `complex_dsp` and `hard_mem` macros.
53+
# The hard_block_include.v file
54+
# defines this macros, thereby enabling instantiations of the hard blocks
55+
include_list_add=hard_block_include.v
5456

5557
synthesis_parse_file=regression_test/parse_result/conf/synth.toml

doc/src/vtr/benchmarks.rst

Lines changed: 12 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -64,11 +64,11 @@ The Titan benchmarks are suitable for large-scale FPGA CAD research, and FPGA ar
6464

6565
.. seealso:: :ref:`titan_benchmarks_tutorial`
6666

67-
Koios Benchmarks
67+
Koios 2.0 Benchmarks
6868
-----------------
6969
The Koios benchmarks :cite:`koios_benchmarks` are a set of Deep Learning (DL) benchmarks.
7070
They are suitable for DL related architecture and CAD research.
71-
There are 19 designs that include several medium-sized benchmarks and some large benchmarks.
71+
There are 40 designs that include several medium-sized benchmarks and some large benchmarks.
7272
The designs target different network types (CNNs, RNNs, MLPs, RL) and layer types (fully-connected, convolution, activation, softmax, reduction, eltwise).
7373
Some of the designs are generated from HLS tools as well.
7474
These designs use many precisions including binary, different fixed point types int8/16/32, brain floating point (bfloat16), and IEEE half-precision floating point (fp16).
@@ -80,21 +80,26 @@ These designs use many precisions including binary, different fixed point types
8080
================= ======================================
8181
Benchmark Description
8282
================= ======================================
83-
clstm_like CLSTM-like accelerator
8483
dla_like Intel-DLA-like accelerator
84+
clstm_like CLSTM-like accelerator
85+
deepfreeze ARM FixyNN design
86+
tdarknet_like Accelerator for Tiny Darknet
87+
bwave_like Microsoft-Brainwave-like design
8588
lstm LSTM engine
86-
tpu_like Google-TPU-v1-like accelerator
8789
bnn 4-layer binary neural network
88-
tiny_darknet_like Accelerator for Tiny Darknet
90+
lenet Accelerator for LeNet-5
91+
dnnweaver DNNWeaver accelerator
92+
tpu_like Google-TPU-v1-like accelerator
8993
gemm_layer 20x20 matrix multiplication engine
9094
attention_layer Transformer self-attention layer
9195
conv_layer GEMM based convolution
92-
spmv Sparse matrix vector multiplication
9396
robot_rl Robot+maze application
9497
reduction_layer Add/max/min reduction tree
98+
spmv Sparse matrix vector multiplication
99+
eltwise_layer Matrix elementwise add/sub/mult
95100
softmax Softmax classification layer
96101
conv_layer_hls Sliding window convolution
97-
eltwise_layer Matrix elementwise add/sub/mult
102+
proxy Proxy/synthetic benchmarks
98103
================= ======================================
99104

100105
The VTR benchmarks are provided as Verilog (enabling full flexibility to modify and change how the designs are implemented) under: ::

0 commit comments

Comments
 (0)