Numa patch by YWHyuk · Pull Request #3 · linuxgeek-Inc/GiantVM

YWHyuk · 2022-07-12T07:58:45Z

Revert highly contended benchmark. This patch is experimental feature.

Plus, Add list benchmark infrastructure.

CC-lock is based on flat-lock combining algorithm. In this lock, only one thread, called combiner thread the request of critical section. So, combiner thread can exploit locality and aviod high contetetion between lock varible. When each cpu use only one node, let's assume lock hold node A. In this case, node A's (wait, completed) status should be (false, false). Lock | A When A,B cpu race occured, Let's assume that B is win. Then, B will try to spin on A's wait Status. A -> B w:F w:T At the same time, A was enqueued. So, A's wait status was set to True like below. A -> B -> A w:T w:T w:T This lead to deadlock. To avoid above node-reusing problem, each cpu has two cc_node. Those node are used alternately. A_0 -> B_0 -> A_1 w:f w:T w:T Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Test reported that there is a deadlock. Situation are below. Node(0, 1) { req = 00000000d0495726, params = 000000002f36f5ac, wait = 0, completed = 1, refcount = 0, Next (2, 0) Prev (0, 0) } Node(2, 0) { req = 00000000d0495726, params = 000000002f36f5ac, wait = 1, completed = 0, refcount = 0, Next (2, 1) Prev (0, 1) } Node (0, 1)'s request are handled. So, it wait, completed status are (0, 1). But, it's next node Node(2, 0)'s wait are still 1. The combiner thread should set Node(2, 0) wait = 0. Previous logic set wait = 0, when DECODE_CPU(pending_cpu) != NR_CPUS. But there can be race between combiner thread and normal thread. In the combiner thread it check node->req first, then it check node->next. So there could be a situation below Node(0, 1) Node(2, 0) prev->req = req if(pending->req) ... DECODE_CPU(pending->next) prev->next = this_cpu To fix this, combiner thread check node->next first. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Previous, test thread used jiffes to measure the spent time. But, it's resolution is low. So all the results are zero or one. So use sched_clock. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

To keep order of reading node->next and writing of node->wait, node->completed, smp_mb should be used instead of smp_mb(). So fix it Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Using the "echo 2 > trigger", spinlock based benchmark can be run. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

This script provide measurement result parsing and plotting features.

To optmize, enable debug code when DEBUG is defined Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

To reduce inter-node traffic, add delay when it fail to get global lock. The time of delay is a value from experimental result. Signed-off-by: wonhyuk yang <vvghjk1234@gmail.com>

This reverts commit d4ee7b4.

Add new benchmark that measure the time of list operation.

YWHyuk added 23 commits April 28, 2022 19:24

Use sched_clock to measure time with high-resolution

7d9bd92

Previous, test thread used jiffes to measure the spent time. But, it's resolution is low. So all the results are zero or one. So use sched_clock. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Change smp_rmb to smp_mb

be59754

To keep order of reading node->next and writing of node->wait, node->completed, smp_mb should be used instead of smp_mb(). So fix it Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Make quit file to trigger reinit cc-lock

6568b49

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Add spinlock benchmark

aac3ab2

Using the "echo 2 > trigger", spinlock based benchmark can be run. Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Add dmesg result parser

12a2817

This script provide measurement result parsing and plotting features.

Seperate debug code with ifdef macro

a0c9d0c

To optmize, enable debug code when DEBUG is defined Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Introduce control file

0e1ae62

WIP

34b45cb

Call printk after measurement done

579fe25

Add helper script to run cclock, spinlock benchmark

9cb9ab0

Introuduce more fair start

2a9e2c4

Integrate wait/complete flag to optmize

fdb6460

tune bench

8d532e9

Add benchmark ready debubfs for polling

f884e0a

Signed-off-by: Wonhyuk Yang <vvghjk1234@gmail.com>

Optimize data structure by aligning 4KB

026c44c

Add per node cclock

af20905

Fix NUMA awareness cclock

fd53cc9

To reduce inter-node traffic, add delay when it fail to get global lock. The time of delay is a value from experimental result. Signed-off-by: wonhyuk yang <vvghjk1234@gmail.com>

Change benchmark to mearsure highly contended cost

d4ee7b4

Revert "Change benchmark to mearsure highly contended cost"

db7ee66

This reverts commit d4ee7b4.

Add list benchmark with helper script

de45fa3

Add new benchmark that measure the time of list operation.

Rename benchmark function

a822d97

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numa patch#3

Numa patch#3
YWHyuk wants to merge 23 commits intolinuxgeek-Inc:masterfrom
YWHyuk:NUMA

YWHyuk commented Jul 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

YWHyuk commented Jul 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant