CoCalc -- litmus-tests.txt

GitHub Repository: torvalds/linux
Path: blob/master/tools/memory-model/Documentation/litmus-tests.txt
²⁶²⁸⁸ views
1
Linux-Kernel Memory Model Litmus Tests
2
======================================
3

4
This file describes the LKMM litmus-test format by example, describes
5
some tricks and traps, and finally outlines LKMM's limitations.  Earlier
6
versions of this material appeared in a number of LWN articles, including:
7

8
https://lwn.net/Articles/720550/
9
	A formal kernel memory-ordering model (part 2)
10
https://lwn.net/Articles/608550/
11
	Axiomatic validation of memory barriers and atomic instructions
12
https://lwn.net/Articles/470681/
13
	Validating Memory Barriers and Atomic Instructions
14

15
This document presents information in decreasing order of applicability,
16
so that, where possible, the information that has proven more commonly
17
useful is shown near the beginning.
18

19
For information on installing LKMM, including the underlying "herd7"
20
tool, please see tools/memory-model/README.
21

22

23
Copy-Pasta
24
==========
25

26
As with other software, it is often better (if less macho) to adapt an
27
existing litmus test than it is to create one from scratch.  A number
28
of litmus tests may be found in the kernel source tree:
29

30
	tools/memory-model/litmus-tests/
31
	Documentation/litmus-tests/
32

33
Several thousand more example litmus tests are available on github
34
and kernel.org:
35

36
	https://github.com/paulmckrcu/litmus
37
	https://git.kernel.org/pub/scm/linux/kernel/git/paulmck/perfbook.git/tree/CodeSamples/formal/herd
38
	https://git.kernel.org/pub/scm/linux/kernel/git/paulmck/perfbook.git/tree/CodeSamples/formal/litmus
39

40
The -l and -L arguments to "git grep" can be quite helpful in identifying
41
existing litmus tests that are similar to the one you need.  But even if
42
you start with an existing litmus test, it is still helpful to have a
43
good understanding of the litmus-test format.
44

45

46
Examples and Format
47
===================
48

49
This section describes the overall format of litmus tests, starting
50
with a small example of the message-passing pattern and moving on to
51
more complex examples that illustrate explicit initialization and LKMM's
52
minimalistic set of flow-control statements.
53

54

55
Message-Passing Example
56
-----------------------
57

58
This section gives an overview of the format of a litmus test using an
59
example based on the common message-passing use case.  This use case
60
appears often in the Linux kernel.  For example, a flag (modeled by "y"
61
below) indicates that a buffer (modeled by "x" below) is now completely
62
filled in and ready for use.  It would be very bad if the consumer saw the
63
flag set, but, due to memory misordering, saw old values in the buffer.
64

65
This example asks whether smp_store_release() and smp_load_acquire()
66
suffices to avoid this bad outcome:
67

68
 1 C MP+pooncerelease+poacquireonce
69
 2
70
 3 {}
71
 4
72
 5 P0(int *x, int *y)
73
 6 {
74
 7   WRITE_ONCE(*x, 1);
75
 8   smp_store_release(y, 1);
76
 9 }
77
10
78
11 P1(int *x, int *y)
79
12 {
80
13   int r0;
81
14   int r1;
82
15
83
16   r0 = smp_load_acquire(y);
84
17   r1 = READ_ONCE(*x);
85
18 }
86
19
87
20 exists (1:r0=1 /\ 1:r1=0)
88

89
Line 1 starts with "C", which identifies this file as being in the
90
LKMM C-language format (which, as we will see, is a small fragment
91
of the full C language).  The remainder of line 1 is the name of
92
the test, which by convention is the filename with the ".litmus"
93
suffix stripped.  In this case, the actual test may be found in
94
tools/memory-model/litmus-tests/MP+pooncerelease+poacquireonce.litmus
95
in the Linux-kernel source tree.
96

97
Mechanically generated litmus tests will often have an optional
98
double-quoted comment string on the second line.  Such strings are ignored
99
when running the test.  Yes, you can add your own comments to litmus
100
tests, but this is a bit involved due to the use of multiple parsers.
101
For now, you can use C-language comments in the C code, and these comments
102
may be in either the "/* */" or the "//" style.  A later section will
103
cover the full litmus-test commenting story.
104

105
Line 3 is the initialization section.  Because the default initialization
106
to zero suffices for this test, the "{}" syntax is used, which mean the
107
initialization section is empty.  Litmus tests requiring non-default
108
initialization must have non-empty initialization sections, as in the
109
example that will be presented later in this document.
110

111
Lines 5-9 show the first process and lines 11-18 the second process.  Each
112
process corresponds to a Linux-kernel task (or kthread, workqueue, thread,
113
and so on; LKMM discussions often use these terms interchangeably).
114
The name of the first process is "P0" and that of the second "P1".
115
You can name your processes anything you like as long as the names consist
116
of a single "P" followed by a number, and as long as the numbers are
117
consecutive starting with zero.  This can actually be quite helpful,
118
for example, a .litmus file matching "^P1(" but not matching "^P2("
119
must contain a two-process litmus test.
120

121
The argument list for each function are pointers to the global variables
122
used by that function.  Unlike normal C-language function parameters, the
123
names are significant.  The fact that both P0() and P1() have a formal
124
parameter named "x" means that these two processes are working with the
125
same global variable, also named "x".  So the "int *x, int *y" on P0()
126
and P1() mean that both processes are working with two shared global
127
variables, "x" and "y".  Global variables are always passed to processes
128
by reference, hence "P0(int *x, int *y)", but *never* "P0(int x, int y)".
129

130
P0() has no local variables, but P1() has two of them named "r0" and "r1".
131
These names may be freely chosen, but for historical reasons stemming from
132
other litmus-test formats, it is conventional to use names consisting of
133
"r" followed by a number as shown here.  A common bug in litmus tests
134
is forgetting to add a global variable to a process's parameter list.
135
This will sometimes result in an error message, but can also cause the
136
intended global to instead be silently treated as an undeclared local
137
variable.
138

139
Each process's code is similar to Linux-kernel C, as can be seen on lines
140
7-8 and 13-17.  This code may use many of the Linux kernel's atomic
141
operations, some of its exclusive-lock functions, and some of its RCU
142
and SRCU functions.  An approximate list of the currently supported
143
functions may be found in the linux-kernel.def file.
144

145
The P0() process does "WRITE_ONCE(*x, 1)" on line 7.  Because "x" is a
146
pointer in P0()'s parameter list, this does an unordered store to global
147
variable "x".  Line 8 does "smp_store_release(y, 1)", and because "y"
148
is also in P0()'s parameter list, this does a release store to global
149
variable "y".
150

151
The P1() process declares two local variables on lines 13 and 14.
152
Line 16 does "r0 = smp_load_acquire(y)" which does an acquire load
153
from global variable "y" into local variable "r0".  Line 17 does a
154
"r1 = READ_ONCE(*x)", which does an unordered load from "*x" into local
155
variable "r1".  Both "x" and "y" are in P1()'s parameter list, so both
156
reference the same global variables that are used by P0().
157

158
Line 20 is the "exists" assertion expression to evaluate the final state.
159
This final state is evaluated after the dust has settled: both processes
160
have completed and all of their memory references and memory barriers
161
have propagated to all parts of the system.  The references to the local
162
variables "r0" and "r1" in line 24 must be prefixed with "1:" to specify
163
which process they are local to.
164

165
Note that the assertion expression is written in the litmus-test
166
language rather than in C.  For example, single "=" is an equality
167
operator rather than an assignment.  The "/\" character combination means
168
"and".  Similarly, "\/" stands for "or".  Both of these are ASCII-art
169
representations of the corresponding mathematical symbols.  Finally,
170
"~" stands for "logical not", which is "!" in C, and not to be confused
171
with the C-language "~" operator which instead stands for "bitwise not".
172
Parentheses may be used to override precedence.
173

174
The "exists" assertion on line 20 is satisfied if the consumer sees the
175
flag ("y") set but the buffer ("x") as not yet filled in, that is, if P1()
176
loaded a value from "x" that was equal to 1 but loaded a value from "y"
177
that was still equal to zero.
178

179
This example can be checked by running the following command, which
180
absolutely must be run from the tools/memory-model directory and from
181
this directory only:
182

183
herd7 -conf linux-kernel.cfg litmus-tests/MP+pooncerelease+poacquireonce.litmus
184

185
The output is the result of something similar to a full state-space
186
search, and is as follows:
187

188
 1 Test MP+pooncerelease+poacquireonce Allowed
189
 2 States 3
190
 3 1:r0=0; 1:r1=0;
191
 4 1:r0=0; 1:r1=1;
192
 5 1:r0=1; 1:r1=1;
193
 6 No
194
 7 Witnesses
195
 8 Positive: 0 Negative: 3
196
 9 Condition exists (1:r0=1 /\ 1:r1=0)
197
10 Observation MP+pooncerelease+poacquireonce Never 0 3
198
11 Time MP+pooncerelease+poacquireonce 0.00
199
12 Hash=579aaa14d8c35a39429b02e698241d09
200

201
The most pertinent line is line 10, which contains "Never 0 3", which
202
indicates that the bad result flagged by the "exists" clause never
203
happens.  This line might instead say "Sometimes" to indicate that the
204
bad result happened in some but not all executions, or it might say
205
"Always" to indicate that the bad result happened in all executions.
206
(The herd7 tool doesn't judge, so it is only an LKMM convention that the
207
"exists" clause indicates a bad result.  To see this, invert the "exists"
208
clause's condition and run the test.)  The numbers ("0 3") at the end
209
of this line indicate the number of end states satisfying the "exists"
210
clause (0) and the number not not satisfying that clause (3).
211

212
Another important part of this output is shown in lines 2-5, repeated here:
213

214
 2 States 3
215
 3 1:r0=0; 1:r1=0;
216
 4 1:r0=0; 1:r1=1;
217
 5 1:r0=1; 1:r1=1;
218

219
Line 2 gives the total number of end states, and each of lines 3-5 list
220
one of these states, with the first ("1:r0=0; 1:r1=0;") indicating that
221
both of P1()'s loads returned the value "0".  As expected, given the
222
"Never" on line 10, the state flagged by the "exists" clause is not
223
listed.  This full list of states can be helpful when debugging a new
224
litmus test.
225

226
The rest of the output is not normally needed, either due to irrelevance
227
or due to being redundant with the lines discussed above.  However, the
228
following paragraph lists them for the benefit of readers possessed of
229
an insatiable curiosity.  Other readers should feel free to skip ahead.
230

231
Line 1 echos the test name, along with the "Test" and "Allowed".  Line 6's
232
"No" says that the "exists" clause was not satisfied by any execution,
233
and as such it has the same meaning as line 10's "Never".  Line 7 is a
234
lead-in to line 8's "Positive: 0 Negative: 3", which lists the number
235
of end states satisfying and not satisfying the "exists" clause, just
236
like the two numbers at the end of line 10.  Line 9 repeats the "exists"
237
clause so that you don't have to look it up in the litmus-test file.
238
The number at the end of line 11 (which begins with "Time") gives the
239
time in seconds required to analyze the litmus test.  Small tests such
240
as this one complete in a few milliseconds, so "0.00" is quite common.
241
Line 12 gives a hash of the contents for the litmus-test file, and is used
242
by tooling that manages litmus tests and their output.  This tooling is
243
used by people modifying LKMM itself, and among other things lets such
244
people know which of the several thousand relevant litmus tests were
245
affected by a given change to LKMM.
246

247

248
Initialization
249
--------------
250

251
The previous example relied on the default zero initialization for
252
"x" and "y", but a similar litmus test could instead initialize them
253
to some other value:
254

255
 1 C MP+pooncerelease+poacquireonce
256
 2
257
 3 {
258
 4   x=42;
259
 5   y=42;
260
 6 }
261
 7
262
 8 P0(int *x, int *y)
263
 9 {
264
10   WRITE_ONCE(*x, 1);
265
11   smp_store_release(y, 1);
266
12 }
267
13
268
14 P1(int *x, int *y)
269
15 {
270
16   int r0;
271
17   int r1;
272
18
273
19   r0 = smp_load_acquire(y);
274
20   r1 = READ_ONCE(*x);
275
21 }
276
22
277
23 exists (1:r0=1 /\ 1:r1=42)
278

279
Lines 3-6 now initialize both "x" and "y" to the value 42.  This also
280
means that the "exists" clause on line 23 must change "1:r1=0" to
281
"1:r1=42".
282

283
Running the test gives the same overall result as before, but with the
284
value 42 appearing in place of the value zero:
285

286
 1 Test MP+pooncerelease+poacquireonce Allowed
287
 2 States 3
288
 3 1:r0=1; 1:r1=1;
289
 4 1:r0=42; 1:r1=1;
290
 5 1:r0=42; 1:r1=42;
291
 6 No
292
 7 Witnesses
293
 8 Positive: 0 Negative: 3
294
 9 Condition exists (1:r0=1 /\ 1:r1=42)
295
10 Observation MP+pooncerelease+poacquireonce Never 0 3
296
11 Time MP+pooncerelease+poacquireonce 0.02
297
12 Hash=ab9a9b7940a75a792266be279a980156
298

299
It is tempting to avoid the open-coded repetitions of the value "42"
300
by defining another global variable "initval=42" and replacing all
301
occurrences of "42" with "initval".  This will not, repeat *not*,
302
initialize "x" and "y" to 42, but instead to the address of "initval"
303
(try it!).  See the section below on linked lists to learn more about
304
why this approach to initialization can be useful.
305

306

307
Control Structures
308
------------------
309

310
LKMM supports the C-language "if" statement, which allows modeling of
311
conditional branches.  In LKMM, conditional branches can affect ordering,
312
but only if you are *very* careful (compilers are surprisingly able
313
to optimize away conditional branches).  The following example shows
314
the "load buffering" (LB) use case that is used in the Linux kernel to
315
synchronize between ring-buffer producers and consumers.  In the example
316
below, P0() is one side checking to see if an operation may proceed and
317
P1() is the other side completing its update.
318

319
 1 C LB+fencembonceonce+ctrlonceonce
320
 2
321
 3 {}
322
 4
323
 5 P0(int *x, int *y)
324
 6 {
325
 7   int r0;
326
 8
327
 9   r0 = READ_ONCE(*x);
328
10   if (r0)
329
11     WRITE_ONCE(*y, 1);
330
12 }
331
13
332
14 P1(int *x, int *y)
333
15 {
334
16   int r0;
335
17
336
18   r0 = READ_ONCE(*y);
337
19   smp_mb();
338
20   WRITE_ONCE(*x, 1);
339
21 }
340
22
341
23 exists (0:r0=1 /\ 1:r0=1)
342

343
P1()'s "if" statement on line 10 works as expected, so that line 11 is
344
executed only if line 9 loads a non-zero value from "x".  Because P1()'s
345
write of "1" to "x" happens only after P1()'s read from "y", one would
346
hope that the "exists" clause cannot be satisfied.  LKMM agrees:
347

348
 1 Test LB+fencembonceonce+ctrlonceonce Allowed
349
 2 States 2
350
 3 0:r0=0; 1:r0=0;
351
 4 0:r0=1; 1:r0=0;
352
 5 No
353
 6 Witnesses
354
 7 Positive: 0 Negative: 2
355
 8 Condition exists (0:r0=1 /\ 1:r0=1)
356
 9 Observation LB+fencembonceonce+ctrlonceonce Never 0 2
357
10 Time LB+fencembonceonce+ctrlonceonce 0.00
358
11 Hash=e5260556f6de495fd39b556d1b831c3b
359

360
However, there is no "while" statement due to the fact that full
361
state-space search has some difficulty with iteration.  However, there
362
are tricks that may be used to handle some special cases, which are
363
discussed below.  In addition, loop-unrolling tricks may be applied,
364
albeit sparingly.
365

366

367
Tricks and Traps
368
================
369

370
This section covers extracting debug output from herd7, emulating
371
spin loops, handling trivial linked lists, adding comments to litmus tests,
372
emulating call_rcu(), and finally tricks to improve herd7 performance
373
in order to better handle large litmus tests.
374

375

376
Debug Output
377
------------
378

379
By default, the herd7 state output includes all variables mentioned
380
in the "exists" clause.  But sometimes debugging efforts are greatly
381
aided by the values of other variables.  Consider this litmus test
382
(tools/memory-order/litmus-tests/SB+rfionceonce-poonceonces.litmus but
383
slightly modified), which probes an obscure corner of hardware memory
384
ordering:
385

386
 1 C SB+rfionceonce-poonceonces
387
 2
388
 3 {}
389
 4
390
 5 P0(int *x, int *y)
391
 6 {
392
 7   int r1;
393
 8   int r2;
394
 9
395
10   WRITE_ONCE(*x, 1);
396
11   r1 = READ_ONCE(*x);
397
12   r2 = READ_ONCE(*y);
398
13 }
399
14
400
15 P1(int *x, int *y)
401
16 {
402
17   int r3;
403
18   int r4;
404
19
405
20   WRITE_ONCE(*y, 1);
406
21   r3 = READ_ONCE(*y);
407
22   r4 = READ_ONCE(*x);
408
23 }
409
24
410
25 exists (0:r2=0 /\ 1:r4=0)
411

412
The herd7 output is as follows:
413

414
 1 Test SB+rfionceonce-poonceonces Allowed
415
 2 States 4
416
 3 0:r2=0; 1:r4=0;
417
 4 0:r2=0; 1:r4=1;
418
 5 0:r2=1; 1:r4=0;
419
 6 0:r2=1; 1:r4=1;
420
 7 Ok
421
 8 Witnesses
422
 9 Positive: 1 Negative: 3
423
10 Condition exists (0:r2=0 /\ 1:r4=0)
424
11 Observation SB+rfionceonce-poonceonces Sometimes 1 3
425
12 Time SB+rfionceonce-poonceonces 0.01
426
13 Hash=c7f30fe0faebb7d565405d55b7318ada
427

428
(This output indicates that CPUs are permitted to "snoop their own
429
store buffers", which all of Linux's CPU families other than s390 will
430
happily do.  Such snooping results in disagreement among CPUs on the
431
order of stores from different CPUs, which is rarely an issue.)
432

433
But the herd7 output shows only the two variables mentioned in the
434
"exists" clause.  Someone modifying this test might wish to know the
435
values of "x", "y", "0:r1", and "0:r3" as well.  The "locations"
436
statement on line 25 shows how to cause herd7 to display additional
437
variables:
438

439
 1 C SB+rfionceonce-poonceonces
440
 2
441
 3 {}
442
 4
443
 5 P0(int *x, int *y)
444
 6 {
445
 7   int r1;
446
 8   int r2;
447
 9
448
10   WRITE_ONCE(*x, 1);
449
11   r1 = READ_ONCE(*x);
450
12   r2 = READ_ONCE(*y);
451
13 }
452
14
453
15 P1(int *x, int *y)
454
16 {
455
17   int r3;
456
18   int r4;
457
19
458
20   WRITE_ONCE(*y, 1);
459
21   r3 = READ_ONCE(*y);
460
22   r4 = READ_ONCE(*x);
461
23 }
462
24
463
25 locations [0:r1; 1:r3; x; y]
464
26 exists (0:r2=0 /\ 1:r4=0)
465

466
The herd7 output then displays the values of all the variables:
467

468
 1 Test SB+rfionceonce-poonceonces Allowed
469
 2 States 4
470
 3 0:r1=1; 0:r2=0; 1:r3=1; 1:r4=0; x=1; y=1;
471
 4 0:r1=1; 0:r2=0; 1:r3=1; 1:r4=1; x=1; y=1;
472
 5 0:r1=1; 0:r2=1; 1:r3=1; 1:r4=0; x=1; y=1;
473
 6 0:r1=1; 0:r2=1; 1:r3=1; 1:r4=1; x=1; y=1;
474
 7 Ok
475
 8 Witnesses
476
 9 Positive: 1 Negative: 3
477
10 Condition exists (0:r2=0 /\ 1:r4=0)
478
11 Observation SB+rfionceonce-poonceonces Sometimes 1 3
479
12 Time SB+rfionceonce-poonceonces 0.01
480
13 Hash=40de8418c4b395388f6501cafd1ed38d
481

482
What if you would like to know the value of a particular global variable
483
at some particular point in a given process's execution?  One approach
484
is to use a READ_ONCE() to load that global variable into a new local
485
variable, then add that local variable to the "locations" clause.
486
But be careful:  In some litmus tests, adding a READ_ONCE() will change
487
the outcome!  For one example, please see the C-READ_ONCE.litmus and
488
C-READ_ONCE-omitted.litmus tests located here:
489

490
	https://github.com/paulmckrcu/litmus/blob/master/manual/kernel/
491

492

493
Spin Loops
494
----------
495

496
The analysis carried out by herd7 explores full state space, which is
497
at best of exponential time complexity.  Adding processes and increasing
498
the amount of code in a give process can greatly increase execution time.
499
Potentially infinite loops, such as those used to wait for locks to
500
become available, are clearly problematic.
501

502
Fortunately, it is possible to avoid state-space explosion by specially
503
modeling such loops.  For example, the following litmus tests emulates
504
locking using xchg_acquire(), but instead of enclosing xchg_acquire()
505
in a spin loop, it instead excludes executions that fail to acquire the
506
lock using a herd7 "filter" clause.  Note that for exclusive locking, you
507
are better off using the spin_lock() and spin_unlock() that LKMM directly
508
models, if for no other reason that these are much faster.  However, the
509
techniques illustrated in this section can be used for other purposes,
510
such as emulating reader-writer locking, which LKMM does not yet model.
511

512
 1 C C-SB+l-o-o-u+l-o-o-u-X
513
 2
514
 3 {
515
 4 }
516
 5
517
 6 P0(int *sl, int *x0, int *x1)
518
 7 {
519
 8   int r2;
520
 9   int r1;
521
10
522
11   r2 = xchg_acquire(sl, 1);
523
12   WRITE_ONCE(*x0, 1);
524
13   r1 = READ_ONCE(*x1);
525
14   smp_store_release(sl, 0);
526
15 }
527
16
528
17 P1(int *sl, int *x0, int *x1)
529
18 {
530
19   int r2;
531
20   int r1;
532
21
533
22   r2 = xchg_acquire(sl, 1);
534
23   WRITE_ONCE(*x1, 1);
535
24   r1 = READ_ONCE(*x0);
536
25   smp_store_release(sl, 0);
537
26 }
538
27
539
28 filter (0:r2=0 /\ 1:r2=0)
540
29 exists (0:r1=0 /\ 1:r1=0)
541

542
This litmus test may be found here:
543

544
https://git.kernel.org/pub/scm/linux/kernel/git/paulmck/perfbook.git/tree/CodeSamples/formal/herd/C-SB+l-o-o-u+l-o-o-u-X.litmus
545

546
This test uses two global variables, "x1" and "x2", and also emulates a
547
single global spinlock named "sl".  This spinlock is held by whichever
548
process changes the value of "sl" from "0" to "1", and is released when
549
that process sets "sl" back to "0".  P0()'s lock acquisition is emulated
550
on line 11 using xchg_acquire(), which unconditionally stores the value
551
"1" to "sl" and stores either "0" or "1" to "r2", depending on whether
552
the lock acquisition was successful or unsuccessful (due to "sl" already
553
having the value "1"), respectively.  P1() operates in a similar manner.
554

555
Rather unconventionally, execution appears to proceed to the critical
556
section on lines 12 and 13 in either case.  Line 14 then uses an
557
smp_store_release() to store zero to "sl", thus emulating lock release.
558

559
The case where xchg_acquire() fails to acquire the lock is handled by
560
the "filter" clause on line 28, which tells herd7 to keep only those
561
executions in which both "0:r2" and "1:r2" are zero, that is to pay
562
attention only to those executions in which both locks are actually
563
acquired.  Thus, the bogus executions that would execute the critical
564
sections are discarded and any effects that they might have had are
565
ignored.  Note well that the "filter" clause keeps those executions
566
for which its expression is satisfied, that is, for which the expression
567
evaluates to true.  In other words, the "filter" clause says what to
568
keep, not what to discard.
569

570
The result of running this test is as follows:
571

572
 1 Test C-SB+l-o-o-u+l-o-o-u-X Allowed
573
 2 States 2
574
 3 0:r1=0; 1:r1=1;
575
 4 0:r1=1; 1:r1=0;
576
 5 No
577
 6 Witnesses
578
 7 Positive: 0 Negative: 2
579
 8 Condition exists (0:r1=0 /\ 1:r1=0)
580
 9 Observation C-SB+l-o-o-u+l-o-o-u-X Never 0 2
581
10 Time C-SB+l-o-o-u+l-o-o-u-X 0.03
582

583
The "Never" on line 9 indicates that this use of xchg_acquire() and
584
smp_store_release() really does correctly emulate locking.
585

586
Why doesn't the litmus test take the simpler approach of using a spin loop
587
to handle failed spinlock acquisitions, like the kernel does?  The key
588
insight behind this litmus test is that spin loops have no effect on the
589
possible "exists"-clause outcomes of program execution in the absence
590
of deadlock.  In other words, given a high-quality lock-acquisition
591
primitive in a deadlock-free program running on high-quality hardware,
592
each lock acquisition will eventually succeed.  Because herd7 already
593
explores the full state space, the length of time required to actually
594
acquire the lock does not matter.  After all, herd7 already models all
595
possible durations of the xchg_acquire() statements.
596

597
Why not just add the "filter" clause to the "exists" clause, thus
598
avoiding the "filter" clause entirely?  This does work, but is slower.
599
The reason that the "filter" clause is faster is that (in the common case)
600
herd7 knows to abandon an execution as soon as the "filter" expression
601
fails to be satisfied.  In contrast, the "exists" clause is evaluated
602
only at the end of time, thus requiring herd7 to waste time on bogus
603
executions in which both critical sections proceed concurrently.  In
604
addition, some LKMM users like the separation of concerns provided by
605
using the both the "filter" and "exists" clauses.
606

607
Readers lacking a pathological interest in odd corner cases should feel
608
free to skip the remainder of this section.
609

610
But what if the litmus test were to temporarily set "0:r2" to a non-zero
611
value?  Wouldn't that cause herd7 to abandon the execution prematurely
612
due to an early mismatch of the "filter" clause?
613

614
Why not just try it?  Line 4 of the following modified litmus test
615
introduces a new global variable "x2" that is initialized to "1".  Line 23
616
of P1() reads that variable into "1:r2" to force an early mismatch with
617
the "filter" clause.  Line 24 does a known-true "if" condition to avoid
618
and static analysis that herd7 might do.  Finally the "exists" clause
619
on line 32 is updated to a condition that is alway satisfied at the end
620
of the test.
621

622
 1 C C-SB+l-o-o-u+l-o-o-u-X
623
 2
624
 3 {
625
 4   x2=1;
626
 5 }
627
 6
628
 7 P0(int *sl, int *x0, int *x1)
629
 8 {
630
 9   int r2;
631
10   int r1;
632
11
633
12   r2 = xchg_acquire(sl, 1);
634
13   WRITE_ONCE(*x0, 1);
635
14   r1 = READ_ONCE(*x1);
636
15   smp_store_release(sl, 0);
637
16 }
638
17
639
18 P1(int *sl, int *x0, int *x1, int *x2)
640
19 {
641
20   int r2;
642
21   int r1;
643
22
644
23   r2 = READ_ONCE(*x2);
645
24   if (r2)
646
25     r2 = xchg_acquire(sl, 1);
647
26   WRITE_ONCE(*x1, 1);
648
27   r1 = READ_ONCE(*x0);
649
28   smp_store_release(sl, 0);
650
29 }
651
30
652
31 filter (0:r2=0 /\ 1:r2=0)
653
32 exists (x1=1)
654

655
If the "filter" clause were to check each variable at each point in the
656
execution, running this litmus test would display no executions because
657
all executions would be filtered out at line 23.  However, the output
658
is instead as follows:
659

660
 1 Test C-SB+l-o-o-u+l-o-o-u-X Allowed
661
 2 States 1
662
 3 x1=1;
663
 4 Ok
664
 5 Witnesses
665
 6 Positive: 2 Negative: 0
666
 7 Condition exists (x1=1)
667
 8 Observation C-SB+l-o-o-u+l-o-o-u-X Always 2 0
668
 9 Time C-SB+l-o-o-u+l-o-o-u-X 0.04
669
10 Hash=080bc508da7f291e122c6de76c0088e3
670

671
Line 3 shows that there is one execution that did not get filtered out,
672
so the "filter" clause is evaluated only on the last assignment to
673
the variables that it checks.  In this case, the "filter" clause is a
674
disjunction, so it might be evaluated twice, once at the final (and only)
675
assignment to "0:r2" and once at the final assignment to "1:r2".
676

677

678
Linked Lists
679
------------
680

681
LKMM can handle linked lists, but only linked lists in which each node
682
contains nothing except a pointer to the next node in the list.  This is
683
of course quite restrictive, but there is nevertheless quite a bit that
684
can be done within these confines, as can be seen in the litmus test
685
at tools/memory-model/litmus-tests/MP+onceassign+derefonce.litmus:
686

687
 1 C MP+onceassign+derefonce
688
 2
689
 3 {
690
 4 y=z;
691
 5 z=0;
692
 6 }
693
 7
694
 8 P0(int *x, int **y)
695
 9 {
696
10   WRITE_ONCE(*x, 1);
697
11   rcu_assign_pointer(*y, x);
698
12 }
699
13
700
14 P1(int *x, int **y)
701
15 {
702
16   int *r0;
703
17   int r1;
704
18
705
19   rcu_read_lock();
706
20   r0 = rcu_dereference(*y);
707
21   r1 = READ_ONCE(*r0);
708
22   rcu_read_unlock();
709
23 }
710
24
711
25 exists (1:r0=x /\ 1:r1=0)
712

713
Line 4's "y=z" may seem odd, given that "z" has not yet been initialized.
714
But "y=z" does not set the value of "y" to that of "z", but instead
715
sets the value of "y" to the *address* of "z".  Lines 4 and 5 therefore
716
create a simple linked list, with "y" pointing to "z" and "z" having a
717
NULL pointer.  A much longer linked list could be created if desired,
718
and circular singly linked lists can also be created and manipulated.
719

720
The "exists" clause works the same way, with the "1:r0=x" comparing P1()'s
721
"r0" not to the value of "x", but again to its address.  This term of the
722
"exists" clause therefore tests whether line 20's load from "y" saw the
723
value stored by line 11, which is in fact what is required in this case.
724

725
P0()'s line 10 initializes "x" to the value 1 then line 11 links to "x"
726
from "y", replacing "z".
727

728
P1()'s line 20 loads a pointer from "y", and line 21 dereferences that
729
pointer.  The RCU read-side critical section spanning lines 19-22 is just
730
for show in this example.  Note that the address used for line 21's load
731
depends on (in this case, "is exactly the same as") the value loaded by
732
line 20.  This is an example of what is called an "address dependency".
733
This particular address dependency extends from the load on line 20 to the
734
load on line 21.  Address dependencies provide a weak form of ordering.
735

736
Running this test results in the following:
737

738
 1 Test MP+onceassign+derefonce Allowed
739
 2 States 2
740
 3 1:r0=x; 1:r1=1;
741
 4 1:r0=z; 1:r1=0;
742
 5 No
743
 6 Witnesses
744
 7 Positive: 0 Negative: 2
745
 8 Condition exists (1:r0=x /\ 1:r1=0)
746
 9 Observation MP+onceassign+derefonce Never 0 2
747
10 Time MP+onceassign+derefonce 0.00
748
11 Hash=49ef7a741563570102448a256a0c8568
749

750
The only possible outcomes feature P1() loading a pointer to "z"
751
(which contains zero) on the one hand and P1() loading a pointer to "x"
752
(which contains the value one) on the other.  This should be reassuring
753
because it says that RCU readers cannot see the old preinitialization
754
values when accessing a newly inserted list node.  This undesirable
755
scenario is flagged by the "exists" clause, and would occur if P1()
756
loaded a pointer to "x", but obtained the pre-initialization value of
757
zero after dereferencing that pointer.
758

759

760
Comments
761
--------
762

763
Different portions of a litmus test are processed by different parsers,
764
which has the charming effect of requiring different comment syntax in
765
different portions of the litmus test.  The C-syntax portions use
766
C-language comments (either "/* */" or "//"), while the other portions
767
use Ocaml comments "(* *)".
768

769
The following litmus test illustrates the comment style corresponding
770
to each syntactic unit of the test:
771

772
 1 C MP+onceassign+derefonce (* A *)
773
 2
774
 3 (* B *)
775
 4
776
 5 {
777
 6 y=z; (* C *)
778
 7 z=0;
779
 8 } // D
780
 9
781
10 // E
782
11
783
12 P0(int *x, int **y) // F
784
13 {
785
14   WRITE_ONCE(*x, 1);  // G
786
15   rcu_assign_pointer(*y, x);
787
16 }
788
17
789
18 // H
790
19
791
20 P1(int *x, int **y)
792
21 {
793
22   int *r0;
794
23   int r1;
795
24
796
25   rcu_read_lock();
797
26   r0 = rcu_dereference(*y);
798
27   r1 = READ_ONCE(*r0);
799
28   rcu_read_unlock();
800
29 }
801
30
802
31 // I
803
32
804
33 exists (* J *) (1:r0=x /\ (* K *) 1:r1=0) (* L *)
805

806
In short, use C-language comments in the C code and Ocaml comments in
807
the rest of the litmus test.
808

809
On the other hand, if you prefer C-style comments everywhere, the
810
C preprocessor is your friend.
811

812

813
Asynchronous RCU Grace Periods
814
------------------------------
815

816
The following litmus test is derived from the example show in
817
Documentation/litmus-tests/rcu/RCU+sync+free.litmus, but converted to
818
emulate call_rcu():
819

820
 1 C RCU+sync+free
821
 2
822
 3 {
823
 4 int x = 1;
824
 5 int *y = &x;
825
 6 int z = 1;
826
 7 }
827
 8
828
 9 P0(int *x, int *z, int **y)
829
10 {
830
11   int *r0;
831
12   int r1;
832
13
833
14   rcu_read_lock();
834
15   r0 = rcu_dereference(*y);
835
16   r1 = READ_ONCE(*r0);
836
17   rcu_read_unlock();
837
18 }
838
19
839
20 P1(int *z, int **y, int *c)
840
21 {
841
22   rcu_assign_pointer(*y, z);
842
23   smp_store_release(*c, 1); // Emulate call_rcu().
843
24 }
844
25
845
26 P2(int *x, int *z, int **y, int *c)
846
27 {
847
28   int r0;
848
29
849
30   r0 = smp_load_acquire(*c); // Note call_rcu() request.
850
31   synchronize_rcu(); // Wait one grace period.
851
32   WRITE_ONCE(*x, 0); // Emulate the RCU callback.
852
33 }
853
34
854
35 filter (2:r0=1) (* Reject too-early starts. *)
855
36 exists (0:r0=x /\ 0:r1=0)
856

857
Lines 4-6 initialize a linked list headed by "y" that initially contains
858
"x".  In addition, "z" is pre-initialized to prepare for P1(), which
859
will replace "x" with "z" in this list.
860

861
P0() on lines 9-18 enters an RCU read-side critical section, loads the
862
list header "y" and dereferences it, leaving the node in "0:r0" and
863
the node's value in "0:r1".
864

865
P1() on lines 20-24 updates the list header to instead reference "z",
866
then emulates call_rcu() by doing a release store into "c".
867

868
P2() on lines 27-33 emulates the behind-the-scenes effect of doing a
869
call_rcu().  Line 30 first does an acquire load from "c", then line 31
870
waits for an RCU grace period to elapse, and finally line 32 emulates
871
the RCU callback, which in turn emulates a call to kfree().
872

873
Of course, it is possible for P2() to start too soon, so that the
874
value of "2:r0" is zero rather than the required value of "1".
875
The "filter" clause on line 35 handles this possibility, rejecting
876
all executions in which "2:r0" is not equal to the value "1".
877

878

879
Performance
880
-----------
881

882
LKMM's exploration of the full state-space can be extremely helpful,
883
but it does not come for free.  The price is exponential computational
884
complexity in terms of the number of processes, the average number
885
of statements in each process, and the total number of stores in the
886
litmus test.
887

888
So it is best to start small and then work up.  Where possible, break
889
your code down into small pieces each representing a core concurrency
890
requirement.
891

892
That said, herd7 is quite fast.  On an unprepossessing x86 laptop, it
893
was able to analyze the following 10-process RCU litmus test in about
894
six seconds.
895

896
https://github.com/paulmckrcu/litmus/blob/master/auto/C-RW-R+RW-R+RW-G+RW-G+RW-G+RW-G+RW-R+RW-R+RW-R+RW-R.litmus
897

898
One way to make herd7 run faster is to use the "-speedcheck true" option.
899
This option prevents herd7 from generating all possible end states,
900
instead causing it to focus solely on whether or not the "exists"
901
clause can be satisfied.  With this option, herd7 evaluates the above
902
litmus test in about 300 milliseconds, for more than an order of magnitude
903
improvement in performance.
904

905
Larger 16-process litmus tests that would normally consume 15 minutes
906
of time complete in about 40 seconds with this option.  To be fair,
907
you do get an extra 65,535 states when you leave off the "-speedcheck
908
true" option.
909

910
https://github.com/paulmckrcu/litmus/blob/master/auto/C-RW-R+RW-R+RW-G+RW-G+RW-G+RW-G+RW-R+RW-R+RW-R+RW-R+RW-G+RW-G+RW-G+RW-G+RW-R+RW-R.litmus
911

912
Nevertheless, litmus-test analysis really is of exponential complexity,
913
whether with or without "-speedcheck true".  Increasing by just three
914
processes to a 19-process litmus test requires 2 hours and 40 minutes
915
without, and about 8 minutes with "-speedcheck true".  Each of these
916
results represent roughly an order of magnitude slowdown compared to the
917
16-process litmus test.  Again, to be fair, the multi-hour run explores
918
no fewer than 524,287 additional states compared to the shorter one.
919

920
https://github.com/paulmckrcu/litmus/blob/master/auto/C-RW-R+RW-R+RW-G+RW-G+RW-G+RW-G+RW-R+RW-R+RW-R+RW-R+RW-R+RW-R+RW-G+RW-G+RW-G+RW-G+RW-R+RW-R+RW-R.litmus
921

922
If you don't like command-line arguments, you can obtain a similar speedup
923
by adding a "filter" clause with exactly the same expression as your
924
"exists" clause.
925

926
However, please note that seeing the full set of states can be extremely
927
helpful when developing and debugging litmus tests.
928

929

930
LIMITATIONS
931
===========
932

933
Limitations of the Linux-kernel memory model (LKMM) include:
934

935
1.	Compiler optimizations are not accurately modeled.  Of course,
936
	the use of READ_ONCE() and WRITE_ONCE() limits the compiler's
937
	ability to optimize, but under some circumstances it is possible
938
	for the compiler to undermine the memory model.  For more
939
	information, see Documentation/explanation.txt (in particular,
940
	the "THE PROGRAM ORDER RELATION: po AND po-loc" and "A WARNING"
941
	sections).
942

943
	Note that this limitation in turn limits LKMM's ability to
944
	accurately model address, control, and data dependencies.
945
	For example, if the compiler can deduce the value of some variable
946
	carrying a dependency, then the compiler can break that dependency
947
	by substituting a constant of that value.
948

949
	Conversely, LKMM will sometimes overestimate the amount of
950
	reordering compilers and CPUs can carry out, leading it to miss
951
	some pretty obvious cases of ordering.  A simple example is:
952

953
		r1 = READ_ONCE(x);
954
		if (r1 == 0)
955
			smp_mb();
956
		WRITE_ONCE(y, 1);
957

958
	The WRITE_ONCE() does not depend on the READ_ONCE(), and as a
959
	result, LKMM does not claim ordering.  However, even though no
960
	dependency is present, the WRITE_ONCE() will not be executed before
961
	the READ_ONCE().  There are two reasons for this:
962

963
                The presence of the smp_mb() in one of the branches
964
                prevents the compiler from moving the WRITE_ONCE()
965
                up before the "if" statement, since the compiler has
966
                to assume that r1 will sometimes be 0 (but see the
967
                comment below);
968

969
                CPUs do not execute stores before po-earlier conditional
970
                branches, even in cases where the store occurs after the
971
                two arms of the branch have recombined.
972

973
	It is clear that it is not dangerous in the slightest for LKMM to
974
	make weaker guarantees than architectures.  In fact, it is
975
	desirable, as it gives compilers room for making optimizations.
976
	For instance, suppose that a 0 value in r1 would trigger undefined
977
	behavior elsewhere.  Then a clever compiler might deduce that r1
978
	can never be 0 in the if condition.  As a result, said clever
979
	compiler might deem it safe to optimize away the smp_mb(),
980
	eliminating the branch and any ordering an architecture would
981
	guarantee otherwise.
982

983
2.	Multiple access sizes for a single variable are not supported,
984
	and neither are misaligned or partially overlapping accesses.
985

986
3.	Exceptions and interrupts are not modeled.  In some cases,
987
	this limitation can be overcome by modeling the interrupt or
988
	exception with an additional process.
989

990
4.	I/O such as MMIO or DMA is not supported.
991

992
5.	Self-modifying code (such as that found in the kernel's
993
	alternatives mechanism, function tracer, Berkeley Packet Filter
994
	JIT compiler, and module loader) is not supported.
995

996
6.	Complete modeling of all variants of atomic read-modify-write
997
	operations, locking primitives, and RCU is not provided.
998
	For example, call_rcu() and rcu_barrier() are not supported.
999
	However, a substantial amount of support is provided for these
1000
	operations, as shown in the linux-kernel.def file.
1001

1002
	Here are specific limitations:
1003

1004
	a.	When rcu_assign_pointer() is passed NULL, the Linux
1005
		kernel provides no ordering, but LKMM models this
1006
		case as a store release.
1007

1008
	b.	The "unless" RMW operations are not currently modeled:
1009
		atomic_long_add_unless(), atomic_inc_unless_negative(),
1010
		and atomic_dec_unless_positive().  These can be emulated
1011
		in litmus tests, for example, by using atomic_cmpxchg().
1012

1013
		One exception of this limitation is atomic_add_unless(),
1014
		which is provided directly by herd7 (so no corresponding
1015
		definition in linux-kernel.def).  atomic_add_unless() is
1016
		modeled by herd7 therefore it can be used in litmus tests.
1017

1018
	c.	The call_rcu() function is not modeled.  As was shown above,
1019
		it can be emulated in litmus tests by adding another
1020
		process that invokes synchronize_rcu() and the body of the
1021
		callback function, with (for example) a release-acquire
1022
		from the site of the emulated call_rcu() to the beginning
1023
		of the additional process.
1024

1025
	d.	The rcu_barrier() function is not modeled.  It can be
1026
		emulated in litmus tests emulating call_rcu() via
1027
		(for example) a release-acquire from the end of each
1028
		additional call_rcu() process to the site of the
1029
		emulated rcu-barrier().
1030

1031
	e.	Reader-writer locking is not modeled.  It can be
1032
		emulated in litmus tests using atomic read-modify-write
1033
		operations.
1034

1035
The fragment of the C language supported by these litmus tests is quite
1036
limited and in some ways non-standard:
1037

1038
1.	There is no automatic C-preprocessor pass.  You can of course
1039
	run it manually, if you choose.
1040

1041
2.	There is no way to create functions other than the Pn() functions
1042
	that model the concurrent processes.
1043

1044
3.	The Pn() functions' formal parameters must be pointers to the
1045
	global shared variables.  Nothing can be passed by value into
1046
	these functions.
1047

1048
4.	The only functions that can be invoked are those built directly
1049
	into herd7 or that are defined in the linux-kernel.def file.
1050

1051
5.	The "switch", "do", "for", "while", and "goto" C statements are
1052
	not supported.	The "switch" statement can be emulated by the
1053
	"if" statement.  The "do", "for", and "while" statements can
1054
	often be emulated by manually unrolling the loop, or perhaps by
1055
	enlisting the aid of the C preprocessor to minimize the resulting
1056
	code duplication.  Some uses of "goto" can be emulated by "if",
1057
	and some others by unrolling.
1058

1059
6.	Although you can use a wide variety of types in litmus-test
1060
	variable declarations, and especially in global-variable
1061
	declarations, the "herd7" tool understands only int and
1062
	pointer types.	There is no support for floating-point types,
1063
	enumerations, characters, strings, arrays, or structures.
1064

1065
7.	Parsing of variable declarations is very loose, with almost no
1066
	type checking.
1067

1068
8.	Initializers differ from their C-language counterparts.
1069
	For example, when an initializer contains the name of a shared
1070
	variable, that name denotes a pointer to that variable, not
1071
	the current value of that variable.  For example, "int x = y"
1072
	is interpreted the way "int x = &y" would be in C.
1073

1074
9.	Dynamic memory allocation is not supported, although this can
1075
	be worked around in some cases by supplying multiple statically
1076
	allocated variables.
1077

1078
Some of these limitations may be overcome in the future, but others are
1079
more likely to be addressed by incorporating the Linux-kernel memory model
1080
into other tools.
1081

1082
Finally, please note that LKMM is subject to change as hardware, use cases,
1083
and compilers evolve.
1084

1085
Product

Resources

Company