Task Parallelism and Synchronization¶

Chapel supports both task parallelism and data parallelism. This chapter details task parallelism as follows:

Tasks and Task Parallelism introduces tasks and task parallelism.
The Begin Statement describes the begin statement, an unstructured way to introduce concurrency into a program.
Synchronization Variables describes synchronization variables, an unstructured mechanism for synchronizing tasks.
Atomic Variables describes atomic variables, a mechanism for supporting atomic operations.
The Cobegin Statement describes the cobegin statement, a structured way to introduce concurrency into a program.
The Coforall Loop describes the coforall loop, another structured way to introduce concurrency into a program.
Task Intents specifies how variables from outer scopes are handled within begin, cobegin and coforall statements. Task-Private Variables are also available.
The Sync Statement describes the sync statement, a structured way to control parallelism.
The Serial Statement describes the serial statement, a structured way to suppress parallelism.
Yielding Task Execution describes yielding the current tasks execution.

Tasks and Task Parallelism¶

A Chapel task is a distinct context of execution that may be running concurrently with other tasks. Chapel provides a simple construct, the begin statement, to create tasks, introducing concurrency into a program in an unstructured way. In addition, Chapel introduces the type qualifier sync for synchronization between tasks.

Chapel provides two constructs, the cobegin and coforall statements, to introduce concurrency in a more structured way. These constructs create multiple tasks but do not continue until these tasks have completed. In addition, Chapel provides two constructs, the sync and serial statements, to insert synchronization and suppress parallelism. All four of these constructs can be implemented through judicious uses of the unstructured task-parallel constructs described in the previous paragraph.

Tasks are considered to be created when execution reaches the start of a begin, cobegin, or coforall statement. When the tasks are actually executed depends on the Chapel implementation and run-time execution state.

Tasks created by begin, cobegin, and coforall can depend upon each other, even if that leads to the program not being serializable.

A task is implemented as a call to a task function, whose body contains the Chapel code for the task. Variables defined in outer scopes are considered to be passed into a task function by default intent, unless a different task intent is specified explicitly by a task-intent-clause.

Accesses to the same variable from different tasks are subject to the Memory Consistency Model (Memory Consistency Model). Such accesses can result from aliasing due to ref argument intents or task intents, among others.

The Begin Statement¶

The begin statement creates a task to execute a statement. The syntax for the begin statement is given by

begin-statement:
  'begin' task-intent-clause[OPT] statement

Control continues concurrently with the statement following the begin statement.

Example (beginUnordered.chpl).

The code
begin writeln("output from spawned task");
writeln("output from main task");
executes two writeln statements that output the strings to the terminal, but the ordering is purposely unspecified. There is no guarantee as to which statement will execute first. When the begin statement is executed, a new task is created that will execute the writeln statement within it. However, execution will continue immediately after task creation with the next statement.

A begin statement creates a single task function, whose body is the body of the begin statement. The handling of the outer variables within the task function and the role of task-intent-clause are defined in Task Intents.

Yield and return statements are not allowed in begin blocks. Break and continue statements may not be used to exit a begin block.

Synchronization Variables¶

Synchronization variables have a logical state associated with the value. The state of the variable is either full or empty. Normal reads of a synchronization variable cannot proceed until the variable’s state is full. Normal writes of a synchronization variable cannot proceed until the variable’s state is empty.

The sync type qualifier precedes the type of the variable’s value in the declaration. sync is supported for the primitive types nothing, bool, int, uint, real, imag, complex, range, bytes, and string ( Primitive Types); for enumerated types ( Enumerated Types); and for class types (Class Types) and record types (Record Types). For sync variables of class type, the full/empty state applies to the reference to the class object, not to its member fields.

If a task attempts to read or write a synchronization variable that is not in the correct state, the task is suspended. When the variable transitions to the correct state, the task is resumed. If there are multiple tasks blocked waiting for the state transition one task is non-deterministically selected to proceed and the others continue to wait.

A synchronization variable is specified with a sync type given by the following syntax:

sync-type:
  'sync' type-expression

A default-initialized synchronization variable will be empty. A synchronization variable initialized from another expression will be full and store the value from that expression.

Example (beginWithSyncVar.chpl).

The code
class Tree {
  var isLeaf: bool;
  var left, right: unmanaged Tree?;
  var value: int;

  proc sum():int {
    if (isLeaf) then
       return value;

    var x: sync int;
    begin x.writeEF(left!.sum());
    var y = right!.sum();
    return x.readFE() + y;
  }
}
the sync variable x is assigned by an asynchronous task created with the begin statement. The task returning the sum waits on the reading of x until it has been assigned.

Example (syncVar.chpl).

The following code implements a simple split-phase barrier using a sync variable.
var count: sync int = n;  // counter which also serves as a lock
var release: sync bool; // barrier release

forall t in 1..n do begin {
  work(t);
  var myc = count.readFE();  // read the count, set state to empty
  if myc!=1 {
    write(".");
    count.writeEF(myc-1);   // update the count, set state to full
    // we could also do some work here before blocking
    release.readFF();
  } else {
    release.writeEF(true);  // last one here, release everyone
    writeln("done");
  }
}
In each iteration of the forall loop after the work is completed, the task reads the count variable, which is used to tally the number of tasks that have arrived. All tasks except the last task to arrive will block while trying to read the variable release. The last task to arrive will write to release, setting its state to full at which time all the other tasks can be unblocked and run.

If a formal argument with a default intent either has a synchronization type or the formal is generic (Formal Arguments of Generic Type) and the actual has a synchronization type, the actual must be an lvalue and is passed by reference. In these cases the formal itself is an lvalue, too. The actual argument is not read or written during argument passing; its state is not changed or waited on. The qualifier sync without the value type can be used to specify a generic formal argument that requires a sync actual.

Predefined Sync Methods¶

The following methods are defined for variables of sync type:

proc sync.readFE()¶

Read a full sync variable, leaving it empty.

Block until the sync variable is full.
Read the value of the sync variable and set the variable to empty.

Returns:: The value of the sync variable.

proc sync.readFF()¶

Read a full sync variable, leaving it full.

Block until the sync variable is full.
Read the value of the sync variable and leave the variable full.

Returns:: The value of the sync variable.

proc sync.readXX()¶

Warning

‘readXX’ is unstable

Read a sync variable regardless of its state, leaving its state unchanged.

Without blocking, read the value of the sync variable
Leaving the state unchanged, return a value based on the current state:

full: return a copy of the stored value.

empty: return either a new default-initialized value of the stored type or, the last value stored (implementation dependent).

Returns:: The value of the sync variable.

proc ref sync.writeEF(in val: valType)¶

Write into an empty sync variable, leaving it full.

Block until the sync variable is empty.
Write the value of the sync variable and leave the variable full.

Arguments:: val – New value of the sync variable.

proc ref sync.writeFF(in val: valType)¶

Warning

‘writeFF’ is unstable

Write into a full sync variable, leaving it full.

Block until the sync variable is full.
Write the value of the sync variable and leave the variable full.

Arguments:: val – New value of the sync variable.

proc ref sync.writeXF(in val: valType)¶

Warning

‘writeXF’ is unstable

Write into a sync variable regardless of its state, leaving it full.

Do not block.
Write the value of the sync variable, leave it’s state full.

Arguments:: val – New value of the sync variable.

proc ref sync.reset()¶: Warning

‘reset’ is unstable

Resets the value of this sync variable to the default value of its type. This method is non-blocking and the state of the sync variable is set to empty when this method completes.

proc sync.isFull¶

Warning

‘isFull’ is unstable

Determine if the sync variable is full without blocking. Does not alter the state of the sync variable.

Returns:: true if the state of the sync variable is full, false if it’s empty.

Atomic Variables¶

atomic is a type qualifier that precedes the variable’s type in the declaration. An atomic variable is specified with an atomic type given by the following syntax:

atomic-type:
  'atomic' type-expression

For example, the following code declares an atomic variable x that stores an int:

var x: atomic int;

Such an atomic variable that is declared without an initialization expression will store the default value of the contained type (i.e. 0 or false).

Atomic variables can also be declared with an initial value:

var y: atomic int = 1;

Similarly, a temporary atomic value can be created by casting to atomic:

var one: int = 1;
... one : atomic int... // creates an `atomic int` initialized with 1

Assignment is supported between atomic variables as well:

var x: atomic int = 1;
var y: atomic int = 2;

x = y; // equivalent to x.write(y.read())

Chapel currently supports atomic operations for bools, all supported sizes of signed and unsigned integers, as well as all supported sizes of reals. Note that not all operations are supported for all atomic types. The supported types are listed for each operation.

Rationale.

The choice of supported atomic variable types as well as the atomic operations was strongly influenced by the C11 standard.

Most atomic methods accept an optional argument named order of type memoryOrder. The order argument is used to specify the ordering constraints of atomic operations. The supported memoryOrder values are:

memoryOrder.relaxed

memoryOrder.acquire

memoryOrder.release

memoryOrder.acqRel

memoryOrder.seqCst

See also Memory Consistency Model and in particular Non-Sequentially Consistent Atomic Operations for more information on the meaning of these memory orders.

Unless specified, the default for the memoryOrder parameter is memoryOrder.seqCst.

Implementors’ note.

Not all architectures or implementations may support all memoryOrder values. In these cases, the implementation should default to a more conservative ordering than specified.

proc atomicFence(param order: memoryOrder = memoryOrder.seqCst)¶: An atomic fence that establishes an ordering of non-atomic and relaxed atomic operations.

atomic (bool) : writeSerializable

proc read(param order: memoryOrder = memoryOrder.seqCst) : bool¶: Returns the stored value.

proc ref write(val: bool, param order: memoryOrder = memoryOrder.seqCst) : void¶: Stores val as the new value.

proc ref exchange(val: bool, param order: memoryOrder = memoryOrder.seqCst) : bool¶: Stores val as the new value and returns the original value.

proc ref compareExchange(ref expected: bool, desired: bool, param order: memoryOrder = memoryOrder.seqCst) : bool¶: Stores desired as the new value, if and only if the original value is equal to expected. Returns true if desired was stored, otherwise updates expected to the original value.

proc ref compareExchange(ref expected: bool, desired: bool, param success: memoryOrder, param failure: memoryOrder) : bool

proc ref compareExchangeWeak(ref expected: bool, desired: bool, param order: memoryOrder = memoryOrder.seqCst) : bool¶

Similar to compareExchange, except that this function may return false even if the original value was equal to expected. This may happen if the value could not be updated atomically.

This weak version is allowed to spuriously fail, but when compareExchange is already in a loop, it can offer better performance on some platforms.

proc ref compareExchangeWeak(ref expected: bool, desired: bool, param success: memoryOrder, param failure: memoryOrder)

proc ref compareAndSwap(expected: bool, desired: bool, param order: memoryOrder = memoryOrder.seqCst) : bool¶: Warning

‘compareAndSwap’ is unstable

Stores desired as the new value, if and only if the original value is equal to expected. Returns true if desired was stored.

proc ref testAndSet(param order: memoryOrder = memoryOrder.seqCst) : bool¶: Stores true as the new value and returns the old value.

proc ref clear(param order: memoryOrder = memoryOrder.seqCst) : void¶: Stores false as the new value.

proc waitFor(val: bool, param order: memoryOrder = memoryOrder.seqCst) : void¶: Waits until the stored value is equal to val. The implementation may yield the running task while waiting.

atomic (valType) : writeSerializable

proc read(param order: memoryOrder = memoryOrder.seqCst) : valType: Returns the stored value.

proc ref write(val: valType, param order: memoryOrder = memoryOrder.seqCst) : void: Stores val as the new value.

proc ref exchange(val: valType, param order: memoryOrder = memoryOrder.seqCst) : valType: Stores val as the new value and returns the original value.

proc ref compareExchange(ref expected: valType, desired: valType, param order: memoryOrder = memoryOrder.seqCst) : bool: Stores desired as the new value, if and only if the original value is equal to expected. Returns true if desired was stored, otherwise updates expected to the original value.

proc ref compareExchange(ref expected: valType, desired: valType, param success: memoryOrder, param failure: memoryOrder) : bool

proc ref compareExchangeWeak(ref expected: valType, desired: valType, param order: memoryOrder = memoryOrder.seqCst) : bool

Similar to compareExchange, except that this function may return false even if the original value was equal to expected. This may happen if the value could not be updated atomically.

This weak version is allowed to spuriously fail, but when compareExchange is already in a loop, it can offer better performance on some platforms.

proc ref compareExchangeWeak(ref expected: valType, desired: valType, param success: memoryOrder, param failure: memoryOrder) : bool

proc ref compareAndSwap(expected: valType, desired: valType, param order: memoryOrder = memoryOrder.seqCst) : bool: Warning

‘compareAndSwap’ is unstable

Stores desired as the new value, if and only if the original value is equal to expected. Returns true if desired was stored.

proc ref fetchAdd(val: valType, param order: memoryOrder = memoryOrder.seqCst) : valType¶

Returns:: The original value.

Adds val to the original value and stores the result. Defined for integer and real atomic types.

proc ref add(val: valType, param order: memoryOrder = memoryOrder.seqCst) : void¶: Adds val to the original value and stores the result. Defined for integer and real atomic types.

proc ref fetchSub(val: valType, param order: memoryOrder = memoryOrder.seqCst) : valType¶

Returns:: The original value.

Subtracts val from the original value and stores the result. Defined for integer and real atomic types.

proc ref sub(val: valType, param order: memoryOrder = memoryOrder.seqCst) : void¶: Subtracts val from the original value and stores the result. Defined for integer and real atomic types.

proc ref fetchOr(val: valType, param order: memoryOrder = memoryOrder.seqCst) : valType¶

Returns:: The original value.

Applies the | operator to val and the original value, then stores the result.