Task Parallelism

View taskParallel.chpl on GitHub

This primer illustrates Chapel’s task-parallel features, namely the begin, cobegin, and coforall statements. In Chapel, a task is a computation in the program that can, and typically will, run in parallel with respect to other tasks. These three statements are the only ways to create new tasks within a Chapel program.

Begin Statements

The begin statement creates a new parallel task that’s independent of the original one encountering the begin.

writeln("1: ### The begin statement ###");
begin writeln("1: output from spawned task");

The original task continues on to the next statement. In this example, there is no guarantee which statement will execute first since they can run in parallel and are not synchronized.

writeln("1: output from main task");

Cobegin Statements

For more structured behavior, the cobegin statement can be used to spawn a block of tasks, one for each statement.

writeln("2: ### The cobegin statement ###");

cobegin {
  writeln("2: output from spawned task 1");
  writeln("2: output from spawned task 2");
}

The original task continues execution after the cobegin block, but only after all the child tasks created by the cobegin block have completed. As a result, in this example, the output from within the cobegin statement will always precede the following output from the original task:

writeln("2: output from main task");

If any begin statements are used within a cobegin statement’s tasks, the original task does not wait for those begin tasks to complete. That is, the original task only waits on the cobegin’s child tasks, not all of their descendent tasks.

writeln("3: ### The cobegin statement with nested begin statements ###");

cobegin {
  begin writeln("3: output from spawned task 1");
  begin writeln("3: output from spawned task 2");
}

The order of the output is again undefined because the begin statements in the above cobegin statement are not guaranteed to have been executed before control reaches the following statement.

writeln("3: output from main task");

Coforall Loops

Another structured form of task parallelism is the coforall loop. This loop form is like a for loop, except that each iteration of the loop is executed by a distinct task. Similar to the cobegin statement, the original task does not continue until the tasks created for each iteration have completed.

writeln("4: ### The coforall loop ###");

config const n = 10;

coforall i in 1..n {
  writeln("4: output 1 from spawned task ", i);
  writeln("4: output 2 from spawned task ", i);
}

While the statements within the loop body will execute in the normal way (so in this case, the first writeln() will execute before the second), the order of execution relative to other iterations is undefined.

As with the cobegin statement, the original task will wait until the coforall’s child tasks have completed before proceeding. For this example, this means that all output from within the coforall loop will precede the following output:

writeln("4: output from main task");

Also like the cobegin statement, the orignal task will not wait for any begin tasks spawned by its child tasks.

writeln("5: ### The coforall loop with nested begin statements ###");
coforall i in 1..n {
  begin writeln("5: output from spawned task 1 (iteration ", i, ")");
  begin writeln("5: output from spawned task 2 (iteration ", i, ")");
}

The order of output is undefined.

writeln("5: output from main task");

Task Intents

The body of a task construct may refer to some variables declared outside its lexical scope, known as “outer variables”. When it does, “shadow variables” are introduced. Each task created by the task construct gets its own set of shadow variables, one per outer variable.

  • Each shadow variable behaves as if it were a formal argument of a function that implements the task’s work. (These “task functions” are described in the language spec). The outer variable is passed to this formal argument according to the argument intent associated with the shadow variable, which is called a “task intent”.

  • References within a task that seem to refer to an outer variable will actually be referring to the corresponding shadow variable owned by the task.

  • Each shadow variable is deallocated at the end of its task.

For most types, forall intents use the default argument intent (The Default Intent). For numeric types, this implies capturing the value of the outer variable by the time the task starts executing. Sync and atomic variables are passed by reference (Sync, Atomics). Arrays infer their default intent based upon the declaration of the array. Mutable arrays (e.g. declared with var or passed by ref intent) have a default intent of ref, while immutable arrays (e.g. declared with const or passed by const intent) have a default intent of const. These defaults are described in the language spec.

begin statements currently capture the values of outer variables of numeric types into their shadow variables at task creation time. This means that the shadow variables can be accessed even after the outer variables’ scope exits. This is not the case for some other types such as arrays.

var outerIntVariable = 2;
begin assert(outerIntVariable == 2);

The task intents in, const in, ref, const ref, and reduce can be specified explicitly using a with clause.

An in or const in intent creates a copy of the outer variable for each task. A ref or const ref makes the shadow variable an alias for the outer variable.

var outerArray = [10, 11, 12];
begin with (in outerArray) assert(outerArray[0] == 10);

var outerRealVariable = 1.0;

coforall i in 1..n with (ref outerRealVariable) {
  if i == 1 then            // ensure only one task updates outerIntVariable
    outerRealVariable *= 2; // to avoid the risk of a data race
}

A reduce intent can be used to compute reductions with coforall loops. The values of each reduce-intent shadow variable at the end of its task is combined onto its outer variable according to the specified reduction operation.

 // The values of the outer variables before the loop will be included
 // in the reduction result.
var outerMaxVariable = 0;
var outerMinVariable = 0;

coforall i in 1..n with (+ reduce outerIntVariable,
                         max reduce outerMaxVariable,
                         min reduce outerMinVariable) {
  outerIntVariable = i;

  if i % 2 == 0 then
    outerMaxVariable = i;  // compute the max of even indices
  else
    outerMinVariable = -i; // ... and the min of negated odd ones

  // The loop body can contain other code
  // regardless of reduce-related operations.
}