6.2 Trapezoidal Rule Integration with MPI¶

Our next example is also a classic for demonstrating parallel computing, and we introduced it already in our companion PDC for Beginners book using OpenMP for a shared memory system.

The algorithm is a straightforward mathematical technique for finding the area under a curve by splitting the area into many small trapezoid slices whose area we can estimate, then add those areas together. This type of numerical integration is described on Wikipedia using the following diagram for an arbitrary function:

Consult the above Wikipedia entry for details on the mathematics for the composite trapezoidal rule, which is written into the function called Trap() in the code below.

In our case, we will begin by considering the sine(x) function, which looks like this when x varies from 0 to \(5*\pi\):

We will use this function to make it simpler for us to determine the correctness of our parallel implementation in MPI. To make it even simpler, let’s first concentrate on the range of the first part of the above function, from 0 to \(\pi\), which looks like this:

We will rely on a mathematical theory which tells us that the integral where x ranges from 0 to \(\pi\) of sine(x) is 2.0. A sequential solution would split the curve of this function into many small trapezoids, iterating from a = 0.0 to b = \(\pi\), summing the value of each trapezoid as we go. This is provided in a function called Trap(), which looks like this:

double Trap(double left_endpt, double right_endpt, int trap_count, double base_len) {
    double estimate, x;
    int i;

    estimate = (f(left_endpt) + f(right_endpt))/2.0;
    for (i = 1; i <= trap_count-1; i++) {
        x = left_endpt + i*base_len;
        estimate += f(x);
    }
    estimate = estimate*base_len;
    return estimate;
}

In this case, base_len is the width of each trapezoid, trap_count is the number of trapezoids, and left_endpt and right_endpt are the range we will be integrating over.

Parallelize with common patterns¶

Perhaps unsurprisingly, the patterns used in this solution are the same as the previous Monte Carlo example, because we again have a familiar for loop in this above function that we can decompose into chunks.

The important patterns in this code are:

SPMD, since we have one program that multiple processes all run.
Data decomposition, since each process will compute an equal share of the total number of trapezoids requested.
Parallel for loop split with equal chunks computed by each process.
Reduction communication pattern to combine the results from each process.
There is also a broadcast from process 0. See if you can find that and what it was used for.

To illustrate the use of these patterns, suppose we were using a small number of trapezoids (24) and 4 processes. We can use decomposition of the loop in the Trap() function above so that each process works on computing 6 of the trapezoids, like this, where each color represents a process:

In the following code example, focus on the use of ‘local’ in certain variables below. These indicate values that will be different for each process and represent the range of values that each process can compute. Each process keeps its own sum for the area of the trapezoids, and then the reduction takes place when all processes are completed.

/*
* Peter S. Pacheco, An Introduction to Parallel Programming,
* Morgan Kaufmann Publishers, 2011
* IPP:   Section 3.4.2 (pp. 104 and ff.)
*
* Timing and command line argument added by Hannah Sonsalla,
* Macalester College, 2017
*
* Update to use f(x) as sin(x) by Libby Shoop, 2023
*
* mpi_trap.c
*
* ... Use MPI to implement a parallel version of the trapezoidal
*     rule.  Uses collective communications to distribute the
*     input data and compute the global sum.
*
* Input:    Number of trapezoids
* Output:   Estimate of the integral from a to b of f(x)
*           using the trapezoidal rule and n trapezoids.
*
* Usage:    mpirun -np <number of processes> ./mpi_trap < number of trapezoids>
*
* Algorithm:
*    1.  Each process calculates "its" interval of
*        integration.
*    2.  Each process estimates the integral of f(x)
*        over its interval using the trapezoidal rule.
*    3a. Each process != 0 sends its integral to 0.
*    3b. Process 0 sums the calculations received from
*        the individual processes and prints the result.
*
* Note:  f(x) is sin(x) in this example. The integral from 0 to Pi of
*         sin(x) is known to be 2.0, so we know if our solution is correct.
*
*/
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <math.h>
#include <mpi.h>

const double a = 0;
const double b = M_PI; // pi from math.h

/* Function declarations */
void Get_input(int argc, char* argv[], int my_rank, double* n_p);
double Trap(double left_endpt, double right_endpt, int trap_count,
double base_len);
double f(double x);

int main(int argc, char** argv) {
    int my_rank, comm_sz, local_n;
    double n, h, local_a, local_b;
    double local_int, total_int;
    double start, finish, loc_elapsed, elapsed;

MPI_Init(NULL, NULL);
    MPI_Comm_rank(MPI_COMM_WORLD, &my_rank);
    MPI_Comm_size(MPI_COMM_WORLD, &comm_sz);

Get_input(argc, argv, my_rank, &n); /*Read user input */

/*Note: h and local_n are the same for all processes*/
    h = (b-a)/n;          /* length of each trapezoid */
    local_n = n/comm_sz;  /* number of trapezoids per process */

/* Length of each process' interval of integration = local_n*h. */
    local_a = a + my_rank*local_n*h;
    local_b = local_a + local_n*h;

MPI_Barrier(MPI_COMM_WORLD);
    start = MPI_Wtime();

/* Calculate each process' local integral using local endpoints*/
    local_int = Trap(local_a, local_b, local_n, h);

finish = MPI_Wtime();
    loc_elapsed = finish-start;
    MPI_Reduce(&loc_elapsed, &elapsed, 1, MPI_DOUBLE, MPI_MAX, 0, MPI_COMM_WORLD);

/* Add up the integrals calculated by each process */
    MPI_Reduce(&local_int, &total_int, 1, MPI_DOUBLE, MPI_SUM, 0,
            MPI_COMM_WORLD);

if (my_rank == 0) {
        printf("With n = %.0f trapezoids, our estimate\n", n);

printf("of the integral from %f to %f of sin(x) = %f\n",
            a, b, total_int);
        printf("Elapsed time = %f milliseconds \n", elapsed * 1000);
    }

/* Shut down MPI */
    MPI_Finalize();

return 0;
} /*  main  */

/*------------------------------------------------------------------
* Function:     Get_input
* Purpose:      Get the user input: the number of trapezoids
* Input args:   my_rank:  process rank in MPI_COMM_WORLD
*               comm_sz:  number of processes in MPI_COMM_WORLD
* Output args:  n_p:  pointer to number of trapezoids
*/
void Get_input(int argc, char* argv[], int my_rank, double* n_p){
    if (my_rank == 0) {
        if (argc!= 2){
            fprintf(stderr, "usage: mpirun -np <N> %s <number of trapezoids> \n", argv[0]);
            fflush(stderr);
            *n_p = -1;
        } else {
            *n_p = atoi(argv[1]);
        }
    }
    // Broadcasts value of n to each process
    MPI_Bcast(n_p, 1, MPI_DOUBLE, 0, MPI_COMM_WORLD);

// negative n ends the program
    if (*n_p <= 0) {
        MPI_Finalize();
        exit(-1);
    }
}  /* Get_input */

/*------------------------------------------------------------------
* Function:     Trap
* Purpose:      Serial function for estimating a definite integral
*               using the trapezoidal rule
* Input args:   left_endpt
*               right_endpt
*               trap_count
*               base_len
* Return val:   Trapezoidal rule estimate of integral from
*               left_endpt to right_endpt using trap_count
*               trapezoids
*/
double Trap(double left_endpt, double right_endpt, int trap_count, double base_len) {
    double estimate, x;
    int i;

estimate = (f(left_endpt) + f(right_endpt))/2.0;
    for (i = 1; i <= trap_count-1; i++) {
        x = left_endpt + i*base_len;
        estimate += f(x);
    }
    estimate = estimate*base_len;

return estimate;
} /*  Trap  */

/*------------------------------------------------------------------
* Function:    f
* Purpose:     Compute value of function to be integrated
* Input args:  x
*/

double f(double x) {
    return sin(x);
} /* f */

/////////////////////////////////// Pacheco test copyright disclaimer
// Redistribution and use in source and binary forms, with or without
// modification, are permitted provided that the following conditions
// are met:

//    - Redistributions of source code must retain the above copyright
//      notice, this list of conditions and the following disclaimer.
//    - Redistributions in binary form must reproduce the above
//      copyright notice, this list of conditions and the following
//      disclaimer in the documentation and/or other materials provided
//      with the distribution.

// THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
// "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
// LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS
// FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE
// COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT,
// INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING,
// BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
// LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
// CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
// LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN
// ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE
// POSSIBILITY OF SUCH DAMAGE.

You have attempted of activities on this page