ts/tutorials/ex26.c

*c4762a1bSJed Brown
*c4762a1bSJed Brownstatic char help[] = "Transient nonlinear driven cavity in 2d.\n\
*c4762a1bSJed Brown  \n\
*c4762a1bSJed BrownThe 2D driven cavity problem is solved in a velocity-vorticity formulation.\n\
*c4762a1bSJed BrownThe flow can be driven with the lid or with bouyancy or both:\n\
*c4762a1bSJed Brown  -lidvelocity <lid>, where <lid> = dimensionless velocity of lid\n\
*c4762a1bSJed Brown  -grashof <gr>, where <gr> = dimensionless temperature gradent\n\
*c4762a1bSJed Brown  -prandtl <pr>, where <pr> = dimensionless thermal/momentum diffusity ratio\n\
*c4762a1bSJed Brown  -contours : draw contour plots of solution\n\n";
*c4762a1bSJed Brown/*
*c4762a1bSJed Brown      See src/snes/tutorials/ex19.c for the steady-state version.
*c4762a1bSJed Brown
*c4762a1bSJed Brown      There used to be a SNES example (src/snes/tutorials/ex27.c) that
*c4762a1bSJed Brown      implemented this algorithm without using TS and was used for the numerical
*c4762a1bSJed Brown      results in the paper
*c4762a1bSJed Brown
*c4762a1bSJed Brown        Todd S. Coffey and C. T. Kelley and David E. Keyes, Pseudotransient
*c4762a1bSJed Brown        Continuation and Differential-Algebraic Equations, 2003.
*c4762a1bSJed Brown
*c4762a1bSJed Brown      That example was removed because it used obsolete interfaces, but the
*c4762a1bSJed Brown      algorithms from the paper can be reproduced using this example.
*c4762a1bSJed Brown
*c4762a1bSJed Brown      Note: The paper describes the algorithm as being linearly implicit but the
*c4762a1bSJed Brown      numerical results were created using nonlinearly implicit Euler.  The
*c4762a1bSJed Brown      algorithm as described (linearly implicit) is more efficient and is the
*c4762a1bSJed Brown      default when using TSPSEUDO.  If you want to reproduce the numerical
*c4762a1bSJed Brown      results from the paper, you'll have to change the SNES to converge the
*c4762a1bSJed Brown      nonlinear solve (e.g., -snes_type newtonls).  The DAE versus ODE variants
*c4762a1bSJed Brown      are controlled using the -parabolic option.
*c4762a1bSJed Brown
*c4762a1bSJed Brown      Comment preserved from snes/tutorials/ex27.c, since removed:
*c4762a1bSJed Brown
*c4762a1bSJed Brown        [H]owever Figure 3.1 was generated with a slightly different algorithm
*c4762a1bSJed Brown        (see targets runex27 and runex27_p) than described in the paper.  In
*c4762a1bSJed Brown        particular, the described algorithm is linearly implicit, advancing to
*c4762a1bSJed Brown        the next step after one Newton step, so that the steady state residual
*c4762a1bSJed Brown        is always used, but the figure was generated by converging each step to
*c4762a1bSJed Brown        a relative tolerance of 1.e-3.  On the example problem, setting
*c4762a1bSJed Brown        -snes_type ksponly has only minor impact on number of steps, but
*c4762a1bSJed Brown        significantly reduces the required number of linear solves.
*c4762a1bSJed Brown
*c4762a1bSJed Brown      See also https://lists.mcs.anl.gov/pipermail/petsc-dev/2010-March/002362.html
*c4762a1bSJed Brown*/
*c4762a1bSJed Brown
*c4762a1bSJed Brown/*T
*c4762a1bSJed Brown   Concepts: TS^solving a system of nonlinear equations (parallel multicomponent example);
*c4762a1bSJed Brown   Concepts: DMDA^using distributed arrays;
*c4762a1bSJed Brown   Concepts: TS^multicomponent
*c4762a1bSJed Brown   Concepts: TS^differential-algebraic equation
*c4762a1bSJed Brown   Processors: n
*c4762a1bSJed BrownT*/
*c4762a1bSJed Brown/* ------------------------------------------------------------------------
*c4762a1bSJed Brown
*c4762a1bSJed Brown    We thank David E. Keyes for contributing the driven cavity discretization
*c4762a1bSJed Brown    within this example code.
*c4762a1bSJed Brown
*c4762a1bSJed Brown    This problem is modeled by the partial differential equation system
*c4762a1bSJed Brown
*c4762a1bSJed Brown        - Lap(U) - Grad_y(Omega) = 0
*c4762a1bSJed Brown        - Lap(V) + Grad_x(Omega) = 0
*c4762a1bSJed Brown        Omega_t - Lap(Omega) + Div([U*Omega,V*Omega]) - GR*Grad_x(T) = 0
*c4762a1bSJed Brown        T_t - Lap(T) + PR*Div([U*T,V*T]) = 0
*c4762a1bSJed Brown
*c4762a1bSJed Brown    in the unit square, which is uniformly discretized in each of x and
*c4762a1bSJed Brown    y in this simple encoding.
*c4762a1bSJed Brown
*c4762a1bSJed Brown    No-slip, rigid-wall Dirichlet conditions are used for [U,V].
*c4762a1bSJed Brown    Dirichlet conditions are used for Omega, based on the definition of
*c4762a1bSJed Brown    vorticity: Omega = - Grad_y(U) + Grad_x(V), where along each
*c4762a1bSJed Brown    constant coordinate boundary, the tangential derivative is zero.
*c4762a1bSJed Brown    Dirichlet conditions are used for T on the left and right walls,
*c4762a1bSJed Brown    and insulation homogeneous Neumann conditions are used for T on
*c4762a1bSJed Brown    the top and bottom walls.
*c4762a1bSJed Brown
*c4762a1bSJed Brown    A finite difference approximation with the usual 5-point stencil
*c4762a1bSJed Brown    is used to discretize the boundary value problem to obtain a
*c4762a1bSJed Brown    nonlinear system of equations.  Upwinding is used for the divergence
*c4762a1bSJed Brown    (convective) terms and central for the gradient (source) terms.
*c4762a1bSJed Brown
*c4762a1bSJed Brown    The Jacobian can be either
*c4762a1bSJed Brown      * formed via finite differencing using coloring (the default), or
*c4762a1bSJed Brown      * applied matrix-free via the option -snes_mf
*c4762a1bSJed Brown        (for larger grid problems this variant may not converge
*c4762a1bSJed Brown        without a preconditioner due to ill-conditioning).
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ------------------------------------------------------------------------- */
*c4762a1bSJed Brown
*c4762a1bSJed Brown/*
*c4762a1bSJed Brown   Include "petscdmda.h" so that we can use distributed arrays (DMDAs).
*c4762a1bSJed Brown   Include "petscts.h" so that we can use TS solvers.  Note that this
*c4762a1bSJed Brown   file automatically includes:
*c4762a1bSJed Brown     petscsys.h       - base PETSc routines   petscvec.h - vectors
*c4762a1bSJed Brown     petscmat.h - matrices
*c4762a1bSJed Brown     petscis.h     - index sets            petscksp.h - Krylov subspace methods
*c4762a1bSJed Brown     petscviewer.h - viewers               petscpc.h  - preconditioners
*c4762a1bSJed Brown     petscksp.h   - linear solvers         petscsnes.h - nonlinear solvers
*c4762a1bSJed Brown*/
*c4762a1bSJed Brown#include <petscts.h>
*c4762a1bSJed Brown#include <petscdm.h>
*c4762a1bSJed Brown#include <petscdmda.h>
*c4762a1bSJed Brown
*c4762a1bSJed Brown/*
*c4762a1bSJed Brown   User-defined routines and data structures
*c4762a1bSJed Brown*/
*c4762a1bSJed Browntypedef struct {
*c4762a1bSJed Brown  PetscScalar u,v,omega,temp;
*c4762a1bSJed Brown} Field;
*c4762a1bSJed Brown
*c4762a1bSJed BrownPetscErrorCode FormIFunctionLocal(DMDALocalInfo*,PetscReal,Field**,Field**,Field**,void*);
*c4762a1bSJed Brown
*c4762a1bSJed Browntypedef struct {
*c4762a1bSJed Brown  PetscReal   lidvelocity,prandtl,grashof;   /* physical parameters */
*c4762a1bSJed Brown  PetscBool   parabolic;                     /* allow a transient term corresponding roughly to artificial compressibility */
*c4762a1bSJed Brown  PetscReal   cfl_initial;                   /* CFL for first time step */
*c4762a1bSJed Brown} AppCtx;
*c4762a1bSJed Brown
*c4762a1bSJed BrownPetscErrorCode FormInitialSolution(TS,Vec,AppCtx*);
*c4762a1bSJed Brown
*c4762a1bSJed Brownint main(int argc,char **argv)
*c4762a1bSJed Brown{
*c4762a1bSJed Brown  AppCtx            user;             /* user-defined work context */
*c4762a1bSJed Brown  PetscInt          mx,my,steps;
*c4762a1bSJed Brown  PetscErrorCode    ierr;
*c4762a1bSJed Brown  TS                ts;
*c4762a1bSJed Brown  DM                da;
*c4762a1bSJed Brown  Vec               X;
*c4762a1bSJed Brown  PetscReal         ftime;
*c4762a1bSJed Brown  TSConvergedReason reason;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = PetscInitialize(&argc,&argv,(char*)0,help);if (ierr) return ierr;
*c4762a1bSJed Brown  ierr = TSCreate(PETSC_COMM_WORLD,&ts);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMDACreate2d(PETSC_COMM_WORLD,DM_BOUNDARY_NONE,DM_BOUNDARY_NONE,DMDA_STENCIL_STAR,4,4,PETSC_DECIDE,PETSC_DECIDE,4,1,0,0,&da);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMSetFromOptions(da);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMSetUp(da);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSSetDM(ts,(DM)da);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = DMDAGetInfo(da,0,&mx,&my,PETSC_IGNORE,PETSC_IGNORE,PETSC_IGNORE,PETSC_IGNORE,PETSC_IGNORE,PETSC_IGNORE,
*c4762a1bSJed Brown                     PETSC_IGNORE,PETSC_IGNORE,PETSC_IGNORE,PETSC_IGNORE);CHKERRQ(ierr);
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Problem parameters (velocity of lid, prandtl, and grashof numbers)
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  user.lidvelocity = 1.0/(mx*my);
*c4762a1bSJed Brown  user.prandtl     = 1.0;
*c4762a1bSJed Brown  user.grashof     = 1.0;
*c4762a1bSJed Brown  user.parabolic   = PETSC_FALSE;
*c4762a1bSJed Brown  user.cfl_initial = 50.;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = PetscOptionsBegin(PETSC_COMM_WORLD,NULL,"Driven cavity/natural convection options","");CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = PetscOptionsReal("-lidvelocity","Lid velocity, related to Reynolds number","",user.lidvelocity,&user.lidvelocity,NULL);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = PetscOptionsReal("-prandtl","Ratio of viscous to thermal diffusivity","",user.prandtl,&user.prandtl,NULL);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = PetscOptionsReal("-grashof","Ratio of bouyant to viscous forces","",user.grashof,&user.grashof,NULL);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = PetscOptionsBool("-parabolic","Relax incompressibility to make the system parabolic instead of differential-algebraic","",user.parabolic,&user.parabolic,NULL);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = PetscOptionsReal("-cfl_initial","Advective CFL for the first time step","",user.cfl_initial,&user.cfl_initial,NULL);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = PetscOptionsEnd();CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = DMDASetFieldName(da,0,"x-velocity");CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMDASetFieldName(da,1,"y-velocity");CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMDASetFieldName(da,2,"Omega");CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMDASetFieldName(da,3,"temperature");CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
*c4762a1bSJed Brown     Create user context, set problem data, create vector data structures.
*c4762a1bSJed Brown     Also, compute the initial guess.
*c4762a1bSJed Brown     - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - */
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
*c4762a1bSJed Brown     Create time integration context
*c4762a1bSJed Brown     - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - */
*c4762a1bSJed Brown  ierr = DMSetApplicationContext(da,&user);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMDATSSetIFunctionLocal(da,INSERT_VALUES,(DMDATSIFunctionLocal)FormIFunctionLocal,&user);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSSetMaxSteps(ts,10000);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSSetMaxTime(ts,1e12);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSSetExactFinalTime(ts,TS_EXACTFINALTIME_STEPOVER);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSSetTimeStep(ts,user.cfl_initial/(user.lidvelocity*mx));CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSSetFromOptions(ts);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = PetscPrintf(PETSC_COMM_WORLD,"%Dx%D grid, lid velocity = %g, prandtl # = %g, grashof # = %g\n",mx,my,(double)user.lidvelocity,(double)user.prandtl,(double)user.grashof);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
*c4762a1bSJed Brown     Solve the nonlinear system
*c4762a1bSJed Brown     - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - */
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = DMCreateGlobalVector(da,&X);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = FormInitialSolution(ts,X,&user);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = TSSolve(ts,X);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSGetSolveTime(ts,&ftime);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSGetStepNumber(ts,&steps);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSGetConvergedReason(ts,&reason);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = PetscPrintf(PETSC_COMM_WORLD,"%s at time %g after %D steps\n",TSConvergedReasons[reason],(double)ftime,steps);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
*c4762a1bSJed Brown     Free work space.  All PETSc objects should be destroyed when they
*c4762a1bSJed Brown     are no longer needed.
*c4762a1bSJed Brown     - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - */
*c4762a1bSJed Brown  ierr = VecDestroy(&X);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = DMDestroy(&da);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr = TSDestroy(&ts);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  ierr = PetscFinalize();
*c4762a1bSJed Brown  return ierr;
*c4762a1bSJed Brown}
*c4762a1bSJed Brown
*c4762a1bSJed Brown/* ------------------------------------------------------------------- */
*c4762a1bSJed Brown
*c4762a1bSJed Brown
*c4762a1bSJed Brown/*
*c4762a1bSJed Brown   FormInitialSolution - Forms initial approximation.
*c4762a1bSJed Brown
*c4762a1bSJed Brown   Input Parameters:
*c4762a1bSJed Brown   user - user-defined application context
*c4762a1bSJed Brown   X - vector
*c4762a1bSJed Brown
*c4762a1bSJed Brown   Output Parameter:
*c4762a1bSJed Brown   X - vector
*c4762a1bSJed Brown */
*c4762a1bSJed BrownPetscErrorCode FormInitialSolution(TS ts,Vec X,AppCtx *user)
*c4762a1bSJed Brown{
*c4762a1bSJed Brown  DM             da;
*c4762a1bSJed Brown  PetscInt       i,j,mx,xs,ys,xm,ym;
*c4762a1bSJed Brown  PetscErrorCode ierr;
*c4762a1bSJed Brown  PetscReal      grashof,dx;
*c4762a1bSJed Brown  Field          **x;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  grashof = user->grashof;
*c4762a1bSJed Brown  ierr    = TSGetDM(ts,&da);CHKERRQ(ierr);
*c4762a1bSJed Brown  ierr    = DMDAGetInfo(da,0,&mx,0,0,0,0,0,0,0,0,0,0,0);CHKERRQ(ierr);
*c4762a1bSJed Brown  dx      = 1.0/(mx-1);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Get local grid boundaries (for 2-dimensional DMDA):
*c4762a1bSJed Brown       xs, ys   - starting grid indices (no ghost points)
*c4762a1bSJed Brown       xm, ym   - widths of local grid (no ghost points)
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  ierr = DMDAGetCorners(da,&xs,&ys,NULL,&xm,&ym,NULL);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Get a pointer to vector data.
*c4762a1bSJed Brown       - For default PETSc vectors, VecGetArray() returns a pointer to
*c4762a1bSJed Brown         the data array.  Otherwise, the routine is implementation dependent.
*c4762a1bSJed Brown       - You MUST call VecRestoreArray() when you no longer need access to
*c4762a1bSJed Brown         the array.
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  ierr = DMDAVecGetArray(da,X,&x);CHKERRQ(ierr);
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Compute initial guess over the locally owned part of the grid
*c4762a1bSJed Brown     Initial condition is motionless fluid and equilibrium temperature
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  for (j=ys; j<ys+ym; j++) {
*c4762a1bSJed Brown    for (i=xs; i<xs+xm; i++) {
*c4762a1bSJed Brown      x[j][i].u     = 0.0;
*c4762a1bSJed Brown      x[j][i].v     = 0.0;
*c4762a1bSJed Brown      x[j][i].omega = 0.0;
*c4762a1bSJed Brown      x[j][i].temp  = (grashof>0)*i*dx;
*c4762a1bSJed Brown    }
*c4762a1bSJed Brown  }
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Restore vector
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  ierr = DMDAVecRestoreArray(da,X,&x);CHKERRQ(ierr);
*c4762a1bSJed Brown  return 0;
*c4762a1bSJed Brown}
*c4762a1bSJed Brown
*c4762a1bSJed BrownPetscErrorCode FormIFunctionLocal(DMDALocalInfo *info,PetscReal ptime,Field **x,Field **xdot,Field **f,void *ptr)
*c4762a1bSJed Brown{
*c4762a1bSJed Brown  AppCtx         *user = (AppCtx*)ptr;
*c4762a1bSJed Brown  PetscErrorCode ierr;
*c4762a1bSJed Brown  PetscInt       xints,xinte,yints,yinte,i,j;
*c4762a1bSJed Brown  PetscReal      hx,hy,dhx,dhy,hxdhy,hydhx;
*c4762a1bSJed Brown  PetscReal      grashof,prandtl,lid;
*c4762a1bSJed Brown  PetscScalar    u,udot,uxx,uyy,vx,vy,avx,avy,vxp,vxm,vyp,vym;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  PetscFunctionBeginUser;
*c4762a1bSJed Brown  grashof = user->grashof;
*c4762a1bSJed Brown  prandtl = user->prandtl;
*c4762a1bSJed Brown  lid     = user->lidvelocity;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Define mesh intervals ratios for uniform grid.
*c4762a1bSJed Brown
*c4762a1bSJed Brown     Note: FD formulae below are normalized by multiplying through by
*c4762a1bSJed Brown     local volume element (i.e. hx*hy) to obtain coefficients O(1) in two dimensions.
*c4762a1bSJed Brown
*c4762a1bSJed Brown
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  dhx   = (PetscReal)(info->mx-1);  dhy = (PetscReal)(info->my-1);
*c4762a1bSJed Brown  hx    = 1.0/dhx;                   hy = 1.0/dhy;
*c4762a1bSJed Brown  hxdhy = hx*dhy;                 hydhx = hy*dhx;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  xints = info->xs; xinte = info->xs+info->xm; yints = info->ys; yinte = info->ys+info->ym;
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* Test whether we are on the bottom edge of the global array */
*c4762a1bSJed Brown  if (yints == 0) {
*c4762a1bSJed Brown    j     = 0;
*c4762a1bSJed Brown    yints = yints + 1;
*c4762a1bSJed Brown    /* bottom edge */
*c4762a1bSJed Brown    for (i=info->xs; i<info->xs+info->xm; i++) {
*c4762a1bSJed Brown      f[j][i].u     = x[j][i].u;
*c4762a1bSJed Brown      f[j][i].v     = x[j][i].v;
*c4762a1bSJed Brown      f[j][i].omega = x[j][i].omega + (x[j+1][i].u - x[j][i].u)*dhy;
*c4762a1bSJed Brown      f[j][i].temp  = x[j][i].temp-x[j+1][i].temp;
*c4762a1bSJed Brown    }
*c4762a1bSJed Brown  }
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* Test whether we are on the top edge of the global array */
*c4762a1bSJed Brown  if (yinte == info->my) {
*c4762a1bSJed Brown    j     = info->my - 1;
*c4762a1bSJed Brown    yinte = yinte - 1;
*c4762a1bSJed Brown    /* top edge */
*c4762a1bSJed Brown    for (i=info->xs; i<info->xs+info->xm; i++) {
*c4762a1bSJed Brown      f[j][i].u     = x[j][i].u - lid;
*c4762a1bSJed Brown      f[j][i].v     = x[j][i].v;
*c4762a1bSJed Brown      f[j][i].omega = x[j][i].omega + (x[j][i].u - x[j-1][i].u)*dhy;
*c4762a1bSJed Brown      f[j][i].temp  = x[j][i].temp-x[j-1][i].temp;
*c4762a1bSJed Brown    }
*c4762a1bSJed Brown  }
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* Test whether we are on the left edge of the global array */
*c4762a1bSJed Brown  if (xints == 0) {
*c4762a1bSJed Brown    i     = 0;
*c4762a1bSJed Brown    xints = xints + 1;
*c4762a1bSJed Brown    /* left edge */
*c4762a1bSJed Brown    for (j=info->ys; j<info->ys+info->ym; j++) {
*c4762a1bSJed Brown      f[j][i].u     = x[j][i].u;
*c4762a1bSJed Brown      f[j][i].v     = x[j][i].v;
*c4762a1bSJed Brown      f[j][i].omega = x[j][i].omega - (x[j][i+1].v - x[j][i].v)*dhx;
*c4762a1bSJed Brown      f[j][i].temp  = x[j][i].temp;
*c4762a1bSJed Brown    }
*c4762a1bSJed Brown  }
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* Test whether we are on the right edge of the global array */
*c4762a1bSJed Brown  if (xinte == info->mx) {
*c4762a1bSJed Brown    i     = info->mx - 1;
*c4762a1bSJed Brown    xinte = xinte - 1;
*c4762a1bSJed Brown    /* right edge */
*c4762a1bSJed Brown    for (j=info->ys; j<info->ys+info->ym; j++) {
*c4762a1bSJed Brown      f[j][i].u     = x[j][i].u;
*c4762a1bSJed Brown      f[j][i].v     = x[j][i].v;
*c4762a1bSJed Brown      f[j][i].omega = x[j][i].omega - (x[j][i].v - x[j][i-1].v)*dhx;
*c4762a1bSJed Brown      f[j][i].temp  = x[j][i].temp - (PetscReal)(grashof>0);
*c4762a1bSJed Brown    }
*c4762a1bSJed Brown  }
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /* Compute over the interior points */
*c4762a1bSJed Brown  for (j=yints; j<yinte; j++) {
*c4762a1bSJed Brown    for (i=xints; i<xinte; i++) {
*c4762a1bSJed Brown
*c4762a1bSJed Brown      /*
*c4762a1bSJed Brown        convective coefficients for upwinding
*c4762a1bSJed Brown      */
*c4762a1bSJed Brown      vx  = x[j][i].u; avx = PetscAbsScalar(vx);
*c4762a1bSJed Brown      vxp = .5*(vx+avx); vxm = .5*(vx-avx);
*c4762a1bSJed Brown      vy  = x[j][i].v; avy = PetscAbsScalar(vy);
*c4762a1bSJed Brown      vyp = .5*(vy+avy); vym = .5*(vy-avy);
*c4762a1bSJed Brown
*c4762a1bSJed Brown      /* U velocity */
*c4762a1bSJed Brown      u         = x[j][i].u;
*c4762a1bSJed Brown      udot      = user->parabolic ? xdot[j][i].u : 0.;
*c4762a1bSJed Brown      uxx       = (2.0*u - x[j][i-1].u - x[j][i+1].u)*hydhx;
*c4762a1bSJed Brown      uyy       = (2.0*u - x[j-1][i].u - x[j+1][i].u)*hxdhy;
*c4762a1bSJed Brown      f[j][i].u = udot + uxx + uyy - .5*(x[j+1][i].omega-x[j-1][i].omega)*hx;
*c4762a1bSJed Brown
*c4762a1bSJed Brown      /* V velocity */
*c4762a1bSJed Brown      u         = x[j][i].v;
*c4762a1bSJed Brown      udot      = user->parabolic ? xdot[j][i].v : 0.;
*c4762a1bSJed Brown      uxx       = (2.0*u - x[j][i-1].v - x[j][i+1].v)*hydhx;
*c4762a1bSJed Brown      uyy       = (2.0*u - x[j-1][i].v - x[j+1][i].v)*hxdhy;
*c4762a1bSJed Brown      f[j][i].v = udot + uxx + uyy + .5*(x[j][i+1].omega-x[j][i-1].omega)*hy;
*c4762a1bSJed Brown
*c4762a1bSJed Brown      /* Omega */
*c4762a1bSJed Brown      u             = x[j][i].omega;
*c4762a1bSJed Brown      uxx           = (2.0*u - x[j][i-1].omega - x[j][i+1].omega)*hydhx;
*c4762a1bSJed Brown      uyy           = (2.0*u - x[j-1][i].omega - x[j+1][i].omega)*hxdhy;
*c4762a1bSJed Brown      f[j][i].omega = (xdot[j][i].omega + uxx + uyy
*c4762a1bSJed Brown                       + (vxp*(u - x[j][i-1].omega)
*c4762a1bSJed Brown                          + vxm*(x[j][i+1].omega - u)) * hy
*c4762a1bSJed Brown                       + (vyp*(u - x[j-1][i].omega)
*c4762a1bSJed Brown                          + vym*(x[j+1][i].omega - u)) * hx
*c4762a1bSJed Brown                       - .5 * grashof * (x[j][i+1].temp - x[j][i-1].temp) * hy);
*c4762a1bSJed Brown
*c4762a1bSJed Brown      /* Temperature */
*c4762a1bSJed Brown      u            = x[j][i].temp;
*c4762a1bSJed Brown      uxx          = (2.0*u - x[j][i-1].temp - x[j][i+1].temp)*hydhx;
*c4762a1bSJed Brown      uyy          = (2.0*u - x[j-1][i].temp - x[j+1][i].temp)*hxdhy;
*c4762a1bSJed Brown      f[j][i].temp =  (xdot[j][i].temp + uxx + uyy
*c4762a1bSJed Brown                       + prandtl * ((vxp*(u - x[j][i-1].temp)
*c4762a1bSJed Brown                                     + vxm*(x[j][i+1].temp - u)) * hy
*c4762a1bSJed Brown                                    + (vyp*(u - x[j-1][i].temp)
*c4762a1bSJed Brown                                       + vym*(x[j+1][i].temp - u)) * hx));
*c4762a1bSJed Brown    }
*c4762a1bSJed Brown  }
*c4762a1bSJed Brown
*c4762a1bSJed Brown  /*
*c4762a1bSJed Brown     Flop count (multiply-adds are counted as 2 operations)
*c4762a1bSJed Brown  */
*c4762a1bSJed Brown  ierr = PetscLogFlops(84.0*info->ym*info->xm);CHKERRQ(ierr);
*c4762a1bSJed Brown  PetscFunctionReturn(0);
*c4762a1bSJed Brown}
*c4762a1bSJed Brown
*c4762a1bSJed Brown/*TEST
*c4762a1bSJed Brown
*c4762a1bSJed Brown    test:
*c4762a1bSJed Brown      args: -da_grid_x 20 -da_grid_y 20 -lidvelocity 100 -grashof 1e3 -ts_max_steps 100 -ts_rtol 1e-3 -ts_atol 1e-3 -ts_type rosw -ts_rosw_type ra3pw -ts_monitor -ts_monitor_solution_vtk 'foo-%03D.vts'
*c4762a1bSJed Brown      requires: !complex !single
*c4762a1bSJed Brown
*c4762a1bSJed Brown    test:
*c4762a1bSJed Brown      suffix: 2
*c4762a1bSJed Brown      nsize: 4
*c4762a1bSJed Brown      args: -da_grid_x 20 -da_grid_y 20 -lidvelocity 100 -grashof 1e3 -ts_max_steps 100 -ts_rtol 1e-3 -ts_atol 1e-3 -ts_type rosw -ts_rosw_type ra3pw -ts_monitor -ts_monitor_solution_vtk 'foo-%03D.vts'
*c4762a1bSJed Brown      requires: !complex !single
*c4762a1bSJed Brown
*c4762a1bSJed Brown    test:
*c4762a1bSJed Brown      suffix: 3
*c4762a1bSJed Brown      nsize: 4
*c4762a1bSJed Brown      args: -da_refine 2 -lidvelocity 100 -grashof 1e3 -ts_max_steps 10 -ts_rtol 1e-3 -ts_atol 1e-3 -pc_type none -ts_type beuler -ts_monitor -snes_monitor_short -snes_type aspin -da_overlap 4
*c4762a1bSJed Brown      requires: !complex !single
*c4762a1bSJed Brown
*c4762a1bSJed Brown    test:
*c4762a1bSJed Brown      suffix: 4
*c4762a1bSJed Brown      nsize: 2
*c4762a1bSJed Brown      args: -da_refine 1 -lidvelocity 100 -grashof 1e3 -ts_max_steps 10 -ts_rtol 1e-3 -ts_atol 1e-3
*c4762a1bSJed Brown      requires: !complex !single
*c4762a1bSJed Brown
*c4762a1bSJed Brown    test:
*c4762a1bSJed Brown      suffix: asm
*c4762a1bSJed Brown      nsize: 4
*c4762a1bSJed Brown      args: -da_refine 1 -lidvelocity 100 -grashof 1e3 -ts_max_steps 10 -ts_rtol 1e-3 -ts_atol 1e-3
*c4762a1bSJed Brown      requires: !complex !single
*c4762a1bSJed Brown
*c4762a1bSJed BrownTEST*/