Standard Forms

PART II: SECOND ORDER EQUATIONS
Section 11
Standard Forms
James V. Herod*

*(c) Copyright 1993,1994,1995 by James V. Herod, herod@math.gatech.edu. All rights reserved.

Web page maintained by Evans M. Harrell, II, harrell@math.gatech.edu.


In this section, we review the procedures to get second-order equations into standard forms, and compare these with the techniques to get second-order partial differential equations with constant coefficients into standard form.

Performing these algebraic procedures corresponds to the geometric notions of translations and rotations.

For example, the equation

x2 - 3y2 - 8x + 30y = 50

represents a hyperbola. To draw the graph of the hyperbola, one algebraically factors the equation or, geometrically, translates the axes:

(x - 4)2 - 3(y - 5)2 = 1.

Now, the response that this is a hyperbola with center {4,5} is expected. More detailed information about the direction of the major and minor axes could be made, but these are not notions that we will wish to carry over to the process of getting second order partial differential equations into standard forms.

There is another idea more appropriate. Rather than keeping the hyperbola in the Euclidean plane where it now has the equation

x2 - 3y2 = 1

in the translated form, think of this hyperbola in the Cartesian plane, and do not insist that the x axis and the y axis have the same scale. In this particular case, keep the x axis the same size and expand the y axis so that every unit is the old unit multiplying by R(3). Algebraically, one commonly writes that there are new coordinates {x',y'} related to the old coordinates by

x = x' , R(3) y = y'.

The algebraic effect is that the equation is transformed into an equation in {x',y'} coordinates:

x' 2 - y' 2 = 1.

Pay attention to the fact that it is a mistake to carry over too much of the geometric language for the form. For example if the original quadratic equation had been

x2 + 3y2 - 8x + 30y = 50

and we had translated axes to produce

x2 + 3y2 = 50,

and then rescaled the axes to get

x2 + y2 = 50

we have not changed an ellipse into a circle for a circle is a geometric object whose very definition involves the notion of distance. The process of changing the scale on the x axis and the y axis certainly destroys the entire notion of distance being the same in all directions.

Rather, the rescaling is an idea that is algebraically simplifying.

Before we pursue the idea of rescaling and translating in second order partial differential equations in order to come up with standard forms, we need to recall that there is also the troublesome need to rotate the axis in order to get some quadratic forms into the standard one. For example, if the equation is

xy = 2,

we quickly recognize this as a quadratic equation. Even more, we could draw the graph. If pushed, we would identify the resulting geometric figure as a hyperbola. We ask for more here since these geometric ideas are more readily transformed into ideas about partial differential equations if they are converted into algebraic ideas. The question, then, is how do we achieve the algebraic representation of the hyperbola in standard form?

One recalls from analytic geometry, or recognizes from simply looking at the picture of the graph of the equation, that this hyperbola has been rotated out of standard form. To see it in standard form, we must rotate the axes. One forgets the details of how this rotation is performed, but should know a reference to find the scheme. From here on, the following may serve as your reference!

Here is the rotation needed to remove the xy term in the equation

ax2 + bxy + cy2 + dx + ey + f = 0. (11.1)

The new coordinates {x', y'} are given by

B(A(x',y')) = B(ACO2( cos([[alpha]]), sin([[alpha]]),-sin([[alpha]]), cos([[alpha]]))) B(A(x,y)), (11.2)

where

[[alpha]] = B(A([[pi]]/4 if a = c, F(1,2) arctan( F(b,a-c) ) if a != c)). (11.3)

And equivalent way to write (11.2) is to multiply both sides by the inverse of the matrix:

B(A(x,y)) = B(ACO2(cos([[alpha]]), -sin([[alpha]]),sin([[alpha]]), cos([[alpha]]))) B(A(x',y')). (11.4)

Thus, substitute x = x' cos([[alpha]]) - y' sin([[alpha]]) and y = x' sin([[alpha]]) + y' cos([[alpha]]) into the equation (11.1), where [[alpha]] is as indicated above and the cross term, bxy, will disappear.

Given a general quadratic in two variables, there are three things that need to be done to get it into standard form: get rid of the xy terms, factor all the x terms and the y terms separately, and rescale the axes so that the coefficients of the x2 term and the y2 terms are the same. Geometrically this corresponds, as we have recalled, to a rotation, translation, and expansion, respectively. From the geometric point of view, it does not matter which is done first: the rotation and then the translation, or vice versa. Algebraically, it is better to remove the xy terms first, for then the factoring is easier.

Finally, it should also be remembered that the sign of b2 - 4ac can be used to predict whether the curve is a hyperbola, parabola, or ellipse.

What follows is a MAPLE program for removing the cross term. Using the program assures fast accurate computation for otherwise tedious calculations to determine the rotation for equations of the form

ax2 + bxy + cy2 +dx +ey + f = 0.

The coefficients are read in first.

* a:=?: b:=?:c:=?:d:=?:e:=?:f:=?:

The angle is determined and the rotation is performed.

*if a=c then alpha := Pi/4

else alpha:=arctan(b/(a-c))/2 fi;

* si:=sin(alpha);

* co:=cos(alpha);

* x:=co*u-si*v; y:=si*u+co*v;

* q:=a*x^2+b*x*y+c*y^2+d*x+e*y+f;

* simplify(");

* evalf(");

The purpose of the previous paragraphs recalling how to change algebraic equations representing two dimensional conic sections into standard form was to suggest that the same ideas carry over almost unchanged for the second degree partial differential equations. The techniques will change these equations into the standard forms for elliptic, hyperbolic, or parabolic partial differential equations.

Here are the forms we want:

Elliptic Equations: L[u] = F([[partialdiff]]2u,[[partialdiff]]x2) + F([[partialdiff]]2u,[[partialdiff]]y2) +/- k u [[equivalence]] --2 u +/- k u,

Hyperbolic Equations: L[u] = F([[partialdiff]]2u,[[partialdiff]]x2) - F([[partialdiff]]2u,[[partialdiff]]y2) +/- k u,

Parabolic Equation: L[u] = F([[partialdiff]]u,[[partialdiff]]x) - F([[partialdiff]]2u,[[partialdiff]]y2) +/- u.

The choice for which form to use is determined by the techniques used to solve the equations.

We turn now to the techniques of arriving at standard forms for partial differential equations.

If one has a second order partial differential equation with constant coefficients that is not in the standard form, there is a method to change it into this form. The techniques are similar to those used in the analytic geometry. Having the standard form, one might then solve the equation. Finally, the solution should be transformed back into the original coordinate system.

We will illustrate the procedure for transformation of a second order equation into standard form. Consider the equation

11 F([[partialdiff]]2u,[[partialdiff]]x2) + 4R(3) F([[partialdiff]]2u,[[partialdiff]]x[[partialdiff]]y) + 7 F([[partialdiff]]2u,[[partialdiff]]x2) - 5u = 0.

In the original equation, if we think of the equation as

a F([[partialdiff]]2u,[[partialdiff]]x2) + b F([[partialdiff]]2u,[[partialdiff]]x[[partialdiff]]y) + c F([[partialdiff]]2u,[[partialdiff]]y2) + d F([[partialdiff]]u,[[partialdiff]]x) + e F([[partialdiff]]u,[[partialdiff]]y) + f u = 0.

Then, a = 11, b = 4 R(3), c = 7, so that b2 - 4 a c = -260 and we identify this as an elliptic equation.

We would like to transform the equation into the form

F([[partialdiff]]2u,[[partialdiff]]x2) + F([[partialdiff]]2u,[[partialdiff]]y2) + cu = 0.

Introduce new coordinates ([[xi]],[[eta]]) by rotation of axes so that in the transformed equation the mixed second partial derivative does not appear. Let

B(A([[xi]],[[eta]])) = B(ACO2( cos([[alpha]]), sin([[alpha]]),-sin([[alpha]]), cos([[alpha]]))) B(A(x,y))

or,

B(A(x,y)) = B(ACO2(cos([[alpha]]), -sin([[alpha]]),sin([[alpha]]), cos([[alpha]]))) B(A([[xi]],[[eta]])).

Using the chain rule, F([[partialdiff]] ,[[partialdiff]]x) = cos([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[xi]]) - sin([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[eta]])

and F([[partialdiff]] ,[[partialdiff]]y) = sin([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[xi]]) + cos([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[eta]]) .

It follows that

F([[partialdiff]]2 ,[[partialdiff]]x2) = F([[partialdiff]] ,[[partialdiff]]x) F([[partialdiff]] ,[[partialdiff]]x) = ( cos([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[xi]]) - sin([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[eta]]) )( cos([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[xi]]) - sin([[alpha]]) F([[partialdiff]] ,[[partialdiff]][[eta]]) )

so that

F([[partialdiff]]2 ,[[partialdiff]]x2) = cos2([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[xi]]2) - 2 sin([[alpha]])cos([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[xi]][[partialdiff]][[eta]]) + sin2([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[eta]]2) .

In a similar manner,

F([[partialdiff]]2 ,[[partialdiff]]x [[partialdiff]]y) =

sin([[alpha]]) cos([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[xi]]2) + (cos2([[alpha]]) - sin2([[alpha]]) ) F([[partialdiff]]2 ,[[partialdiff]][[xi]] [[partialdiff]][[eta]]) - sin([[alpha]])cos([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[eta]]2) ,

and

F([[partialdiff]]2 ,[[partialdiff]]y) = sin2([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[xi]]2) + 2 sin([[alpha]]) cos([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[xi]] [[partialdiff]][[eta]]) + cos2([[alpha]]) F([[partialdiff]]2 ,[[partialdiff]][[eta]]2) .

The original equation described u as a function of x and y. We now define v as a function of [[xi]] and [[eta]] by v([[xi]],[[eta]]) = u(x,y). The variables [[xi]] and [[eta]] are related to x and y as described by the rotation above:

v([[xi]],[[eta]]) = u(x([[xi]],[[eta]]), y([[xi]],[[eta]]))

= u(cos([[alpha]]) [[xi]] - sin([[alpha]]) [[eta]], sin([[alpha]]) [[xi]] + cos([[alpha]]) [[eta]])

Of course, we have not specified [[alpha]] yet. This comes next.

The equation satisfied by v is

[11c2 + 4R(3)sc + 7 s2] F([[partialdiff]]2v,[[partialdiff]][[xi]]2) + [-8 sc + 4 R(3) (c2 - s2)] F([[partialdiff]]2v,[[partialdiff]][[xi]][[partialdiff]][[eta]])

+ [ 11s2 + 4R(3) sc + 7 c2] F([[partialdiff]]2v,[[partialdiff]][[eta]]2) - 5 v,

where we have used the abbreviations s = sin([[alpha]]) and c = cos([[alpha]]). The coefficient of the mixed partials will vanish if [[alpha]] is chosen so that

- 8 sin([[alpha]])cos([[alpha]]) + 4 R(3) (cos2([[alpha]]) - sin2([[alpha]])) = 0,

that is,

tan(2[[alpha]]) = R(3) / 2.

This means [[alpha]] = [[pi]]/6, sin([[alpha]]) = R(F(1- cos(2 [[alpha]]),2)) and cos([[alpha]]) = R(F(1+ cos(2 [[alpha]]),2)).

After substitution of these values, the equation satisfied by v becomes

13 F([[partialdiff]]2v,[[partialdiff]][[xi]]2) + 5 F([[partialdiff]]2v,[[partialdiff]][[eta]]2) -5 v = 0.

This special example, together with the foregoing discussions of analytic geometry makes the following statement believable: Every second order partial differential equation with constant coefficients can be transformed into one in which mixed partials are absent. The angle of rotation is, again, given by (11.3).

It is left as an exercise to see that in the general case, b2 - 4 a c is left unchanged by this rotation. Surely, MAPLE is to be used to do this calculation. (See Exercise 11.4.)

We are now ready for the second step: to remove the first order term. For economy of notation, let us assume that the given equation is already in the form

F([[partialdiff]]2u,[[partialdiff]]x2) - 4 F([[partialdiff]]2u,[[partialdiff]]y2) + 3 F([[partialdiff]]u,[[partialdiff]]x) + u = 0.

Define v by

v(x,y) = e-[[beta]]x u(x,y) or u(x,y) = e[[beta]]x v(x,y),

where [[beta]] will be chosen so that the transformed equation will have the first order derivative removed. Differentiating u and substituting into the equation we get that

F([[partialdiff]]2v,[[partialdiff]]x2) - 4 F([[partialdiff]]2v,[[partialdiff]]y2) + (2[[beta]] +3) F([[partialdiff]]v,[[partialdiff]]x) + ([[beta]]2 + 3[[beta]] + 1)v = 0.

If we choose [[beta]] = - 3/2, we have

F([[partialdiff]]2v,[[partialdiff]]x2) - 4 F([[partialdiff]]2v,[[partialdiff]]y2) - F(5,4) v = 0.

Notice that this transformation to achieve an equation lacking the first derivative with respect to x is generally possible when the coefficient on the second derivative with respect to x is not zero, and is otherwise impossible. The same statements hold for derivatives with respect to y.

The final step is rescaling. We choose variables [[xi]] and [[eta]] by

[[xi]] = u x and [[eta]] = [[nu]] y, where u and [[nu]] are chosen so that in the transformed equation the coefficients of F([[partialdiff]]2v,[[partialdiff]][[xi]]2) , F([[partialdiff]]2v,[[partialdiff]][[eta]]2) , and v are equal in absolute value. We have

F([[partialdiff]]2u,[[partialdiff]]x2) = u2 F([[partialdiff]]2v,[[partialdiff]][[xi]]2) and F([[partialdiff]]2u,[[partialdiff]]y2) = [[nu]]2 F([[partialdiff]]2v,[[partialdiff]][[eta]]2) .

Our equation becomes

u2 F([[partialdiff]]2v,[[partialdiff]][[xi]]2) - 4 [[nu]]2 F([[partialdiff]]2v,[[partialdiff]][[eta]]2) - f(5,4) v = 0.

The condition that

u2 = 4 [[nu]]2 = F(5,4)

will be satisfied if u = F(R(5),2) and [[nu]] = F(R(5),4) . Then, we obtain the standard form

F([[partialdiff]]2v,[[partialdiff]][[xi]]2) - F([[partialdiff]]2v,[[partialdiff]][[eta]]2) - v = 0.

Generalized Rotations to Remove Cross Terms

The removal of cross terms as typically presented is complicated by the necessity of finding the angle [[alpha]] of rotation and of working with the ensuing trigonometric functions. These ideas seem inappropriate for higher dimensions.

An alternate approach is to address the problem from the perspective of linear algebra, where the ideas even generalize for large dimensions. We will need the linear algebra package to make this work.

> with(linalg);

We choose the constants as the coefficients for the partial differential equations

a uxx + b uxy + c uyy + d ux + e uy + f = 0.

> a:=2: b:=3: c:=-2: d:=1: e:=1: f:=3/25:

Make a symmetric matrix with a, b/2, and c as the entries.

> A:=matrix([[a,b/2],[b/2,c]]):

It is a theorem in linear algebra that symmetric matrices have a diagonal Jordan Form.

> jordan(A,P);

We want K to be a matrix of eigenvectors.

> K:=inverse(P);

In order to make this process work, we ask that the eigenvectors -- which form the columns of K -- should have norm 1. This will cause the eigenvectors to be orthonormal.

> N1:=norm([K[1,1],K[2,1]],2): N2:=norm([K[1,2],K[2,2]],2):

> L:=matrix([[K[1,1]/N1,K[1,2]/N2],[K[2,1]/N1,K[2,2]/N2]]);

It will now be the case that (transpose(L) * A * L) is the Jordan form.

> evalm(transpose(L) &* A &* L);

To remove the cross-product term, we now perform a change of variables defining s and t.

> s:=(x,y)->evalm(transpose(L) &* vector([x,y]))[1]:

t:=(x,y)->evalm(transpose(L) &* vector([x,y]))[2]:

If we define v as indicated below, then it will satisfy a PDE with the cross-term missing and with u satisfying the original partial differential equation.

> u:=(x,y)->v(s(x,y),t(x,y)):

> a*diff(u(x,y),x,x)+b*diff(u(x,y),x,y)+c*diff(u(x,y),y,y)+

d*diff(u(x,y),x) + e*diff(u(x,y),y) + u(x,y):

> simplify(");

> collect(",D[1,2](v)(s(x,y),t(x,y)));

> pde:=collect(

collect(

collect(

collect(",D[1](v)(s(x,y),t(x,y)))

,D[2](v)(s(x,y),t(x,y)))

,D[2,2](v)(s(x,y),t(x,y)))

,D[1,1](v)(s(x,y),t(x,y)));

Thus, we solve this simpler system and create u that satisfies the original equation.

EXERCISES:

11.1. Transform the following equations into standard form:

(a) 3 F([[partialdiff]]2u,[[partialdiff]]x2) + 4 F([[partialdiff]]2u,[[partialdiff]]y2) - u = 0.

(b) 4 F([[partialdiff]]2u,[[partialdiff]]x2) + F([[partialdiff]]2u,[[partialdiff]]x[[partialdiff]]y) + 4 F([[partialdiff]]2u,[[partialdiff]]y2) + u = 0.

(c) F([[partialdiff]]2u,[[partialdiff]]x2) + F([[partialdiff]]2u,[[partialdiff]]y2) + 3 F([[partialdiff]]u,[[partialdiff]]x) - 4 F([[partialdiff]]u,[[partialdiff]]y) + 25 u = 0.

(d) F([[partialdiff]]2u,[[partialdiff]]x2) - 3 F([[partialdiff]]2u,[[partialdiff]]y2) + 2 F([[partialdiff]]u,[[partialdiff]]x) - F([[partialdiff]]u,[[partialdiff]]y) + u = 0.

(e) F([[partialdiff]]2u,[[partialdiff]]x2) - 2 F([[partialdiff]]2u,[[partialdiff]]x[[partialdiff]]y) + F([[partialdiff]]2u,[[partialdiff]]y2) + 3 u = 0.

11.2. Show that the equation

F([[partialdiff]]2u,[[partialdiff]]x2) - F([[partialdiff]]u,[[partialdiff]]y) + [[gamma]] u = f(x,y)

where [[gamma]] is any constant, can be transformed into

F([[partialdiff]]2v,[[partialdiff]]x2) - F([[partialdiff]]v,[[partialdiff]]y) = g(x,y).

11.3. Show that by rotation of the axis by 45deg. the equations

F([[partialdiff]]2u,[[partialdiff]]x2) - F([[partialdiff]]2u,[[partialdiff]]y2) = 0 and F([[partialdiff]]2u,[[partialdiff]]x[[partialdiff]]y) = 0.

can be transformed into one another. Find the general solution for both equations.

11.4 Show that upon making a rotation to remove the cross terms, then the discriminant b2 - 4ac is unchanged. 


Return to James Herod's Table of Contents

Back to preceding lecture

Onward to the next lecture

Return to Evans Harrell's