Skip to main content

Section 4.3 The General Differentiation Rules

. . . mathematical analysis is to the technical man merely a tool or instrument for doing his work, of which he must have not only the knowledge and understanding but also the expertness which he has with other tools.
Arthur S. Hathaway
 13 
https://en.wikipedia.org/wiki/Arthur_Stafford_Hathaway
, 1855—1934
It is better to do the right problem the wrong way than the wrong problem the right way.
Richard W. Hamming
 14 
https://mathshistory.st-andrews.ac.uk/Biographies/Hamming/
(1915—1998)
The General Differentiation Rules are surprisingly easy to learn and remember. Moreover, despite the fact that we have six entries in our table there are really only three rules. The Constant Rule, The Sum Rule, and the Product Rule are fundamental. The other three, The Constant Multiple Rule, The Power Rule, and the Quotient Rule are straightforward consequences of the first three. We only state them separately for convenience.
We call these rules “general” because each one of them can be applied to a wide variety of expressions. In addition to the general rules there is a large set of specific rules that only apply to specific functions. We will come back to these beginning in Chapter 6
To begin, suppose a point is moving along the \(x\)-axis. As in Chapters 2 and 3 when we think of a point in motion we write \(x(t)\) to express the idea that the position \(x\text{,}\) depends on the time, \(t\text{.}\) At a specific time, \(t_1\text{,}\) the position \(x(t_1)\text{,}\) is expressed more compactly as \(x_1\) and at time \(t_2\) the position is \(x_2\text{..}\)
The (small, finite) change in \(x\) is the difference between \(x_2\) and \(x_1\text{:}\) \(x_2-x_1\text{,}\) and the (small, finite) change in \(t\) is the difference between \(t_2\) and \(t_1:\) \(t_2-t_1\text{.}\)
It is customary to denote these small, finite changes with the notation:
\begin{equation*} \Delta x = x_2-x_1, \text{ and } \Delta t= t_2-t_1. \end{equation*}
We will denote infinitely small changes in \(x\) and \(t\) as \(\dx{x}\) and \(\dx{t}\text{.}\) The symbol \(\dx{x}\) is called the differential of \(x\text{.}\)
Notationally, the distinction between \(\Delta x\) and \(\dx{x}\) is small, but conceptually it is profound. When we write \(\Delta x\) there is some measurable distance between \(x_2\) and \(x_1.\) When we write \(\dx{x}\) the distance between \(x_2\) and \(x_1\) is immeasurably (infinitesimally) small.
If \(x\) is changing in time, \(\dx{x}\) represents how much \(x\) changes in an infinitesimally small increment of time \(\dx{t}\text{.}\) If the concept of an infinitely small increment troubles you (it should) it is safe, for now, to think of \(\dx{x}\) as a very, very, very, small change in \(x\text{.}\)
In general, if two variables, say \(x\) and \(y\text{,}\) are related we will be interested in asking (and answering) the question: If \(x\) is incremented by \(\dx{x}\) how will \(y\) be affected?

Subsection 4.3.1 The Constant Rule

Now that we understand our notation the first differentiation rule is quite obvious. In fact, it is the same whether we are thinking of a small finite change or of a differential. If \(x\) is unchanging, if it is not moving, then “the (infinitesimal) change in \(x\)” between two different times is zero. It doesn’t really matter whether we are talking about a finite \(\Delta x\) or an infinitesimal \(\dx{x}\text{.}\) Both are zero because \(x\) doesn’t change. Thus we have the following rule:

Problem 4.3.2.

Compute the differential of the following.
  1. \(\displaystyle y=1\)
  2. \(\displaystyle y=\pi \)
  3. \(\displaystyle y=\frac{\sqrt{2}}{\pi}\)
  4. \(\displaystyle y=\sqrt{2}\pi\)
  5. \(\displaystyle y=2,000,000,000\)
  6. \(\displaystyle y=\sin^2(\theta)+\cos^2(\theta)\)

Drill 4.3.3.

In Table 4.2.1 we stated the Constant Rule using \(a\) as the constant, in Rule 4.3.1 we used \(x\text{,}\) and in Problem 4.3.2 we used \(y\text{.}\) Does it matter what symbol we use? Explain.

Subsection 4.3.2 The Sum Rule

The Constant Rule 4.3.1 was particularly simple since only one quantity was involved and it was unchanging. The second is only slightly more complex.

Ad Hoc Proof.

If \(\textcolor{red}{x_1}\) and \(\textcolor{red}{x_2}\) are infinitely close together, then \(\textcolor{red}{\dx{x}} = \textcolor{red}{x_2}-\textcolor{red}{x_1}.\) Similarly if \(\textcolor{blue}{y_1}\) and \(\textcolor{blue}{y_2}\) are infinitely close together then \(\textcolor{blue}{\dx{y}}=\textcolor{blue}{y_2}-\textcolor{blue}{y_1}.\) Then:
\begin{align*} \dx(x+y) \amp = ({\color{red}{x_2}}+{\color{blue}{y_2}}) - ({\color{red}{x_1}}+{\color{blue}{y_1}})\\ \amp = ({\color{red}{x_2-x_1}}) + ({\color{blue}{y_2-y_1}})\\ \amp = {\color{red}{\dx x}} +{\color{blue}{\dx y}}, \end{align*}
which verifies the Sum Rule.
Notice that like the Constant Rule, the Sum Rule is true regardless of whether we are talking about finite differences or differentials.
The Sum Rule is stated as if it only applies to the sum of two quantities. This is typical of mathematical writing. We tend to be minimalists so we don’t state more than we absolutely have to. However, if the Sum Rule applies to the sum of two quantities it is straightforward to show that it also applies in the obvious fashion to a sum of three, four, or more quantities.

Problem 4.3.5.

(a)
Show that \(\dx(x+(y+z))=\dx{x}+\dx{y}+\dx{z}\)
Hint.
Notice how we placed the parentheses.
(b)
Show that \(\dx(x+y+z+w)=\dx{x}+\dx{y}+\dx{z}+\dx{w}\)
(c)
Explain how you would show that
\begin{equation*} \dx(x_1+x_2+x_3+\cdots + x_n)=\dx{x_1}+\dx{x_2}+\dx{x_3}+\cdots + \dx{x_n} \end{equation*}
where \(n\) is an arbitrary positive, integer.

Subsection 4.3.3 The Constant Multiple Rule

Ad Hoc Proof.

Suppose \(a\) is a constant and \(x\) is a variable. Since \(\dx{(ax)}= ax_2-ax_1\) is the differential between infinitely close values of \(ax\) we see that the differential of \(ax\) is given by:
\begin{equation*} \dx{(ax)} = ax_2 - ax_1 = a(x_2-x_1) = a\dx{x}. \end{equation*}
which verifies the Constant Multiple Rule.

Example 4.3.7.

To get a sense of how we will be using differentials consider the line which is the graph of the equation \(3x+2y=5\text{.}\) Let’s differentiate both sides of this formula using the tools we have so far. Obviously if two quantities are equal then their differentials are also equal. So,
\begin{equation*} \dx{(3x+2y)}=\dx{(5)}\text{.} \end{equation*}
From the Sum Rule and the Constant Rule we have,
\begin{equation*} \dx{(3x)}+\dx{(2y)}=0 \end{equation*}
and from the Constant Multiple Rule we see that,
\begin{equation} 3\dx{x}+2\dx{y}=0.\tag{4.3} \end{equation}
Notice that equation (4.3) can be rearranged as follows: \(\dfdx{y}{x} = -\frac32. \) So it appears that the differential ratio \(\dfdx{y}{x}\) is the slope of our line. This makes sense since \(\dfdx{y}{x}\) is the (infinitesimal) change in \(y\) divided by the (infinitesimal) change in \(x\text{.}\) From your study of lines in Algebra you know that the change in \(y\) divided by the change in \(x\) is the definition of slope. It continues to be the slope even if the changes in \(x\) and \(y\) are infinitesimal.

Problem 4.3.8.

Compute \(\dx{y}\) for each of the following:
  1. \(\displaystyle y-x=0\)
  2. \(\displaystyle y-x=5\)
  3. \(\displaystyle y-2x=7\)
  4. \(\displaystyle 20x-y=-1\)
  5. \(\displaystyle y=0.0002x\)
  6. \(\displaystyle \frac{y}{x}=\pi\)
  7. \(\displaystyle y=3+2x\)
  8. \(\displaystyle \frac{7-\pi y}{x}= 17\)
  9. \(\displaystyle y=\sqrt{2}x\)
  10. \(\displaystyle y=(\pi+\sqrt{2})x\)
  11. \(\displaystyle y=\pi x+\sqrt{2}x\)
  12. \(\displaystyle y=\dfrac{(\pi+\sqrt{2})x}{x}\)
  13. \(\displaystyle y=\dfrac{(1-x)x^4}{x^3-x^4}\)
In his \(1684\) paper on Calculus, Leibniz presented all of the General Differentiation Rules with no justification whatsoever, stating simply that
“The demonstration of all this will be easy to one who is experienced in these matters . . .”
Our first three rules seem to bear this out. They are really the same as the analogous rules for finite differences. His claim is a bit more suspect for the next differentiation rule.

Subsection 4.3.4 The Product Rule

Ad Hoc Proof.

Think of the product \(xy\) as the (A)rea of the rectangle with sides of length \(x\) and \(y\) so that \(A=xy\text{,}\) as shown in the following sketch.
If we increase the sides by \(\dx{x}\) and \(\dx{y}\text{,}\) respectively, then the area will change by an infinitesimal amount \(\dx{A}\) which is the sum of areas of the blue, yellow, and red regions shown in the sketch. The area of the blue region is \(x\dx{y}\text{,}\) the area of the red region is \(y\dx{x}\) and the area of the yellow region is \(\dx{x}\dx{y}\text{.}\) Since \(\dx{A}\) is the sum of these three regions we have
\begin{equation} \dx{A} =x\dx{y}+y\dx{x}+\dx{x}\dx{y}.\tag{4.4} \end{equation}
But remember that \(\dx{x}\) and \(\dx{y}\) represent infinitely small changes. Thus, their product \(\dx{x}\dx{y}\) is even more infinitely small than they are. (Quadratically infinitely small?)
In fact, the product \(\dx{x}\dx{y}\) is so small compared to any of the other terms in Equation (4.4) that it simply doesn’t matter. So we will ignore it.
This leaves us with \(\dx{A}=\dx(xy)=x\dx{y}+y\dx{x}\) verifying the Product Rule.
You should feel at least a little uncomfortable with the logical sleight of hand in our ad hoc proof of The Product Rule 4.3.9. Some questions present themselves immediately: How can we choose not to ignore infinitesimals like \(\dx{x}\) and \(\dx{y}\text{,}\) but then choose to ignore their product \(\dx{x}\dx{y}\text{?}\) What happens if \(x\) or \(y\) is negative? What happens if \(\dx{x}\) or \(\dx{y}\) is negative?
Perhaps Leibniz left out the demonstrations (proofs) of his rules because he knew they would be controversial. However, he was not lying when he said that those familiar with infinitesimals would understand. Newton also ignored products of infinitesimals in his version of Calculus. As we discussed in Section 4.1, it took mathematicians some \(200\) years to sort this out. We will begin to address this problem in the Chapter 13. For now, we will simply accept that the Product Rule works as we’ve stated it knowing full well that we are on shaky ground that will eventually need to be firmed up.

Problem 4.3.10.

Use the Product Rule to show that if \(y=x^2\text{,}\) then \(\dx{y}=2x\dx{x}\text{.}\)
Even though they could not fully explain their methods, both Newton and Leibniz could see that their methods gave answers that made sense physically and which agreed with the results of their predecessors. That is, their methods worked.
For example, in Problem 3.4.10 you used Fermat’s method for tangents to show that at the point \((1,1)\) the slope of the curve \(y=x^2\) is \(2\) and that at the point \((-2,4)\) the slope of the curve is \(-4\text{.}\) Recall from Example 4.3.7 that the differential ratio \(\dfdx{y}{x}\) represents the slope of the curve. With this in mind we rewrite the formula \(\dx{y}=2x\dx{x}\) from Problem 4.3.10 as \(\dfdx{y}{x}=2x\text{,}\) from which we see that the Product Rule easily recovers Fermat’s results.
But what can it mean to say that the slope of the graph of \(y=x^2\) is \(2x\text{?}\) The expression \(2x\) changes with \(x\) but slope is supposed to be constant isn’t it? Moreover, slopes have always been associated with straight lines in the past, but there is not a straight line to be seen anywhere in this problem.
Or is there?
The notion of a differential is essentially an extension of the following observation: If you look very closely at a curve it looks like a straight line. You can confirm this with any graphing technology. Graph a curve and zoom in repeatedly at any point on the curve. Eventually the curve will look like a straight line, and the slope of that line will be the value of \(\dfdx{y}{x}\) at that point. If we extend this observation to the infinitely small scale we get differentials.
The idea that at the infinitely small scale lines and curves are indistinguishable is known as the Principal of Local Linearity, and it is a fundamental principle underlying all of Calculus. We will formally define the Principle of Local Linearity in Section 5.2. For now, the informal idea is that when you look at a very small section of a curve it looks like a straight line. When we choose to simply ignore the term \(\dx{x}\dx{y}\) in our development of the Product Rule we are invoking the Principle of Local Linearity
Again, Newton and Leibniz relied on the fact that their methods produced solutions that made sense physically and which agreed with known solutions to old problems. Snell’s Law is an example of the latter. The following problem is an example of the former.

Problem 4.3.11.

When a rock is tossed into a quiet lake it generates an expanding, circular wave, as in the diagram below.
The area enclosed by the wave at any time is given by the formula: \(A=\pi r^2\text{,}\) where \(r\) is the expanding radius of the circle. As time advances the radius of the circular wave expands and, as a consequence, the area enclosed by the wave also grows.
(a)
Use the Constant Multiple Rule and the Product Rule to show that \(\dx{A}=\dx(\pi r^2)=2\pi r\dx{r}.\)
(b)
Explain why be the area of the yellow ring in the diagram is also equal to \(\dx{A}\text{.}\)
(c)
Find \(\dx{A}\) by cutting the yellow region at the dotted line, reshaping the ring into a rectangle. How does this compare with what we obtained using our Calculus rules. Do you see any logical issues with this solution? Specifically, why is it important that \(\dx{r}\) must be infinitely small?

DIGRESSION: Differential Notation.

We will pause here for a moment to take explicit notice of the notational convenience we used in part (a) of Problem 4.3.11 because it will be useful later. We have used the expression \(\dx{x}\) to represent an infinitesimal displacement of \(x\text{,}\) but it will be useful to think of the “\(\dx{ }\)” as meaningful in itself. That is, in the expression \(\dx(\pi r^2)\) we think of the “d” as “operating on” the expression \(\pi r^2.\) Thus when \(\dx{}\) “operates on” the expression \(\pi r^2\) it returns the expression \(2\pi r\dx{r}\) and we have \(\dx{A}=\dx(\pi r^2) = 2\pi r \dx{r}.\) This is actually consistent with our previous usage if we think of \(\dx(A)\) as “operating on” the area \(A\) and returning the differential \(\dx{A}.\)
This is akin to the way we think about the square root symbol. The symbol \(\sqrt{\phantom{4}}\) “operates on” the expression underneath it so we have \(\sqrt{4}=2.\) Just as \(\sqrt{\phantom{4}}\) is always reserved for the square root of some quantity, \(\dx{}\) is always reserved for the differential of some quantity. In both cases the quantity needs to be named. It is as meaningless to write \(\pi r^2\dx{}\) as it is to write \(\pi r^2\sqrt{\phantom{4}}.\) It is important to use the notation as it was designed to be used.
Also, notice that on the left side of \(\dx{A}=\dx(\pi r^2) = 2\pi r \dx{r}\) we have the infinitesimal quantity, \(\dx{A}\text{.}\) Since the two sides of the equation are equal it follows that the quantity on the right side is also an infinitesimal. The obvious conclusion is that the ordinary real number \(2\pi r\) multiplied by the infinitesimal \(\dx{r}\) must result in an infinitesimal quantity. Dividing both sides by \(\dx{r}\) we have \(\dfdx{A}{r}=2\pi r\text{,}\) which seems to indicate that the quotient of two differentials will be an ordinary real number. Does this make intuitive sense to you?
Notice that we asked if it makes intuitive sense. When working with differentials it frequently happens that a statement feels right (makes intuitive sense) but begins to fall apart when examined more carefully.
END OF DIGRESSION

Drill 4.3.12.

We mentioned earlier that the Constant Rule, the Sum Rule, and the Product Rule are fundamental, and that the other three can be proved using these three. Show that the Constant Multiple Rule follows from the Constant Rule and the Product Rule.

Subsection 4.3.5 The Power Rule

The simplest version of is the Power Rule is really just repeated use of Fact 4.3.9. We will develop this simple version via Problems 4.3.13 and Problem 4.3.14 below.

Problem 4.3.13. Find the Pattern.

(a)
Compute each of the following differentials.
  1. \(\displaystyle \dx{(x+1)}\)
  2. \(\displaystyle \dx{\left[(x+1)(x+2)\right]}\)
  3. \(\displaystyle \dx{\left[(x+1)(x+2)(x+3)\right]}\)
  4. \(\displaystyle \dx{\left[(x+1)(x+2)(x+3)(x+4)\right]}\)
  5. \(\displaystyle \dx{\left[(x+1)(x+2)(x+3)(x+4)(x+5)\right]}\)
  6. \(\displaystyle \dx{\left[(x+1)(x+2)(x+3)(x+4)(x+5)(x+6)\right]}\)
(b)
Do you see a pattern in the computations you did in part 4.3.13.a? Assume \(n\) is an unknown positive integer and use the pattern to compute:
\begin{equation*} \dx{\left[(x+1)(x+2)(x+3)(x+4)\cdots(x+n)\right]}. \end{equation*}

Problem 4.3.14. The Product Rule for Positive Integers.

By custom, differentials are written at the end of each term where they appear in a given computation so the Product Rule is usually written as
\begin{equation*} \dx{(xy)}=x(\dx{y})+y(\dx{x}). \end{equation*}
But this is just a custom so it is not strictly necessary. We could also use the Product Rule in the form
\begin{equation*} \dx{(xy)}=x(\dx{y})+(\dx{x})y. \end{equation*}
(a)
Use the Product Rule twice to show that
\begin{equation*} \dx{(xyz)}=xy(\dx{z})+x(\dx{y})z+(\dx{x})yz, \end{equation*}
Hint.
The Product Rule requires two factors. But we have three: \(x\text{,}\) \(y\text{,}\) and \(z\text{.}\) We can fix this with the substitution \(\alpha=yz\text{.}\) Then \(\dx{(xyz)}=\dx{(x\alpha)}.\) Now use the Product Rule.
(b)
Emulate the computations in part 4.3.14.a to show that.
\begin{equation*} \dx{(xyzw)}=xyz(\dx{w})+xy(\dx{z})w+x(\dx{y})zw +(\dx{x})yzw. \end{equation*}
(c)
What does the Product Rule look like for \(\dx{(x_1x_2x_3\cdots x_n)}?\)
(d)
Use the result of part (c) with \(x_1=x_2=\cdots = x_n =x\) to show that
\begin{equation} \dx\left(x^n\right)=nx^{n-1}\dx{x}.\tag{4.5} \end{equation}
Equation (4.5) is called the Power Rule for Positive Integers.
Let’s try out our differentiation rules on a relatively simple example. We will include every single step so that you can see how the differentiation rules are used in concert.

Example 4.3.15.

From the Sum Rule we have,
\begin{equation*} \dx\left(5x^6-3x^4+21x^2-7\right) =\dx\left(5x^6\right)-\dx\left(3x^4\right)+\dx\left(21x^2\right)-\dx(7). \end{equation*}
Next, the Constant Rule, and the Constant Multiple Rule give,
\begin{equation*} \dx\left(5x^6-3x^4+21x^2-7\right) = 5\dx\left(x^6\right)-3\dx\left(x^4\right)+21\dx\left(x^2\right)-0, \end{equation*}
and by the Power Rule for Positive Integers we have,
\begin{align*} \dx\left(5x^6-3x^4+21x^2-7\right) \amp{}= 5\left(6x^5\dx{x}\right)-3\left(4x^3\dx{x}\right)+21\left(2x^1\dx{x}\right)\\ \amp{}=30x^5\dx{x}-12x^3\dx{x}+42x\dx{x}. \end{align*}
Finally, factoring out the differential of \(x\) gives,
\begin{equation*} \dx\left(5x^6-3x^4+21x^2-7\right) =\left(30x^5-12x^3+42x\right)\dx{x}. \end{equation*}
Did you notice that the differential \(\dx{x}\) appeared in every term allowing us to factor it out at the end? This will always happen when there is only one independent variable involved (\(x\) in this case). So it provides an easy way to check your computations. If you end with an expression where the differential of the independent variable cannot be factored out then you have made a mistake in your computations. You should find your mistake before going on. Or just begin again.
As you become more skilled you will probably do many of the steps from Example 4.3.15 in your head. But for now, notice how the rules work together and are applied as they are needed.

Example 4.3.16.

Here is another example without all the details spelled out. Make sure you see which rule is being invoked at each step.
\begin{align*} \dx(\textcolor{red}{(2x+5)}\textcolor{blue}{(3x-7)})\amp =\textcolor{red}{(2x+5)}\textcolor{blue}{\dx(3x-7)}+\textcolor{blue}{(3x-7)}\textcolor{red}{\dx(2x+5)}\\ \amp =\textcolor{red}{(2x+5)}(3)\dx{x}+\textcolor{blue}{(3x-7)}(2)\dx{x}\\ \amp =(6x+15+6x-14)\dx{x}\\ \amp =(12x+1)\dx{x}. \end{align*}

Drill 4.3.17.

Identify each of the rules used in each step of Example 4.3.16.
Of course, in Example 4.3.16 we could have multiplied the factors together before differentiating and obtained the same answer:
\begin{align*} \dx((2x+5)(3x-7)) \amp = \dx(6x^2+x-35)\\ \amp = 6(2x\dx{x})+\dx{x}-0\\ \amp = (12x+1)\dx{x}. \end{align*}
Either procedure will work. You are at liberty to do whichever you prefer.

Drill 4.3.18.

Compute each of the following:
  1. \(\displaystyle \dx(5x+3)\)
  2. \(\displaystyle \dx(\pi^2 )\)
  3. \(\displaystyle \dx(x^3-5x^2+7)\)
  4. \(\displaystyle \dx{(\pi^2+2)}\)
  5. \(\displaystyle \dx \left(\left(x-3\right)^ {2}\right)\)
  6. \(\dx \left(\left(x-r \right)^ {2}\right)\) Assume that \(r\) is a constant.
  7. \(\displaystyle \dx((3x+4)(2x-5)(2-6x))\)

Drill 4.3.19.

Compute each of the following:
  1. \(\displaystyle \dx{((x^2+3x+1)(5x-7x^{20}+x^{12}-6))}\)
  2. \(\displaystyle \dx((5x^2-7x+1)(2x^{10}-4x^7+3x^3-2x+8))\)
  3. \(\dx( {a} _ {0} + {a} _ {1} x+ {a} _ {2} {x} ^ {2} + {a} _ {3} {x} ^ {3} +\ldots+ {a} _ {n} {x} ^ {n} )\) Assume that \(a_0, a_1, \ldots,\) and \(a_n\) are all constant.
  4. \(\displaystyle \dx{(x^4(3x^2-2x+45))}\)

Drill 4.3.20.

Compute \(\dx{y}\) for each of the following:
  1. \(\displaystyle y=4.9x^2+15x+6\)
  2. \(\displaystyle y=3.14\)
  3. \(\displaystyle y=3.1416\)
  4. \(\displaystyle y=\pi\)
  5. \(\displaystyle y=x^2-2x+4\)
  6. \(\displaystyle y=3(x^3-4x+2)\)
  7. \(\displaystyle y=\frac{7x^5-3x^3+x^2}{x}\)
  8. \(\displaystyle y=5x^2+2x-3\)
  9. \(\displaystyle x^2-y^2=1\)
  10. \(\displaystyle x^2+y^2+z^2=1\)
  11. \(\displaystyle xy^2=z^3\)
  12. \(\displaystyle xyz=2\)

Example 4.3.21.

Figure 4.3.22.
Differentiating both sides of an ordinary algebraic equation yields a Differential Equation — an equation which relates the differentials of the quantities involved. For example, suppose we have a point \((x,y)\) lying on the unit circle so that the quantities \(x\) and \(y\) are related by the equation \(x^2+y^2=1\text{.}\)
Differentiating, we have the differential equation
\begin{align} 2x\dx{x}+2y\dx{y}\amp =0 \text{ or }\notag\\ y\dx{y}\amp =-x\dx{x}.\tag{4.6} \end{align}
Let’s look at this geometrically and focus on the first quadrant. The sketch above shows what happens to the quantities \(x\) and \(y\) as the point changes its position on the circle.
The smaller triangle is called a differential triangle and it is infinitely small. Differential triangles were a favorite tool of Leibniz. He assumed without justification (as we will, for now) that they are just like ordinary (finite) triangles.
Notice that \(\dx{x}\) is negative since the \(x\) coordinate is moving to the left, and that \(\dx{y}\) is positive since the \(y\) coordinate is moving upward. We need positive quantities to represent the length of the triangle’s legs. Therefore we use \(-\dx{x}\) to represent a positive length for the horizontal leg of our differential triangle.
A little more mysterious is the (correct) observation that the smaller triangle isn’t really a triangle at all. Because it is part of the circle the line connecting \((x,y)\) and \((x+\dx{x}, y+\dx{y})\) — the hypotenuse of the triangle — is curved. But there is nothing lost by thinking of the hypotenuse as a straight line because according to the Principle of Local Linearity at an infinitely small scale lines and curves are indistinguishable. At least in part, this why Newton, Leibniz, and their contemporaries worked with differentials. At that scale they could replace curves with straight lines.

Problem 4.3.23.

(a)
Show that the two triangles in Figure 4.3.22 are similar. Use this to show that
\begin{equation*} y\dx{y}=-x\dx{x}. \end{equation*}
Notice that this problem specifically asks you to use the triangles, not differentiation to show that \(y\dx{y}=-x\dx{x}.\)
(b)
After Problem 4.3.10 we observed that the ratio \(\dfdx{y}{x}\) must represent the slope of both the curve and its tangent line at a given point. Notice that the differential equation for the circle can be written as
\begin{equation*} \dfdx{y}{x}=-\frac{x}{y}. \end{equation*}
Explain how this is related to the geometric fact that the tangent to a circle at a point is perpendicular to the radius through that point.

Example 4.3.24. Brute Force Computation.

Suppose that we wish to find \(\dx{y}\) when \(y=5(x^2+x)^3\text{.}\) One way to do this is simple brute force. That is, we compute
\begin{equation*} y=5\textcolor{red}{(x^2+x)^3} = 5\textcolor{red}{(x^6+3x^5+3x^4+x^3)} = 5x^6+15x^5+15x^4+5x^3. \end{equation*}
Applying the Sum Rule we see that
\begin{equation*} \dx y = \dx \left(5x^6\right)+\dx \left(15x^5\right)+\dx \left(15x^4\right)+\dx \left(5x^3\right). \end{equation*}
Next we use the Constant Multiple Rule to get
\begin{align*} \dx y \amp{} = 5\dx \left(x^6\right)+15\dx \left(x^5\right)+15\dx \left(x^4\right)+5\dx \left(x^3\right),\\ \end{align*}
and finally the Power Rule for Positive Integers gives,
\begin{align*} \dx y \amp = 30x^5\dx x+75x^4\dx x+60x^3\dx x+15x^2\dx x. \end{align*}
In the end we have,
\begin{equation*} \dx{y}= (30x^5+75x^4+60x^3+15 x^2)\dx{x}. \end{equation*}
Brute force works but expanding an expression like \((x^2+x)^3\) is dull (and error prone) and mathematicians are like Nature: We are lazy. We’d really rather avoid doing all of that dull computation if we possibly can. Wouldn’t you?
Suppose, we make this problem a little “easier on the eyes” with a substitution like we did in Section 2.2. Let \(z=x^2+x\text{.}\) Then our original problem becomes \(y=5z^3\) which is much easier to look at. And now \(\dx{y}=15z^2\dx{z}\text{.}\) Substituting \(x^2+x\) back in for \(z\) we get \(\dx y = 15(x^2+x)^2\dx (x^2+x).\) After computing \(\dx (x^2+x) = (2x+1)\dx x\) we see that
\begin{equation*} \dx y = 15(x^2+x)^2(2x+1)\dx x. \end{equation*}

Problem 4.3.25.

(a)
Complete Example 4.3.24 by showing that we get the same \(\dx{y}\) both ways.
(b)
Compute \(\dx{y}\) if \(y=5(x^2+x)^{30}\) and then solve the equation \(\dx y=0\) for \(x\text{.}\)

Comment 4.3.26.

Problem 4.3.25 is completely contrived. There is no obvious reason that we would want or need to solve this equation. However, in the near future this will be the next step in many of the problems we will encounter.

Example 4.3.27. Making Things “Easier on the Eyes”.

Suppose we wish to find \(\dx{y}\) when \(y=(x+2)^2(2x-3)^3\) Once again we could just apply brute force by multiplying out the expression \((x+2)^2(2x-3)^3.\) But this is a lot of work and it is very error prone.
Instead notice that \(y\) is the product of \((x+2)^2\) and \((2x-3)^3,\) so the Product Rule seems a likely place to begin. But first we make it “easier on the eyes” with the substitutions, \(z=x+2\text{,}\) and \(w=2x-3\text{.}\) Thus:
\begin{align*} y\amp =\textcolor{red}{(\underbrace{x+2}_{=z})^2}\textcolor{blue}{(\underbrace{2x-3}_{=w})^3}\\ y\amp =\textcolor{red}{z^2}\textcolor{blue}{w^3}. \end{align*}
Using the Product Rule we now have
\begin{align*} \dx y \amp = \textcolor{red}{z^2}\textcolor{blue}{\dx(w^3)} + \textcolor{blue}{w^3}\textcolor{red}{\dx(z^2)}\\ \dx y \amp = \textcolor{red}{z^2}\textcolor{blue}{\dx(w^3)} + \textcolor{blue}{w^3}\textcolor{red}{\dx(z^2)}\\ \amp = (x+2)^2\textcolor{blue}{{\dx(w^3)}} + (2x-3)^3\textcolor{red}{{\dx(z^2)}}\\ \amp = (x+2)^2\textcolor{blue}{(3w^2)\dx{w}}+(2x-3)^3\textcolor{red}{(2z)\dx{z}}\\ \amp = (x+2)^2(3)(2x-3)^2\textcolor{blue}{{\dx(2x-3)}} + (2x-3)^3(2)(x+2)\textcolor{red}{{\dx(x+2)}}\\ \amp = (x+2)^2(3)(2x-3)^2(2)\dx x + (2x-3)^3(2)(x+2)\dx{x}. \end{align*}
and so
\begin{equation*} \dx{y} = \left[6(x+2)^2(2x-3)^2 + 2(2x-3)^3(x+2)\right]\dx{x}. \end{equation*}
Not only does this involve considerably less Algebra (and is thus less prone to error) it also gives us \(\dx{y}\) in a form that is easy to factor. This is important because, as we mentioned in Comment 4.3.26, many of the problems to come will involve finding a differential, setting it equal to zero, and then solving the resulting equation. If we had computed this by brute force we would have to find the roots of a fully expanded fourth degree polynomial. That would be hard. The form given above is much easier to work with. Let’s go ahead and set \(\dx{y}=0\) and solve the resulting equation:
\begin{equation*} 6\textcolor{red}{(x+2)}^2\textcolor{blue}{(2x-3)}^2 + 2\textcolor{blue}{(2x-3)}^3\textcolor{red}{(x+2)} = 0. \end{equation*}
just for practice.
The prospects for solving the equation, even in this form seem pretty glum at first, but it really isn’t so bad once we notice that \(\textcolor{red}{(x+2)}\) and \(\textcolor{blue}{(2x-3)^2}\) appear as factors in both terms on the left hand side. Factoring these out gives
\begin{align} \textcolor{red}{(x+2)}\textcolor{blue}{(2x-3)^2}\left[6\textcolor{red}{(x+2)}+2\textcolor{blue}{(2x-3)}\right] \amp{} \amp{}= 0\notag\\ \end{align}
so that
\begin{align} \textcolor{red}{(x+2)}\textcolor{blue}{(2x-3)^2}\left[6x+12+4x-6)\right] \amp{}= 0\notag\\ 2(x+2)(2x-3)^2(5x+3) \amp{}= 0,\tag{4.7}\\ \end{align}
so that
\begin{align} \textcolor{red}{(x+2)}\textcolor{blue}{(2x-3)^2}\left[6x+12+4x-6)\right] \amp{}= 0\notag\\ 2(x+2)(2x-3)^2(5x+3) \amp{}= 0.\tag{4.8} \end{align}

Drill 4.3.29.

Compute the differential of \(y=(x^2-4)^3(x^2+3)^5\) and solve \(\dx{y}=0\) for \(x\text{.}\)
As you become more comfortable using these differentiation rules together, you will probably do many of the steps in a long differentiation more informally, perhaps even in your head. However, most of the time differentiation will only be one part of much larger problems. If you don’t differentiate correctly whatever you do after that is wasted time and effort. It is better to proceed slowly to be sure you are right. If it helps to write out each step in detail then you should certainly do that.

Example 4.3.30.

At this point we can find the differentials of a large number of algebraic expressions. But not all. For example suppose \(y=\sqrt{x}.\) Nothing we’ve done so far tells us what \(\dx{y}\) is.
However, you are surely aware that the radical symbol, \(\sqrt{\phantom{4} }\text{,}\) can be expressed as a fractional exponent. For example, \(\sqrt{x} =x^{\frac12}.\) It is tempting to simply assert that, by the Power Rule,
\begin{equation*} \dx(x^{1/2}) = \frac12\cdot x^{1/2-1}\dx{x} = \frac12\cdot\frac{1}{x^{1/2}}\dx{x} = \frac{1}{\sqrt{2x}}\dx{x}. \end{equation*}
In fact this is true, but it doesn’t follow from the way we have developed the Power Rule because we depended quite explicitly on the assumption that the exponent of \(x^n\) is a positive integer.
Here’s one way to prove that the Power Rule works when \(n=1/2\text{.}\) If \(y=\sqrt{x}\) then \(y^2=x.\) Computing the differential of both sides gives:
\begin{align*} \dx(y^2)\amp =\dx x\\ 2y \dx{y}\amp =\dx x\\ \dx{y}\amp =\frac{1}{2y} \dx x=\frac{1}{2x^{1/2}} \dx x=\frac{1}{2}\cdot\frac{1}{x^{1/2}} \dx{x} \end{align*}
or \(\dx{y} =\frac{1}{2}{x^{1/2-1}} \dx{x}\) which is the Power Rule with \(n=\frac12\text{.}\)
Example 4.3.30 is more important than it appears to be. What it shows is that the Power Rule can be extended to include at least the case where the exponent is \(1/2\text{.}\) In fact, the Power Rule is easily extended to allow the exponent to be any positive fraction, say \(2/3\) for example.

Example 4.3.31.

You should be a little bit surprised at this. After all, \(y=x^{2/3}\) is a complicated thing, compared to \(y=x^2\text{,}\) or even \(y=x^{1/3}\text{.}\) Nevertheless, essentially the same procedure works here too. If \(y=x^{2/3}\text{,}\) then \(y^3=x^2\text{.}\) Differentiating both sides gives
\begin{equation*} 3y^2\dx{y}=2x\dx{x} \end{equation*}
and solving for \(\dx{y}\) leaves us with
\begin{equation*} \dx{y}=\frac23 \frac{x}{y^2}\dx{x}. \end{equation*}
This is not yet recognizable as the Power Rule primarily because we need to express the right side entirely in terms of \(x\text{.}\) But recall that \(y=x^{2/3}\) so that \(y^2=x^{4/3}\text{.}\) Making that substitution gives
\begin{equation*} \dx{y} = \frac23\frac{x}{x^{4/3}\dx{x}}= \frac23 x^{-\frac{1}{3}}\dx{x}= \frac23 x^{\frac{2}{3}-1}\dx{x} \end{equation*}
which is the Power Rule with \(n=2/3.\)

Problem 4.3.32. The Power Rule for Positive, Rational Powers.

(b)
Now suppose \(y=x^{\frac{p}{q}},\) where \(p\) and \(q\) are positive integers and show that \(\dx\left( x^{\frac{p}{q}}\right)=\frac{p}{q} x^{\frac{p}{q}-1} \dx x.\)

Example 4.3.33.

Having seen that the Power Rule can be extended to include positive, fractional exponents it is natural to ask if it can be extended to negative exponents Can we find the differential of \(y=1/x = \inverse{x}\text{?}\)
Of course we can. If \(y=1/x\) then \(xy=1.\) By the Product Rule we have
\begin{align*} \dx(xy)=\dx(1) x\dx{y}+y\dx{x}\amp =0\\ x\dx{y}\amp =-y\dx{x}\\ \dx{y}\amp =-\frac{\textcolor{red}{y}}{x}\dx{x}=-\frac{\textcolor{red}{\frac1x}}{x}\dx{x}\\ \amp=-\frac{1}{x^2}\dx{x} \end{align*}
and thus
\begin{equation*} \dx{y} = \dx{\left(\inverse{x}\right)} =(-1)x^{(-1)-1}\dx{x}. \end{equation*}
So the Power Rule works for \(n=-1\) as well.

Problem 4.3.34.

Let \(r\) be a positive rational number and \(y=x^{-r}=1/x^r\text{.}\) As in Example 4.3.33 clear the fraction and use the Product Rule to obtain the Power Rule for negative rational numbers:
\begin{equation*} \dx y=-rx^{-r-1} \dx x\text{.} \end{equation*}

Drill 4.3.35.

Use the formula you found in Problem 4.3.34 to show that
\begin{equation*} \dx\left(\frac{1}{\sqrt[3]{x^4}}\right)=-\frac43x^{-\frac73}\dx{x}\text{.} \end{equation*}
Thus we see that the Power Rule holds for any rational number, positive or negative, which gives us the following.
The Power Rule for Positive, Rational Powers: Let \(p,\) and \(q\) be any integers, positive or negative, with \(q\neq0\text{.}\) Then \(\dx\left(x^{\frac{p}{q}}\right)=\frac{p}{q}x^{\frac{p}{q}-1}\dx{x}\text{.}\)
In the title of his \(1684\) paper Leibniz claimed that his Calculus Differentialis was “. . . Impeded Neither by Fractional nor Irrational Quantities, . . .” But notice that nothing we’ve done so far allows us to find \(\dx{y}\) if say \(y=x^{\sqrt{2}}\text{.}\) This is because \(\sqrt{2}\) is irrational and cannot be represented as a fraction. We have extended the Power Rule to the case where this exponent is any rational number, but we have not addressed the case when the exponent is irrational.

Comment 4.3.36.

Having an irrational exponent is a bigger problem than it seems to be. We have no way (so far) to interpret the meaning of an expression like \(x^{\sqrt{2}}\text{.}\) So trying to find its differential using the techniques we have is hopeless. Take a moment to consider what an expression like \(x^{\sqrt{2}}\) could even mean.
In Section 8.3 we will find two different ways to assign meaning to \(x^{\alpha }\) when \(\alpha{}\) is irrational. (Actually, we will invent a way. Meaning is not something we find, it is something we create.) When we do we will find that the Power Rule still works. In the meantime, we will assume for now that the Power Rule always works, even when the exponent is irrational.
We can now differentiate some pretty complicated expressions. Frequently an appropriate substitution will make this “easier on the eyes”, but otherwise at each step we only need to invoke one of the General Differentiation Rules. When the expression is complicated the key is to be methodical and do one step at a time even if you are doing some of them in your head.

Example 4.3.38.

To differentiate \(\left( \sqrt{x^2-2} \right) \left( \sqrt{x}-2\right)\) we begin by re-expressing the radicals as exponents because we have no differentiation rules for radicals.
\begin{align} \dx{\left(\textcolor{red}{\sqrt{x^2-2}}\right)\amp{}\left(\textcolor{blue}{\sqrt{x}-2}\right)}\notag\\ \amp = \dx\left[\left(\textcolor{red}{(x^2-2)^{1/2}}\right)\left(\textcolor{blue}{x^{1/2}-2}\right)\right]\notag\\ \amp = (x^2-2)^{\frac12}\textcolor{blue}{\dx{\left(x^{\frac12}-2\right)}}+\left(x^{\frac12}-2\right)\textcolor{red}{\dx{(x^2-2)^{\frac12}}}\notag\\ \amp = (x^2-2)^{\frac12}\left(\frac12x^{-\frac12}\right)\dx{x}+\left(x^{\frac12}-2\right)\left(\frac12(x^2-2)^{-\frac12}\right)\textcolor{red}{\dx(x^2-2)}\notag\\ \amp = (x^2-2)^{\frac12}\left(\frac12x^{-\frac12}\right)\dx{x}+\left(x^{\frac12}-2\right)\left(\frac12(x^2-2)^{-\frac12}\right)2x\dx{x}\notag\\ \amp = \left[ (x^2-2)^{\frac12}\left(\frac12x^{-\frac12}\right)+\left(x^{\frac12}-2\right)\left(\frac12(x^2-2)^{-\frac12}\right)2x\right]\dx{x}\tag{4.10} \end{align}
Be sure you can follow all the computations at each step in Example 4.3.38. You may want to make some substitutions as you go. With practice, you will be able to do most of this in your head.

Drill 4.3.39.

Show that equation (4.10) “simplifies” to
\begin{equation*} \dx\left[\left(\sqrt{x^2-2}\right)\left(\sqrt{x}-2\right)\right] = \frac{3x^2-4x^{\frac32}+2}{2\sqrt{x}\sqrt{x^2-2}}\dx{x} \end{equation*}
Does this seem simpler to you?

Drill 4.3.40.

Compute \(\dx{y}\) in terms of \(x\) and \(\dx{x}.\)
  1. \(y=z^3\text{,}\) where \(z=x^2+3x-7\)
  2. \(y=\sqrt{w}+3x\text{,}\) where \(w=x^2+x\)
  3. \(y=z^{-\frac14}\text{,}\) where \(z=3w^5+w^{-5}\text{,}\) and \(w=x^4-5x^{-3}\)

Drill 4.3.41.

Compute each of the following:
  1. \(\displaystyle \dx(2x+2x^{-1}) \)
  2. \(\displaystyle \dx \left( 3x+\frac{1}{\sqrt[3]{x^2}} \right)\)
  3. \(\displaystyle \dx \left( \frac{5\sqrt{x}-x^4}{\sqrt[5]{x}} \right)\)
  4. \(\displaystyle \dx \left (\frac{1}{\sqrt{3}{x}} -5 {x} ^ {2} + \sqrt {5} {{x} ^ {4}} \right )\)
  5. \(\displaystyle \dx{\left(\frac1\pi+\sqrt{\pi}\right)}\)
  6. \(\displaystyle \dx{\left(\sqrt[3]{x^2-\frac7x+4}\right)}\)
  7. \(\displaystyle \dx{\left((t^2+24t-3)^{-7}\right)}\)

Drill 4.3.42.

Compute each of the following.
  1. \(\displaystyle \dx \left(\left(x^2-\sqrt{x}\right)\left(3x^{-\frac14}+5x\right)\right)\)
  2. \(\displaystyle \dx{\left((2x-3\sqrt{x})^5(x^2-x+1)^9\right)}\)
  3. \(\displaystyle \dx{\left(\sqrt{x^{-3}+2x-1}\cdot\sqrt[3]{x^{4/3}+\inverse{x}}\right)}\)
  4. \(\displaystyle \dx{\left(\sqrt[3]{x^{-3}+2x-1}\cdot\sqrt[3]{x^{4/3}+\inverse{x}}\right)}\)

Problem 4.3.43.

Compute \(\dx{y}\) for each of the following:
(a)
\(y=\frac{x^2-2x+4}{x}\)
(b)
\(y=x^2-\sqrt{x}\)
(c)
\(y=\left(\sqrt[3]{x}\right)^2-\left(\sqrt{x}\right)^3\)
(d)
\(y=\left(\frac{1}{x^2}+\frac{1}{x^3}\right)\left(x^2+x^3\right)\)
(e)
\(y=\sqrt{3x^4-7x^3+3}\)
(f)
\(y=\sqrt{1+\sqrt{1+\sqrt{1+x}}}\)
(g)
\(y=\sqrt{x}+\sqrt{z}\)
(h)
\(y=\sqrt{x+z}\)
(i)
\(y=\sqrt{x z}\)
(j)
\(c^2t^2-x^2-y^2-z^2=0\text{,}\) where \(c\) is a constant
 16 
If \(x\text{,}\) \(y\text{,}\) and \(z\) are spatial coordinates, \(t\) is time, and the constant \(c\) is the speed of light then the expression on the left-hand side represents the square of the distance between the origin and the point \((x,y,z,t)\) in \(4\) dimensional, relativistic, spacetime. Setting this equal to zero allows us to identify all of the events that an observer at the origin is observing at a given instant. For example, if there is a solar flare happening on the sun \(93\) million miles away from the earth, then we observe it \(8\) minutes later. The spacetime distance is zero to allow for the simultaneity of the event happening and us observing it. The closest galaxy to the Milky Way Galaxy is the Andromeda Galaxy which is approximately \(2.5\) million light-years away. This means that the light we are observing now originated 2.5 million years ago but the spacetime distance between the two events (called the Minkowski distance) is still zero.
.
In most parts of Problem 4.3.43 \(y\) is given explicitly in terms of \(x\text{.}\) But in Part 4.3.43.j \(y\) is given implicitly. We’ve seen this before. Almost everyone’s first impulse in this situation is to try to solve explicitly for \(y\) and then differentiate, but solving for \(y\) isn’t necessary, it isn’t always possible, and even when it is possible it is often harder that simply differentiating the formula we’re given. Our differentiation rules remain unchanged. After completing the differentiation we simply need to solve for \(\dx{y}\text{.}\)

Example 4.3.44.

For example, how can we compute \(\dx{y}\) given that
\begin{equation} 2x^2-xy+z^3y^2 = 1+y?\tag{4.11} \end{equation}
It is possible to solve equation (4.11) explicitly for \(y\) in terms of \(x\) but it is difficult to do. Since equation (4.11) is quadratic in \(y\) (the highest power of \(y\) is \(2\)) we need only use the Quadratic Formula to get \(y\) explicitly in terms of \(x\) and \(z\) and then differentiate. Try that some time when you have time to kill. Lots of time.
Instead, we’ll just apply our differentiation rules to equation (4.11) as it is. Differentiating we have
\begin{equation} 4x\dx{x}-(x\textcolor{red}{\dx{y}}+y\dx{x})+\left(z^3(2y\textcolor{red}{\dx{y}})+y^2(3z^2\dx{z})\right)=\textcolor{red}{\dx{y}}.\tag{4.12} \end{equation}
At first this may look very difficult to solve for \(\dx{y}\) but look again. This differential equation is actually linear in each of the differentials, \(\dx{x}\text{,}\) \(\dx{y}\text{,}\) and \(\dx{z}\text{.}\) That is, each differential only appears raised to the first power. This will always happen. When we differentiate any expression the result will always be linear in the differentials involved.
The word “linear” has a very specific meaning in mathematics, but the precise definition isn’t relevant at the moment. For our current purpose it simply means that we can rearrange our equation algebraically so that it has the following form.
\begin{equation*} (\text{stuff})\dx{y} = (\text{more stuff})\dx{x} + (\text{even more stuff})\dx{z}. \end{equation*}
Solving for \(\dx{y}\) gives
\begin{equation*} \dx{y} = \frac{(\text{more stuff})\dx{x} + (\text{even more stuff})\dx{z}}{(\text{stuff})}. \end{equation*}
For this example the specific computations are:
\begin{align} 4x\dx{x}-x\textcolor{red}{\dx{y}}-y\dx{x} + 2yz^3\textcolor{red}{\dx{y}}+3y^2z^2\dx{z}\amp=\textcolor{red}{\dx{y}}\notag\\ 2yz^3\textcolor{red}{\dx{y}}-x\textcolor{red}{\dx{y}}-\textcolor{red}{\dx{y}} \amp=y\dx{x}-4x\dx{x}-3y^2z^2\dx{z}\notag\\ (2yz^3-x-1)\dx{y}\amp=(y-4x)\dx{x} - 2y^2z^2\dx{z}\notag\\ \dx{y}\amp=\frac{(y-4x)\dx{x} - 2y^2z^2\dx{z}}{2yz^3-x-1}.\tag{4.13} \end{align}
At this point we have solved our problem so we needn’t do any more. But notice that we could also write equation (4.13) as
\begin{equation*} \dx{y}=\frac{(y-4x)}{2yz^3-x-1}\dx{x} - \frac{2y^2z^2}{2yz^3-x-1}\dx{z} \text{,} \end{equation*}
which may be preferable depending on the next step if this is a part of a larger problem.
Some problems are legitimately hard, and some only look hard. So remember that as bleak as things may look sometimes, you should always keep your goal in view. In this case we were trying to isolate \(\dx{y}\text{,}\) so we gathered all of the terms that had \(\dx{y}\) in them, put all the other terms on the other side of the equation, factored out the \(\dx{y}\) and divided by the “stuff” that formed the coefficient of \(\dx{y}\text{.}\) This process is often easier than trying to solve for \(y\) before differentiating.

Drill 4.3.46.

Compute \(\dx{y}\) for each of the following. (You will see these formulas again in Problem 5.2.11.)
  1. \(\displaystyle x^\frac23+y^\frac23 = 4\)
  2. \(\displaystyle y^2=x^3-x+1\)
  3. \(\displaystyle \left(x^2+y^2\right)^2+4x(x^2+y^2)=4y^2\)
  4. \(\displaystyle (y-2)^2(x^2+y^2)=2y^2\)
  5. \(\displaystyle (x^2+y^2+1)^2-4x^2=\frac32\)
  6. \(\displaystyle y^2\left(1-x^2\right)=\left(x^2+2y-1\right)^2\)
  7. \(\displaystyle x^4=x^2+y^2\)
  8. \(\displaystyle (x^2+y^2)(y^2+x(x+4))=4xy^2\)

Subsection 4.3.6 The Quotient Rule

Example 4.3.47.

Suppose we wish to find \(\dx{y}\) when \(y= \frac{2-3x^2}{\sqrt{x}}\text{.}\) Rewriting this slightly we have \(y= \frac{2-3x^2}{\sqrt{x}} = x^{-\frac12}\left(2-3x^2\right) = 2x^{-\frac12}-3x^{\frac32}\text{.}\) Thus
\begin{align*} \dx{y}=\dx\left(\frac{2-3x^2}{x^{\frac12}}\right) \amp{} =\dx\left(2x^{-\frac12}-3x^{\frac32}\right)\\\\ \amp{} =\left[2\left(-\frac12x^{-\frac32}\dx{x}\right)-3\left(\frac32x^{\frac12}\dx{x}\right)\right]\\ \amp{}=\left(-x^{-\frac32}-\frac92x^{\frac12}\right)\dx{x}. \end{align*}
Now suppose we have \(z=\frac{1}{y}=\frac{\sqrt{x}}{2-3x^2}\text{.}\) Can we compute \(\dx{z}\text{?}\)
There are actually two different approaches to this problem that use the Product Rule. The first is to rewrite \(z\) as \(x^{\frac12}\inverse{(2-3x^2)}\) then apply the Product Rule followed by the Power Rule.
The second approach is to clear the fraction so that \(z(2-3x^2)=\sqrt{x}\text{.}\) From here we can compute the differential of both sides (using the Product Rule on the left side) and solve the result for \(\dx{z}.\)

Problem 4.3.48.

Compute \(\dx\left(\frac{\sqrt{x}}{2-3x^2}\right)\) by each of the two methods indicated in Example 4.3.47. Verify that your results are equivalent either way.
Both of the techniques in Example 4.3.47 are straightforward and can be applied to any fractional expression. However, since using the Power Rule can be a bit cumbersome at times we will calculate \(\dx\left(\frac{x}{y}\right)\) once and for all. This will give us a convenient shortcut to handle fractional expressions called the Quotient Rule.

Problem 4.3.50.

Derive the Quotient Rule, \(\dx{z}=\dfrac{y\dx{x}-x\dx{y}}{y^2}\text{,}\) in two ways:
(a)
By writing \(\dx\left(\frac{x}{y}\right)=\dx(x\inverse{y})\) and applying the Product Rule and the Power Rule.
(b)
By making the substitution \(z=\frac{x}{y}\text{,}\) so that \(yz=x\text{.}\) Apply the Product Rule and solve for \(\dx{z}\text{.}\)
Applying the Quotient Rule directly to Problem 4.3.48 we have
\begin{align} \dx\left(\frac{\textcolor{blue}{\sqrt{x}}}{\textcolor{red}{2-3x^2}}\right) \amp{}=\frac{(\textcolor{red}{2-3x^2})\dx\left(\textcolor{blue}{x^{1/2}}\right)-\textcolor{blue}{x^{1/2}} \dx(\textcolor{red}{2-3x^2})}{(\textcolor{red}{2-3x^2})^2}\notag\\ \amp{}=\frac{(2-3x^2)(\frac12 x^{-1/2})\dx{x}-x^{1/2} (-6x)\dx{x}}{(2-3x^2 )^2}\\\notag\\ \amp{}=\frac{(2-3x^2)(\frac12 x^{-1/2})-x^{1/2} (-6x)}{(2-3x^2 )^2}\dx{x}.\tag{4.15} \end{align}

Drill 4.3.52.

Compute \(\dx{y}\) for each of the following:
  1. \(\displaystyle y=\frac{x+1}{x}\)
  2. \(\displaystyle y=\frac{x}{x+1}\)
  3. \(\displaystyle y=x+\frac{1}{x}\)
  4. \(\displaystyle y=\frac{x^3+2x^2}{\inverse{x}+7x^{15}}\)
  5. \(\displaystyle y=\dfrac{x}{y^2+1}\)
  6. \(\displaystyle \dfrac{xy}{x+z}=x^2+\dfrac{x}{y+1}\)
  7. \(\displaystyle \dfrac{xy^2}{y^2+1}=1\)
  8. \(\displaystyle y=(x^2-3)\inverse{(3x^4+6x+2)}\)
  9. \(\displaystyle y=\dfrac{(3x^2+2x-1)(4x^8-x^{-9}+2)}{\sqrt{x}+\frac{2}{\sqrt{x}}}\)

Problem 4.3.53.

(a)
Use the Quotient Rule to show that the differential of each of the following is: \(\dx{y}=\frac{-1}{(x+2)^2}\dx{x}.\)
  1. \(\displaystyle y=\dfrac{1}{x+2}\)
  2. \(\displaystyle y=\dfrac{x+3}{x+2}\)
  3. \(\displaystyle y=\dfrac{2x+5}{x+2}\)
  4. \(\displaystyle y=\dfrac{-3x-5}{x+2}\)
Can you explain this?
(b)
Show that each of the expressions in part (a) is of the form \(\frac{1}{x+2}+a\) for some constant \(a\text{.}\) Now can you explain it?
(c)
Compute the differential of \(y=\dfrac{7x+13}{x+2}.\) Can you explain why this one is different?

Problem 4.3.54.

Assume that neither \(x\) nor \(y\) is zero. Show that each of the following statements is true.
(a)
The Product Rule can be rearranged as:
\begin{equation*} \dx\left(xy\right) = xy\left(\frac{\dx{x}}{x}+\frac{\dx{y}}{y}\right)\text{.} \end{equation*}
(b)
The Quotient Rule can be rearranged as:
\begin{equation*} \dx\left(\frac{x}{y}\right) = \frac{x}{y}\left(\frac{\dx{x}}{x}-\frac{\dx{y}}{y}\right)\text{.} \end{equation*}
(c)
Why must we assume that neither \(x\) nor \(y\) is zero?
Of the six General Differentiation Rules, the Quotient Rule is the most complicated to use. Many people try to avoid it when possible. In certain situations you can get around using the Quotient Rule by finding algebraic simplifications of the original expression, as demonstrated by the next problem.

Problem 4.3.55.

First compute each of the following using the Quotient Rule, and then find an algebraic simplification that makes the differentiation easier. Verify that you get the same answer either way.
(a)
\(\dx{\left(\frac{x^2+x}{x}\right)}=\dx{x}\)
(b)
\(\dx{\left(\frac{x^2-1}{x+1}\right)}=\dx{x}\)
(c)
\(\dx{\left(\frac{x^3-x}{x-1}\right)}=\dx{x}\)
(d)
\(\dx{\left(\frac{x^2+x-2}{x+2}\right)}=\dx{x}\)
(e)
\(\dx{\left(\frac{x^3+2x^2-x-2}{x+2}\right)}=2x\dx{x}\)
(f)
\(\dx{\left(\frac{x^3+2x^2-x-2}{(x+2)(x-1)}\right)}=\dx{x}\)
(g)
\(\dx{\left(\frac{x^3+3x^2-x}{x^2+3 x-1}\right)}= \dx{x}\)
(h)
\(\dx{\left(\frac{x^3-x^2-2x+2}{x-1}\right)}= 2x\dx{x}\)
(i)
\(\dx{\left( \frac{x^3-x^2-x-2}{x-2}\right)}= (2x+1)\dx{x}\)
(j)
\(\dx{\left(\frac{x^3-x^2-x-2}{x^2+x+1}\right)}= \dx{x}\)
We now have all six of the General Differentiation Rules in our toolbox:
The Constant Rule:
If \(a\) is a constant, then \(\dx{a}=0.\)
The Sum Rule:
\(\displaystyle \dx(x+y)=\dx x+\dx y.\)
The Constant Multiple Rule:
If \(a\) is a constant, then \(\dx(ax)=a\dx x.\)
The Product Rule:
\(\displaystyle \dx{(xy)}=x\dx{y}+y\dx{x}.\)
The Power Rule:
If \(n\) is any real number then, \(\dx(x^n)=nx^{n-1}\dx{x}\text{.}\)
The Quotient Rule:
\(\displaystyle \dx{\left(\dfrac{x}{y}\right)}=\dfrac{y\dx{x}-x\dx{y}}{y^2}.\)
Of these, the Constant Multiple Rule, the Power Rule and the Quotient Rule are really just conveniences. They can all be derived from the other three.
We remind you that the General Differentiation Rules only tell you how to compute differentials. They do not tell you what differentials you need to compute or why. Algebra, Geometry, Arithmetic, Trigonometry, and differentiation are important tools, but tools are only useful if you are skillful with them, if you can use them in a coordinated fashion. This requires practice. Lots of practice.