10. The Inverse Function Theorem and More
≪ 9. Multivariable Calculus Refresher | Table of Contents | 11. A Generalisation of the Implicit Function Theorem ≫Last week, we looked at two different generalisations of the derivative to multiple dimensions. Although partial derivatives are convenient and familiar both conceptually and computationally, we ultimately decided that the Frechét derivative was the better way to go. Again, the intuition is that a differentiable function is a “locally linear” function in a quantitative sense.
More specifically, let be a function that’s differentiable at some . Then, its derivative is a linear function from , and for sufficiently small, one has The difference between the two is negligble.
This numerical estimate can be extended in a more significant way. Recall the -dimensional change of variables formula. It says if is sufficiently nice ( with nonvanishing derivative is enough), then for any Riemann integrable , one has Morally speaking, this is because when you partition the intervals in the right way. A box of width is distorted to a box of width approximately . Draw a picture!
More broadly, you will prove in lecture that given two reasonable subsets and a diffeomorphism , one has for any integrable function that In principle, scales the volume of a small box containing by a factor of , in much the same way scaled the length of a small interval near by a factor of . One can say that and inherit the volume-scaling properties of their derivatives.
The next question to ask is, in what other ways does resemble its derivative? There are some qualitative questions that make sense, such as:
- If is injective, is injective?
- If is surjective, is surjective?
- If is -dimensional, is also -dimensional?
For the third question, it’s not immediately obvious what it means for the level sets of to be -dimensional, but there is some intuition for what -dimensional subsets (as opposed to subspaces) are. For instance, the circle appears to be a -dimensional object, while the sphere appears to be -dimensional. We will revisit this question later.
The Inverse Function Theorem
Those of you that were here last quarter saw the inverse function theorem several times; these two problems provide a good look at which parts of the inverse function theorem hold when one drops the assumption of continuous derivatives. Let us state the theorem as most people know it:
Theorem 1. Inverse Function Theorem
Let be open, and let be continuously differentiable on . Let such that is nonsingular. Then, there exists an open neighbourhood of and an open neighbourhood of such that is a bijection , and its inverse is differentiable.
This is a somewhat complicated statement, but ultimately, this theorem boils down to: if a function locally looks invertible, it’s locally invertible.
I must point out that nonsingular means invertible. When , this is strictly different from having a nonzero derivative. In fact, there are many, many maps with singular but nonzero derivatives that are not locally invertible, such as via .
Besides this all-too-common mistake, let’s highlight a few pitfalls of this theorem:
- When is not continuously differentiable, the inverse function theorem fails. In the proof of the inverse function theorem, the continuity of the derivative is critical to ensuring local injectivity. However, will continue to be locally surjective (see the aforementioned problems).
- The converse statement is only partially true. If is singular, then it’s entirely possible for to still have a local inverse near . However, this local inverse will not be differentiable.
- generally will not be invertible on its entire domain (i.e., it need not be globally injective), even if its derivative is nonsingular everywhere. There are plenty of counterexamples to this fact; an example is via . Draw a picture of what’s going on. ( can take multiple “sheets” of its domain to the same image.)
Exercise 2.
Show that the function for and is differentiable at and that . Show that is not injective on any neighbourhood of .
Exercise 3.
Let be a continuously differentiable function such that is nonsingular for all . Show that is an open map: that is, if is open, then is open.
Challenge: show that this is still true if is differentiable but not necessarily continuously differentiable.
The Implicit Function Theorem
The inverse function addresses all three of the questions that were posed in a very special scenario, assuming continuous derivatives. Let’s return to the more general question of whether or not injectivity/surjectivity are “inherited” from a function’s derivatives.
One glaring issue with the inverse function theorem that I purposefully chose not to mention earlier is that the inverse function theorem only applies to functions whose domain and codomain have equal dimensions. Yet one might expect similar ideas to apply to the more general case. Suppose is differentiable at .
- If and is injective, should be injective?
- If and is surjective, should be surjective? Should the level sets of be -dimensional?
These seem entirely plausible, especially if one assumes a continuous derivative. Although the inverse function theorem does not apply out-of-the-box, one can actually adapt the proof of the theorem to these two scenarios! I’ll leave this as an exercise for the comitted student.
By the rank-nullity theorem, if , it’s impossible for to be surjective. Likewise, if , it’s impossible for to be injective.
Remark 4.
It’s hard to believe that there could even be a surjective function or an injective function . However, the two sets have the same cardinality, and it is possible to construct set-theoretic bijections between the two.
Okay sure, you say, it’s harder yet to believe that there could be a continuous surjection or a continuous injection . Continuous maps have to retain some idea of dimensionality, right? It so turns out that there are ways to continuously and surjectively map via space-filling curves. That is, one can continuously and surjectively map low-dimensional spaces onto high-dimensional spaces. I am unsure if one can continuously and injectively go the other way.
The answer for both of the questions posed prior is yes. We are not equipped to prove the answer, but we can look at a very specific scenario that we are well-equipped to handle.
Theorem 5. Implicit Function Theorem
Let be a continuously differentiable function. Suppose such that and . Then, there is an open subset containing and a continuously differentiable function such that and for all .
In words, what this theorem says is that looks like the graph of a function near . This graph should be an -dimensional object!
Based on the title of today’s discussion, it seems as though this is a consequence of the inverse function theorem. Indeed it is, but we need to circumvent the dimensional mismatch problem from earlier.
Proof
Define the function as is continuously differentiable — it has continuous first-order partial derivatives in every component. Thus, with as above, one has is nonsingular (it is upper triangular, and all of its diagonal entries are nonzero). So, by the inverse function theorem, it must have a continuously differentiable inverse, say , defined on a neighbourhood of .
Write , where each is defined on a neighbourhood of . By using the fact that is the identity, we actually get that , and likewise for . In other words, .
Since , as long as are close enough to , will be defined. Moreover, one has by construction that Unpacking the last coordinate of yields This is the desired result!
It should be added that one cannot weaken any of these conditions, especially the condition that the -th partial is nonzero. Draw a picture to see what could happen when the -th partial is zero!
Bonus: The Rank Theorem
I don’t think I’ll have time to talk about this during discussion, but the rank theorem is a big part of the picture that gives a unified answer to the question: to what extent does a function resemble its derivative?
The answer is, given continuity of the derivative, the resemblance is extremely strong. If is continuously differentiable in a neighbourhood around and has rank on this neighbourhood, then there is a change of coordinates such that looks like .
Specifically, there is an open neighbourhood of , of , open neighbourhoods and both containing the origin, and functions , with inverses such that is given by (where the number of zeroes, possibly none, is chosen to match the dimension ).
In words, what this is saying is that you can move the origin in to and the origin in to , then wiggle the standard coordinate axes so that does look like a rank linear map.
This strictly generalises both the inverse function theorem and the implicit function theorem; in fact, it relies on the technique of padding in such a way that the problem gets reduced to analysing a map with nonsingular derivative.
Indeed, the moral of this story is: continuously differentiable functions behave locally just like their derivatives!