Generalizations of the derivative

In mathematics, the derivative is a fundamental construction of differential calculus and admits many possible generalizations within the fields of mathematical analysis, combinatorics, algebra, and geometry.

Derivatives in analysisEdit

In real, complex, and functional analysis, derivatives are generalized to functions of several real or complex variables and functions between topological vector spaces. An important case is the variational derivative in the calculus of variations. Repeated application of differentiation leads to derivatives of higher order and differential operators.

Multivariable calculusEdit

The derivative is often met for the first time as an operation on a single real function of a single real variable. One of the simplest settings for generalizations is to vector valued functions of several variables (most often the domain forms a vector space as well). This is the field of multivariable calculus.

In one-variable calculus, we say that a function   is differentiable at a point x if the limit


exists. Its value is then the derivative ƒ'(x). A function is differentiable on an interval if it is differentiable at every point within the interval. Since the line   is tangent to the original function at the point   the derivative can be seen as a way to find the best linear approximation of a function. If one ignores the constant term, setting  , L(z) becomes an actual linear operator on R considered as a vector space over itself.

This motivates the following generalization to functions mapping   to  : ƒ is differentiable at x if there exists a linear operator A(x) (depending on x) such that


Although this definition is perhaps not as explicit as the above, if such an operator exists, then it is unique, and in the one-dimensional case coincides with the original definition. (In this case the derivative is represented by a 1-by-1 matrix consisting of the sole entry f'(x).) Note that, in general, we concern ourselves mostly with functions being differentiable in some open neighbourhood of   rather than at individual points, as not doing so tends to lead to many pathological counterexamples.

An n by m matrix, of the linear operator A(x) is known as Jacobian matrix Jx(ƒ) of the mapping ƒ at point x. Each entry of this matrix represents a partial derivative, specifying the rate of change of one range coordinate with respect to a change in a domain coordinate. Of course, the Jacobian matrix of the composition g°f is a product of corresponding Jacobian matrices: Jx(g°f) =Jƒ(x)(g)Jx(ƒ). This is a higher-dimensional statement of the chain rule.

For real valued functions from Rn to R (scalar fields), the total derivative can be interpreted as a vector field called the gradient. An intuitive interpretation of the gradient is that it points "up": in other words, it points in the direction of fastest increase of the function. It can be used to calculate directional derivatives of scalar functions or normal directions.

Several linear combinations of partial derivatives are especially useful in the context of differential equations defined by a vector valued function Rn to Rn. The divergence gives a measure of how much "source" or "sink" near a point there is. It can be used to calculate flux by divergence theorem. The curl measures how much "rotation" a vector field has near a point.

For vector-valued functions from R to Rn (i.e., parametric curves), one can take the derivative of each component separately. The resulting derivative is another vector valued function. This is useful, for example, if the vector-valued function is the position vector of a particle through time, then the derivative is the velocity vector of the particle through time.

The convective derivative takes into account changes due to time dependence and motion through space along vector field.

Convex analysisEdit

The subderivative and subgradient are generalizations of the derivative to convex functions.

Higher-order derivatives and differential operatorsEdit

One can iterate the differentiation process, that is, apply derivatives more than once, obtaining derivatives of second and higher order. A more sophisticated idea is to combine several derivatives, possibly of different orders, in one algebraic expression, a differential operator. This is especially useful in considering ordinary linear differential equations with constant coefficients. For example, if f(x) is a twice differentiable function of one variable, the differential equation

may be rewritten in the form

is a second order linear constant coefficient differential operator acting on functions of x. The key idea here is that we consider a particular linear combination of zeroth, first and second order derivatives "all at once". This allows us to think of the set of solutions of this differential equation as a "generalized antiderivative" of its right hand side 4x − 1, by analogy with ordinary integration, and formally write


Higher derivatives can also be defined for functions of several variables, studied in multivariable calculus. In this case, instead of repeatedly applying the derivative, one repeatedly applies partial derivatives with respect to different variables. For example, the second order partial derivatives of a scalar function of n variables can be organized into an n by n matrix, the Hessian matrix. One of the subtle points is that the higher derivatives are not intrinsically defined, and depend on the choice of the coordinates in a complicated fashion (in particular, the Hessian matrix of a function is not a tensor). Nevertheless, higher derivatives have important applications to analysis of local extrema of a function at its critical points. For an advanced application of this analysis to topology of manifolds, see Morse theory.

As in the case of functions of one variable, we can combine first and higher order partial derivatives to arrive at a notion of a partial differential operator. Some of these operators are so important that they have their own names:

  • The Laplace operator or Laplacian on R3 is a second-order partial differential operator Δ given by the divergence of the gradient of a scalar function of three variables, or explicitly as
    Analogous operators can be defined for functions of any number of variables.
  • The d'Alembertian or wave operator is similar to the Laplacian, but acts on functions of four variables. Its definition uses the indefinite metric tensor of Minkowski space, instead of the Euclidean dot product of R3:

Weak derivativesEdit

Given a function   which is locally integrable, but not necessarily classically differentiable, a weak derivative may be defined by means of integration by parts. First define test functions, which are infinitely differentiable and compactly supported functions  , and multi-indices, which are length   lists of integers   with  . Applied to test functions,  . Then the   weak derivative of   exists if there is a function   such that for all test functions  , we have


If such a function exists, then  , which is unique almost everywhere. This definition coincides with the classical derivative for functions  , and can be extended to a type of generalized functions called distributions, the dual space of test functions. Weak derivatives are particularly useful in the study of partial differential equations, and within parts of functional analysis.

Analysis on fractalsEdit

Laplacians and differential equations can be defined on fractals.

Fractional derivativesEdit

In addition to n th derivatives for any natural number n, there are various ways to define derivatives of fractional or negative orders, which are studied in fractional calculus. The −1 order derivative corresponds to the integral, whence the term differintegral.

Complex analysisEdit

In complex analysis, the central objects of study are holomorphic functions, which are complex-valued functions on the complex numbers satisfying a suitably extended definition of differentiability.

The Schwarzian derivative describes how a complex function is approximated by a fractional-linear map, in much the same way that a normal derivative describes how a function is approximated by a linear map.

The Wirtinger derivatives are a set of differential operators that permit the construction of a differential calculus for complex functions that is entirely analogous to the ordinary differential calculus for functions of real variables.

Quaternionic analysisEdit

In quaternionic analysis, derivatives can be defined in a similar way to real and complex functions. Since the quaternions   are not commutative, the limit of the difference quotient yields two different derivatives: A left derivative


and a right derivative


The existence of these limits are very restrictive conditions. For example, if   has left-derivatives at every point on an open connected set  , then   for  .

Functional analysisEdit

In functional analysis, the functional derivative defines the derivative with respect to a function of a functional on a space of functions. This is an extension of the directional derivative to an infinite dimensional vector space.

The Fréchet derivative allows the extension of the directional derivative to a general Banach space. The Gateaux derivative extends the concept to locally convex topological vector spaces. Fréchet differentiability is a strictly stronger condition than Gateaux differentiability, even in finite dimensions. Between the two extremes is the quasi-derivative.

In measure theory, the Radon–Nikodym derivative generalizes the Jacobian, used for changing variables, to measures. It expresses one measure μ in terms of another measure ν (under certain conditions).

In the theory of abstract Wiener spaces, the H-derivative defines a derivative in certain directions corresponding to the Cameron-Martin Hilbert space.

On a function space, the linear operator which assigns to each function its derivative is an example of a differential operator. General differential operators include higher order derivatives. By means of the Fourier transform, pseudo-differential operators can be defined which allow for fractional calculus.

Analogues of derivatives in fields of positive characteristicEdit

The Carlitz derivative is an operation similar to usual differentiation have been devised with the usual context of real or complex numbers changed to local fields of positive characteristic in the form of formal Laurent series with coefficients in some finite field Fq (it is known that any local field of positive characteristic is isomorphic to a Laurent series field).

Along with suitably defined analogs to the exponential function, logarithms and others the derivative can be used to develop notions of smoothness, analycity, integration, Taylor series as well as a theory of differential equations.[1]

Difference operator, q-analogues and time scalesEdit

  • The q-derivative of a function is defined by the formula
    For x nonzero, if f is a differentiable function of x then in the limit as q → 1 we obtain the ordinary derivative, thus the q-derivative may be viewed as its q-deformation. A large body of results from ordinary differential calculus, such as binomial formula and Taylor expansion, have natural q-analogues that were discovered in the 19th century, but remained relatively obscure for a big part of the 20th century, outside of the theory of special functions. The progress of combinatorics and the discovery of quantum groups have changed the situation dramatically, and the popularity of q-analogues is on the rise.
  • The difference operator of difference equations is another discrete analog of the standard derivative.
  • The q-derivative, the difference operator and the standard derivative can all be viewed as the same thing on different time scales. For example, taking  , we may have
    The q-derivative is a special case of the Hahn difference,[2]
    The Hahn difference is not only a generalization of the q-derivative but also an extension of the forward difference.
  • Also note that the q-derivative is nothing but a special case of the familiar derivative. Take  . Then we have,

Derivatives in algebraEdit

In algebra, generalizations of the derivative can be obtained by imposing the Leibniz rule of differentiation in an algebraic structure, such as a ring or a Lie algebra.


A derivation is a linear map on a ring or algebra which satisfies the Leibniz law (the product rule). Higher derivatives and algebraic differential operators can also be defined. They are studied in a purely algebraic setting in differential Galois theory and the theory of D-modules, but also turn up in many other areas, where they often agree with less algebraic definitions of derivatives.

For example, the formal derivative of a polynomial over a commutative ring R is defined by


The mapping   is then a derivation on the polynomial ring R[X]. This definition can be extended to rational functions as well.

The notion of derivation applies to noncommutative as well as commutative rings, and even to non-associative algebraic structures, such as Lie algebras.

See also Pincherle derivative and Arithmetic derivative.

Commutative algebraEdit

In commutative algebra, Kähler differentials are universal derivations of a commutative ring or module. They can be used to define an analogue of exterior derivative from differential geometry that applies to arbitrary algebraic varieties, instead of just smooth manifolds.

Number theoryEdit

In p-adic analysis, the usual definition of derivative is not quite strong enough, and one requires strict differentiability instead.

Also see arithmetic derivative and Hasse derivative.

Type theoryEdit

Many abstract data types in mathematics and computer science can be described as the algebra generated by a transformation that maps structures based on the type back into the type. For example, the type T of binary trees containing values of type A can be represented as the algebra generated by the transformation 1+A×T2→T. The "1" represents the construction of an empty tree, and the second term represents the construction of a tree from a value and two subtrees. The "+" indicates that a tree can be constructed either way.

The derivative of such a type is the type that describes the context of a particular substructure with respect to its next outer containing structure. Put another way, it is the type representing the "difference" between the two. In the tree example, the derivative is a type that describes the information needed, given a particular subtree, to construct its parent tree. This information is a tuple that contains a binary indicator of whether the child is on the left or right, the value at the parent, and the sibling subtree. This type can be represented as 2×A×T, which looks very much like the derivative of the transformation that generated the tree type.

This concept of a derivative of a type has practical applications, such as the zipper technique used in functional programming languages.

Derivatives in geometryEdit

Main types of derivatives in geometry is Lie derivatives along a vector field, exterior differential, and covariant derivatives.

Differential topologyEdit

In differential topology, a vector field may be defined as a derivation on the ring of smooth functions on a manifold, and a tangent vector may be defined as a derivation at a point. This allows the abstraction of the notion of a directional derivative of a scalar function to general manifolds. For manifolds that are subsets of Rn, this tangent vector will agree with the directional derivative defined above.

The differential or pushforward of a map between manifolds is the induced map between tangent spaces of those maps. It abstracts the Jacobian matrix.

On the exterior algebra of differential forms over a smooth manifold, the exterior derivative is the unique linear map which satisfies a graded version of the Leibniz law and squares to zero. It is a grade 1 derivation on the exterior algebra.

The Lie derivative is the rate of change of a vector or tensor field along the flow of another vector field. On vector fields, it is an example of a Lie bracket (vector fields form the Lie algebra of the diffeomorphism group of the manifold). It is a grade 0 derivation on the algebra.

Together with the interior product (a degree -1 derivation on the exterior algebra defined by contraction with a vector field), the exterior derivative and the Lie derivative form a Lie superalgebra.

Differential geometryEdit

In differential geometry, the covariant derivative makes a choice for taking directional derivatives of vector fields along curves. This extends the directional derivative of scalar functions to sections of vector bundles or principal bundles. In Riemannian geometry, the existence of a metric chooses a unique preferred torsion-free covariant derivative, known as the Levi-Civita connection. See also gauge covariant derivative for a treatment oriented to physics.

The exterior covariant derivative extends the exterior derivative to vector valued forms.

Geometric calculusEdit

In geometric calculus, the geometric derivative satisfies a weaker form of the Leibniz rule. It specializes the Frechet derivative to the objects of geometric algebra. Geometric calculus is a powerful formalism that has been shown to encompass the similar frameworks of differential forms and differential geometry.[3]

Other generalizationsEdit

It may be possible to combine two or more of the above different notions of extension or abstraction of the original derivative. For example, in Finsler geometry, one studies spaces which look locally like Banach spaces. Thus one might want a derivative with some of the features of a functional derivative and the covariant derivative.

The study of stochastic processes requires a form of calculus known as the Malliavin calculus. One notion of derivative in this setting is the H-derivative of a function on an abstract Wiener space.

Multiplicative calculus replaces addition with multiplication, and hence rather than dealing with the limit of a ratio of differences, it deals with the limit of an exponentiation of ratios. This allows the development of the geometric derivative and bigeometric derivative. Moreover, just like the classical differential operator has a discrete analog, the difference operator, there are also discrete analogs of these multiplicative derivatives.

See alsoEdit


  1. ^ Kochubei, Anatoly N. (2009). Analysis in Positive Characteristic. New York: Cambridge University Press. ISBN 978-0-521-50977-0.
  2. ^ Hahn, Wolfgang (1949). "Über Orthogonalpolynome, die q-Differenzengleichungen genügen". Mathematische Nachrichten. 2: 4–34. doi:10.1002/mana.19490020103. ISSN 0025-584X. MR 0030647.
  3. ^ David Hestenes, Garrett Sobczyk: Clifford Algebra to Geometric Calculus, a Unified Language for mathematics and Physics (Dordrecht/Boston:G.Reidel Publ.Co., 1984, ISBN 90-277-2561-6