Nyquist stability criterion: Difference between revisions
en>Keepitfree m →Mathematical Derivation: corrected two typos. u(v) should be v(u) an d Gamma should be Gamma_s |
|||
Line 1: | Line 1: | ||
= | {{One source | date = December 2010}} | ||
In [[mathematics]], '''abuse of notation''' occurs when an author uses a [[mathematical notation]] in a way that is not formally correct but that seems likely to simplify the exposition or suggest the correct [[Intuition (knowledge)|intuition]] (while being unlikely to introduce errors or cause confusion). Abuse of notation should be contrasted with ''misuse'' of notation, which should be avoided. | |||
{{anchor|Abuse of language}} A related concept is '''abuse of language''' or '''abuse of terminology,''' when not notation but a ''term'' is misused. Abuse of language is an almost synonymous expression that is usually used for non-notational abuses. For example, while the word [[group representation|''representation'']] properly designates a [[group homomorphism]] from a group G to GL(V) where V is a [[vector space]], it is common to call V "a representation of G". A common abuse of language consists in identifying two mathematical objects that are different but [[canonical (disambiguation)#Mathematics|canonically isomorphic]]. For example, identifying a [[constant function]] and its value or identifying to <math>\mathbb R^3</math> the [[Euclidean space]] of dimension three equipped with a [[Cartesian coordinate system]]. | |||
The latter uses may achieve clarity in the new area in an unexpected way, but it may borrow arguments from the old area that do not carry over, creating a [[false analogy]]. | |||
== | == Examples == | ||
Common examples occur when speaking of compound mathematical objects. For example, a [[topological space]] consists of a set <math>X</math> (called the underlying set of the topological space) and a topology <math>\mathcal{T}</math>, and two topological spaces <math>(X, \mathcal{T})</math> and <math>(X, \mathcal{T}')</math>, even with the same underlying set <math>X</math>, can be quite different if they have different topologies. Nevertheless, it is common to refer to such a space simply as <math>X</math> when there is no danger of confusion—that is, when it is implicitly clear what topology is being considered. Similarly, one often refers to a [[group (mathematics)|group]] <math>(G, \star)</math> as simply <math>G</math> when the group operation is clear from context. | |||
=== Equivalence classes === | |||
A very common form of abuse of notation is that often used when a set is partitioned into disjoint subsets ([[equivalence classes]]) by an [[equivalence relation]]. Formally, if a set X is partitioned by an equivalence relation ~, then for a given x∈X, the (equivalence) class {y∈X|y~x} would be denoted [x]. But in practice, if the remainder of the discussion is focused on equivalence classes rather than individual elements of the underlying set, it is common to drop the square brackets in the discussion. For example, in modular arithmetic, a finite group of size n can be formed by partitioning the integers via the equivalence relation x~y iff x=y(mod n). The elements of that group would then formally be listed as {[0], [1], ..., [n-1]}, but in practice they are usually just called 0, 1, ..., n-1. | |||
Another type of case is the space of (classes of) measurable functions over a [[measure space]], or classes of [[Lebesgue integral|Lebesgue integrable]] functions, where the equivalence relation is equality "[[almost everywhere]]". | |||
=== Derivative === | |||
In standard [[Mathematical analysis|analysis]], algebraic manipulations of the [[Leibniz notation]] for the [[derivative]] <math>\frac{dy}{dx}</math> are commonly thought{{by whom|date=February 2013}} to be an abuse of notation. It is frequently convenient to treat the expression <math>\frac{dy}{dx}</math> as a fraction. For example, it leads to the correct formula for differentiation of the composition of functions (commonly called the "[[chain rule]]") <math>\frac{dy}{dx}=\frac{dy}{du}\frac{du}{dx}</math>. Another example is the concept of [[separation of variables]] in solving [[differential equations]], in which one can rewrite the equation <math>\frac{dy}{dx}={g(x) \over h(y)}</math> as <math>h(y) dy = {g(x)dx}</math> and then integrate. | |||
A related form of abuse of notation occurs when an integral like <math>\int {1 \over x}\,dx</math> is written as <math>\int {dx \over x}</math>, as if <math>dx</math> were a term that is multiplied into the argument <math>1 \over x</math> of the integral. | |||
The | === Del operator === | ||
The [[del]] operator, denoted by <math>\nabla</math>, is a tuple of partial derivative operators posing as a vector. This suggests notations such as <math>\nabla f</math> for [[gradient]], <math>\nabla \cdot \vec v</math> for [[divergence]] and <math>\nabla \times \vec v</math> for [[curl (mathematics)|curl]]. The notation is extremely convenient because <math>\nabla</math> does behave like a vector most of the time. But it can be regarded as an abuse because <math>\nabla</math> doesn't [[commutative property|commute]] with vectors, and so doesn't satisfy ''all'' properties of vectors. | |||
(A contrary view is that notation is not abused if one does not think of <math>\nabla</math> as a vector. The vector-like notations are simply specially defined uses of the dot and cross.) | |||
===Cross product=== | |||
The determinant of a 3×3 matrix may be computed by "expanding along the first row" as follows: | |||
:: <math>\det \begin{bmatrix} | |||
a_1 & a_2 & a_3 \\ | |||
b_1 & b_2 & b_3 \\ | |||
c_1 & c_2 & c_3 | |||
\end{bmatrix} = a_1 \det \begin{bmatrix} | |||
b_2 & b_3 \\ | |||
c_2 & c_3\end{bmatrix}- a_2 \det \begin{bmatrix} | |||
b_1 & b_3 \\ | |||
c_1 & c_3\end{bmatrix}+ a_3 \det \begin{bmatrix} | |||
b_1 & b_2 \\ | |||
c_1 & c_2\end{bmatrix} </math> | |||
The [[cross product]] of the vectors (''a''<sub>1</sub>, ''a''<sub>2</sub>, ''a''<sub>3</sub>) and (''b''<sub>1</sub>, ''b''<sub>2</sub>, ''b''<sub>3</sub>) is given similarly by | |||
:: <math>\det \begin{bmatrix} | |||
a_2 & a_3 \\ | |||
b_2 & b_3\end{bmatrix}\mathbf{i} - \det \begin{bmatrix} | |||
a_1 & a_3 \\ | |||
b_1 & b_3\end{bmatrix}\mathbf{j}+ \det \begin{bmatrix} | |||
a_1 & a_2 \\ | |||
b_1 & b_2 \end{bmatrix} \mathbf{k}</math> | |||
Thus the cross product may be computed by writing the "symbolic determinant" | |||
:: <math>\mathbf{a}\times\mathbf{b}=\det \begin{bmatrix} | |||
\mathbf{i} & \mathbf{j} & \mathbf{k} \\ | |||
a_1 & a_2 & a_3 \\ | |||
b_1 & b_2 & b_3 \\ | |||
\end{bmatrix}</math> | |||
and expanding along the first row by rote, ignoring the fact that the matrix is not truly a matrix over the real or complex numbers (or whatever [[field (mathematics)|field]] the matrix entries belong to), and that thus the resulting computation does not compute an ordinary determinant. This is technically an abuse of notation, but is useful as a mnemonic to remember the formula for cross product and is very helpful in calculations.<ref>{{cite book |title=Multivariable Calculus |last=Stewart |first=James |authorlink= James Stewart (mathematician) |year=2007 |edition=6th |publisher=Brooks/Cole |isbn= 0-495-01163-0 |pages=822–823}}</ref> | |||
== | === Function application over set === | ||
John Harrison (1996)<ref> | |||
{{cite book | last=Harrison | first=John | title=Formalized Mathematics | url=http://www.tucs.fi/publications/techreports/TR36.php | publisher=Turku Centre for Computer Science | series=[http://www.tucs.fi/research/series/serie.php?type=techreport Technical Reports] 36 | year=1996 | isbn=951-650-813-8 | chapter=2.2 Criticism and reconstruction | chapterurl=http://www.rbjones.com/rbjpub/logic/jrh0110.htm }} | |||
</ref> cites "the use of ''f''(''x'') to represent both application of a [[Function (mathematics)|function]] ''f'' to an argument ''x'', and the image under ''f'' of a subset, ''x'', of ''f'''s domain". (Note that the last ''x'' is usually written differently, e.g. as ''X'', in order to distinguish an element ''x'' of the domain from a subset ''X''.) | |||
=== Exponentiation as repetition === | |||
[[Exponentiation]] is repeated multiplication, and multiplication is frequently denoted by juxtaposition of operands, with no operator at all. The suggested superscript notation for other [[associative]] operations denoted by juxtaposition follows: | |||
* [[Function application]] is sometimes denoted without parentheses: <math>f\, x = f(x)</math>. This suggests the [[Function_composition#Functional_powers|functional powers]] notation: <math>f^2 x = f f\, x = f(f(x))</math>. This also generalizes nicely to represent [[function inverse]] for a power of −1 and [[functional square root]] for a power of 1/2. | |||
* [[Exponentiation#Over_sets|Exponentiation over sets]]. | |||
* String repetition: "ab<sup>3</sup>c" = "abbbc". | |||
=== Cartesian product as associative === | |||
The [[cartesian product]] is often seen as associative, with: | |||
:<math>(E \times F) \times G = E \times (F \times G) = E \times F \times G</math> | |||
This of course cannot be rigorously true: if <math>x \in E</math>, <math>y \in F</math> and <math>z \in G</math>, the identity <math>((x, y), z) = (x, (y, z))</math> would imply that <math>(x, y) = x</math> and <math>z = (y, z)</math>, and <math>((x, y), z) = (x, y, z)</math> would mean nothing. | |||
This notion can be made rigorous in [[category theory]], using the idea of a [[natural isomorphism]]. | |||
===Trigonometric functions=== | |||
In some countries it is common to denote the ''square of the value'' of <math>\sin(x)</math> as <math>\sin^2(x)</math>, and the ''inverse function'' as <math>\sin^{-1}(x)</math>. In his article on ''notation'' in the ''[[Edinburgh Encyclopedia]]'' [[Charles Babbage]] complains at length of this abuse of notation and suggests two alternatives for the notation <math>f^n(x)</math> | |||
* Function composition, i.e. <math>f^2(x) = f(f(x))</math> and <math>f^{-1}(x)</math> is the inverse. | |||
* Powers of the value, i.e. <math>f^2(x) = (f(x))^2</math> and <math>f^{-1}(x)=\frac{1}{f(x)}</math> is the reciprocal. | |||
Babbage argues strongly for the former, and also that the square of the value should be notated as <math>\sin x^2\ </math>, but beware: Babbage intends <math>(\sin x)^2\ </math> even though what he wrote is easily confused with <math>\sin(x^2)\ </math> (the only non-confusing way to avoid this abuse of notation is to ''always'' include the parentheses). | |||
To press his example further, Babbage investigates what the function <math>\sin^2(x) = \sin(\sin x)</math> is like, and also <math>\sin^{\frac{1}{2}}(x),</math> which is the function which, when composed with itself, equals <math>\sin(x)</math>, the [[functional square root]]. | |||
===Big O notation=== | |||
With [[Big O notation]], we say that some term <math>f(x)</math> "is" <math>O(g(x))</math> (given some function ''g'', where ''x'' is one of ''f''<nowiki>'</nowiki>s parameters). | |||
Example: "Runtime of the algorithm is <math>O(n^2)</math>" or in symbols "<math>T(n) = O(n^2)</math>". | |||
Intuitively this notation groups functions according to their growth respective to some parameter; as such, the notation is abusive in two respects: | |||
It abuses "=", and it invokes terms that are real numbers instead of function terms. | |||
It would be appropriate to use the set membership notation and thus write <math>f(n)\in O(g(n))</math> instead of <math>f(n)=O(g(n))</math>. | |||
This would allow for common set operations like <math>O(n\cdot\log n) \subset O(n^2)</math>, <math>O(2^n) \bigcup O(n^2)</math>, | |||
and it would make clear, that the relation is not symmetric in contrast to what the "=" symbol suggests. | |||
Some argue for "=", because as an extension of the abuse, | |||
it could be useful to overload relation symbols such as < and ≤, such that, | |||
for example, <math>f < O(g)</math> means that ''f'''s real growth is less than <math>g</math>. | |||
But this further abuse is not necessary if, following [[Donald Knuth|Knuth]] one distinguishes between ''O'' and | |||
the closely related [[Big O notation#Related asymptotic notations|''o'' and ''Θ'' notations]]. | |||
Concerning the use of terms for scalar numbers instead of functions, one encounters the following troubles. | |||
# You cannot cleanly define what <math>f(n)\in O(g(n))</math> may mean, due to the fact the O notation is about growth of functions, but to the left hand and the right hand side of the relation, there are scalar values, and you cannot decide whether the relation holds if you look at particular function values. | |||
# The abused O notation is bound to one variable, and the identity of that variable can be ambiguous: for instance, in <math>O(n^m)</math> one of the variables might be a parameter which is not in scope of the <math>O</math>. | |||
That is, you might mean <math>O(2^m)</math>, since <math>n</math> was the parameter that you assigned 2, or you might mean <math>O(n^3)</math>, since <math>m</math> was the parameter substituted by 3 here. | |||
Even <math>O(c)</math> might be the same as <math>O(1)</math>, since <math>c</math> might be a parameter, not the concerned function variable. | |||
The carelessness regarding the use of function terms might be caused by the rarely-used functional notations, like [[Lambda calculus|Lambda notation]]. | |||
You would have to write <math>(n\mapsto n\cdot\log n) \in O(n\mapsto n^2)</math> and <math>O(n\mapsto n\cdot\log n) \subset O(n\mapsto n^2)</math>. | |||
The correct O notation can be easily extended to complexity functions which map tuples to complexities; this lets you formulate a statement like | |||
"the graph algorithm needs time proportional to the logarithm of the number of edges and to the number of vertices" | |||
by <math>T_{\mbox{graph}}\in O((v,e)\mapsto v\cdot\log e)</math>, | |||
which is not possible with the abused notation. | |||
You can also state theorems like <math>O(f)</math> is a [[convex cone]], and use that for formal reasoning. | |||
===Equality vs. isomorphism=== | |||
Another common abuse of notation is to blur the distinction between [[equality (mathematics)|equality]] and [[isomorphism]]. For instance, in the [[construction of the real numbers]] from [[Dedekind cuts]] of rational numbers, the rational number <math>r</math> is identified with the set of all [[rational numbers]] less than <math>r</math>, even though the two are obviously not the same thing (as one is a rational number and the other is a set of rational numbers). However, this ambiguity is tolerated, because the set of rational numbers and the set of Dedekind cuts of the form {x: x<r} have the same structure. It is through this abuse of notation that '''Q''' is regarded as a subset of '''R'''. | |||
===Dirac delta function=== | |||
The [[Dirac delta function]] can not be interpreted as a function in classical analysis. However it is often treated as one, for example when calculating convolutions. Treating the Dirac delta "function" as a function lets the user save traditional [[Limit (mathematics)|limit]] notation, saving its visual clutter. | |||
===Values of a random variable=== | |||
In probability theory, indicating probabilities of a [[Random variable]] is abuse of notation in two ways: Writing <math>P(x)</math> instead of <math>P(X = x)</math> leaves out the identity of the random variable (here <math>X</math>), which can be confusing out of context. However, even when writing <math>P(X = x)</math>, there is a mismatch of types: the expression <math>X=x</math> is an equation and from a [[type theory]] point of view has type boolean; that is, it evaluates to either "true" or "false". The domain of the <math>P</math> function here is not <math>\{T,F\}</math>, though; instead <math>P</math> should be logically thought of as taking two arguments: a random variable <math>X</math> and a subset of that random variable <math>X</math>'s sample space <math>\Omega</math>. This is important: if one were to implement <math>P</math> in a computer algebra system one would need to give it two arguments (and not only one boolean one), just like an implementation of the summation symbol <math>\sum_{i=min}^{max}f(i)</math> is really a function of the form <math>F(f,x,min,max)</math>, not <math>F(f,truthvalue,max)</math>. So a logically more appropriate notation could be <math>P(X, \{x\})</math> (the second argument here is the set of values we consider for <math>X</math>) or (borrowing from analysis, since the value set contains only the single element <math>x</math> in this case) <math>P(\left.X\right|_{x})</math>, but everybody writes <math>P(X = x)</math> or (abbreviated) <math>P(x)</math>. | |||
There is a good reason for such widespread so-called abuse: Notational abuse is a matter of perspective. | |||
Despite the arguably suggestive manner in which it is written, the notation <math>P(\phi)</math> does not (and is not meant to) mean applying some [[Function_(mathematics)|function]] to some value. | |||
Instead, the meaning is that <math>P(\phi)</math> takes the ''entire expression'' <math>\phi</math> as input --- ''not evaluated'' --- and expands into a particular, longer, expression in a (nominally) simpler language. | |||
Specifically, the notation can be defined by expanding to [[measure theory]] and [[set-builder notation]] as in (roughly): | |||
:<math>P(\phi) = P_\mu(\phi) = P_{\Omega,\mu}(\phi) = \frac{\mu(\{w \mid \phi(w) \text{ and } w\in\Omega\})}{\mu(\{w \mid w\in\Omega\})}</math> | |||
In words: To compute the probability of a ''formula'' being true, build the set of all possible worlds in which the formula is true, ''measure'' that set, and finally divide that by the measure of the set of all possible worlds. | |||
There are, naturally, a number of other, better, ways to [[Probability_theory|define the notation]]. | |||
That which matters here is just to recognize that the notation is no more abusive than some abbreviation ultimately resting on top of set-builder notation. | |||
(Whether we consider set-builder notation to be rigorous is [[Russell's_paradox|another matter]] entirely.) | |||
Regarding the computer science perspective: <math>P(\phi)</math> can be --- directly --- implemented on a computer ''as a [[Macro_(computer_science)#Syntactic_macros|macro]]''. | |||
(The abbreviations can be supported by default parameters, fields, closures, environments, global variables, and so forth.) | |||
That implementation is awkward in [[Evaluation_strategy#Applicative_order|applicative-order evaluation]], as initially sketched, but simple in [[Evaluation_strategy#Normal_order|normal-order evaluation]], as just sketched, directly indicates that the concept is primarily about ''syntax''. | |||
So regarding <math>P(X=x)</math>, while it can be called abusive, it can ''also'' be said to '''exemplify''' proper use of notation: | |||
it is a primitive of the ''language'' of probability theory (so is "notation"), that [[Probability_theory|has been shown]] to rigorously reduce to the language of set theory (so is "proper"). | |||
A perhaps uncontroversial example of abuse in probability theory is to take <math>P(X)</math> as meaning the marginal distribution of random variable <math>X</math>, and, ''at the same time'', to declare that <math>P(X=x)</math> means a number. | |||
At face value this seems legitimate, and it could perhaps be kept that way, but for the fact that probability theorists permit any sort of expression inside the <math>P()</math>. | |||
So, what would <math>P(Z)</math> mean, where <math>Z</math> is a non-basic random variable (deterministically) defined by <math>Z=(X=x)</math>? | |||
That is, <math>Z</math> is true when random variable <math>X</math> equals our favorite value, <math>x</math>, and in all other cases is false. | |||
Given that <math>Z=(X=x)</math> then one concludes that <math>P(Z)=P(X=x)</math> ought to hold. | |||
However, the left-hand side is supposed to mean a distribution, while the right hand side is supposed to mean a number. | |||
Distributions and numbers are not, of course, equal to one another, so contradiction ensues if we try to rigorously support both conventions ''at the same time''. | |||
The resolution is to call one convention the definition and the other the abuse. | |||
If we take <math>P(X=x)</math> meaning a number as the abuse, | |||
then the abuse is more specifically that we implicitly typecast a marginal distribution over a Boolean random variable down to its probability of being true. | |||
If we take <math>P(X)</math> meaning an entire distribution as the abuse, | |||
then the abuse is more specifically that we implicitly surround the expression with quantifiers ranging over all possible values of <math>X</math> | |||
(so as to form its entire marginal distribution one entry at a time). | |||
==Bourbaki== | |||
The term "abuse of language" frequently appears in the writings of [[Nicolas Bourbaki]]:<ref> | |||
{{cite book | last=Bourbaki | first=Nicolas | title=Algebra I: Chapters 1-3 | series=Elements of Mathematics | year=1988 | publisher=Springer }} | |||
</ref> | |||
:''We have made a particular effort always to use rigorously correct language, without sacrificing simplicity. As far as possible we have drawn attention in the text to'' abuses of language, ''without which any mathematical text runs the risk of pedantry, not to say unreadability.'' Bourbaki (1988). | |||
For example: | |||
:''Let E be a set. A mapping f of E × E into E is called a law of composition on E. [...] By an abuse of language, a mapping of a'' subset ''of E × E into E is sometimes called a law of composition'' not everywhere defined ''on E.'' Bourbaki (1988). | |||
In other words, it is an abuse of language to refer to [[partial function]]s from ''E × E'' to ''E'' as "functions from ''E × E'' to ''E'' that are not everywhere defined." To clarify this, it makes sense to compare the following two sentences. | |||
:1. A partial function from ''A'' to ''B'' is a [[function (mathematics)|function]] ''f: A' → B'', where ''<nowiki>A'</nowiki>'' is a [[subset]] of ''A''. | |||
:2. A function not everywhere defined from ''A'' to ''B'' is a function ''f: A' → B'', where ''<nowiki>A'</nowiki>'' is a subset of ''A''. | |||
If one were to be extremely pedantic, one could say that even the term "partial function" could be called an abuse of language, because a partial function is not a function. (Whereas a [[continuous function]] is a function that is continuous.) But the use of [[adjective]]s (and [[adverb]]s) in this way is standard English practice, although it can occasionally be confusing. Some adjectives, such as "generalized", can only be used in this way. (''e.g.'', a [[magma (algebra)|magma]] is a generalized [[Group (mathematics)|group]].) | |||
The words "not everywhere defined", however, form a [[English relative clauses#Zero relative pronoun|relative clause]]. Since in mathematics relative clauses are rarely used to generalize a noun, this might be considered an abuse of language. As mentioned above, this does not imply that such a term should not be used; although in this case perhaps "function not necessarily everywhere defined" would give a better idea of what is meant, and "partial function" is clearly the best option in most contexts. | |||
Using the term "continuous function not everywhere defined" after having defined only "continuous function" and "function not everywhere defined" is not an example of abuse of language. In fact, as there are several reasonable definitions for this term, this would be an example of woolly thinking or a cryptic writing style. | |||
==Subjectivity== | |||
The terms "abuse of language" and "abuse of notation" depend on context. | |||
Writing "''f'': ''A'' → ''B''" for a partial function from ''A'' to ''B'' is almost always an abuse of notation, but not in a [[category (mathematics)|category theoretic]] context, where ''f'' can be seen as a [[morphism]] in the category of partial functions. | |||
==See also== | |||
* [[Mathematical notation]] | |||
==References== | |||
<references/> | |||
==External links== | |||
*[http://www.henning-thielemann.de/Research/notation.pdf "Strong Symbols", by Henning Thielemann (PDF Slides)] Section 5: Common abuse of notation | |||
[[Category:Mathematical notation]] | |||
[[Category:Mathematical terminology]] |
Revision as of 20:50, 29 January 2014
In mathematics, abuse of notation occurs when an author uses a mathematical notation in a way that is not formally correct but that seems likely to simplify the exposition or suggest the correct intuition (while being unlikely to introduce errors or cause confusion). Abuse of notation should be contrasted with misuse of notation, which should be avoided.
<Abuse of language>...</Abuse of language>
A related concept is abuse of language or abuse of terminology, when not notation but a term is misused. Abuse of language is an almost synonymous expression that is usually used for non-notational abuses. For example, while the word representation properly designates a group homomorphism from a group G to GL(V) where V is a vector space, it is common to call V "a representation of G". A common abuse of language consists in identifying two mathematical objects that are different but canonically isomorphic. For example, identifying a constant function and its value or identifying to the Euclidean space of dimension three equipped with a Cartesian coordinate system.
The latter uses may achieve clarity in the new area in an unexpected way, but it may borrow arguments from the old area that do not carry over, creating a false analogy.
Examples
Common examples occur when speaking of compound mathematical objects. For example, a topological space consists of a set (called the underlying set of the topological space) and a topology , and two topological spaces and , even with the same underlying set , can be quite different if they have different topologies. Nevertheless, it is common to refer to such a space simply as when there is no danger of confusion—that is, when it is implicitly clear what topology is being considered. Similarly, one often refers to a group as simply when the group operation is clear from context.
Equivalence classes
A very common form of abuse of notation is that often used when a set is partitioned into disjoint subsets (equivalence classes) by an equivalence relation. Formally, if a set X is partitioned by an equivalence relation ~, then for a given x∈X, the (equivalence) class {y∈X|y~x} would be denoted [x]. But in practice, if the remainder of the discussion is focused on equivalence classes rather than individual elements of the underlying set, it is common to drop the square brackets in the discussion. For example, in modular arithmetic, a finite group of size n can be formed by partitioning the integers via the equivalence relation x~y iff x=y(mod n). The elements of that group would then formally be listed as {[0], [1], ..., [n-1]}, but in practice they are usually just called 0, 1, ..., n-1.
Another type of case is the space of (classes of) measurable functions over a measure space, or classes of Lebesgue integrable functions, where the equivalence relation is equality "almost everywhere".
Derivative
In standard analysis, algebraic manipulations of the Leibniz notation for the derivative are commonly thoughtTemplate:By whom to be an abuse of notation. It is frequently convenient to treat the expression as a fraction. For example, it leads to the correct formula for differentiation of the composition of functions (commonly called the "chain rule") . Another example is the concept of separation of variables in solving differential equations, in which one can rewrite the equation as and then integrate.
A related form of abuse of notation occurs when an integral like is written as , as if were a term that is multiplied into the argument of the integral.
Del operator
The del operator, denoted by , is a tuple of partial derivative operators posing as a vector. This suggests notations such as for gradient, for divergence and for curl. The notation is extremely convenient because does behave like a vector most of the time. But it can be regarded as an abuse because doesn't commute with vectors, and so doesn't satisfy all properties of vectors.
(A contrary view is that notation is not abused if one does not think of as a vector. The vector-like notations are simply specially defined uses of the dot and cross.)
Cross product
The determinant of a 3×3 matrix may be computed by "expanding along the first row" as follows:
The cross product of the vectors (a1, a2, a3) and (b1, b2, b3) is given similarly by
Thus the cross product may be computed by writing the "symbolic determinant"
and expanding along the first row by rote, ignoring the fact that the matrix is not truly a matrix over the real or complex numbers (or whatever field the matrix entries belong to), and that thus the resulting computation does not compute an ordinary determinant. This is technically an abuse of notation, but is useful as a mnemonic to remember the formula for cross product and is very helpful in calculations.[1]
Function application over set
John Harrison (1996)[2] cites "the use of f(x) to represent both application of a function f to an argument x, and the image under f of a subset, x, of f's domain". (Note that the last x is usually written differently, e.g. as X, in order to distinguish an element x of the domain from a subset X.)
Exponentiation as repetition
Exponentiation is repeated multiplication, and multiplication is frequently denoted by juxtaposition of operands, with no operator at all. The suggested superscript notation for other associative operations denoted by juxtaposition follows:
- Function application is sometimes denoted without parentheses: . This suggests the functional powers notation: . This also generalizes nicely to represent function inverse for a power of −1 and functional square root for a power of 1/2.
- String repetition: "ab3c" = "abbbc".
Cartesian product as associative
The cartesian product is often seen as associative, with:
This of course cannot be rigorously true: if , and , the identity would imply that and , and would mean nothing.
This notion can be made rigorous in category theory, using the idea of a natural isomorphism.
Trigonometric functions
In some countries it is common to denote the square of the value of as , and the inverse function as . In his article on notation in the Edinburgh Encyclopedia Charles Babbage complains at length of this abuse of notation and suggests two alternatives for the notation
Babbage argues strongly for the former, and also that the square of the value should be notated as , but beware: Babbage intends even though what he wrote is easily confused with (the only non-confusing way to avoid this abuse of notation is to always include the parentheses).
To press his example further, Babbage investigates what the function is like, and also which is the function which, when composed with itself, equals , the functional square root.
Big O notation
With Big O notation, we say that some term "is" (given some function g, where x is one of f's parameters). Example: "Runtime of the algorithm is " or in symbols "". Intuitively this notation groups functions according to their growth respective to some parameter; as such, the notation is abusive in two respects: It abuses "=", and it invokes terms that are real numbers instead of function terms. It would be appropriate to use the set membership notation and thus write instead of . This would allow for common set operations like , , and it would make clear, that the relation is not symmetric in contrast to what the "=" symbol suggests. Some argue for "=", because as an extension of the abuse, it could be useful to overload relation symbols such as < and ≤, such that, for example, means that f's real growth is less than . But this further abuse is not necessary if, following Knuth one distinguishes between O and the closely related o and Θ notations. Concerning the use of terms for scalar numbers instead of functions, one encounters the following troubles.
- You cannot cleanly define what may mean, due to the fact the O notation is about growth of functions, but to the left hand and the right hand side of the relation, there are scalar values, and you cannot decide whether the relation holds if you look at particular function values.
- The abused O notation is bound to one variable, and the identity of that variable can be ambiguous: for instance, in one of the variables might be a parameter which is not in scope of the .
That is, you might mean , since was the parameter that you assigned 2, or you might mean , since was the parameter substituted by 3 here.
Even might be the same as , since might be a parameter, not the concerned function variable.
The carelessness regarding the use of function terms might be caused by the rarely-used functional notations, like Lambda notation. You would have to write and . The correct O notation can be easily extended to complexity functions which map tuples to complexities; this lets you formulate a statement like "the graph algorithm needs time proportional to the logarithm of the number of edges and to the number of vertices" by , which is not possible with the abused notation.
You can also state theorems like is a convex cone, and use that for formal reasoning.
Equality vs. isomorphism
Another common abuse of notation is to blur the distinction between equality and isomorphism. For instance, in the construction of the real numbers from Dedekind cuts of rational numbers, the rational number is identified with the set of all rational numbers less than , even though the two are obviously not the same thing (as one is a rational number and the other is a set of rational numbers). However, this ambiguity is tolerated, because the set of rational numbers and the set of Dedekind cuts of the form {x: x<r} have the same structure. It is through this abuse of notation that Q is regarded as a subset of R.
Dirac delta function
The Dirac delta function can not be interpreted as a function in classical analysis. However it is often treated as one, for example when calculating convolutions. Treating the Dirac delta "function" as a function lets the user save traditional limit notation, saving its visual clutter.
Values of a random variable
In probability theory, indicating probabilities of a Random variable is abuse of notation in two ways: Writing instead of leaves out the identity of the random variable (here ), which can be confusing out of context. However, even when writing , there is a mismatch of types: the expression is an equation and from a type theory point of view has type boolean; that is, it evaluates to either "true" or "false". The domain of the function here is not , though; instead should be logically thought of as taking two arguments: a random variable and a subset of that random variable 's sample space . This is important: if one were to implement in a computer algebra system one would need to give it two arguments (and not only one boolean one), just like an implementation of the summation symbol is really a function of the form , not . So a logically more appropriate notation could be (the second argument here is the set of values we consider for ) or (borrowing from analysis, since the value set contains only the single element in this case) , but everybody writes or (abbreviated) .
There is a good reason for such widespread so-called abuse: Notational abuse is a matter of perspective. Despite the arguably suggestive manner in which it is written, the notation does not (and is not meant to) mean applying some function to some value. Instead, the meaning is that takes the entire expression as input --- not evaluated --- and expands into a particular, longer, expression in a (nominally) simpler language. Specifically, the notation can be defined by expanding to measure theory and set-builder notation as in (roughly):
In words: To compute the probability of a formula being true, build the set of all possible worlds in which the formula is true, measure that set, and finally divide that by the measure of the set of all possible worlds. There are, naturally, a number of other, better, ways to define the notation. That which matters here is just to recognize that the notation is no more abusive than some abbreviation ultimately resting on top of set-builder notation. (Whether we consider set-builder notation to be rigorous is another matter entirely.)
Regarding the computer science perspective: can be --- directly --- implemented on a computer as a macro. (The abbreviations can be supported by default parameters, fields, closures, environments, global variables, and so forth.) That implementation is awkward in applicative-order evaluation, as initially sketched, but simple in normal-order evaluation, as just sketched, directly indicates that the concept is primarily about syntax.
So regarding , while it can be called abusive, it can also be said to exemplify proper use of notation: it is a primitive of the language of probability theory (so is "notation"), that has been shown to rigorously reduce to the language of set theory (so is "proper").
A perhaps uncontroversial example of abuse in probability theory is to take as meaning the marginal distribution of random variable , and, at the same time, to declare that means a number. At face value this seems legitimate, and it could perhaps be kept that way, but for the fact that probability theorists permit any sort of expression inside the . So, what would mean, where is a non-basic random variable (deterministically) defined by ? That is, is true when random variable equals our favorite value, , and in all other cases is false.
Given that then one concludes that ought to hold. However, the left-hand side is supposed to mean a distribution, while the right hand side is supposed to mean a number. Distributions and numbers are not, of course, equal to one another, so contradiction ensues if we try to rigorously support both conventions at the same time.
The resolution is to call one convention the definition and the other the abuse. If we take meaning a number as the abuse, then the abuse is more specifically that we implicitly typecast a marginal distribution over a Boolean random variable down to its probability of being true. If we take meaning an entire distribution as the abuse, then the abuse is more specifically that we implicitly surround the expression with quantifiers ranging over all possible values of (so as to form its entire marginal distribution one entry at a time).
Bourbaki
The term "abuse of language" frequently appears in the writings of Nicolas Bourbaki:[3]
- We have made a particular effort always to use rigorously correct language, without sacrificing simplicity. As far as possible we have drawn attention in the text to abuses of language, without which any mathematical text runs the risk of pedantry, not to say unreadability. Bourbaki (1988).
For example:
- Let E be a set. A mapping f of E × E into E is called a law of composition on E. [...] By an abuse of language, a mapping of a subset of E × E into E is sometimes called a law of composition not everywhere defined on E. Bourbaki (1988).
In other words, it is an abuse of language to refer to partial functions from E × E to E as "functions from E × E to E that are not everywhere defined." To clarify this, it makes sense to compare the following two sentences.
- 1. A partial function from A to B is a function f: A' → B, where A' is a subset of A.
- 2. A function not everywhere defined from A to B is a function f: A' → B, where A' is a subset of A.
If one were to be extremely pedantic, one could say that even the term "partial function" could be called an abuse of language, because a partial function is not a function. (Whereas a continuous function is a function that is continuous.) But the use of adjectives (and adverbs) in this way is standard English practice, although it can occasionally be confusing. Some adjectives, such as "generalized", can only be used in this way. (e.g., a magma is a generalized group.)
The words "not everywhere defined", however, form a relative clause. Since in mathematics relative clauses are rarely used to generalize a noun, this might be considered an abuse of language. As mentioned above, this does not imply that such a term should not be used; although in this case perhaps "function not necessarily everywhere defined" would give a better idea of what is meant, and "partial function" is clearly the best option in most contexts.
Using the term "continuous function not everywhere defined" after having defined only "continuous function" and "function not everywhere defined" is not an example of abuse of language. In fact, as there are several reasonable definitions for this term, this would be an example of woolly thinking or a cryptic writing style.
Subjectivity
The terms "abuse of language" and "abuse of notation" depend on context. Writing "f: A → B" for a partial function from A to B is almost always an abuse of notation, but not in a category theoretic context, where f can be seen as a morphism in the category of partial functions.
See also
References
- ↑ 20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.
My blog: http://www.primaboinca.com/view_profile.php?userid=5889534 - ↑
20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.
My blog: http://www.primaboinca.com/view_profile.php?userid=5889534 - ↑
20 year-old Real Estate Agent Rusty from Saint-Paul, has hobbies and interests which includes monopoly, property developers in singapore and poker. Will soon undertake a contiki trip that may include going to the Lower Valley of the Omo.
My blog: http://www.primaboinca.com/view_profile.php?userid=5889534
External links
- "Strong Symbols", by Henning Thielemann (PDF Slides) Section 5: Common abuse of notation