Produced by Charles Wells Revised 2017-04-18 Introduction to this website website TOC website index blog head of Understanding Math chapter
An abstraction of a concept $C$ is a concept $C'$ with these properties:
The concept $C'$ may include instances not included in $C$.
In this article, I will give one example of abstraction in detail, describing the way that mathematicians have abstracted the concept of relation. This will be followed by some examples not given in detail, with references.
Mathematicians have made use of relations between math objects since antiquity.
Let's consider how we could give a mathematical definition of "relation".
The logicians' definition of "relation" is to say that a relation is a true-or-false statement that can be made about a pair of math objects of a certain type. That defines "relation" in terms of the concept of "true-or-false statement".
In the late nineteenth century, mathematicians came up with a formal definition of "set" and began to use the theory of sets as a basis for all definitions in math. That resulted in this definition of "relation":
A relation on a set $S$ is a set $\alpha$ of ordered pairs of elements of $S$.
The definition of "relation" as any set of ordered pairs is an abstraction of the concept of relation generated by considering what examples of relations have in common. A consequence of this definition is that any set of ordered pairs is a relation.
Example: Let \[\xi:=\{(2,3),(2,9),(9,1),(9,2)\}\] Then by definition of relation, $\xi$ is a relation on the set $\{1,2,3,9\}$. Before this new definition of "relation" came up, no one would have called this a relation.
Your reaction may be: What relation IS it? Answer: just that set of ordered pairs. You know that $2\xi3$ and $2\xi9$, for example, but $1\xi9$ is false. There is no other definition of $\xi$.
Yes, the relation $\xi$ is weird. It is an arbitrary definition. It does not have any verbal description other than listing the pairs in $\xi$. It is probably useless. Live with it.
The symbol "$\xi$" is a Greek letter. It looks weird, so I used it to name a weird relation. Its upper case version is "$\Xi$", which is even weirder. I pronounce "$\xi$" as "ksee" but most mathematicians call it "si" or "zi" (rhyming with "pie").
The axiomatic method is a technique for studying math objects of some kind by formulating them as a type of math structure.
Typically when using the axiomatic method some of these things may happen:
A continuous function (from the set of real numbers to the set of real numbers) is sometimes described as a function whose graph you can draw without lifting your chalk from the board. This is a physical description, not a mathematical definition.
In the nineteenth century, mathematicians talked about continuous functions but became aware that they needed a rigorous definition. One possibility was functions given by formulas, but that didn’t work: some formulas give discontinuous functions and they couldn’t think of formulas for some continuous functions.
This description of nineteenth century math is an oversimplification.
Cauchy produced the definition we now use (the epsilon-delta definition) which is a rigorous mathematical version of the no-lifting-chalk idea and which included the functions they thought of as continuous.
To their surprise, some clever mathematicians produced examples of some weird continuous functions that you can’t draw, for example the sine blur function. In the terminology in the discussion of abstraction above, the abstraction $C'$ (epsilon-delta continuous functions) had functions in it that were not in $C$ (no-chalk-lifting functions.) On the other hand, their definition now applied to functions between some spaces besides the real numbers, for example the complex numbers, for which drawing the graph without lifting the chalk doesn’t even make sense.
Suppose you are studying the algebraic properties of numbers. You know that addition and multiplication are both associative operations and that they are related by the distributive law: $x(y+z)=xy+xz$. Both addition and multiplication have identity elements ($0$ and $1$) and satisfy some other properties as well: addition forms a commutative group for example, and if $x$ is any number, then $0\cdot x=0$.
One way to approach this problem is to write down some of these laws as axioms on a set with two binary operations without assuming that the elements are numbers. In doing this, you are abstracting some of the properties of numbers.
Certain properties such as those in the first paragraph of this example were chosen to define a type of math structure called a ring. (The precise set of axioms for rings is given in the Wikipedia article.)
You may then prove theorems about rings strictly by logical deduction from the axioms without calling on your familiarity with numbers.
When mathematicians did this, the following events occurred:
So the name "commutative ring" means the multiplication is commutative, because addition in rings is always commutative. Mathematical names are not always transparent.
Nowadays, all math structures are defined by axioms.
Many articles on the web about the axiomatic method emphasize the representation of the axiom system as a formal logical theory (formal system). Mathematicians in practice create and use a particular axiom system as a tool for research and understanding, and state and prove theorems of the system in semi-formal narrative form rather than in formal logic.
To generalize a mathematical concept $C$ is to find a concept $C'$ with the property that all instances of $C$ are also instances of $C'$.
a) $\mathbb{R}^n$, for arbitrary positive integer $n$, is a generalization of $\mathbb{R}^2$ (the set of all ordered pairs of real numbers). One replaces the ordered pairs in $\mathbb{R}^2$ by ordered $n$-tuples, and much of the arithmetic and spatial structure (except for the representation of $\mathbb{R}^2$ using complex numbers) and even some of our intuitions carry over to the more general case.
b) The concept of abstract vector space is a generalization of ${\mathbb{R}}^n$. To get it you forget that the elements of ${\mathbb{R}^n}$ are $n$-tuples of real numbers and your produce a different, more abstract definition obtained by finding properties of $\mathbb{R}^n$ such as the existence of addition, multiplication, scalar multiplication, and so on, and using these properties as axioms for the new concept of vector space.
Example (a) is an example of an expansive generalization, obtained by changing a datum in the definition of a concept (the dimension $2$ in this case) into a parameter.
Example (b) is obtained taking the concept and reconstructing it, producing a very different-looking definition that includes all the original examples and others as well. This is reconstructive generalization.
Both expansive and reconstructive generalizations, if done correctly, take a concept and introduce another concept that includes all the examples of the original concepts and (in general) others as well. These are legitimate generalizations.
It can happen that all the examples you know of a specific concept have a certain property $P$ not required by the definition. If you conclude from this that all the examples of the concept have property $P$ you are engaging in generalization from examples.
Generalization
from examples |
but 1/$n$ is never equal to $0$.
On the other hand,
\[\underset{n\to \infty }{\mathop{\lim }}\,\frac{\sin \left(\frac{\pi n}{4}\right)}{n}=0\]but every fourth term is zero. The sequence starts out like this:
\[\frac{1}{\sqrt{2}},\,\frac{1}{2},\,\frac{1}{3 \sqrt{2}}\, ,\,0,-\frac{1}{5 \sqrt{2}}\, ,\,-\frac{1}{6},\,-\frac{1}{7 \sqrt{2}}\, ,\,0,\,\frac{1}{9 \sqrt{2}}\, ,\,\frac{1}{10},\,\frac{1}{11 \sqrt{2}}\, ,\,0,\,-\frac{1}{13 \sqrt{2}}\, ,\,-\frac{1}{14},\,-\frac{1}{15 \sqrt{2}}\, ,\,0\]The names "expansive" and "reconstructive" are due to Harel and Tall.
Once you acquire an insight, you may not be able to understand how someone else can't understand it. It becomes obvious, or trivial to prove. That is the ratchet effect.
“How did you know that \[{{\left(\frac{{{x}^{3}}-10}{3{{e}^{-x}}+1} \right)}^{6}}\] is never negative?”
“Because it is a sixth power, so it is a square of a number, and the square of a number is never negative.”
(Smiting forehead) “Of course, why didn’t I see that!”
This observation stays in your head. You can’t forget it. Your neurons have been permanently changed and you can’t go back and regain your former state of non-understanding!
This is an example of pattern recognition – of ignoring the details and seeing \[{{\left(\frac{{{x}^{3}}-10}{3{{e}^{-x}}+1} \right)}^{6}}\]as\[\left(\left(\text{some stuff}\right)^3\right)^2\] See also chunking. There are other examples like this here, here and here
Mathematicians can be quite cruel once they know how to do something and they enjoy seeing you squirm as you struggle to catch on. –Marcus du Sautoy
“How could you NOT see that \[{{\left( \frac{{{x}^{3}}-10}{3{{e}^{-x}}+1}\right)}^{6}}\] is never negative?! It’s OBVIOUS that it’s never negative. It’s a SIXTH POWER YOU DUMMY!”
It is distressingly common that a mathematician for whom a concept has become obvious because of the ratchet effect will then tell someone else that the concept is obvious or trivial, leaving them to feel put down. It is one thing for s to do this to each other; that is a kind of normal competitiveness among members of a tribe. I believe behaving this way to non-mathematicians is a major contribution to the dislike and fear of math that many people have.
When you are explaining abstract math to non-mathematicians
DO NOT DUMP on those who are baffled
because they haven’t been RATCHETED UP yet.
This work is licensed under a Creative Commons Attribution-ShareAlike 2.5 License.