![]() |
Information is a concept first used to resolve the paradox of Maxwell's demon. About 150 years ago, the physicist J. C. Maxwell came up with an intriguing idea. He conceived a thought experiment, in which a little demon who operates a friction-free trap door to separate molecules of one type from the other (see Figure 01), and finally arrives at a system with lower entropy. Such organizing entity of Maxwell's seems to violate the second law of thermodynamics as the demon only selects molecules but does no work. This paradox kept physicists in suspense for half a century until Leo Szilard showed that the demon's stunt really isn't free of charge. When he creates the precious commodity called information; it actually produces an amount of entropy (through mental processing* in the brain) exactly offsetting the decrement in the re-arrangement. The unit of this commodity is the bit, and each time the demon chooses a molecule to shuffle, he shells out one bit ("select" or "un-select") of information for this cognitive act, precisely balances the |
Figure 01 Maxwell's Demon [view large image] |
thermodynamics accounts. The new concept has since shown its useful-ness in communication and computer, but perhaps its greatest power lies in biology, for the organizing entities in living beings - the proteins and certain RNAs - are but Maxwell's demons. |
![]() |
The information content in a given sequence of units be they letters in a sentence, or nucleotides in DNA, depends on the minimum number of instructions needed to specify or describe the structure. Many instructions are needed to specify a complex, information-bearing structure such as DNA. Only a few instructions are need to specify an ordered structure such as a crystal. In this case we have a description of the initial sequence or unit arrangement which is then repeated ad infinitum according to the packing instructions (see Figure 02). |
Figure 02 Crystal |
![]() |
|
Figure 03 Equilibrium |
![]() |
|
Figure 04 Non-equilibrium |
subjective connotations - many situations can be met only by an intelligent being or a living organism; and "usefulness" to some may be garbage to others. |
![]() |
Figure 05 shows an example which should be very easy to understand. According to one definition of "information", the meaning of the messages is completely ignored but takes into account only the variations that is possible in the message. Thus the grammatical rules are more or less taken out of the equation. The image on the left portrays Bob as a dummy who can only utter Ba Ba Ba Ba to Alice. There is no variation in the message resulting in no "information content". On the other hand, the picture on the right shows |
Figure 05 Information Content [view large image] |
Bob to possess a wonderful command of vocabulary. Alice is surprise to hear the actual message out of so many possibilities. Therefore, this definition uses the "average amount of surprise" as its criterion. It doesn't take into account the "meaning" of the message. |
-24 (see formula in "Mathematical Formulation").
![]() |
to the color of their bindings (Figure 06). There may be a thousand red books grouped together in one section of the shelves. This arrangement contains a certain amount of order and conveys some information, but not as much as in the first library. Since there are no rules governing the ordering of books on the shelves by title and author within the red section, the number of possible ways of arranging the books there is much greater. If the borrower knows that "War and Peace" has a red binding, he will proceed to the right section, but then he will have to examine each book in turn until he chances upon the one he is looking for. Then imagine a third library, in which all the rules have broken down. The books are strewn at random on any shelf or on a desk (as the case in Figure 03). "War and Peace" could be anywhere in the building. There is no denying that the books are in a certain specific sequence, but the sequence is a "noise," not a message. It is only one of a truly immense number of possible ways of arranging the books, and there is no telling which one, because all are equally probable. A borrower's |
Figure 06 Imperfect Cataloging [view large image] |
ignorance of the actual arrangement is great in proportion to the quantity of these possible, equally probable ways. |
![]() |
of two, and finally of one. This way, we would hit upon the Ace in three steps, regardless where it happens to be. The "yes" or "no" are the constraints (the rules or procedures), which resolve more uncertainties with more questionings and in turn obtaining more information. Thus, 3 is the minimum number of correct binary choices or, by our definition above, the amount of information needed to locate a card in this particular arrangement. In effect, what we have been doing here is taking the binary logarithm of the number of possibilities (N=8), i.e., log2(8) = 3. In other words, the information required to determine the location in a deck of 8 cards is 3 bits. A bit is the smallest unit of information. It is one of the two distinct |
Figure 07 Information and Rules [view large image] |
alternatives, which can be any of the hot/cold, black/white, in/out, up/down, ... pairs. The number of possible alternatives N for a series of K trials is 2K. Using up/down for example with K=3 trials. There are N = 23 = 8 possible alternatives: (up, down, up), (up, down, down), |
![]() |
For ni = 8, all the members are distributed randomly in one partition, finding the member there is certain but difficult without any information; while for ni = 1, the members are distributed orderly in eight equal size partitions, the chance of finding the member in each is 1/8 and 3 bits of information is required to find the member. In other words, a greater amount of uncertainty is resolved (more information) if the target chosen is one of a large number of possibilities and if it is one of the more unlikely of those possibilities as shown in Figure 08. |
Figure 08 Information and Probability [view large image] |
(pi) log2(pi) ---------- (1).
(pi) ln(pi) ---------- (2).
e-Ei/kT is the probability of finding the particle in state i, temperature T, energy Ei, and ln is logarithm with base e = 2.71828.
10-16 erg/K, which is very small comparing to the entropy generated in raising 1 gram of water by 1oC at room temperature (27oC), i.e.,
S = 1.4x10-8 erg/K.![]() |
The entropy in Eq.(3) is related to the information either by the mental (or biological) process in creating the information or via the natural decay from order to disorder. The relationship between information and entropy can be illustrated by the evolution of puffs of smoke, e.g., in the smoke signal as shown in Figure 09. Initially, when the puffs have just been delivered, most of the smoke molecules are concentrated near a point (as shown by Box Diagram 1 for one puff). At that instant, the system has significant order and information as interpreted by certain codes for the signalling. With time, the smoke molecules spread through the space, distributing themselves more and more evenly; the system evolves toward more probable states, states of less order and less information (Box Diagrams 2-3). Finally, the distribution of the molecules is completely even (Box Diagram 4). The system then lacks a definable structure - its molecules merely move about at random; order and information have decayed to zero. This is the state of thermodynamic equilibrium. Thus, the molecular system loses its information with time. Eventually, all the information dissipate into entropy as related by the equation : |
Figure 09 Information and Entropy [view large image] |
S2 = -(k ln2) I1 (see Figure 09). |
, then it is related to the entropy S by the expression: S = k ln
, where k is the Boltzmann's constant. Example b shows the multiplicity (Arrangements) of throwing a pair of distinguishable dice with the probability P of the outcome denoted by P =
/(Total # of Microstates). The multiplicity for each macrostate is designated by
(macrostate) in red.
![]() |
(greater disorder). This tendency toward greater disorder is prescribed by the second law ofthermodynamics and can be used to define the "arrow of time" since the sequence of events always flows one way to greater disorder. It is highly improbable for the reverse to happen naturally. Note that the second law of thermodynamics applies only to a closed system. For an open system, its orderliness can be boosted up at the expense of the surroundings (making it more disorder) with the infusion of energy. |
Figure 11 Arrow of Time [view large image] |
| Complexity As | Definition | Example(s) | Problem |
|---|---|---|---|
| Size | Larger size means higher complexity | Size of body or genome | Some simple organisms have larger genome size than human's |
| Entropy | More variation signifies more complex message | HHH... has no variation and zero entropy, the random sequence DXW... has lot of variation | The most complex object is in between most orderly and complete randomness |
| Algorithmic Content | Shorter computer program to describe the object corresponds to lesser complexity | HHH... requires very short description, garbled message cannot be compressed | Random object leads to high information content |
| Logical Depth |
Complexity is measured by how difficult to construct the object | HHH... is very easy to construct, while a specific message requires more work | It is difficult to measure the difficulty |
| Fractal Dimension | Higher fractal dimension equals to higher complexity | The coastal line is more complex than a straight line | There are other kinds of complexity not defined by fractal dimension |
| Degree of Hierarchy | Complexity is equated to the number of sub-systems | Organ to cells to organelles to macro-molecules to ... | It is difficult to separate the whole into parts |
![]() |
A working definition is taken here from ordinary discourse, which states: "it is a state of intricacy, complication, variety, or involvement, as in the interconnected parts of a structure - a quality of having many interacting, different components". Quantitatively, complexity can be measured roughly in term of the number of components such as the number of parts in a machine, the number of cell types in living organism (see Figure 12), or the vocabulary in a language. It is believed that complexity is created in nature by fluctuations - random deviations from some average, equilibrium value of density, temperature, pressure, etc. - also called "instabilities" or "inhomogeneities". Normally, an open system near equilibrium |
Figure 12 Evolution to Greater Complexity [view large image] |
does not evolve spontaneously to new and interesting structures. But should those fluctuations become too great for the open system to damp, the system will then |
![]() |
departs far from equilibrium and be forced to reorganize. Such reorganization generates a kind of "dynamic steady state" provided the energy flow rate exceeds the thermal relaxation rate. The feedback loops are positive in this kind of process. Complexity itself consequently creates the condition for greater instability, which in turn provides an opportunity for greater reordering.
Another approach to view the development of complexity is through the concept of energy flow per unit mass. Figure 13(a) shows the increase of complexity as the energy flow (per unit mass) into the various systems increases over the age of the universe. Figure 13b depicts qualitatively the departure from equilibrium at each bifurcate point where the energy flow has reached a critical value and thus can promote more complexity in the system. The dotted curves indicate the options that have not been taken by the evolution. The bifurcation is created when the system enters a nonlinear mode beyond some energy threshold. Thus the development of complexity is a |
Figure 13 Complexity and Energy Flow [view large image] |
phenomenon closely related to the chaos theory or nonlinearity with a positive feedback loop. |
Table 02 below compares randomness, order, and complexity in the context of information.
| System | Structure | Alphabetical Arrangement |
Natural System |
Order | Information |
|---|---|---|---|---|---|
| Randomness | random | HSIA TESHO SR I | molecules in the air | none | none |
| Order | periodic | HHHHHHHHHHH | crystal | lot | none |
| Complexity | aperiodic | HORSE THIS A IS | Nucleotides C, T, A, G | some | some |
| Specified Complexity | aperiodic | THIS IS A HORSE | Viable genes to produce proteins | lot | lot |
The difference between randomness and complexity in Table 02 is that while both may look random to an untrained eye, effort has been spent and information has been created to arrange the sequence in complexity similar to the case of the Maxwell's demon. Only man-made and biological systems have evolved to the level of specific complexity, in which the units are arranged in such a way that the sequence is capable of performing a certain function. The examples for specified complexity in the table either conveys a message by following a set of grammatical rules or produces proteins according to the genetic codes. Order looks nice but is incapable of conveying a message or encrypting information.