Track your comments!
[x]


When you register, comments on your articles and replies to your comments appear here. Register Now!

Sign in to your account
[x]

Not a Scientific Blogging member yet?

Register Now for a Free Scientificblogging.com Account

  • Customize your profile with pictures, banner, a blogroll and more.
  • Leave comments on articles, add other members to your friend lists, chat with people on the site.
  • Write blog posts that can be seen by hundreds of thousands of readers.

It's free and it only takes a minute!

Already a Scientific Blogging member?

Sign In Now

Banner
By Mark Changizi | September 17th 2009 10:57 AM | 11 comments | Print | E-mail | Track Comments
About Mark

Mark Changizi is assistant professor in the Department of Cognitive Science at Rensselaer Polytechnic Institute, with expertise in theoretical neurobiology...

View Mark's Profile
Reading pervades every aspect of our daily lives, so much so that one would be hardpressed to find a room in a modern house without words written somewhere inside. Many of us now read more sentences in a day than we listen to. Not only are we highly competent readers, but our brains even appear to have regions devoted to recognizing words. A Martian just beginning to study us humans might be excused for concluding that we had evolved to read. 
 
But, of course, we haven’t. Reading and writing is a recent human invention, going back only several thousand years, and much more recently for many parts of the world. We are reading using the eyes and brains of our illiterate ancestors. Why are we so good at such an unnatural act? 
 
Here I describe recent evidence that, although we have not evolved to be good at reading, writing appears to have culturally evolved to be good for the eye. More specifically, recent research supports the exciting hypothesis that human visual signs look like nature, because that is what we have evolved over millions of years to be good at seeing. This ecological hypothesis for letter shape not only helps explain why we are such good readers, but answers the question, Why are letters and other visual signs shaped the way they are? 

The Variety of Visual Signs

Given the tremendous variety of visual signs over human history, it may at first glance seem that there could be no simple answer to the question, Why are visual signs shaped as they are? After all, we have been making visual signs for at least 40,000 years, starting with tool decorations and cave paintings.

The evolution of ornamentation, art, painting, and other non-linguistic visual signs (i.e., signs not part of language) has gone on unabated, diversifying into millions of non-linguistic symbols used over the ages, and occupying nearly all aspects of our lives, including pottery, body art, religion, politics, folklore, medicine, music, architecture, trademarks and traffic. 

sample logographic signs topography of language
Click image for larger size.

Writing (i.e., visual signs distinguished by use as a means of visually recording the content of spoken language) has also undergone an evolutionary explosion in variety. The earliest writing appeared several thousand years ago, and occurred independently in Sumer, Egypt and China (and much more recently in the Americas). These earliest linguistic visual signs were pictograms, evolving later to logograms (where a character denotes an object, idea or action), and a single logographic writing system (such as Chinese or Linear B) can have many thousands of distinct visual signs.

It wasn’t until about 2000 years ago in Egypt that phonemic writing was invented and used, where each character stands for a constituent of speech rather than having a meaning as in logographic writing. Many hundreds of writing systems have evolved and diversified from this ancestor (e.g., Latin, Arabic, Avestan, Mongolian, Phags-pa), varying widely in geometrical shape and style, and in the aspects of speech the characters represent (e.g., alphabets represent consonants and vowels, abugidas represent just consonants, and syllabaries represent syllables).
 
Amongst both non-linguistic and linguistic signs, some visual signs are representations of the world­e.g., cave paintings and pictograms, respectively­and it is, of course, not surprising that these visual signs look like nature. It would be surprising, however, to find that non-pictorial visual signs look, despite first appearances, like nature. Although writing began with pictograms, there have been so many mutations to writing over the millenia that if writing still looks like nature, it must be because this property has been selectively maintained. For non-linguistic visual signs, there is not necessarily any pictorial origin as there is for writing, because amongst the earliest non-linguistic visual signs were non-pictorial decorative signs. The question we then ask is, Why are non-pictorial visual signs shaped the way they are?
 
Previous efforts at answering this question have primarily concentrated on the differences. In particular, some of the shape differences among different (non-pictorial) visual signs are due to the kind of writing implement used, whether impressions in clay tablets with a blunt reed, rounded writing on leaves, or the physical details of a modified feather-tip point. Little attention has been devoted to uncovering the similarities, however, and as we will see here, there are deeper visual regularities that hold across human visual signs, independent of the writing mechanism (regularities that are also found in nature).

It is as if someone had noticed that throat size causes male and female voices to sound differently, without noticing that male and female speech possesses a critical deeper regularity, namely that they utter the same set of phonemes, morphemes, words and sentences as one another (within a single language speaking community). We will find that, despite superficial differences in their shapes, visual signs appear to possess similar underlying “visual phonemes.”

The Shapes of Visual Signs

Uncovering deeper visual regularities that might govern visual signs is crucial in any attempt to explain why visual signs are shaped as they are, and, in turn, to explain why we are so good at reading. After all, we cannot explain why visual signs are shaped as they are if we do not first determine how visual signs over history are, in fact, shaped! A difficulty in trying to find such regularities is that it is not straightforward to describe how even a single letter is shaped, for a single letter can undergo considerable distortion (e.g., from person to person, or from font to font) without losing its identity.

How can one hope to scientifically address the kinds of shapes found among visual signs, when it is awkward to even rigorously say what the shape of a single letter is?
 
To solve this problem I decided to use a topological notion of shape, where the details of the geometry do not matter, and what matters is only the manner in which strokes intersect, or join, with other strokes. A straight line, a C and an S have the same topological shape because each is topologically just a single stroke. L, T and X are the three distinct kinds of topological shape having two strokes. For example, a V has the same topological shape as an L because each consists of two strokes meeting at their endpoints.

This notion of shape will be helpful later in measuring the shapes of nature, because while geometrical shape can change quickly as a function of a person’s viewpoint, topological shape is more viewpoint invariant, providing a more robust characterization of the shapes in nature. This topological notion of shape is not merely useful, but possesses psychological justification as well: experts in psychology (e.g., Irving Biederman’s work on intermediate-level representations) and computer vision believe that our visual systems may represent shape in a topological manner.

topological shape figure 2 topography of language

In earlier research I had shown (along with my collaborator Dr. Shinsuke Shimojo) that letters have on average three strokes, and this average does not vary as a function of the number of letters in the writing system. Because of this, I considered topological shapes with three or fewer segments, in particular the 36 topological shapes that can be drawn with three straight lines (even though each topological shape covers curvy geometrical shapes as well). In addition to a single-stroke, and L, T and X, there are five three-stroke configurations having a single junction (i.e., a single point of intersection of the strokes) exemplified by Y, K and ψ. There are 11 configuration types having three segments and two junctions, exemplified by characters such as ] (or, equivalently, Z), 1, F, I, Π and ≠. Finally, there are 16 topological shapes with three segments and three junctions, such as Δ and A. 

With these 36 kinds of topological shape primitive in hand, we (Dr. Shinsuke Shimojo, and two Caltech undergraduate students, Qiong Zhang and Hao Ye) set about rigorously measuring how common these shapes occur among visual signs. We began by measuring from three distinct classes of non-pictorial visual sign: phonemic writing systems (non-logographic), Chinese characters (logographic), and non-linguistic symbols. The set of phonemic letters were taken from about one hundred phonemic writing systems over history, and the topological shape of the entire letter was measured (if it was one of the 36 types in our repertoire). For Chinese characters and non-linguistic symbols, the signs typically have more than three strokes, and we measured all the topological shapes that occur as part of the whole sign.

What we discovered is that the shapes across these three very different kinds of visual sign are similar. For example, Ls and Ts are in each case common, but Xs rare. And across the 32 different kinds of topological shape with three segments, these three classes of visual sign highly correlate with one another. For example, Y's tend to be common relative to ∗'s, Z's and F's more common than 1's, and H's more common than Π's. That is, despite the seemingly unrestrained variability in shape among these visual signs, they in fact possess a similar topological shape “signature.” Now we are in a position to more meaningfully ask why visual signs are shaped as they are. Namely, why do visual signs have this signature?

frequency of topological shapes topography Of Language figure 3

They don’t get this signature by chance. For example, if one were to randomly place strokes onto the writing surface, the most common two-segment topological shape would be X. T's would be rarer because they require a coincidental alignment of one endpoint along the edge of another. L's, in turn, are even rarer because they require the double-coincidence of two endpoints touching. Among the topological shapes with two or three junctions, the shapes with more X's will be more common in a randomly generated sign, and the shapes with more L's the rarest; e.g., ≠ is the most common topological shape with two junctions, and Z the least common.

This is not at all the case for the visual sign signature, where Ls and Ts are more common than X, and where, for example, ≠ is actually much rarer than Z. Another mechanism for the random generation of visual signs would be the act of scribbling, which is similar to the random-stroke case just mentioned, except that for scribbles Ls are now common, not rare. That is, for scribbles Ls and Xs are much more common than Ts, leading to a distribution of topological shapes unlike that of human visual signs.

topological shapes in visual signs topography Of Language figure 4

Designed For Reading or Writing?

Thus far, we have seen that human non-pictorial visual signs appear to possess a characteristic signature, and we have seen that this signature is not a result of chance. Before attempting to explain this signature, a natural first question is, Does this signature appear to be good for the eye, or good for the hand (or any other writing mechanism)? 

There are at least two reasons for expecting that visual sign shapes are designed (by cultural selection) for ease of reading, not ease of writing. First, visual signs are written once, but can be read many times. Second, writing speed is typically limited not by the motor system, but by the time taken for the writer to compose the sentence; that is, writing is not like talking, where we can talk effortlessly without feeling as if we are composing our thoughts. 

Shorthand is an example kind of visual sign that violates both of these reasons—it is typically not read more than once, and it is written without the writer having to compose the sentences (instead, the boss is orally dictating). Accordingly, shorthand is designed for the hand at the expense of the eye. We measured the topological shapes across six different shorthand writing systems, and found that their topological shapes are radically different from that found in visual signs more generally. 

In contrast, consider trademark logos, which are designed to be seen at the expense of being written (they are, in fact, typically not written at all). We discovered that trademark logos possess the same shape signature found in visual signs. 

That is, when we look at signs we know are designed for the eye at the expense of the hand, the signature matches the general signature we saw earlier, but when we look at signs we know are designed for the hand at the expense of the eye, the signature is altogether different. 

As further support for this, we found that the visual sign shape signature correlates well with the number of angles in the shape (a measure of visual complexity), but does not correlate at all with the number of hand motions required to write the topological shape (a measure of motor complexity). 

Together this makes a strong argument that the topological shapes of visual signs have been selected for reading, not writing. 

visual shapes designed for reading not writing topography Of Language figure 5
Natural to the Eye

The topological shapes of non-pictorial visual signs are, then, for the eye, not the hand. But we are still left with the question, Why does the eye like these shapes? Here is where the evolutionary, or ecological, hypothesis enters into the story. Because over millions of years of evolution our visual systems have been selected to be good at processing the conglomerations of contours occurring in nature, I reasoned that if visual signs have culturally evolved to be easy to see, then we should expect visual signs to have natural topological shapes. 
topological shapes 3D topography Of Language figure 6
Where are these topological shapes in nature? What were conglomerations of strokes for visual signs are now conglomerations of contours for natural scenes. Contours are the edges of objects (as seen by the eye), not, of course, strokes in the world. For example, an L occurs in the world when exactly two edges of an object meet at their endpoints, like an elbow. A T occurs in the world when the edge of an object goes behind another object in the foreground. A Y occurs, for example, at the inside corner of a rectangular room.

We measured how common these and the other topological shapes occur in natural scenes, and were stunned to find that nature possesses the shape signature we saw earlier for visual signs. That is, visual signs are shaped like nature, confirming our ecological hypothesis for the shapes of visual signs. 


common topological shapes in photographs topography Of Language figure 7

If visual signs look like nature, one might first suppose that the shape signature of nature depends significantly on which natural environment one considers. However, to our surprise, we found that the shape signature is highly robust, differing hardly at all whether we measured images of ancestral environments (e.g., tribal villages, savannas) or urban environments (buildings, walkways).

Because of the robust topological notion of shape we used in our analysis, any environment with opaque objects strewn about will tend to have the same shape signature.

This underlies why the diverse kinds of visual sign have a similar signature despite the diverse environments from which they spring, and one may speculate that aliens might for this reason possess visual signs that look reminiscent of our own.

topological shapes in nature topography Of Language figure 8
We have been considering visual signs generally, but let us now specifically consider letters in phonemic writing system, for there is an additional question one might have about letters. We saw a moment ago that letters look like natural object junctions. Our ecological hypothesis expects letters to look natural, but why natural junctions? Why not have letters shaped like natural single contours? Or, alternatively, why not have letters shaped like whole objects? Instead of one stroke or a dozen strokes, letters in fact tend to have about three strokes (independent of the size of the writing system), and thus are at an intermediate level between edges and objects. 

The answer may lie in the following pair of facts: (i) we wish to read words, not letters; and (ii) we have evolved to see objects, not object-junctions. In this light, we expect culture to select words to look like objects, so that words may be processed by the same area in visual cortex responsible for recognizing objects.

Logographic characters (e.g., Chinese) and non-linguistic symbols do tend to be more object-like, possessing many more than three strokes. For phonemic writing, however, there are severe limits to how closely words can match natural objects, for the manner in which letters combine is determined by speech. However, by having letters shaped like natural object-junctions—rather than natural contours or natural whole objects—written words become combinations of natural junctions, and thus more similar to objects and more easily processed by our visual system. 

heirarchy of visual cortex for recognizing shapes topography Of Language figure 9
Evolution by natural selection is too slow to design our brains for reading, and so cultural selection has come to the rescue, designing (without any designer) visual signs for our brains. Because our visual systems have evolved to be good at perceiving natural objects, cultural evolution has created non-linguistic symbols, logographic symbols, and written words in phonemic writing that tend to be built out of object-junction-like constituents, and are thus object-like.

In particular, this explains why letters tend to have around three strokes and have the topological shapes they do. We expect that these insights will be useful in designing optimal alphabets or visual displays.

Because culture is capable of designing for the eye, the visual signs of our culture are a fingerprint of what our visual systems like. Akin to the linguistic study of the auditory productions humans make, the “visual linguistic” study of the visual productions people make is a currently under-utilized tool for vision research.

There is every reason to believe that the study of visual linguistics will aid traditional lab experiments on vision and brain design as much as linguistics has supplemented lab experiments on cognition. 

Short Bibliography: 
Biederman I&Cooper EE (1991) Priming contour-deleted images: evidence for intermediate representations in visual object recognition. Cognitive Psychology 23: 393–419.

Changizi MA&Shimojo S (2005) Character complexity and redundancy in writing systems over human history. Proceedings of the Royal Society of London B 272: 267-275.

Changizi MA (2006) The optimal human ventral stream from estimates of the complexity of visual objects. Biological Cybernetics 94: 415-426.

Changizi MA, Zhang Q, Ye H & Shimojo S (2006) The structures of letters and symbols throughout human history are selected to match those found in objects in natural scenes. The American Naturalist 167: E117-E139.

Changizi MA (2009) The Vision Revolution (Benbella Books, Dallas).

Daniels PT & Bright B (1996) The World’s Writing Systems. New York: Oxford University Press.

Comments

Interesting. Was any attempt made to correct for the fact that the overwhelming majority of phonemic writing systems in use today all derive from the Phonecian abjad? It would seem that the burden of proof in this case would first lie with showing that similarities in topography were independent of common ancestry. Were any non-Phonecian-derived phonemic systems included? How were the scripts, and the samples thereof, chosen? I can think of a few writing systems that would defy topographical analysis altogether. Likewise, different orthographies even within the same writing system or language could produce very different topological footprints (a medieval blackletter script vs. Helvetica vs. a doctor's cursive!). Some of the correlations shown are very impressive, but I can't help but wonder if they would remain with different sets of data. I'd love to see the topographical footprint of 100 different historical Latin alphabets, for instance.

I assume many of these questions are addressed in the book, of course.

Mark Changizi's picture

Hi Jack,

The data come from non-linguistic signs, trademark logos, logographic writing systems, and phonemic writing systems. The latter group consisted of about 100 pulled from those I had used in a previous paper, prior to having come up with the "natural" hypothesis; the previous paper is here...http://www.changizi.com/char.zip


The "writing looks like nature" paper itself can be freely linked here...http://www.journals.uchicago.edu/doi/pdf/10.1086/502806?cookieSet=1

On shared origins, one could argue that because the first signs were pictographs, perhaps any natural signature found today is due to inheritance from this. The difficulty is that writing systems change substantially over time, and if there had not been selection pressure maintaining the earlier (natural) signature, it would seem that it should have been lost or mostly lost by now. Same point for those writing systems coming from Phonecian.

I'd love to see more work done along the lines you suggest. To look at individual writing systems, one would have to look at a smaller set of "shapes" types; it isn't helpful to have 36 shape types when looking at a writing system with only, say, 26 letters. 

Sincerely,

Mark

I'll take a closer look at the articles once my comps are over. One thing I did notice in the first paper is the statement "Lower-case characters were used unless otherwise noted...." This brings me back to the question of different orthographies. Minuscules are a relatively-recent innovation, and wouldn't necessarily represent the letter-forms that had been actually read during most of the system's history. There is also the distinction between epigraphic and manuscript forms, square and cursive, book and documentary hands, etc. Picking the forms presented on Omniglot, which often represent modern printed typefaces, may have weighted the sample toward designs which have been deliberately tweaked for easier reading at the expense of historical accuracy.

Now, I do think there may be a selection-pressure influencing changes in characters over time, and quite possibly one rooted in our cognitive biology, especially as technology placed fewer limits on orthography. I certainly would rather read Greek set in a modern minuscule than a late-antique uncial. I just think that picking one orthography from a single point in time as the "official" form of a writing system actually obscures the process of development. Perhaps using samples from real documents and inscriptions, rather than from Omniglot, would provide more illuminating results.

Mark, really interesting work. I'm ABD in linguistics and found this piqued my curiosity. I never studied sign language, but it seems plausible that one could make an argument that sign language gestures were selected for (over time) based on some natural requirement for those gestures that our brain is naturally good at parsing. I know that sign language scholars do use the term "phoneme" to refer to sub-parts of gestures within sign language (meaning something like feature sets composed of minimal velocity and maximal change of directional angle. Just thinking out loud...

Mark Changizi's picture

Hi Chris,

I've thought about that.  The goal would be to somehow catalog the set of natural gestures, or rate gestures according to their frequency of use in non-linguistic settings, and then see if sign language uses the more common of these gestures. Problem might be that there's relatively few sign languages, and they're relatively young, and potentially not as well-optimized.

Sincerely,

Mark

FYI, the article link returns an error.

From reading this, it sounds suspiciously like you've bought into the myth that Chinese type scripts encode meaning without any reference to sound.

Mark Changizi's picture

I'm vaguely aware of this, although don't know much about it. For the visual appearance of letters and characters, though, it doesn't matter.  Thanks for the comment. -Mark

The article proposes 'three distinct classes' of 'very different kinds of visual signs', but confuse the issue by twice lumping logograms in with non-linguistic symbols (have >3 strokes, are 'object-like'), and introduce trademark logos as if it were a fourth category, and then add categories of shorthands and scribbles. The three categories are not so distinct.

One is described as logograms that denote a meaning--as opposed to other scripts that stand for a constituent of speech--a notion that is thoroughly discredited, along with Gelb's evolutionary model of script development that it also implicitly endorses.

One source of logographic signs is Linear B, which is widely considered to be a syllabary; the few examples of non-linguistic signs include a Syllabic from Cree and arguably a Chinese character and a Letter from Gurmukhi (upside down); and the examples of 'phonemic signs' include Chinese characters for 'one, two, three, ten, t-shaped', and 'work'.
Even if one could classify scripts as logographic or phonemic, this wouldn't hold for individual symbols as they often serve different functions. Japanese writes a number of symbols as both Kanji and syllabics, even within the same script; the English “I” is famously at once alphabetic, syllabic, and 'logographic' or morphemic.

The response to my comment above implies that the letters and characters are visually distinct based only on their visual appearance, without reference to the phonology they encode. It's not obvious that this is possible. The point of the research seems to be contrasting signs designed to be read with those designed to be written, which needn't involve any of the these problems. The graph would make the point more forcefully if it conflated Chinese and non-logographic categories.

The other problem is with topology. The forms |, L, T, X, Y, K, and phi are all homotopic, and | and L are homeomorphic as well. It's hard to imagine any kind of topology, in the mathematical sense, that would classify | and L as different forms. If the authors have devised their own classification scheme, as seems to be the case, then the article needs to make this clear.

Mark Changizi's picture

Thanks for your comment.

Your main point seems to be that it is inappropriate to, for example, keep Chinese characters in a distinct data set from writing systems like the one you're reading now. This would be a non-starter, however, for any analysis of the shape of characters. The average number strokes per character for non-logographic writing systems tends to about three, and their shape therefore has to be handled in a different quantitative fashion than that for characters having much more than three. 

And although logographic writing has phonemic aspects to it, one can't ignore the huge qualitative differences that nevertheless exist between them: that Chinese characters often have meanings at the word level, whereas phonemic writing systems do not. ...and that this is the primary reason for the size difference in their characters. This is not in any way "discredited".

Some other points:

Trademark logos are introduced as a special case of non-linguistic symbols, as a case that is clearly selected for the eye at the expense of the hand.

Linear B had both syllabic and logographic signs. I displayed in one of the figures some of the latter, but I actually used the syllabic characters from Linear B within the non-logographic data set.

The examples here from non-linguistic signs are sampled from books of symbols, some symbols which could have visual similarities to linguistic signs.

The phonemic signs are pulled from a figure within a Proc Roc Soc Lond B paper, where Chinese numerals were part of the data set, to compare to other numeral systems. They ended up accidentally being piled into that figure, but were not actually part of the data set for non-logographic signs.

On topology, the point was not to remain true to mathematics, but to have a topologically inspired notion of shape that is a useful one for characterizing both shapes of characters and shapes of contour-configurations in natural scenes.  It is a useful one for natural scenes because it is robust to viewpoint change. And it is a useful one for writing systems because it is robust to the larger variability in geometric shape across users and fonts.

-Mark

An excelent article although there's a small confusion "(e.g., alphabets represent consonants and vowels, abugidas represent just consonants, and syllabaries represent syllables)" Abugidas represent consonants with an inherent vowel and have representations for vowels in the initial position (like devanagari), abjads are the ones that represent only consonants and some of them indicate vowels with matres lectonis when they do.

Add a comment

The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <sup> <sub> <a> <em> <strong> <center> <cite> <code> <TH><ul> <ol> <li> <dl> <dt> <dd> <img> <br> <p> <blockquote> <strike> <object> <param> <embed> <del> <pre> <b> <i> <table> <tbody> <div> <tr> <td> <h1> <h2> <h3> <h4> <h5> <h6> <hr> <iframe>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.
CAPTCHA
If you register, you will never be bothered to prove you are human again. And you get a real editor toolbar to use instead of this HTML thing that wards off spam bots.