Interword separation

From Wikipedia, the free encyclopedia

Interword separation is the act and the effect of mutually separating the written representations of words.

According to Spaces between Words[1], the early Semitic languages—which had no vowel signs—had interword separation, but languages with vowels (principally Greek and Latin) lost the separation, not regaining it until much later.

In modern languages, though punctuation marks used for other reasons (such as commas or semicolons) may have the side-effect of separating consecutive words, the issue of separating distinct consecutively written terms exists in general. Depending on the language and the epochs, interword separation may be achieved by means of special symbols or conventions, or by means of "blank zones" called spaces.

Contents

None Alphabetic writing without interword separation is sometimes known as scripta continua. This is or was typical for Ancient Egyptian, Ancient Greek language, Ancient Latin (after the interpunct period, until 600AD-800AD), Chinese, and Japanese.
Spaces English, Latin after 600AD-800AD, Romance languages, Korean
Vertical lines The ancient Anatolian hieroglyphs frequently (but not always) used vertical lines to separate words. Similarly, Linear B used short vertical lines. However, this technical advance mostly died out. In Biblical Hebrew, a vertical line between words called a Pasek indicates a small pause.
Slashes and dots One reference implies that Phoenician originally used slashes and dots to mark word boundaries. It continues to say that Hebrew and Aramaic scribes borrowed the slash and dot advance, and in Aramaic used a space.
Vertical lines/dots Ethiopic inscriptions used a vertical line, but on paper was written as two dots, resembling a colon (in Unicode, "ethiopic wordspace", at U+1361: ፡). This double-dot symbol also appears in ancient Turkic.
Interpunct Older Latin writing used the interpunct, a small dot, to separate words for a while before abandoning it (as in ALEA·IACTA·EST‬).
Different letter shapes Because Hebrew script and Arabic script do not have vowels, it is particularly important to recognize word boundaries. While Hebrew and Arabic have always used spaces between words, some letters also have different shapes depending upon their position.

Five Hebrew letters take a different shape when they are at the end of a word. Arabic characters have up to three different shapes, depending upon whether they are at the beginning, middle, or end of a word. Additionally, characters can have yet another shape when they stand alone as headings in an index.

Vertical space The Nasta'liq version of the Arabic script also uses vertical space to separate words. The beginning of each word is written high up above the baseline, while the end of the word is low, near the baseline; the line of text ends up looking a little bit like the teeth of a saw. While Nastaliq script is sometimes used to write Arabic, it is more often used for Persian, Uyghur, Pashto, and Urdu.

The Irish appear to have been the first to consistently use blank spaces to delimit word boundaries in the Latin alphabet, sometime between 600 AD and 800 AD. As Irish is from a different branch of the Indo-European language family than Latin, the Irish would have had much more difficulty reading Latin than people with, for example, Spanish or Italian (which descended from Latin and are still quite close to it) as their first language. Thus they would have had greater incentive to make reading Latin easier.

  1. ^ Saenger, Paul (2000). Spaces between Words. Stanford University Press. ISBN 0-8047-4016-X. 
Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.