International Components for Unicode

From Wikipedia, the free encyclopedia

International Components for Unicode (ICU) is an open source project of mature C/C++ and Java libraries for Unicode support, software internationalization and software globalization. ICU is widely portable to many operating systems and environments. It gives applications the same results on all platforms and between C/C++ and Java software. The ICU project is an open source development project that is sponsored, supported and used by IBM and many other companies.

Some of the services that it provides are the following.

  • Text: Unicode text handling, full character properties and character set conversions
  • Analysis: Unicode regular expressions; full Unicode sets; character, word and line boundaries
  • Comparison: Language sensitive collation and searching
  • Transformations: normalization, upper/lowercase, script transliterations
  • Locales: Comprehensive locale data and resource bundle architecture
  • Complex Text Layout: Arabic, Hebrew, Indic and Thai
  • Time: Multi-calendar and time zone
  • Formatting and Parsing: dates, times, numbers, currencies, messages and rule based

ICU provides much richer internationalization facilities than the standard libraries for C or C++, and most operating systems.

Contents

ICU is descended from C++ frameworks produced by Taligent in the mid 1990s. Soon after Taligent became part of IBM in early 1996, Sun Microsystems decided that Java, then in its infancy, "was missing international support. Taligent had great international technology, talented engineers, and a location about 100 meters from Sun's JavaSoft division in Cupertino, California. IBM arranged for Taligent's Text and International group to contribute international classes to Sun's Java Development Kit."[1] Some of code for text processing, date formatting, etc was rewritten in Java and became the JDK 1.1 internationalization APIs. A large portion of this still exists in the java.text and java.util packages. Further internationalization features were added with each later release of Java.

IBM programmers then rewrote the Java internalization classes in C++ and later ported some classes to C. The C++/C version of ICU is known as ICU4C. The ICU project also provides ICU4J ("ICU for Java"), which adds features not present in the standard Java libraries. ICU4C and ICU4J are kept as similar as possible (though not identical — for example, ICU4C includes a character converter API). Both have been enhanced over time to support new facilities and new features of Unicode. ICU was released as an open source project in 1999 under the name "IBM Classes for Unicode". It was later renamed to "International Components For Unicode".

  1. ^ Laura Werner (1999). Getting Java ready for the world: A brief history of IBM and Sun's internationalization efforts.

Advanced Search
Included Web Search Engines


Safe Search

close

Top Matching Results

Occasionally Search.com will highlight specialized results that are based on the context of your query. Examples of specialized results include specific links to news, images, or video.

Top Matching Results may highlight information from other Search.com pages, content from the CNET Network of sites, or third party content. The listings are based purely on relevance. Search.com does not receive payment for listings in this section but our partners that provide this data may get paid for listing these products.

Sponsored Links

This section contains paid listings which have been purchased by companies that want to have their sites appear for specific search terms and related content. These listings are administered, sorted and maintained by a third party and are not endorsed by Search.com.

Search Results

Search.com sends your search query to several search engines at one time and integrates the results into one list which has been sorted by relevance using Search.com's proprietary algorithm. You can customize the list of search engines included in your metasearch from the preferences.

The search engines that are used in your metasearch may allow companies to pay to have their Web sites included within the results. To view the Paid Inclusion policy for a specific search engine, please visit their Web site. Search.com does not accept payment or share revenue with any search engine partner for listings in this section.