|
|
| Character Encoding |
|
| Listing: |
- A
Brief History of Character Codes
A concise history of the development of character
encoding in Western and East Asian languages,
including ASCII, EBCDIC, Unicode and TRON.
- czyborra.com
Information on the ISO 8859 alphabet soup,
Cyrillic encoding, PC codepages, East Asian
encoding, ASCII and variants, EBCDIC and Unicode
- by Roman Czyborra.
- ASCII
and EBCDIC Compared
A comparison of two of these two basic encoding
systems, with tables.
- Basis
Technology: Presentations and Papers
A wide range of articles on Unicode, East
Asian localization and Internationalization
issues.
- Character
Set Issues beyond HTML3.2
Internationalization issues beyond HTML3.2
and ISO-8859-1. Includes information on Baltic
encodings.
- Characters
and Encodings
A tutorial on character code issues in digital
processing and transfer of text data, on the
Internet or otherwise. Includes tables and
a detailed listing of control codes. In English
and Finnish.
- Chilkat
Charset Conversion Component
A character set conversion component for Unicode,
Japanese, Chinese, Korean, Cyrillic, Arabic,
Hebrew, Thai, Vietnamese and all Western languages.
- Dan's
Web Tips: Characters and Fonts
Hints and tips about character sets and fonts
in web development. Includes links to related
resources.
- Diffuse
Project: Character Set Standards
An overview of different character set encoding
standards.
- An
Early History of Character Set Standardization
Covers the beginnings of the ASCII standards
from ASCII-1963 onwards and information on
Cyrillic, Japanese, Korean, Thai and Vietnamese
encoding systems, including various localized
versions of EBCDIC. With tables and links
to other resources.
- ECMA:
Character Code Structure and Extension Techniques
Specifies the structure of ECMA-35, for 8-bit
codes and 7-bit codes which provide for the
coding of character sets, with a detailed
PDF document.
- eGrannie:
ASCII-EBCDIC chart
A side-by-side comparision of ASCII and EBCDIC
encoding.
- EKI
Letter Database
Query character sets, encoding, codepages
and Unicode information in an easy-to-use
web form. Held at the Institute of the Estonian
Language.
- HTML
Document Representation
Chapter covering document character sets and
encodings in HTML from the World Wide Web
Consortium's HTML 4.0 Specification.
- HTML
Validation: Using Character Encodings
How to validate HTML documents in various
character encodings.
- IANA:
Character Sets
The official names for character sets that
may be used in the Internet and referred to
in Internet documentation - held at the Internet
Assigned Number Authority.
- ISO
639 Language Names
The standard names for use in SGML and XML,
including a complete list of language name
codes.
- LangBox
International
Codetables for ISO 8859-6, ASMO 449 plus,
ASMO 708 (Arabic) and ISO 8859-8 (Hebrew)
and further information about the company's
work in multilingual UNIX.
- MS
Windows characters in HTML
A review of the HTML authoring problems caused
by some special characters which belong to
MS Windows character set but not to ISO Latin
1. Includes technical details and substitution
tables. In English and Finnish.
- 3rdpageSearch
Front end to several search engines and portals
that allows you to enter queries in various
character sets.
- Tips
& Techniques for Foreign Content on the
Web
Pennsylvania State University's guide to reading
and publishing different languages on the
web. Includes details of various encoding
systems and links.
- Tutorial:
Shady Characters
A tutorial that explains HTML character sets,
character encodings and character references
from Webreference.com.
- World
Wide Web Consortium
Covers code tables, Unicode, HTML and XML
and links to other resources and discusses
internationalization and localization issues
relating to character sets.
- Xceed
Binary Encoding Library
A library for Windows developers that allows
applications to encode binary data and files
into text and vice-versa.
|
|
|