summaryrefslogtreecommitdiff
path: root/autodoc.pl
diff options
context:
space:
mode:
authorSteve Peters <steve@fisharerojo.org>2008-12-19 11:38:31 -0600
committerSteve Peters <steve@fisharerojo.org>2008-12-19 11:38:31 -0600
commit2bbc8d558d247c6ef91207a12a4650c0bc292dd6 (patch)
treef56c82008dc643d8e799b8e21fb9a3c36b64b3b4 /autodoc.pl
parent7df2e4bc09d8ad053532c5f9232b2d713856c938 (diff)
downloadperl-2bbc8d558d247c6ef91207a12a4650c0bc292dd6.tar.gz
Subject: PATCH 5.10 documentation
From: karl williamson <public@khwilliamson.com> Date: Tue, 16 Dec 2008 16:00:34 -0700 Message-ID: <49483312.80804@khwilliamson.com>
Diffstat (limited to 'autodoc.pl')
-rw-r--r--autodoc.pl25
1 files changed, 24 insertions, 1 deletions
diff --git a/autodoc.pl b/autodoc.pl
index f97af93acf..25fabf0ca4 100644
--- a/autodoc.pl
+++ b/autodoc.pl
@@ -238,7 +238,30 @@ Note that all Perl API global variables must be referenced with the C<PL_>
prefix. Some macros are provided for compatibility with the older,
unadorned names, but this support may be disabled in a future release.
-The listing is alphabetical, case insensitive.
+Perl was originally written to handle US-ASCII only (that is characters
+whose ordinal numbers are in the range 0 - 127).
+And documentation and comments may still use the term ASCII, when
+sometimes in fact the entire range from 0 - 255 is meant.
+
+Note that Perl can be compiled and run under EBCDIC (See L<perlebcdic>)
+or ASCII. Most of the documentation (and even comments in the code)
+ignore the EBCDIC possibility.
+For almost all purposes the differences are transparent.
+As an example, under EBCDIC,
+instead of UTF-8, UTF-EBCDIC is used to encode Unicode strings, and so
+whenever this documentation refers to C<utf8>
+(and variants of that name, including in function names),
+it also (essentially transparently) means C<UTF-EBCDIC>.
+But the ordinals of characters differ between ASCII, EBCDIC, and
+the UTF- encodings, and a string encoded in UTF-EBCDIC may occupy more bytes
+than in UTF-8.
+
+Also, on some EBCDIC machines, functions that are documented as operating on
+US-ASCII (or Basic Latin in Unicode terminology) may in fact operate on all
+256 characters in the EBCDIC range, not just the subset corresponding to
+US-ASCII.
+
+The listing below is alphabetical, case insensitive.
_EOB_