summaryrefslogtreecommitdiff
path: root/utf8.c
Commit message (Collapse)AuthorAgeFilesLines
* Clean-up some warnings when compiling on Win32 with VC++Steve Hay2005-03-291-1/+1
| | | p4raw-id: //depot/perl@24096
* [perl #33185] UTF-8 string substitution corrupts memoryDave Mitchell2005-03-261-5/+5
| | | | | | | The implicit call of 'require utf8' triggered by code like "\x{100}" =~ /[[:print:]]/ wasn't saving state correctly first. p4raw-id: //depot/perl@24084
* Consting part 4Andy Lester2005-03-251-4/+4
| | | | | Message-ID: <20050325204331.GA16388@petdance.com> p4raw-id: //depot/perl@24079
* Third consting batchAndy Lester2005-03-241-22/+23
| | | | | Message-Id: <2f14220e7101a03f7659dbe79a03b115@petdance.com> p4raw-id: //depot/perl@24074
* More const parmsAndy Lester2005-03-181-51/+51
| | | | | Message-ID: <20050314195954.GB7141@petdance.com> p4raw-id: //depot/perl@24042
* Adding const qualifiersAndy Lester2005-03-141-11/+10
| | | | | Message-ID: <20050313202005.GA23535@petdance.com> p4raw-id: //depot/perl@24037
* Update copyright years and add editor blocksNicholas Clark2005-01-221-1/+2
| | | p4raw-id: //depot/perl@23861
* Re: uc($long_utf8_string) exhausts memoryJarkko Hietaniemi2005-01-221-35/+32
| | | | | | | | Message-Id: <41F1801C.3080201@iki.fi> Make buffer size estimates for utf8 case conversion less maximally pessimistic p4raw-id: //depot/perl@23857
* Further follow-up to 23767Robin Barker2005-01-191-1/+1
| | | | | Message-ID: <533D273D4014D411AB1D00062938C4D90849C57D@hotel.npl.co.uk> p4raw-id: //depot/perl@23824
* A terser way to write the \-ing code in pv_uni_displayNicholas Clark2005-01-141-11/+27
| | | | | Plus drive-by insert of a more correct editor block. (thanks Dave) p4raw-id: //depot/perl@23801
* sv_catpvf holds no advantage over sv_catpv when the "pattern" isn't.Nicholas Clark2005-01-141-8/+8
| | | p4raw-id: //depot/perl@23798
* sv_2mortal(NEWSV(0,0)) better written as sv_newmortal()Nicholas Clark2005-01-131-1/+1
| | | p4raw-id: //depot/perl@23796
* Re: Smoke [5.9.2] 23634 FAIL(F) openbsd 3.6 (i386/1 cpu) [PATCH]Gisle Aas2004-12-131-1/+1
| | | | | Message-ID: <lrr7lxje5s.fsf_-_@caliper.activestate.com> p4raw-id: //depot/perl@23640
* Fix a couple of typos.Rafael Garcia-Suarez2004-11-171-1/+1
| | | p4raw-id: //depot/perl@23508
* Make the perl interpreter more tolerant of UTF-16-encoded scriptRafael Garcia-Suarez2004-10-081-1/+7
| | | | | (patch by Jarkko Hietaniemi) p4raw-id: //depot/perl@23351
* fix minor nit in file description, to keep Jarkko happyDave Mitchell2004-08-091-1/+2
| | | p4raw-id: //depot/perl@23207
* Add comment to top of reentr.c and fix typos in other filesDave Mitchell2004-08-011-2/+2
| | | p4raw-id: //depot/perl@23180
* Add comment to the top of most .c files explaining their purposeDave Mitchell2004-07-311-0/+5
| | | p4raw-id: //depot/perl@23176
* Perl_is_utf8_string(pTHX_ U8 *s, STRLEN len)Nick Ing-Simmons2004-07-121-2/+2
| | | | | | | | Can now handle the case is_utf8_string(NULL,0) again. Why do we have code dup for Perl_is_utf8_string_loc() ? p4raw-id: //depot/perl@23083
* Save some repeated strlen()s in Perl_swash_initNicholas Clark2004-05-021-4/+7
| | | p4raw-id: //depot/perl@22764
* Update copyright noticesRafael Garcia-Suarez2004-03-161-1/+1
| | | p4raw-id: //depot/perl@22509
* Speed up the unicode case mappings (for /i, lc, uc, etc).Jarkko Hietaniemi2004-03-031-12/+10
| | | | | | Subject: [PATCH] [perl #24826] Message-Id: <9B5CBF96-6CE5-11D8-83B0-00039362CB92@iki.fi> p4raw-id: //depot/perl@22427
* Macrofy the compile/runtime test.Jarkko Hietaniemi2003-09-211-3/+3
| | | p4raw-id: //depot/perl@21297
* It's UTF-8, not UTF8. (Note: not s/UTF-8/UTF8/,Jarkko Hietaniemi2003-09-121-20/+20
| | | | | | since that would break a lot of code.) Also few stray UTF16s, UTF32s, and "encoded in Unicode". p4raw-id: //depot/perl@21198
* A new UTF-8 API, Perl_is_utf8_string_loc(), a variantJarkko Hietaniemi2003-09-101-0/+49
| | | | | of Perl_utf8_is_string(). p4raw-id: //depot/perl@21152
* [perl #22946] Bug in Unicode surrogate pair conversion in Perl_utf16_to_utf8 Dinger, Tom2003-07-241-1/+2
| | | | | | From: "Dinger, Tom" (via RT) <perlbug-followup@perl.org> Message-ID: <rt-22946-60715.1.00007189884266@rt.perl.org> p4raw-id: //depot/perl@20211
* Fix up Larry's copyright statements to my best knowledge.Jarkko Hietaniemi2003-04-161-1/+1
| | | | | | | (Lots of Perl 5 source code archaeology was involved.) Larry didn't make strangled noises when I showed him the patch, either :-) p4raw-id: //depot/perl@19242
* Synchronize the specifications of the POSIX characterJarkko Hietaniemi2003-04-161-2/+2
| | | | | | classes alnum, graph, and print closer to the planned Unicode proposal. p4raw-id: //depot/perl@19231
* Update all copyrights to 2003, from JarkkoHugo van der Sanden2003-03-021-1/+1
| | | p4raw-id: //depot/perl@18801
* API doc tweaks.Jarkko Hietaniemi2003-02-221-4/+7
| | | p4raw-id: //depot/perl@18760
* %_ (was Re: [PATCH] operation on `PL_na' may be undefined)Nicholas Clark2003-01-071-2/+2
| | | | | Message-ID: <20021226211626.GD284@Bagpuss.unfortu.net> p4raw-id: //depot/perl@18456
* [PATCH] bug in utf8.c(?)Marty Pauley2002-09-261-0/+1
| | | | | Subject: [PATCH] bug in utf8.c(?) p4raw-id: //depot/perl@17928
* Small speedup by inlining the easy bits of is_utf8_char()Jarkko Hietaniemi2002-07-011-3/+11
| | | | | into is_utf8_string(). p4raw-id: //depot/perl@17392
* good day for WinCE port of perl.Vadim Konovalov2002-05-161-9/+0
| | | | | Message-ID: <001301c1fc68$e808e560$a95cc3d9@vad> p4raw-id: //depot/perl@16628
* WinCE several touchesVadim Konovalov2002-05-131-1/+1
| | | | | Message-ID: <007b01c1fabe$cc8cbbf0$785cc3d9@vad> p4raw-id: //depot/perl@16582
* WinCE many fixesVadim Konovalov2002-04-281-0/+9
| | | | | Message-ID: <00bf01c1eedd$c0c62a00$d25cc3d9@vad> p4raw-id: //depot/perl@16251
* fixes for all the warnings reported by Visual C (most of thisGurusamy Sarathy2002-04-211-82/+82
| | | | | | change is from change#12026) p4raw-link: @12026 on //depot/maint-5.6/perl: ff42b73b40f5a895aef4bed81c794f468e0609bc p4raw-id: //depot/perl@16048
* my $utf8here, our $utf8here, and package variable $utf8here.Jarkko Hietaniemi2002-04-161-8/+27
| | | | | | | | | | | | | | | The actual minimal fix is in utf8.c and from NI-S, the rest are the tests (in fresh_perl since I couldn't get them easily to work elsewhere) and a slight behaviour change: previously UTF-8 identifiers had to start with an alphabetic character. No more so, now they can start with an (Unicode) ID_Continue character (which however is not a (Unicode) digit). (Limiting the first character to ID_Start would be rather restrictive, since ID_Start allows only alphabetic letters.) TODO: use vars qw($utf8here). This I don't find to be a showstopper. p4raw-id: //depot/perl@15943
* Re: Change 15762: As noted by Philip Newton: nothing wrong with BOM,Philip Newton2002-04-071-1/+1
| | | | | Message-ID: <1dnvau4j684hke2igk990f01nit8r2811s@4ax.com> p4raw-id: //depot/perl@15777
* As noted by Philip Newton: nothing wrong with BOM,Jarkko Hietaniemi2002-04-061-14/+3
| | | | | but 0xFFFE quite wrong. p4raw-id: //depot/perl@15762
* What started as a small nit (the charnames test, nit foundJarkko Hietaniemi2002-04-021-6/+10
| | | | | | | | | be Hugo), ballooned a bit... the goal is Larry's wish that illegal Unicode (such as U+FFFF) by default doesn't warn, since what if somebody WANTS to create illegal Unicode? Now getting close to this in the regex runtime. (Also, fix more of my fixation that BOM would be U+FFFE.) p4raw-id: //depot/perl@15689
* A little bit better error message for \pq, stillJarkko Hietaniemi2002-03-281-1/+5
| | | | | not good because the script context is not shown. p4raw-id: //depot/perl@15581
* Warn instead of croak.Jarkko Hietaniemi2002-03-271-8/+42
| | | p4raw-id: //depot/perl@15556
* B::perlstring and unicodeRafael Garcia-Suarez2002-03-181-1/+1
| | | | | Message-ID: <20020318231431.A699@rafael> p4raw-id: //depot/perl@15308
* more warnings tidyupPaul Marquess2002-03-111-4/+4
| | | | | | From: "Paul Marquess" <paul_marquess@yahoo.co.uk> Message-ID: <AIEAJICLCBDNAAOLLOKLMEEGDPAA.paul_marquess@yahoo.co.uk> p4raw-id: //depot/perl@15155
* EBCDIC: this seems to calm the last of theJarkko Hietaniemi2002-02-241-12/+8
| | | | | Malformed UTF-8 warnings. p4raw-id: //depot/perl@14850
* In EBCDIC the UNI_TO_NATIVE() macro evaluates its argumentJarkko Hietaniemi2002-02-201-2/+4
| | | | | twice, causing the loop to skip every other character. p4raw-id: //depot/perl@14800
* Misplaced block end.Jarkko Hietaniemi2002-02-191-1/+1
| | | p4raw-id: //depot/perl@14766
* Oops.Jarkko Hietaniemi2002-02-191-1/+1
| | | p4raw-id: //depot/perl@14762
* Try special casing first.Jarkko Hietaniemi2002-02-191-27/+33
| | | p4raw-id: //depot/perl@14759