diff options
author | Yves Orton <demerphq@gmail.com> | 2016-06-20 22:51:38 +0200 |
---|---|---|
committer | Yves Orton <demerphq@gmail.com> | 2016-06-22 18:21:32 +0200 |
commit | 8bf4c4010cc474d4000c2a8c78f6890fa5f1e577 (patch) | |
tree | b12d25aa70138f2dbc13bf1bb49a93fab7e7a4b7 /pod/perldata.pod | |
parent | 6c50b67b99a3df9486896d14dc294825a148a673 (diff) | |
download | perl-8bf4c4010cc474d4000c2a8c78f6890fa5f1e577.tar.gz |
Change scalar(%hash) to be the same as 0+keys(%hash)
This subject has a long history see [perl #114576] for more discussion.
https://rt.perl.org/Public/Bug/Display.html?id=114576
There are a variety of reasons we want to change the return signature of
scalar(%hash). One is that it leaks implementation details about our
associative array structure. Another is that it requires us to keep track
of the used buckets in the hash, which we use for no other purpose but
for scalar(%hash). Another is that it is just odd. Almost nothing needs to
know these values. Perhaps debugging, but we have several much better
functions for introspecting the internals of a hash.
By changing the return signature we can remove all the logic related
to maintaining and updating xhv_fill_lazy. This should make hot code
paths a little faster, and maybe save some memory for traversed hashes.
In order to provide some form of backwards compatibility we adds three
new functions to the Hash::Util namespace: bucket_ratio(), num_buckets()
and used_buckets(). These functions are actually implemented in
universal.c, and thus always available even if Hash::Util is not loaded.
This simplifies testing. At the same time Hash::Util contains backwards
compatible code so that the new functions are available from it should
they be needed in older perls.
There are many tests in t/op/hash.t that are more or less obsolete after
this patch as they test that xhv_fill_lazy is correctly set in various
situations. However since we have a backwards compat layer we can just
switch them to use bucket_ratio(%hash) instead of scalar(%hash) and keep
the tests, just in case they are actually testing something not tested
elsewhere.
Diffstat (limited to 'pod/perldata.pod')
-rw-r--r-- | pod/perldata.pod | 17 |
1 files changed, 12 insertions, 5 deletions
diff --git a/pod/perldata.pod b/pod/perldata.pod index 66bb206133..0ff6534572 100644 --- a/pod/perldata.pod +++ b/pod/perldata.pod @@ -400,17 +400,24 @@ leave nothing to doubt: $element_count = scalar(@whatever); If you evaluate a hash in scalar context, it returns false if the -hash is empty. If there are any key/value pairs, it returns true; -more precisely, the value returned is a string consisting of the +hash is empty. If there are any key/value pairs, it returns true. +A more precise definition is version dependent. + +Prior to Perl 5.25 the value returned was a string consisting of the number of used buckets and the number of allocated buckets, separated by a slash. This is pretty much useful only to find out whether Perl's internal hashing algorithm is performing poorly on your data set. For example, you stick 10,000 things in a hash, but evaluating %HASH in scalar context reveals C<"1/16">, which means only one out of sixteen buckets has been touched, and presumably contains all -10,000 of your items. This isn't supposed to happen. If a tied hash -is evaluated in scalar context, the C<SCALAR> method is called (with a -fallback to C<FIRSTKEY>). +10,000 of your items. This isn't supposed to happen. + +As of Perl 5.25 the return was changed to be the count of keys in the +hash. If you need access to the old behavior you can use +C<Hash::Util::bucket_ratio()> instead. + +If a tied hash is evaluated in scalar context, the C<SCALAR> method is +called (with a fallback to C<FIRSTKEY>). X<hash, scalar context> X<hash, bucket> X<bucket> You can preallocate space for a hash by assigning to the keys() function. |