Why does Perl use the empty string to represent the boolean false value?
Asked Answered
P

5

18

When evaluating an expression in a scalar (boolean) context, Perl uses the explicit value 1 as a result if the expression evaluates to true and the empty string if the expression evaluates to false. I'm curious why Perl uses the empty string to represent boolean false value and not 0 which seems more intuitive.

Note that I'm not concerned with Perl treating the empty string as a false in scalar (boolean) context.

EDIT

How would using string which is true ("false" for instance) as a string representation of false values change the meaning of existing code? Could we say that code that changes semantics after such a change is less robust/correct than it could have been? I guess string context is so pervasive in Perl that the only option leading to sane semantics is if boolean value preserve its value after round tripping to and from a string...

Phasis answered 12/10, 2010 at 11:27 Comment(1)
Possible duplicate of Why does !1 give me nothing in Perl?Bernardinabernardine
R
32

The various logical operators don't return an empty string, they return a false or true value in all three simple scalar types. It just looks like it returns an empty string because print forces a string context on its arguments:

#!/usr/bin/perl

use strict;
use warnings;

use Devel::Peek;

my $t = 5 > 4;
my $f = 5 < 4;

Dump $t;
Dump $f;

Output:

SV = PVNV(0x100802c20) at 0x100827348
  REFCNT = 1
  FLAGS = (PADMY,IOK,NOK,POK,pIOK,pNOK,pPOK)
  IV = 1
  NV = 1
  PV = 0x100201e60 "1"\0
  CUR = 1
  LEN = 16
SV = PVNV(0x100802c40) at 0x100827360
  REFCNT = 1
  FLAGS = (PADMY,IOK,NOK,POK,pIOK,pNOK,pPOK)
  IV = 0
  NV = 0
  PV = 0x100208ca0 ""\0
  CUR = 0
  LEN = 16

For those not familiar with the Perl 5 internals, a PVNV is a scalar structure that holds all three simple scalar types (integer IV, double precision float NV, and string PV). The flags IOK, NOK, and POK mean that the integer, double, and string values are all in sync (for some definition of in sync) so any one of them may be used (i.e. no conversions need to take place if you use it as an integer, double, or string).

I assume the empty string was chosen for the false string because it is smaller and is more in keeping with the idea of a false string than "0". Ignore my statement about it being smaller, both "" and "1" are the same size: sixteen characters. It says so right in the dump. Perl 5 adds extra space to strings to allow them to grow quickly.

Oh, and I hate you. In researching this I have found that I have lied in perlopquick and will now have to find a way to fix it. If only you had been like all of the other sheep and just accepted Perl 5's surface weirdness as fact, I would have less work to do.

Answers to the questions in the EDIT section:

How would using string which is true ("false" for instance) as a string representation of false values change the meaning of existing code?

The only special things about about PL_sv_yes and PL_sv_no (the canonically true and false values returned by comparison operators) are that they are read only and are created by perl not the program that is running. If you change them, it does not change the truthiness test, so a PL_sv_no that is set to "false" will be treated as true. You can even do this yourself (this code stops working at some point between Perl 5.18 and the latest Perl) using undocumented features of perl:

#!/usr/bin/perl

use strict;
use warnings;
use Scalar::Util qw/dualvar/;

BEGIN {
        # use the undocumented SvREADONLY function from Internals to
        # modify a reference to PL_sv_no's readonly flag
        # note the use of & to make the compiler not use SvREADONLY's
        # prototype, yet another reason prototypes are bad and shouldn't
        # be used
        &Internals::SvREADONLY(\!!0, 0);

        # set PL_sv_no to a dualvar containing 0 and "false"
        ${\!!0} = dualvar 0, "false";
}

if (5 < 4) {
        print "oops\n";
}

outputs

opps

This is because the truthiness test looks at strings first.

Could we say that code that changes semantics after such a change is less robust/correct than it could have been?

It will be straight up broken. Even if you restrict yourself to setting it to an int 0 or a string "0" (both of which are false), it will break some valid code.

I guess string context is so pervasive in Perl that the only option leading to sane semantics is if boolean value preserve its value after round tripping to and from a string...

Yes.

Redo answered 12/10, 2010 at 12:9 Comment(5)
I hate/love both of you -- this question caused me to send a docpatch to p5p just now: perlguts mistakenly refers to PL_sv_no (the "false" scalar in question) as PL_sv_false. :)Immune
@Piotr Dobrogost See what you have done! See what comes from asking questions? You are making things in Perl 5 better! How can you live with yourself?Redo
@Immune Could you care to elaborate?Phasis
@Piotr Dobrogost The perldoc perlguts document incorrectly stated that there was a PL_sv_false C function that returned the same value as 1 < 0. The function is really named PL_sv_no. Read the p5p post for more information.Redo
What do you think about questions I added in EDIT section?Phasis
M
5

You can overload the stringification of true, false and undef, like this:

&Internals::SvREADONLY( \ !!1, 0);    # make !!1 writable
${ \ !!1 } = 'true';                  # change the string value of true
&Internals::SvREADONLY( \ !!1, 1);    # make !!1 readonly again
print 42 == (6*7);                    # prints 'true'

&Internals::SvREADONLY( \ !!0, 0);    # make !!0 writable
${ \ !!0 } = 'false';                 # change the string value of false
&Internals::SvREADONLY( \ !!0, 1);    # make !!0 readonly again
print 42 == (6*6);                    # prints 'false'
Mercuri answered 12/10, 2010 at 16:58 Comment(3)
Very nice customization. I had no idea Perl allows for such a thing.Phasis
The Internals package and functions or variables in it are not for public consumption (hence the name). Globally changing the value of the true and false values returned by operators is inadvisable in the extreme. Circumventing SvREADONLY's prototype is just icing on the cake. That said, nifty.Redo
Sadly, Perl 5.22 seems to have fixed this.Redo
O
3

It's not just "" that's false in Perl. As for why... it's either because Perl is awesome or terrible -- depending on your personal preferences :)

Offing answered 12/10, 2010 at 11:31 Comment(5)
Perl is the only language which is equally readable before and after RSA encryption.Percolator
@Percolator No, APL is. Perl is slightly more readable in unencrypted form.Redo
@Chas - I think you meant to say "in encrypted form".Dardanus
@Dardanus Well played sir, well played.Redo
This is not an answer to the question. The question was not about what's true or false in Perl.Phasis
S
3

Both number 0 and empty string ultimately evaluate as false in Perl. I think this is a matter of language design. When writing your own code, you can of course assume any which one false encoding convention.

For further details, check out "How do I use boolean variables in Perl?".

Scat answered 12/10, 2010 at 11:33 Comment(0)
K
2

Here is how I got around the problem:

my $res = ($a eq $b) *1;

The *1 converts the boolean resulting from ($a eq $b) into a scalar.

Kammerer answered 3/12, 2014 at 16:45 Comment(1)
Actually previously it was also a scalar value. Now it's additionally a numeric (IV) value. But using *1 is quite clever...Incision

© 2022 - 2024 — McMap. All rights reserved.