digitalmars.D - Issue 3789, stucts equality

bearophile (33/33) Mar 26 2012 Issue 3789 is an enhancement request, I think it fixes one small but qui...

Alvaro (4/15) Mar 26 2012 Maybe it makes more sense that struct==struct applies == to each of its

Nathan M. Swan (2/6) Mar 27 2012 +1
Steven Schveighoffer (5/8) Mar 27 2012 That's exactly what bug 3789 advocates for. Please vote up!

Andrej Mitrovic (8/10) Mar 29 2012 Unrelated, but I don't understand bugzilla's stupid 10-vote

Steven Schveighoffer (14/21) Mar 29 2012 The reasoning behind it was that we didn't want everyone voting for ever...

Andrej Mitrovic (28/31) Mar 29 2012 Well that took 2 seconds.
Andrej Mitrovic (18/19) Mar 29 2012 Correction, makes things easier. No need to pass the type of 'this':
Andrej Mitrovic (28/29) Mar 29 2012 Crap, lockstep doesn't work on different types, it was iterating over
Artur Skawina (21/58) Mar 29 2012 // D does make things easy:
Alvaro (9/18) Mar 29 2012 BTW, today I encountered a problem that is probably related to this bug.

bearophile (90/102) Mar 29 2012 Thank you for your example.

bearophile <bearophileHUGS lycos.com> writes:

Issue 3789 is an enhancement request, I think it fixes one small but quite
important problem in D design. The situation is shown by this simple code:


struct String {
    char[] data;
}
void main () {
    auto foo = String("foo".dup);
    auto bar = String("foo".dup);
    assert(bar !is foo, "structs aren't the same bit-wise");
    assert(bar == foo, "oops structs aren't equal");
}



The D Zen says D is designed to be safe on default and to perform unsafe (and
faster) things on request. Not comparing the strings as strings in the
following code breaks the Principle of least astonishment, so breaks that rule.

An acceptable alternative to fixing Bug 3789 is statically disallowing the
equal operator (==) in such cases (or even in all cases).

D has the "is" for the situations where you want to perform bitwise comparison,
for structs too. For the other situations where I use "==" among struts, I want
it do the right thing, like comparing its contained strings correctly instead
of arbitrarily deciding to use bitwise comparison of the sub-struct that
represents the string.

There is already a patch for this, from the extra-good Kenji Hara:
https://github.com/D-Programming-Language/dmd/pull/387

Making "==" work as "is" for structs means using an operator for the purpose of
the other operator, and it has caused some bugs in my code. And it will cause
bugs in D code to come.


Another example, reduced/modified from a real bug in a program of mine:


import std.stdio;
struct Foo {
    int x;
    string s;
}
void main () {
    int[Foo] aa;
    aa[Foo(10, "hello")] = 1;
    string hel = "hel";
    aa[Foo(10, hel ~ "lo")] = 2;
    writeln(aa);
}


Here D defines a hashing for the Foo struct, but it uses the standard "==" to
compare the struct keys. So the output is this, that I believe is what almost
no one will ever want:

[Foo(10, "hello"):1, Foo(10, "hello"):2]

Bye,
bearophile

Mar 26 2012

Alvaro <alvaroDotSegura gmail.com> writes:

El 26/03/2012 13:58, bearophile escribió:
 Issue 3789 is an enhancement request, I think it fixes one small but quite
important problem in D design. The situation is shown by this simple code:


 struct String {
      char[] data;
 }
 void main () {
      auto foo = String("foo".dup);
      auto bar = String("foo".dup);
      assert(bar !is foo, "structs aren't the same bit-wise");
      assert(bar == foo, "oops structs aren't equal");
 }



 The D Zen says D is designed to be safe on default and to perform unsafe (and
faster) things on request. Not comparing the strings as strings in the
following code breaks the Principle of least astonishment, so breaks that rule.


Maybe it makes more sense that struct==struct applies == to each of its 
fields. It would be the same as bitwise comparison for simple primitive 
types, but would be more useful with other types such as strings.

Mar 26 2012

"Nathan M. Swan" <nathanmswan gmail.com> writes:

On Monday, 26 March 2012 at 21:25:06 UTC, Alvaro wrote:
 Maybe it makes more sense that struct==struct applies == to 
 each of its fields. It would be the same as bitwise comparison 
 for simple primitive types, but would be more useful with other 
 types such as strings.

+1

Mar 27 2012

"Steven Schveighoffer" <schveiguy yahoo.com> writes:

On Mon, 26 Mar 2012 17:25:07 -0400, Alvaro <alvaroDotSegura gmail.com>
wrote:

 Maybe it makes more sense that struct==struct applies == to each of its  
 fields. It would be the same as bitwise comparison for simple primitive  
 types, but would be more useful with other types such as strings.

That's exactly what bug 3789 advocates for.  Please vote up!

http://d.puremagic.com/issues/show_bug.cgi?id=3789

-Steve

Mar 27 2012

Andrej Mitrovic <andrej.mitrovich gmail.com> writes:

On 3/27/12, Steven Schveighoffer <schveiguy yahoo.com> wrote:
 That's exactly what bug 3789 advocates for.  Please vote up!

 http://d.puremagic.com/issues/show_bug.cgi?id=3789

Unrelated, but I don't understand bugzilla's stupid 10-vote
limitation. As if I can only care about 10 bugs. I agree with 3789, ==
should do the safe thing by default.

Speaking of which, does anyone have some sort of template to
autogenerate an opEquals that compares all fields of a struct via
'=='? I could write it (maybe via traits & getAllMembers) but surely
someone has done it before.

Mar 29 2012

"Steven Schveighoffer" <schveiguy yahoo.com> writes:

On Thu, 29 Mar 2012 07:49:28 -0400, Andrej Mitrovic  
<andrej.mitrovich gmail.com> wrote:

 On 3/27/12, Steven Schveighoffer <schveiguy yahoo.com> wrote:
 That's exactly what bug 3789 advocates for.  Please vote up!

 http://d.puremagic.com/issues/show_bug.cgi?id=3789

 Unrelated, but I don't understand bugzilla's stupid 10-vote
 limitation. As if I can only care about 10 bugs. I agree with 3789, ==
 should do the safe thing by default.

The reasoning behind it was that we didn't want everyone voting for every  
bug.  10 votes may be too restrictive, but some restriction should be in  
place IMO.  The 10 votes was an arbitrary choice.  When you have unlimited  
votes, you are not as responsible with them.

I personally have never run out of votes, because every time I go to vote  
for a bug, I have some voted-for bugs which have been fixed.

The voting system was put in place a few years ago, to see how it might  
help focus attention on more severe bugs.  I'm not sure it had the  
dramatic effect we were hoping, but some bugs did get more attention  
because they had more votes.  It at least gives you a good compelling  
argument if you want to argue that a high-vote bug should be fixed :)

-Steve

Mar 29 2012

Andrej Mitrovic <andrej.mitrovich gmail.com> writes:

On 3/29/12, Andrej Mitrovic <andrej.mitrovich gmail.com> wrote:
 Speaking of which, does anyone have some sort of template to
 autogenerate an opEquals that compares all fields of a struct via
 '=='?

Well that took 2 seconds.

import std.range;

template safeOpEquals(T)
{
    bool opEquals(T t)
    {
        foreach (lhsField, rhsField; lockstep(this.tupleof, t.tupleof))
        {
            if (lhsField != rhsField)
                return false;
        }

        return true;
    }
}

struct Foo
{
    int[] arr;
    mixin safeOpEquals!Foo;
}

void main()
{
    Foo a, b;
    a.arr = [1, 2, 3].dup;
    b.arr = [1, 2, 3].dup;
    assert(a == b);
}

D makes things so damn easy.

Mar 29 2012

Andrej Mitrovic <andrej.mitrovich gmail.com> writes:

On 3/29/12, Andrej Mitrovic <andrej.mitrovich gmail.com> wrote:
 D makes things so damn easy.

Correction, makes things easier. No need to pass the type of 'this':

template safeOpEquals()
{
    bool opEquals(typeof(this) t)
    {
        foreach (lhsField, rhsField; lockstep(this.tupleof, t.tupleof)) {
            if (lhsField != rhsField)
                return false;
        }
        return true;
    }
}
struct Foo
{
    int[] arr;
    mixin safeOpEquals;
}

Mar 29 2012

Andrej Mitrovic <andrej.mitrovich gmail.com> writes:

 3/29/12, Andrej Mitrovic <andrej.mitrovich gmail.com> wrote:
 snip

Crap, lockstep doesn't work on different types, it was iterating over
the arrays. That went over my head. A slightly complicated version
that actually works:

import std.stdio;
import std.conv;

string tupleCompare(int len)()
{
    string result;
    foreach (i; 0 .. len)
    {
        result ~= "
            if (this.tupleof[" ~ to!string(i) ~ "] != t.tupleof[" ~
to!string(i) ~ "])
                return false;
        ";
    }
    return result;
}

template safeOpEquals()
{
    bool opEquals(typeof(this) t)
    {
        enum len = typeof(this).tupleof.length;
        mixin(tupleCompare!len);
        return true;
    }
}

Mar 29 2012

Artur Skawina <art.08.09 gmail.com> writes:

On 03/29/12 14:07, Andrej Mitrovic wrote:
  3/29/12, Andrej Mitrovic <andrej.mitrovich gmail.com> wrote:
 snip

 
 Crap, lockstep doesn't work on different types, it was iterating over
 the arrays. That went over my head. A slightly complicated version
 that actually works:
 
 import std.stdio;
 import std.conv;
 
 string tupleCompare(int len)()
 {
     string result;
     foreach (i; 0 .. len)
     {
         result ~= "
             if (this.tupleof[" ~ to!string(i) ~ "] != t.tupleof[" ~
 to!string(i) ~ "])
                 return false;
         ";
     }
     return result;
 }
 
 template safeOpEquals()
 {
     bool opEquals(typeof(this) t)
     {
         enum len = typeof(this).tupleof.length;
         mixin(tupleCompare!len);
         return true;
     }
 }

// D does make things easy:
template safeOpEquals()
{
   bool opEquals(T)(T t) {
      foreach (i, v; this.tupleof )
         if (v != t.tupleof[i])
            return 0;
      return 1;
   }
}

struct Foo
{
    int[] arr;
    string s;
    mixin safeOpEquals;
}


You may also want to include something like

   static if (!is(typeof(this)==T))
      return 0;
   else
      foreach etc...

to make it return false, instead of failing at compiletime when
comparing different types.

artur

Mar 29 2012

Alvaro <alvaroDotSegura gmail.com> writes:

El 27/03/2012 22:20, Steven Schveighoffer escribió:
 On Mon, 26 Mar 2012 17:25:07 -0400, Alvaro <alvaroDotSegura gmail.com>
 wrote:

 Maybe it makes more sense that struct==struct applies == to each of
 its fields. It would be the same as bitwise comparison for simple
 primitive types, but would be more useful with other types such as
 strings.

 That's exactly what bug 3789 advocates for. Please vote up!

 http://d.puremagic.com/issues/show_bug.cgi?id=3789

 -Steve

BTW, today I encountered a problem that is probably related to this bug.

if you have a map with BigInt as key, it duplicates keys:

int[BigInt] a;
a[BigInt(3)] = 1;
a[BigInt(3)] = 2;
writeln(a);

prints

[3:2, 3:1]

Mar 29 2012

bearophile <bearophileHUGS lycos.com> writes:

Alvaro:

 BTW, today I encountered a problem that is probably related to this bug.
 
 if you have a map with BigInt as key, it duplicates keys:
 
 int[BigInt] a;
 a[BigInt(3)] = 1;
 a[BigInt(3)] = 2;
 writeln(a);
 
 prints
 
 [3:2, 3:1]

Thank you for your example.

BigInt usage has several other problems, this is only a partial sample of them:

-----------------------------

import std.bigint;
void main() {
    BigInt[10] a;
    a[0] = 1; // OK
    a[] = 1; // Error
}


test.d(5): Error: cannot implicitly convert expression (1) of type int to
BigInt[]

-----------------------------

import std.bigint;
void main() {
    BigInt b;
    switch (b) {
        case BigInt(0): break;
        default: break;
    }
}


test2.d(4): Error: 'b' is not of integral type, it is a BigInt
test2.d(5): Error: case must be a string or an integral constant, not
BigInt(BigUint([0u]),false)

-----------------------------

import std.bigint;
struct Count(T) {
    T n;
    this(T n_) { this.n = n_; }
    const bool empty = false;
     property T front() { return n; }
    void popFront() { n++; } // line 7
}
void main() {
    auto co = Count!BigInt(BigInt(0));
    foreach (b; co) {}
}


test2.d(7): Error: var has no effect in expression (__pitmp877)
test2.d(10): Error: template instance test.Count!(BigInt) error instantiating

-----------------------------

You can't compile this program:


import std.stdio, std.bigint;

BigInt positiveIntegerPow(in BigInt inBase, in BigInt inExp) pure nothrow
in {
    assert(inBase >= 0 && inExp >= 0);
} out(result) {
    assert(result >= 0);
} body {
    BigInt base = inBase;
    BigInt exp = inExp;
    auto result = BigInt(1);
    while (exp) {
        if (exp & 1)
            result *= base;
        exp >>= 1;
        base *= base;
    }

    return result;
}

void main() {
    writeln(positiveIntegerPow(BigInt(5), BigInt(6)));
}


You have to change it to:


import std.stdio, std.bigint;

BigInt positiveIntegerPow(in BigInt inBase, in BigInt inExp)
in {
    assert(cast()inBase >= 0 && cast()inExp >= 0);
} out(result) {
    assert(cast()result >= 0);
} body {
    BigInt base = cast()inBase;
    BigInt exp = cast()inExp;
    auto result = BigInt(1);
    while (exp != 0) {
        if (exp % 2)
            result *= base;
        exp >>= 1;
        base *= base;
    }

    return result;
}

void main() {
    writeln(positiveIntegerPow(BigInt(5), BigInt(6)));
}


The problems in that little program are:
- You can't assign a const BigInt to a nonconst one.
- BigInt operations aren't pure or nothrow
- while(exp) is not supported
- if if(exp & 1) is not supported
- assert(result>=0); where result is const is not supported.

Bye,
bearophile

Mar 29 2012

D Programming

C/C++ Programming

Other

digitalmars.D - Issue 3789, stucts equality