digitalmars.D - delete and references?

Aaron Watters (8/8) Aug 07 2009 Hi guys. D looks cool. A couple of things confuse me.

Jarrett Billingsley (6/11) Aug 07 2009 If you delete something and attempt to use another reference to the

Jeremie Pelletier (4/18) Aug 07 2009 If you're interested in how this behaves in the runtime, "delete cl;" te...

Andrei Alexandrescu (8/29) Aug 08 2009 Great description.

Jarrett Billingsley (8/11) Aug 08 2009 Why? Instead of dangling pointers, you'd end up with pointers to

Andrei Alexandrescu (4/16) Aug 08 2009 Why? It's not a frequent need.

Leandro Lucarella (13/24) Aug 08 2009 You could easily get a segfault *if* the memory is big enough (> 1 page)

Leandro Lucarella (10/29) Aug 08 2009 But, well, that has nothing to do with putting that memory in the free l...

Jarrett Billingsley (9/26) Aug 08 2009 ut

Jeremie Pelletier (6/37) Aug 08 2009 I just had an idea for new semantics for delete. With proper use it coul...
Andrei Alexandrescu (6/30) Aug 08 2009 I believe it is.

Daniel Keep (5/12) Aug 08 2009 I use it quite a bit, mostly because scope doesn't work for arrays.

Michel Fortin (15/20) Aug 08 2009 I don't see how this changes anything. Instead of accessing a

Andrei Alexandrescu (7/25) Aug 08 2009 The former case has you using memory that was possibly allocated to an

Jeremie Pelletier (3/33) Aug 08 2009 I agree, scope arguments can easily be casted away and live beyond the s...

bearophile (4/5) Aug 08 2009 Are you saying this after reading Bartoz blog posts?

Jeremie Pelletier (2/9) Aug 08 2009 No I haven't, do you have a link?

bearophile (5/6) Aug 08 2009 With the power of Google this is the blog of Bartosz Milewski, you may r...

Aaron Watters <arw1961 yahoo.com> writes:

Hi guys.  D looks cool.  A couple of things confuse me.
The first is: what happens if I do a

delete cl;

after storing a reference to cl somewhere?  Can I use the stored
reference?  Is it invalid?  Please elucidate, thanks.

   -- Aaron Watters

ps: sorry if this is a repeat, but if it is it might be a good addition
  to the FAQ...

Aug 07 2009

Jarrett Billingsley <jarrett.billingsley gmail.com> writes:

On Fri, Aug 7, 2009 at 1:20 PM, Aaron Watters<arw1961 yahoo.com> wrote:
 Hi guys. =A0D looks cool. =A0A couple of things confuse me.
 The first is: what happens if I do a

 delete cl;

 after storing a reference to cl somewhere? =A0Can I use the stored
 reference? =A0Is it invalid? =A0Please elucidate, thanks.

If you delete something and attempt to use another reference to the
deleted object, the behavior's undefined.  Basically the same thing as
accessing a dangling pointer in C or C++.  'delete' is really just
there for when you *know* that there are no other references and you
want to ensure that it gets cleaned up immediately.

Aug 07 2009

Jeremie Pelletier <jeremiep gmail.com> writes:

Jarrett Billingsley Wrote:

 On Fri, Aug 7, 2009 at 1:20 PM, Aaron Watters<arw1961 yahoo.com> wrote:
 Hi guys. �D looks cool. �A couple of things confuse me.
 The first is: what happens if I do a

 delete cl;

 after storing a reference to cl somewhere? �Can I use the stored
 reference? �Is it invalid? �Please elucidate, thanks.

 
 If you delete something and attempt to use another reference to the
 deleted object, the behavior's undefined.  Basically the same thing as
 accessing a dangling pointer in C or C++.  'delete' is really just
 there for when you *know* that there are no other references and you
 want to ensure that it gets cleaned up immediately.

If you're interested in how this behaves in the runtime, "delete cl;" tells the
memory manager to reclaim the memory at 'cl'. It does so by putting its block
of data back onto its internal freelist. At that point the first 4 bytes are
used in a linked list. If you still have references to that memory and then
modify it, you break the memory manager internal state, which is gonna be hard
to track back. After the memory has been reallocated, your old reference and
the new one are now fighting for the same memory block. In either cases, your
old reference will cause trouble that will be very hard to isolate.

A good rule of thumb in D is that if you're unsure whether references are still
alive, don't delete the object, the GC will automatically reclaim the memory
once it detects there are no more references to it. It is also faster to go
that way, lots of delete calls are more expensive than one GC sweep, and the
sweep is going to happen sooner or later anyways.

However, delete is most useful when you need large blocks of data for short
periods of time.

Aug 07 2009

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

Jeremie Pelletier wrote:
 Jarrett Billingsley Wrote:
 
 On Fri, Aug 7, 2009 at 1:20 PM, Aaron Watters<arw1961 yahoo.com> wrote:
 Hi guys.  D looks cool.  A couple of things confuse me.
 The first is: what happens if I do a

 delete cl;

 after storing a reference to cl somewhere?  Can I use the stored
 reference?  Is it invalid?  Please elucidate, thanks.

 If you delete something and attempt to use another reference to the
 deleted object, the behavior's undefined.  Basically the same thing as
 accessing a dangling pointer in C or C++.  'delete' is really just
 there for when you *know* that there are no other references and you
 want to ensure that it gets cleaned up immediately.

 
 If you're interested in how this behaves in the runtime, "delete cl;" tells
the memory manager to reclaim the memory at 'cl'. It does so by putting its
block of data back onto its internal freelist. At that point the first 4 bytes
are used in a linked list. If you still have references to that memory and then
modify it, you break the memory manager internal state, which is gonna be hard
to track back. After the memory has been reallocated, your old reference and
the new one are now fighting for the same memory block. In either cases, your
old reference will cause trouble that will be very hard to isolate.
 
 A good rule of thumb in D is that if you're unsure whether references are
still alive, don't delete the object, the GC will automatically reclaim the
memory once it detects there are no more references to it. It is also faster to
go that way, lots of delete calls are more expensive than one GC sweep, and the
sweep is going to happen sooner or later anyways.
 
 However, delete is most useful when you need large blocks of data for short
periods of time.

Great description.

FWIW, I am trying to convince Walter to not reclaim memory in delete, 
but instead only call destructors. D continues C++'s mistake of 
conflating lifetime termination with memory reclamation.

As there will always be a need for reclaiming memory, malloc can always 
supplant that need.


Andrei

Aug 08 2009

Jarrett Billingsley <jarrett.billingsley gmail.com> writes:

On Sat, Aug 8, 2009 at 9:17 AM, Andrei
Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 FWIW, I am trying to convince Walter to not reclaim memory in delete, but
 instead only call destructors. D continues C++'s mistake of conflating
 lifetime termination with memory reclamation.

Why?  Instead of dangling pointers, you'd end up with pointers to
finalized objects, which would probably lead to harder-to-catch bugs
(since then you wouldn't even get a segfault).

And what if you used it to try to delete a temporary array?  Yes, you
could use malloc/free for the temp array, but.. it just seems silly to
have to dip into C functions to do that.

Aug 08 2009

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

Jarrett Billingsley wrote:
 On Sat, Aug 8, 2009 at 9:17 AM, Andrei
 Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 FWIW, I am trying to convince Walter to not reclaim memory in delete, but
 instead only call destructors. D continues C++'s mistake of conflating
 lifetime termination with memory reclamation.

 
 Why?  Instead of dangling pointers, you'd end up with pointers to
 finalized objects, which would probably lead to harder-to-catch bugs
 (since then you wouldn't even get a segfault).

Getting a segfault == lucky

 And what if you used it to try to delete a temporary array?  Yes, you
 could use malloc/free for the temp array, but.. it just seems silly to
 have to dip into C functions to do that.

Why? It's not a frequent need.


Andrei

Aug 08 2009

Leandro Lucarella <llucax gmail.com> writes:

Andrei Alexandrescu, el  8 de agosto a las 08:42 me escribiste:
 Jarrett Billingsley wrote:
On Sat, Aug 8, 2009 at 9:17 AM, Andrei
Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
FWIW, I am trying to convince Walter to not reclaim memory in delete, but
instead only call destructors. D continues C++'s mistake of conflating
lifetime termination with memory reclamation.

Why?  Instead of dangling pointers, you'd end up with pointers to
finalized objects, which would probably lead to harder-to-catch bugs
(since then you wouldn't even get a segfault).

 
 Getting a segfault == lucky

You could easily get a segfault *if* the memory is big enough (> 1 page)
*and* if there is some support from the OS. Remove the RW permissions from
the page and when somebody wants to access it or write it... BOOM! You'll
get your segfault.

So it's not only lucky, so can choose your own destiny ;)

-- 
Leandro Lucarella (luca) | Blog colectivo: http://www.mazziblog.com.ar/blog/
----------------------------------------------------------------------------
GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145  104C 949E BFB6 5F5A 8D05)
----------------------------------------------------------------------------
CAMPAÑA POR LA PAZ: APLASTARON JUGUETES BÉLICOS
	-- Crónica TV

Aug 08 2009

Leandro Lucarella <llucax gmail.com> writes:

Leandro Lucarella, el  8 de agosto a las 11:22 me escribiste:
 Andrei Alexandrescu, el  8 de agosto a las 08:42 me escribiste:
 Jarrett Billingsley wrote:
On Sat, Aug 8, 2009 at 9:17 AM, Andrei
Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
FWIW, I am trying to convince Walter to not reclaim memory in delete, but
instead only call destructors. D continues C++'s mistake of conflating
lifetime termination with memory reclamation.

Why?  Instead of dangling pointers, you'd end up with pointers to
finalized objects, which would probably lead to harder-to-catch bugs
(since then you wouldn't even get a segfault).

 
 Getting a segfault == lucky

 
 You could easily get a segfault *if* the memory is big enough (> 1 page)
 *and* if there is some support from the OS. Remove the RW permissions from
 the page and when somebody wants to access it or write it... BOOM! You'll
 get your segfault.
 
 So it's not only lucky, so can choose your own destiny ;)

But, well, that has nothing to do with putting that memory in the free list
or not, it can be implemented for both approaches...

-- 
Leandro Lucarella (luca) | Blog colectivo: http://www.mazziblog.com.ar/blog/
----------------------------------------------------------------------------
GPG Key: 5F5A8D05 (F8CD F9A7 BF00 5431 4145  104C 949E BFB6 5F5A 8D05)
----------------------------------------------------------------------------
JUGAR COMPULSIVAMENTE ES PERJUDICIAL PARA LA SALUD.
	-- Casino de Mar del Plata

Aug 08 2009

Jarrett Billingsley <jarrett.billingsley gmail.com> writes:

On Sat, Aug 8, 2009 at 9:42 AM, Andrei
Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 Jarrett Billingsley wrote:
 On Sat, Aug 8, 2009 at 9:17 AM, Andrei
 Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 FWIW, I am trying to convince Walter to not reclaim memory in delete, b=



ut
 instead only call destructors. D continues C++'s mistake of conflating
 lifetime termination with memory reclamation.

 Why? =A0Instead of dangling pointers, you'd end up with pointers to
 finalized objects, which would probably lead to harder-to-catch bugs
 (since then you wouldn't even get a segfault).

 Getting a segfault =3D=3D lucky

And the alternative - using a "dead" object - is any better?

 And what if you used it to try to delete a temporary array? =A0Yes, you
 could use malloc/free for the temp array, but.. it just seems silly to
 have to dip into C functions to do that.

 Why? It's not a frequent need.

That's enough of a justification to change delete's behavior?

I just don't know why it should be changed.  It's a blunt tool for
blunt uses.  When you delete something, you're telling the GC "I don't
need this anymore, *trust me*", so *not* freeing memory associated
with it seems silly.

Aug 08 2009

Jeremie Pelletier <jeremiep gmail.com> writes:

Jarrett Billingsley Wrote:

 On Sat, Aug 8, 2009 at 9:42 AM, Andrei
 Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 Jarrett Billingsley wrote:
 On Sat, Aug 8, 2009 at 9:17 AM, Andrei
 Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 FWIW, I am trying to convince Walter to not reclaim memory in delete, but
 instead only call destructors. D continues C++'s mistake of conflating
 lifetime termination with memory reclamation.

 Why? �Instead of dangling pointers, you'd end up with pointers to
 finalized objects, which would probably lead to harder-to-catch bugs
 (since then you wouldn't even get a segfault).

 Getting a segfault == lucky

 
 And the alternative - using a "dead" object - is any better?
 
 And what if you used it to try to delete a temporary array? �Yes, you
 could use malloc/free for the temp array, but.. it just seems silly to
 have to dip into C functions to do that.

 Why? It's not a frequent need.

 
 That's enough of a justification to change delete's behavior?
 
 I just don't know why it should be changed.  It's a blunt tool for
 blunt uses.  When you delete something, you're telling the GC "I don't
 need this anymore, *trust me*", so *not* freeing memory associated
 with it seems silly.

I just had an idea for new semantics for delete. With proper use it could speed
up the GC a lot, and with misuse it could make it worse.

When deleting memory, that block goes on a special linked list with an internal
entry wrapper as to not alter the memory. It is not finalized, nor is its GC
state changed (has pointers, no scan, etc). Once the GC is asked to do a sweep,
that linked list is checked first for unused memory, and free blocks are put
back on the freelist. If enough can be reclaimed there for the allocation
request which triggered the sweep, the entire set of allocations doesn't have
to be scanned.

Well thats the idea in a nutshell, it popped in my mind as I read Andrei's
comment about delete doing only a finalize. Since we don't have a moving GC
with multiple heaps (short lived heap moving up to long lived heap, and only
the short lived one is scanned often), we could optimize how the sweep is being
done.

That would however require new ABI routines to use with do-it-now deletes, such
as scoped allocations as their usage may rely on the fact that the finalizer is
executed when the scope exits.

Please give me some input on that idea.

Aug 08 2009

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

Jarrett Billingsley wrote:
 On Sat, Aug 8, 2009 at 9:42 AM, Andrei
 Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 Jarrett Billingsley wrote:
 On Sat, Aug 8, 2009 at 9:17 AM, Andrei
 Alexandrescu<SeeWebsiteForEmail erdani.org> wrote:
 FWIW, I am trying to convince Walter to not reclaim memory in delete, but
 instead only call destructors. D continues C++'s mistake of conflating
 lifetime termination with memory reclamation.

 Why?  Instead of dangling pointers, you'd end up with pointers to
 finalized objects, which would probably lead to harder-to-catch bugs
 (since then you wouldn't even get a segfault).

 Getting a segfault == lucky

 
 And the alternative - using a "dead" object - is any better?

Yes. I've internalized this so deeply, I won't even debate it.

 And what if you used it to try to delete a temporary array?  Yes, you
 could use malloc/free for the temp array, but.. it just seems silly to
 have to dip into C functions to do that.

 Why? It's not a frequent need.

 
 That's enough of a justification to change delete's behavior?

I believe it is.

 I just don't know why it should be changed.  It's a blunt tool for
 blunt uses.  When you delete something, you're telling the GC "I don't
 need this anymore, *trust me*", so *not* freeing memory associated
 with it seems silly.

We do have blunt tools for blunt uses: malloc and free. We don't need 
more of those.


Andrei

Aug 08 2009

Daniel Keep <daniel.keep.lists gmail.com> writes:

Andrei Alexandrescu wrote:
 ...
 
 And what if you used it to try to delete a temporary array?  Yes, you
 could use malloc/free for the temp array, but.. it just seems silly to
 have to dip into C functions to do that.

 
 Why? It's not a frequent need.

I use it quite a bit, mostly because scope doesn't work for arrays.

Also, how would you use malloc on class instances?

Also also, I think D's current memory leakage problems should be fixed
BEFORE you start taking away our ability to compensate for the GC.

Aug 08 2009

Michel Fortin <michel.fortin michelf.com> writes:

On 2009-08-08 09:17:28 -0400, Andrei Alexandrescu 
<SeeWebsiteForEmail erdani.org> said:

 Great description.
 
 FWIW, I am trying to convince Walter to not reclaim memory in delete, 
 but instead only call destructors. D continues C++'s mistake of 
 conflating lifetime termination with memory reclamation.

I don't see how this changes anything. Instead of accessing a 
deallocated object, you'll access a finaized but not yet deallocated 
object. In both cases, it's a bug.

Wouldn't it be better to have a system to track unique pointers? If you 
knew that no other pointer points to a given object or memory block, 
you can finalize and deallocate it safely. In fact, the current 
semantics of a scope object assume that the programmer will not leave 
any dangling pointers at the end of the scope, so it's already assuming 
uniqueness, just not enforcing it.

-- 
Michel Fortin
michel.fortin michelf.com
http://michelf.com/

Aug 08 2009

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

Michel Fortin wrote:
 On 2009-08-08 09:17:28 -0400, Andrei Alexandrescu 
 <SeeWebsiteForEmail erdani.org> said:
 
 Great description.

 FWIW, I am trying to convince Walter to not reclaim memory in delete, 
 but instead only call destructors. D continues C++'s mistake of 
 conflating lifetime termination with memory reclamation.

 
 I don't see how this changes anything. Instead of accessing a 
 deallocated object, you'll access a finaized but not yet deallocated 
 object. In both cases, it's a bug.

The former case has you using memory that was possibly allocated to an 
object of different type. This makes for intractable errors.

 Wouldn't it be better to have a system to track unique pointers? If you 
 knew that no other pointer points to a given object or memory block, you 
 can finalize and deallocate it safely. In fact, the current semantics of 
 a scope object assume that the programmer will not leave any dangling 
 pointers at the end of the scope, so it's already assuming uniqueness, 
 just not enforcing it.

Unique pops in an out as a topic for future directions. It is quite 
difficult to make it at the same time correct, inconspicuous, and 
useful. We have firmly decided to do away with unique for D2.


Andrei

Aug 08 2009

Jeremie Pelletier <jeremiep gmail.com> writes:

Andrei Alexandrescu Wrote:

 Michel Fortin wrote:
 On 2009-08-08 09:17:28 -0400, Andrei Alexandrescu 
 <SeeWebsiteForEmail erdani.org> said:
 
 Great description.

 FWIW, I am trying to convince Walter to not reclaim memory in delete, 
 but instead only call destructors. D continues C++'s mistake of 
 conflating lifetime termination with memory reclamation.

 
 I don't see how this changes anything. Instead of accessing a 
 deallocated object, you'll access a finaized but not yet deallocated 
 object. In both cases, it's a bug.

 
 The former case has you using memory that was possibly allocated to an 
 object of different type. This makes for intractable errors.
 
 Wouldn't it be better to have a system to track unique pointers? If you 
 knew that no other pointer points to a given object or memory block, you 
 can finalize and deallocate it safely. In fact, the current semantics of 
 a scope object assume that the programmer will not leave any dangling 
 pointers at the end of the scope, so it's already assuming uniqueness, 
 just not enforcing it.

 
 Unique pops in an out as a topic for future directions. It is quite 
 difficult to make it at the same time correct, inconspicuous, and 
 useful. We have firmly decided to do away with unique for D2.
 
 
 Andrei

I agree, scope arguments can easily be casted away and live beyond the scope.
It's impossible to know if a pointer is unique without scanning the entire heap
and pausing all threads to scan their stacks. And adding reference counts to
allocations would break the need to have a GC in the first place.

The best thing to do is to not use delete when uncertain about the uniqueness
of a pointer.

Aug 08 2009

bearophile <bearophileHUGS lycos.com> writes:

Jeremie Pelletier:
It's impossible to know if a pointer is unique without scanning the entire heap
and pausing all threads to scan their stacks.<

Are you saying this after reading Bartoz blog posts?

Bye,
bearophile

Aug 08 2009

Jeremie Pelletier <jeremiep gmail.com> writes:

bearophile Wrote:

 Jeremie Pelletier:
It's impossible to know if a pointer is unique without scanning the entire heap
and pausing all threads to scan their stacks.<

 
 Are you saying this after reading Bartoz blog posts?
 
 Bye,
 bearophile

No I haven't, do you have a link?

Aug 08 2009

bearophile <bearophileHUGS lycos.com> writes:

Jeremie Pelletier:
No I haven't, do you have a link?<

With the power of Google this is the blog of Bartosz Milewski, you may read
about the last ten posts or so:
http://bartoszmilewski.wordpress.com/

Bye,
bearophile

Aug 08 2009

D Programming

C/C++ Programming

Other

digitalmars.D - delete and references?