digitalmars.D - Re: Ranges and/versus iterators

Steven Schveighoffer <schveiguy yahoo.com> Mar 23 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> Mar 23 2010
Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> Mar 23 2010

Fawzi Mohamed <fawzi gmx.ch> Mar 24 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> Mar 24 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> Mar 25 2010

Fawzi Mohamed <fawzi gmx.ch> Mar 24 2010
Fawzi Mohamed <fawzi gmx.ch> Mar 24 2010
Fawzi Mohamed <fawzi gmx.ch> Mar 24 2010
Fawzi Mohamed <fawzi gmx.ch> Mar 24 2010
Fawzi Mohamed <fawzi gmx.ch> Mar 25 2010

Steven Schveighoffer <schveiguy yahoo.com> writes:

Andrei Alexandrescu Wrote:

 On 03/23/2010 03:46 PM, Steven Schveighoffer wrote:
 A while back, you identified one of the best interfaces for input ranges:

 E* getNext();

 Which allows for null returns when no data is left. The drawback is that
 E must be either referenced or allocated on the heap (providing storage
 to the function is an option). But the killer issue was that safeD would
 not allow it. However, in recent times, you have hinted that safeD may
 allow pointers, but disallow bad pointer operations. In light of this,
 can we reconsider this interface, or other alternatives using pointers?

 I've always felt that if we were to define ranges for streams in a
 non-awkward way, we would need an "all in one" operation, since not only
 does getting data from the range move the range, but checking for empty
 might also move the range (empty on a stream means you tried to read and
 got nothing).


 I'd gladly reconsider E* getNext(), and I like it a lot, but that 
 doesn't accommodate ranges that want to return rvalues without storing 
 them (e.g. a range using getchar() as a back-end, and generally streams 
 that don't correspond to stuff stored in memory). If it's not in memory, 
 there's no pointer to it.


First, a range backed by getchar is about as useful as functional qsort ;)

Second, you *have* to read data into memory.  Even with the ranges as they
currently are, you have to read into memory.  At least this is less awkward.

Take for instance a line iterator.  You have to read enough to see the line
terminator, but you most likely do not read *exactly* to the line terminator,
so you just read in chunks until you get a line, then return the pointer to the
data.  It works actually quite elegantly.

Third, the memory could be supplied by the caller.  For instance, if you wrote
the function like this:

E* getNext(E* buf = null);

Then foreach could do something like this:

foreach(e; streamrange)

=>

E _e;
while(auto e = streamrange.getNext(&_e))

To avoid heap allocation.  Of course, heap allocation would be the default if
buf is null.

Tango does this sort of trick quite often, and it makes the I/O code extremely
fast.

Also, another thing to think about is we can generalize the return type to
satisfying the condition:

iff range is empty then cast(bool)range.getNext == false.

This means as long as your range cannot return a null element for a non-empty
return, it is OK not to use a pointer.  For example, the line iterator again...
it can be written like:

const(char)[] getNext()

because you will only ever return a null const(char)[] when there is no data
left.

I don't think we should give up on trying to make a stream range that is not
awkward, I really dislike the way today's input ranges map to streams.

-Steve

Mar 23 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

On 03/23/2010 09:12 PM, Steven Schveighoffer wrote:
 Andrei Alexandrescu Wrote:

 On 03/23/2010 03:46 PM, Steven Schveighoffer wrote:
 A while back, you identified one of the best interfaces for input ranges:

 E* getNext();

 Which allows for null returns when no data is left. The drawback is that
 E must be either referenced or allocated on the heap (providing storage
 to the function is an option). But the killer issue was that safeD would
 not allow it. However, in recent times, you have hinted that safeD may
 allow pointers, but disallow bad pointer operations. In light of this,
 can we reconsider this interface, or other alternatives using pointers?

 I've always felt that if we were to define ranges for streams in a
 non-awkward way, we would need an "all in one" operation, since not only
 does getting data from the range move the range, but checking for empty
 might also move the range (empty on a stream means you tried to read and
 got nothing).


 I'd gladly reconsider E* getNext(), and I like it a lot, but that
 doesn't accommodate ranges that want to return rvalues without storing
 them (e.g. a range using getchar() as a back-end, and generally streams
 that don't correspond to stuff stored in memory). If it's not in memory,
 there's no pointer to it.


 First, a range backed by getchar is about as useful as functional qsort ;)


Actually I need one. Think fscanf, i.e. unformat() for streams.

Andrei

Mar 23 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

On 03/23/2010 09:12 PM, Steven Schveighoffer wrote:
 Andrei Alexandrescu Wrote:
 I'd gladly reconsider E* getNext(), and I like it a lot, but that
 doesn't accommodate ranges that want to return rvalues without
 storing them (e.g. a range using getchar() as a back-end, and
 generally streams that don't correspond to stuff stored in memory).
 If it's not in memory, there's no pointer to it.


 Second, you *have* to read data into memory.  Even with the ranges as
 they currently are, you have to read into memory.  At least this is
 less awkward.


I agree. But it's one thing to read and pass along, and a different 
thing to read and keep in a buffer inside the range.

 Take for instance a line iterator.  You have to read enough to see
 the line terminator, but you most likely do not read *exactly* to the
 line terminator, so you just read in chunks until you get a line,
 then return the pointer to the data.  It works actually quite
 elegantly.


I disagree about the elegance part. If the range arrogates the right to 
use its own buffering, then when you decide you're done with that range 
and try to read some more from the stream, you discover data has been lost.

The Phobos file I/O functions all avoid doing any more buffering than 
the backing FILE* does. They achieve performance by locking the file 
once with flockfile/funlockfile and then using fgetc_unlocked().

This puts me in real trouble with the formatted reading functions (a la 
fscanf but generalized to all input ranges), which I'm gestating about. 
The problem with the current API is that if you call input.front(), it 
will call fgetc(). But then say I decide I'm done with the range, as is 
the case with e.g. reading an integer and stopping at the first 
non-digit. That non-digit character will be lost. So there's a need to 
say, hey, put this guy back because whoever reads after me will need to 
look at it. So I need a putBackFront() or something (which would call 
fungetc()). I wish things were simpler.

 Third, the memory could be supplied by the caller.  For instance, if
 you wrote the function like this:

 E* getNext(E* buf = null);

 Then foreach could do something like this:

 foreach(e; streamrange)

 =>

 E _e; while(auto e = streamrange.getNext(&_e))

 To avoid heap allocation.  Of course, heap allocation would be the
 default if buf is null.

 Tango does this sort of trick quite often, and it makes the I/O code
 extremely fast.


The problem is that that speed doesn't translate very well to in-memory 
containers. For containers it's preferable to pass null so you get a 
pointer to the actual element; for streams it's preferable to not pass 
null. So it's difficult to write code that works well for both.

 Also, another thing to think about is we can generalize the return
 type to satisfying the condition:

 iff range is empty then cast(bool)range.getNext == false.

 This means as long as your range cannot return a null element for a
 non-empty return, it is OK not to use a pointer.  For example, the
 line iterator again... it can be written like:

 const(char)[] getNext()

 because you will only ever return a null const(char)[] when there is
 no data left.


I see, but if I'm looking for ints? I'll have to return a pointer - or a 
nullable or something.

 I don't think we should give up on trying to make a stream range that
 is not awkward, I really dislike the way today's input ranges map to
 streams.


Me too. Let's keep on looking, I have the feeling something good is 
right behind the corner. But then I felt that way for a year :o).


Andrei

Mar 23 2010

Fawzi Mohamed <fawzi gmx.ch> writes:

On 24-mar-10, at 03:51, Andrei Alexandrescu wrote:

 The Phobos file I/O functions all avoid doing any more buffering  
 than the backing FILE* does. They achieve performance by locking the  
 file once with flockfile/funlockfile and then using fgetc_unlocked().

 This puts me in real trouble with the formatted reading functions (a  
 la fscanf but generalized to all input ranges), which I'm gestating  
 about. The problem with the current API is that if you call  
 input.front(), it will call fgetc(). But then say I decide I'm done  
 with the range, as is the case with e.g. reading an integer and  
 stopping at the first non-digit. That non-digit character will be  
 lost. So there's a need to say, hey, put this guy back because  
 whoever reads after me will need to look at it. So I need a  
 putBackFront() or something (which would call fungetc()). I wish  
 things were simpler.


I had a pushBack (I called that unget) in
http://github.com/fawzi/blip/blob/master/blip/text/TextParser.d 
  , but I recently removed that in favor of a peek function that I  
think is much more flexible.
What I did is to base most parsing on CharReaders (for example the  
char based ones from BasicIO):
{{{
/// extent of a slice of a buffer
enum SliceExtent{ Partial, Maximal, ToEnd }

/// a delegate that reads in from a character source
alias size_t delegate(char[]buf, SliceExtent slice,out bool iterate)  
CharReader;

/// a handler of CharReader, returns true if something was read
alias bool delegate(CharReader)CharReaderHandler;
}}}

a char reader reads from the given buffer buf, and can either request  
more (by returning EOF), or eat some characters out of it. If it sets  
iterate to true it wants to iterate with the eaten buffer (useful to  
for example skip undefined amount of whitespace that might overflow  
the buffer).

Once you have that you can easily create a Peeker structure that wraps  
a CharReader, and exposes a CharReaded that tries to match it, but  
always eats 0 characters, even if the match was successful.
With it you can have a peek method that returns true if the CharReader  
that you pass in matches, false if it does not match, and what you  
want if the buffer is too small to resolve the issue.

Most of these things are templates that work for any type T. Then I  
built buffered types that using a size_t delegate(T[]) give a Reader  
based interface.

All this is not based on single elements anymore, but on arrays  
(ranges? :), but I think that is what is needed for efficient i/o.
 On 03/23/2010 09:12 PM, Steven Schveighoffer wrote:
 I don't think we should give up on trying to make a stream range that
 is not awkward, I really dislike the way today's input ranges map to
 streams.


 Me too. Let's keep on looking, I have the feeling something good is  
 right behind the corner. But then I felt that way for a year :o).


give a try to
	bool popFront(ref T) ( or next, or another name, or even just a  
delegate with that signature)
I was surprised how well it works, not perfect but better than the  
other alternatives I had tried.

loop on a T[] array:
	bool popFront(ref T* el);

mapped to
	opApply(int delegate(ref T x) loopBody);

loop on a source of elements T:
	bool popFront(ref T el);

mapped to
	opApply(int delegate(ref T x) loopBody);
(well the ref there does not make much sense, but that is how opApply  
works to avoid the explosion of opApply).
All it takes is a check for pointers in the templates, and dereference  
the type of opApply.

also direct loops often look reasonable thanks to with D automatically  
dereferencing with ".":
while (it.popFront(el)){
	el.doSomething;
}

yes assigning stuff directly to el, and not its components you need a  
T* iterator and you have to write
	*el=...
and
	x= *el
but that is not so terrible.

filter applied on an iterator it is just
bool popNext(ref T el){
   while (it.popNext(el)){
     if (acceptable(el)){
       return true;
     }
   }
   return false;
}

combiners of iterators are likewise quite simple to write.

Mar 24 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

On 03/24/2010 09:00 AM, Fawzi Mohamed wrote:
 On 24-mar-10, at 03:51, Andrei Alexandrescu wrote:

 The Phobos file I/O functions all avoid doing any more buffering than
 the backing FILE* does. They achieve performance by locking the file
 once with flockfile/funlockfile and then using fgetc_unlocked().

 This puts me in real trouble with the formatted reading functions (a
 la fscanf but generalized to all input ranges), which I'm gestating
 about. The problem with the current API is that if you call
 input.front(), it will call fgetc(). But then say I decide I'm done
 with the range, as is the case with e.g. reading an integer and
 stopping at the first non-digit. That non-digit character will be
 lost. So there's a need to say, hey, put this guy back because whoever
 reads after me will need to look at it. So I need a putBackFront() or
 something (which would call fungetc()). I wish things were simpler.


 I had a pushBack (I called that unget) in
 http://github.com/fawzi/blip/blob/master/blip/text/TextParser.d , but I
 recently removed that in favor of a peek function that I think is much
 more flexible.


Thanks for sharing your design with me. Yes, peek() is more flexible 
than get/unget, but I'm under the stdio tyranny.

In fact I just realized something - I could call

setvbuf(_handle, null, _IONBF, 0)

whenever I bind a File to a FILE*. That way File can do its own 
buffering and can implement peek() etc. I wonder if we need to worry 
about sharing, because e.g. several threads would want to write to stdout.

 What I did is to base most parsing on CharReaders (for example the char
 based ones from BasicIO):
 {{{
 /// extent of a slice of a buffer
 enum SliceExtent{ Partial, Maximal, ToEnd }

 /// a delegate that reads in from a character source
 alias size_t delegate(char[]buf, SliceExtent slice,out bool iterate)
 CharReader;

 /// a handler of CharReader, returns true if something was read
 alias bool delegate(CharReader)CharReaderHandler;
 }}}

 a char reader reads from the given buffer buf, and can either request
 more (by returning EOF), or eat some characters out of it. If it sets
 iterate to true it wants to iterate with the eaten buffer (useful to for
 example skip undefined amount of whitespace that might overflow the
 buffer).

 Once you have that you can easily create a Peeker structure that wraps a
 CharReader, and exposes a CharReaded that tries to match it, but always
 eats 0 characters, even if the match was successful.
 With it you can have a peek method that returns true if the CharReader
 that you pass in matches, false if it does not match, and what you want
 if the buffer is too small to resolve the issue.

 Most of these things are templates that work for any type T.


Wait, if you called it CharReader, how come it works with any type T? Or 
are you referring to T as the parsed type?

 Then I
 built buffered types that using a size_t delegate(T[]) give a Reader
 based interface.

 All this is not based on single elements anymore, but on arrays (ranges?
 :), but I think that is what is needed for efficient i/o.


Sounds good, but I wonder why you use delegates instead of classes. Is 
that for simplicity?

I confess it's not 100% clear to me how the delegates are supposed to be 
used in concert, particularly why there's a need for both CharReader and 
CharReaderHandler.

 On 03/23/2010 09:12 PM, Steven Schveighoffer wrote:
 I don't think we should give up on trying to make a stream range that
 is not awkward, I really dislike the way today's input ranges map to
 streams.


 Me too. Let's keep on looking, I have the feeling something good is
 right behind the corner. But then I felt that way for a year :o).


 give a try to
 bool popFront(ref T) ( or next, or another name, or even just a delegate
 with that signature)
 I was surprised how well it works, not perfect but better than the other
 alternatives I had tried.

 loop on a T[] array:
 bool popFront(ref T* el);


So arrays have a different interface than streams. It looks like you 
can't write code that works uniformly for both, because for some you 
need the * and for some you don't. Did I understand that correctly?

Andrei

Mar 24 2010

Andrei Alexandrescu <SeeWebsiteForEmail erdani.org> writes:

On 03/25/2010 05:32 AM, Fawzi Mohamed wrote:
 thinking more about this, you are right something that returns a ref can
 be used exactly the same way as something that returns a value if one
 takes the value with
 auto val=returnRefOrVal;
 can be used as value exactly in the same way.
 Whereas something that returns a T or a T* , need an explicit conversion.
 That is easy to do, and one can even easily wrap the delegate in place
 with something that returns T instead of T*, but the conversion has to
 be explicit (before feeding it to the code), or explicitly tested for in
 the code.
 In practice I hadn't real problems due to this, but it is something that
 is uglier than ref return.
 On the other hand it is easier to know if you might modify the value
 that you received expecting to modify the underlying structure.


I see. So what you're saying is that maybe the entire idea of iterating 
streams and arrays in a unified way may be problematic, because you can 
do different things with the elements of the two. While I partially 
agree with that, there are a lot of things that one can do the same way 
over a stream or a collection, and I wasn't able to find a way to do 
that that's reasonably efficient for both.

Andrei

Mar 25 2010

Fawzi Mohamed <fawzi gmx.ch> writes:

On 24-mar-10, at 15:00, Fawzi Mohamed wrote:

 [...]
 give a try to
 	bool popFront(ref T) ( or next, or another name, or even just a  
 delegate with that signature)
 I was surprised how well it works, not perfect but better than the  
 other alternatives I had tried.


I forgot to say, that one of the main pita with that approach is  
having to declare the arguments before using them, but should you  
decide that it is indeed a good alternative I have no doubt that you  
could find a good syntactic sugar that Walter could implement... :)

Mar 24 2010

Fawzi Mohamed <fawzi gmx.ch> writes:

On 24-mar-10, at 15:11, Fawzi Mohamed wrote:

 On 24-mar-10, at 15:00, Fawzi Mohamed wrote:

 [...]
 give a try to
 	bool popFront(ref T) ( or next, or another name, or even just a  
 delegate with that signature)
 I was surprised how well it works, not perfect but better than the  
 other alternatives I had tried.


 I forgot to say, that one of the main pita with that approach is  
 having to declare the arguments before using them, but should you  
 decide that it is indeed a good alternative I have no doubt that you  
 could find a good syntactic sugar that Walter could implement... :)


if one would have methods
	bool f(ref T)
as valid iterators
then syntactic sugar replacing
	expr(f(auto a));
with
	static if(is(f S==function)){
	 S args;
	 expr(f(args));
	} else {static assert(0);}
would be nice.

Mar 24 2010

Fawzi Mohamed <fawzi gmx.ch> writes:

On 24-mar-10, at 15:11, Fawzi Mohamed wrote:

 On 24-mar-10, at 15:00, Fawzi Mohamed wrote:

 [...]
 give a try to
 	bool popFront(ref T) ( or next, or another name, or even just a  
 delegate with that signature)
 I was surprised how well it works, not perfect but better than the  
 other alternatives I had tried.


 I forgot to say, that one of the main pita with that approach is  
 having to declare the arguments before using them, but should you  
 decide that it is indeed a good alternative I have no doubt that you  
 could find a good syntactic sugar that Walter could implement... :)


if one would have methods
	bool f(ref T)
as valid iterators
then syntactic sugar replacing
	expr(f(auto a));
with
	static if(is(f S==function)){
	 S args;
	 expr(f(args));
	} else {static assert(0);}
would be nice.

Mar 24 2010

Fawzi Mohamed <fawzi gmx.ch> writes:

On 24-mar-10, at 23:29, Andrei Alexandrescu wrote:

 On 03/24/2010 09:00 AM, Fawzi Mohamed wrote:
 On 24-mar-10, at 03:51, Andrei Alexandrescu wrote:

 The Phobos file I/O functions all avoid doing any more buffering  
 than
 the backing FILE* does. They achieve performance by locking the file
 once with flockfile/funlockfile and then using fgetc_unlocked().

 This puts me in real trouble with the formatted reading functions (a
 la fscanf but generalized to all input ranges), which I'm gestating
 about. The problem with the current API is that if you call
 input.front(), it will call fgetc(). But then say I decide I'm done
 with the range, as is the case with e.g. reading an integer and
 stopping at the first non-digit. That non-digit character will be
 lost. So there's a need to say, hey, put this guy back because  
 whoever
 reads after me will need to look at it. So I need a putBackFront()  
 or
 something (which would call fungetc()). I wish things were simpler.


 I had a pushBack (I called that unget) in
 http://github.com/fawzi/blip/blob/master/blip/text/TextParser.d ,  
 but I
 recently removed that in favor of a peek function that I think is  
 much
 more flexible.


 Thanks for sharing your design with me. Yes, peek() is more flexible  
 than get/unget, but I'm under the stdio tyranny.

 In fact I just realized something - I could call

 setvbuf(_handle, null, _IONBF, 0)

 whenever I bind a File to a FILE*. That way File can do its own  
 buffering and can implement peek() etc. I wonder if we need to worry  
 about sharing, because e.g. several threads would want to write to  
 stdout.


well for stdout by default I use locking to ensure writing writes  
chunks atomically.
I would say that by default streams imply sequence, so can safely be  
non threadsafe.
stdout, stderr and logging are exceptions, there at least chunks  
should be written atomically.

 What I did is to base most parsing on CharReaders (for example the  
 char
 based ones from BasicIO):
 {{{
 /// extent of a slice of a buffer
 enum SliceExtent{ Partial, Maximal, ToEnd }

 /// a delegate that reads in from a character source
 alias size_t delegate(char[]buf, SliceExtent slice,out bool iterate)
 CharReader;

 /// a handler of CharReader, returns true if something was read
 alias bool delegate(CharReader)CharReaderHandler;
 }}}

 a char reader reads from the given buffer buf, and can either request
 more (by returning EOF), or eat some characters out of it. If it sets
 iterate to true it wants to iterate with the eaten buffer (useful  
 to for
 example skip undefined amount of whitespace that might overflow the
 buffer).

 Once you have that you can easily create a Peeker structure that  
 wraps a
 CharReader, and exposes a CharReaded that tries to match it, but  
 always
 eats 0 characters, even if the match was successful.
 With it you can have a peek method that returns true if the  
 CharReader
 that you pass in matches, false if it does not match, and what you  
 want
 if the buffer is too small to resolve the issue.

 Most of these things are templates that work for any type T.


 Wait, if you called it CharReader, how come it works with any type  
 T? Or are you referring to T as the parsed type?


Well I presented the CharReader for simplicity, and that is indeed  
only for chars, but most things can be generalized, and indeed if you  
look at

the Reader(T) interface in
http://github.com/fawzi/blip/blob/master/blip/io/BasicIO.d 
  or blip.text.TextParser or similar they are templated with a generic  
type T.
For TextParser I was thinking T=char,wchar or dchar, whereas others  
cases are even more generic.

 Then I
 built buffered types that using a size_t delegate(T[]) give a Reader
 based interface.

 All this is not based on single elements anymore, but on arrays  
 (ranges?
 :), but I think that is what is needed for efficient i/o.


 Sounds good, but I wonder why you use delegates instead of classes.  
 Is that for simplicity?


there are both, and both have their place.
delegates are very simple and can be easily built on the fly, I like  
that very much, they reduce the code footprint of various things.
More complex behaviour is better captured by classes, and indeed there  
are (also in BasicIO) the following interfaces:

interface OutStreamI{
     void rawWriteStr(char[]);
     void rawWriteStr(wchar[]);
     void rawWriteStr(dchar[]);
     void rawWrite(void[]);
     CharSink charSink();
     BinSink binSink();
     void flush();
     void close();
}

/// a reader of elements of type T
interface Reader(T){
     /// read some data into the given buffer
     size_t readSome(T[]);
     /// character reader handler
     bool handleReader(size_t delegate(T[], SliceExtent slice,out bool  
iterate) r);
     /// shutdown the input source
     void shutdownInput();
}

/// one or more readers
interface MultiReader{
     enum Mode{ Binary=1, Char=2, Wchar=4, Dchar=8 }
     /// returns the modes this reader supports
     uint modes();
     /// returns the native modes of this reader (less overhead)
     uint nativeModes();
     Reader!(char) readerChar();
     Reader!(wchar) readerWchar();
     Reader!(dchar) readerDchar();
     Reader!(void) readerBin();
     void shutdownInput();
}
there are classes that can create the more full fledged objects out of  
delegates.

 I confess it's not 100% clear to me how the delegates are supposed  
 to be used in concert, particularly why there's a need for both  
 CharReader and CharReaderHandler.


mainly one needs CharReader, which is a method that reads something.

CharReaderHandler is there just for completeness, it is a delegate of  
a method that actually reads, but normally one simply uses that  
method, i.e. it uses a Reader!(T).handleReader method...

 On 03/23/2010 09:12 PM, Steven Schveighoffer wrote:
 I don't think we should give up on trying to make a stream range  
 that
 is not awkward, I really dislike the way today's input ranges map  
 to
 streams.


 Me too. Let's keep on looking, I have the feeling something good is
 right behind the corner. But then I felt that way for a year :o).


 give a try to
 bool popFront(ref T) ( or next, or another name, or even just a  
 delegate
 with that signature)
 I was surprised how well it works, not perfect but better than the  
 other
 alternatives I had tried.

 loop on a T[] array:
 bool popFront(ref T* el);


 So arrays have a different interface than streams. It looks like you  
 can't write code that works uniformly for both, because for some you  
 need the * and for some you don't. Did I understand that correctly?


well the foreach loop is the same, but the iteration loop is indeed  
different in the sense that one uses a pointer to an element and the  
other the element itself.
one can write code that removes the pointer that is there  
(dereferencing it, or doing and inline function with subsequent call  
which allows you to reuse the same variable name):
void myF(ref x){
  // code
}
myF(*x);

(that is a nice trick that I used several times).

But yes there *is* a difference and the difference is that with arrays  
you might modify the element, modifying the stored value, whereas with  
streams you can't.
This conceptual difference and if reflected in the interface.
One can then discuss if immutable arrays should be iterated with  
immutable pointers or with values (i.e. copying) just as streams are.


 Andrei

Mar 24 2010

Fawzi Mohamed <fawzi gmx.ch> writes:

--Apple-Mail-13--851818124
Content-Type: text/plain;
	charset=US-ASCII;
	format=flowed;
	delsp=yes
Content-Transfer-Encoding: 7bit


On 25-mar-10, at 00:09, Fawzi Mohamed wrote:

 On 24-mar-10, at 23:29, Andrei Alexandrescu wrote:

 [...]
 So arrays have a different interface than streams. It looks like  
 you can't write code that works uniformly for both, because for  
 some you need the * and for some you don't. Did I understand that  
 correctly?


 well the foreach loop is the same, but the iteration loop is indeed  
 different in the sense that one uses a pointer to an element and the  
 other the element itself.
 one can write code that removes the pointer that is there  
 (dereferencing it, or doing and inline function with subsequent call  
 which allows you to reuse the same variable name):
 void myF(ref x){
 // code
 }
 myF(*x);

 (that is a nice trick that I used several times).

 But yes there *is* a difference and the difference is that with  
 arrays you might modify the element, modifying the stored value,  
 whereas with streams you can't.
 This conceptual difference and if reflected in the interface.
 One can then discuss if immutable arrays should be iterated with  
 immutable pointers or with values (i.e. copying) just as streams are.


thinking more about this, you are right something that returns a ref  
can be used exactly the same way as something that returns a value if  
one takes the value with
	auto val=returnRefOrVal;
can be used as value exactly in the same way.
Whereas something that returns a T or a T* , need an explicit  
conversion.
That is easy to do, and one can even easily wrap the delegate in place  
with something that returns T instead of T*, but the conversion has to  
be explicit (before feeding it to the code), or explicitly tested for  
in the code.
In practice I hadn't real problems due to this, but it is something  
that is uglier than ref return.
On the other hand it is easier to know if you might modify the value  
that you received expecting to modify the underlying structure.


--Apple-Mail-13--851818124
Content-Type: text/html;
	charset=US-ASCII
Content-Transfer-Encoding: quoted-printable

<html><body style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><br><div><div>On 25-mar-10, at =
00:09, Fawzi Mohamed wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><div><br>On =
24-mar-10, at 23:29, Andrei Alexandrescu wrote:<br><br><blockquote =
type=3D"cite">[...]</blockquote><blockquote type=3D"cite">So arrays have =
a different interface than streams. It looks like you can't write code =
that works uniformly for both, because for some you need the * and for =
some you don't. Did I understand that =
correctly?<br></blockquote><br>well the foreach loop is the same, but =
the iteration loop is indeed different in the sense that one uses a =
pointer to an element and the other the element itself.<br>one can write =
code that removes the pointer that is there (dereferencing it, or doing =
and inline function with subsequent call which allows you to reuse the =
same variable name):<br>void myF(ref x){<br> // =
code<br>}<br>myF(*x);<br><br>(that is a nice trick that I used several =
times).<br><br>But yes there *is* a difference and the difference is =
that with arrays you might modify the element, modifying the stored =
value, whereas with streams you can't.<br>This conceptual difference and =
if reflected in the interface.<br>One can then discuss if immutable =
arrays should be iterated with immutable pointers or with values (i.e. =
copying) just as streams are.<font class=3D"Apple-style-span" =
color=3D"#000000"><font class=3D"Apple-style-span" =
color=3D"#144FAE"><br></font></font></div></blockquote><br></div><div>thin=
king more about this, you are right something that returns a ref can be =
used exactly the same way as something that returns a value if one takes =
the value with</div><div><span class=3D"Apple-tab-span" =
style=3D"white-space:pre">	</span>auto =
val=3DreturnRefOrVal;</div><div>can be used as value exactly in the same =
way.</div><div>Whereas something that returns a T or a T* , need an =
explicit conversion.</div><div>That is easy to do, and one can even =
easily wrap the delegate in place with something that returns T instead =
of T*, but the conversion has to be explicit (before feeding it to the =
code), or explicitly tested for in the code.</div><div>In practice I =
hadn't real problems due to this, but it is something that is uglier =
than ref return.</div><div>On the other hand it is easier to know if you =
might modify the value that you received expecting to modify the =
underlying structure.</div><div><br></div></body></html>=

--Apple-Mail-13--851818124--

Mar 25 2010

D Programming

C/C++ Programming

Other

digitalmars.D - Re: Ranges and/versus iterators