digitalmars.D.learn - Why isn't skipOver(string, string) nothrow?

Per =?UTF-8?B?Tm9yZGzDtnc=?= (12/12) Oct 22 2019 Why isn't a call to

Adam D. Ruppe (9/12) Oct 22 2019 without really looking, probably because of invalid utf sequences

Per =?UTF-8?B?Tm9yZGzDtnc=?= (3/17) Oct 22 2019 But startsWith(string, string) is nothrow so skipOver should be

Jonathan M Davis (7/26) Oct 22 2019 That's only true with startsWith, because it avoids decoding in the case

Jonathan M Davis (26/38) Oct 22 2019 Almost anything involving strings isn't going to be nothrow, because fro...

Per =?UTF-8?B?Tm9yZGzDtnc=?= <per.nordlow gmail.com> writes:

Why isn't a call to

     skipOver(string, string)

nothrow?

I see no reason why it shouldn't be.

Further, this test should be qualifyable as nothrow:

 safe pure /* TODO nothrow  nogc */ unittest
{
     import std.algorithm.searching : skipOver;
     auto x = "beta version";
     assert(x.skipOver("beta"));
     assert(x == " version");
}

Oct 22 2019

Adam D. Ruppe <destructionator gmail.com> writes:

On Tuesday, 22 October 2019 at 15:33:05 UTC, Per Nordlöw wrote:
 Why isn't a call to

     skipOver(string, string)

 nothrow?

without really looking, probably because of invalid utf sequences 
potentially throwing. Using the .representation thingy might help 
if im right about this.

A good way to try this is to edit your copy of the Phobos file 
and add nothrow to it. That should give an error inside that is 
more descriptive. (I would love if the compiler would do this 
automatically, we were talking about maybe making that mod on irc 
yesterday).

Oct 22 2019

Per =?UTF-8?B?Tm9yZGzDtnc=?= <per.nordlow gmail.com> writes:

On Tuesday, 22 October 2019 at 15:39:17 UTC, Adam D. Ruppe wrote:
 On Tuesday, 22 October 2019 at 15:33:05 UTC, Per Nordlöw wrote:
 Why isn't a call to

     skipOver(string, string)

 nothrow?

 without really looking, probably because of invalid utf 
 sequences potentially throwing. Using the .representation 
 thingy might help if im right about this.

 A good way to try this is to edit your copy of the Phobos file 
 and add nothrow to it. That should give an error inside that is 
 more descriptive. (I would love if the compiler would do this 
 automatically, we were talking about maybe making that mod on 
 irc yesterday).

But startsWith(string, string) is nothrow so skipOver should be 
that too.

Oct 22 2019

Jonathan M Davis <newsgroup.d jmdavisprog.com> writes:

On Tuesday, October 22, 2019 4:27:59 PM MDT Per Nordl�w via Digitalmars-d-
learn wrote:
 On Tuesday, 22 October 2019 at 15:39:17 UTC, Adam D. Ruppe wrote:
 On Tuesday, 22 October 2019 at 15:33:05 UTC, Per Nordl�w wrote:
 Why isn't a call to

     skipOver(string, string)

 nothrow?

 without really looking, probably because of invalid utf
 sequences potentially throwing. Using the .representation
 thingy might help if im right about this.

 A good way to try this is to edit your copy of the Phobos file
 and add nothrow to it. That should give an error inside that is
 more descriptive. (I would love if the compiler would do this
 automatically, we were talking about maybe making that mod on
 irc yesterday).

 But startsWith(string, string) is nothrow so skipOver should be
 that too.

That's only true with startsWith, because it avoids decoding in the case
where the two strings have the same encoding (e.g. it's not nothrow if you
compare a dstring and a string). Presumably, skipOver could be made to do
the same, but no one has done so.

- Jonathan M Davis

Oct 22 2019

Jonathan M Davis <newsgroup.d jmdavisprog.com> writes:

On Tuesday, October 22, 2019 9:33:05 AM MDT Per Nordl�w via Digitalmars-d-
learn wrote:
 Why isn't a call to

      skipOver(string, string)

 nothrow?

 I see no reason why it shouldn't be.

 Further, this test should be qualifyable as nothrow:

  safe pure /* TODO nothrow  nogc */ unittest
 {
      import std.algorithm.searching : skipOver;
      auto x = "beta version";
      assert(x.skipOver("beta"));
      assert(x == " version");
 }

Almost anything involving strings isn't going to be nothrow, because front
and popFront throw on invalid UTF. To really fix that, we'd need to get rid
of auto-decoding. You can use std.utf.byDchar to wrap the string in a range
of dchar which replaces invalid Unicode with the replacement character
instead, which means that no exception gets thrown, but it also means that
if you hadn't previously validated the Unicode, you could end up processing
invalid Unicode without realizing it. How much that matters depends on what
you're doing. Ideally, all strings would just be validated when they were
created, and then it wouldn't be an issue, but any code that decodes the
code points would still have to deal with invalid Unicode in some manner
(though if we decided that it was the responsibility of the caller to always
validate the Unicode first, then we could use assertions). For better or
worse, the chosen solution when ranges were first put together was to throw
on invalid Unicode, which basically makes it impossible for functions that
process strings to be nothrow unless they go to the extra effort working
around auto-decoding. If we're ever able to remove auto-decoding, then
that's no longer an issue for all string processing functions, but it's
still going to be an issue for any code that calls functions like decode or
stride. They're either going to throw or replace invalid Unicode with the
replacement character. Which approach is better depends on the code.

In any case, as long as auto-decoding is a thing, you'll have to use
wrappers like byDchar or byCodeUnit if you want much of anything involving
strings to be nothrow.

- Jonathan M Davis

Oct 22 2019

D Programming

C/C++ Programming

Other

digitalmars.D.learn - Why isn't skipOver(string, string) nothrow?