digitalmars.D - Imports, packages, modules and symbol name scopes

Bruno Medeiros (57/57) Aug 25 2005 Concerning in D the whole aspect of imports, packages, modules and

Chris Sauls (26/39) Aug 26 2005 I see your point, but rather than your suggested changes, I'd rather som...

=?ISO-8859-1?Q?Thomas_K=FChne?= (6/20) Aug 26 2005 Something along the lines of:
Bruno Medeiros (15/56) Aug 27 2005 Yes, in the interests of compatibility, it would be best to do that, or

Chris Sauls (11/72) Aug 27 2005 Maybe something like:

Bruno Medeiros (11/80) Aug 30 2005 There are some things in D I find less-than-ideal, but while on those

Derek Parnell (27/97) Aug 30 2005 I haven't responded because I'm not too sure about what the issues are. ...

Bruno Medeiros (7/12) Aug 30 2005 Hum.. I was under the impression that the post was clear. I will

Derek Parnell (8/17) Aug 30 2005 Just a little under 1% ;-)

Bruno Medeiros (7/21) Aug 31 2005 Oh, I thought so, that explains things. And it is also somewhat

Derek Parnell (26/44) Aug 31 2005 I can hardly wait ... :D

Bruno Medeiros (86/155) Aug 31 2005 Ok, best if I clarify the whole thing at once then. Suppose we have the

Derek Parnell (22/66) Aug 31 2005 So another way of saying it is that public symbols in other modules shou...

Bruno Medeiros (73/104) Sep 01 2005 There was something ahead in your post that prompted me to change some

Derek Parnell (15/85) Sep 01 2005 I mean by that ... "referencing an entity in another module by using its

Bruno Medeiros (26/93) Sep 03 2005 Ok, now I understand. And now I can say that no, that isn't quite the

Ben Hinkle (18/28) Aug 30 2005 [snip]

Bruno Medeiros (15/54) Aug 31 2005 Yes, there is that difference (if you're not using the /using/

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Concerning in D the whole aspect of imports, packages, modules and 
symbol name scopes, etc.(what to call this whole thing?) there are some 
things I find ill-conceived, so much that I think this aspect of D is in 
need of some rethinking.


First: more often than not, I like to access a symbol using its "fully 
qualified name" (what to call it?), like this:
package1.module1.func(module2.var1);

Currently, if I want to do this I have to import the respective modules, 
which unfortunately also brings the imported modules scope to the local 
module scope, which I do not want. I want to keep the local scope clean 
and access these imported modules only by its FQN.
The first alternative that came to mind was to change import so that it 
would do so: make the symbols accessible only by FQN, and then we would 
have a /using/ statement, same as C++, that would bring packages or 
modules to the local scope (and that would replace the /with/ statement).

But then I realized, why do we then even need the imports at all? Why 
not automatically import all available modules (into FQN scope), and 
just use the /using/ keyword when we want to bring external scopes 
closer? (Are we still thinking C++ legacy-like here?...) It's the simple 

do it this way, which further assures me this is a better behaviour.

This feature requires an import path, i.e. a list of directories where 
to search and import the symbol names (in Java this is the CLASSPATH). 
In D this would be the already existant path defined by the -I 
command-line options, plus now an aditional path element for the project 
root dir itself.

I'm fairly convict D should have this "automatic global 
import/availability of symbol names" (what to call it?).



Second: the module statement. The module statement specifies the name of 
the module, and the hierarchy of parent packages. The module name 
however must match the file name, and the packages must match the 
underlying directory structure, otherwise the source file will be 
unusable (un-importable actually).
This makes the module name declaration pretty much redundant, as well as 
the parent packages declaration (when given a common root dir for a set 
of source files), thus making the whole module statement redundant. I 
find this redundancy unnecessary and undesirable.
So, what possibilities then to this issue of redundancy in the naming of 
packages/modules? There are at first two options:

The current D way (no changes):
* Based upon directory+file structure AND in-file package+module 
declarations, thus having redundancy. (This is like Java)

Alternative:
* Based upon directory+file sctucture only: no need to specify 
package+module in-file, they are inferred.

I was originaly inclined to this second option, however, the 
introduction of automatic global import/availability of symbol names 
possibilitates a third option:

* Based upon in-file declarations only, the file pathnames do not matter 

search the first statement of all project files, which would be the 
module definition, to create the table of packages+modules.

Honestly, on this issue I'm not totally sure what would be best (if any 
is), but now I'm marginally inclined torwards the third option.

-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 25 2005

Chris Sauls <ibisbasenji gmail.com> writes:

Bruno Medeiros wrote:
 First: more often than not, I like to access a symbol using its "fully 
 qualified name" (what to call it?), like this:
 package1.module1.func(module2.var1);
 
 Currently, if I want to do this I have to import the respective modules, 
 which unfortunately also brings the imported modules scope to the local 
 module scope, which I do not want. I want to keep the local scope clean 
 and access these imported modules only by its FQN.

I see your point, but rather than your suggested changes, I'd rather some
decorator for 
the existant 'import' statement, or maybe a companion statement that behaves as
you wish.

 Second: the module statement. The module statement specifies the name of 
 the module, and the hierarchy of parent packages. The module name 
 however must match the file name, and the packages must match the 
 underlying directory structure, otherwise the source file will be 
 unusable (un-importable actually).

Actually this isn't precisely true.  For example, say I have a given package
structure:








And say I expect this program to be compiled with one of "-I./lib/ver1/" or 
"-I./lib/ver2/" on the command line.  Then within the program one would use
"import 
foo.stuff;"... Now, say one of the "stuff" modules is for win32 and the other
for linux. 
They're respective module declerations might then be:






Its entirely do-able.  Its also useful for a project that /will/ be importing
modules with 
  same or similar file names from different places in the package hierarchy.

Its also worth noting that the module decleration is /optional/.  So if you
don't want to 
use it, then you don't use it.  It defaults to the filename.  (Although, again,
it can be 
useful for casing the module name while keeping the filename lo-cased for
filesystem 
compatabilities.)

-- Chris Sauls

Aug 26 2005

=?ISO-8859-1?Q?Thomas_K=FChne?= <thomas-dloop kuehne.cn> writes:

Chris Sauls schrieb:
 Bruno Medeiros wrote:
 
 First: more often than not, I like to access a symbol using its "fully
 qualified name" (what to call it?), like this:
 package1.module1.func(module2.var1);

 Currently, if I want to do this I have to import the respective
 modules, which unfortunately also brings the imported modules scope to
 the local module scope, which I do not want. I want to keep the local
 scope clean and access these imported modules only by its FQN.

 
 I see your point, but rather than your suggested changes, I'd rather
 some decorator for the existant 'import' statement, or maybe a companion
 statement that behaves as you wish.

Something along the lines of:

pragam(FQN_ONLY_EXCEPT_FOR_THE_CURRENT_MODULE);

would be interresting.

Granted the usefullnes depends on your coding style.


Thomas

Aug 26 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Chris Sauls wrote:
 Bruno Medeiros wrote:
 Currently, if I want to do this I have to import the respective 
 modules, which unfortunately also brings the imported modules scope to 
 the local module scope, which I do not want. I want to keep the local 
 scope clean and access these imported modules only by its FQN.

 
 I see your point, but rather than your suggested changes, I'd rather 
 some decorator for the existant 'import' statement, or maybe a companion 
 statement that behaves as you wish.
 

Yes, in the interests of compatibility, it would be best to do that, or 
use a different keyword ("simport" or whatever). Altough this is only 
second-best to having automatic FQN imports.

 Second: the module statement. The module statement specifies the name 
 of the module, and the hierarchy of parent packages. The module name 
 however must match the file name, and the packages must match the 
 underlying directory structure, otherwise the source file will be 
 unusable (un-importable actually).

 
 Actually this isn't precisely true.  For example, say I have a given 
 package structure:







 
 And say I expect this program to be compiled with one of "-I./lib/ver1/" 
 or "-I./lib/ver2/" on the command line.  Then within the program one 
 would use "import foo.stuff;"... Now, say one of the "stuff" modules is 
 for win32 and the other for linux. They're respective module 
 declerations might then be:


 


 
 Its entirely do-able.  Its also useful for a project that /will/ be 
 importing modules with  same or similar file names from different places 
 in the package hierarchy.
 

No, it is not do-able (have you actually tried it?). If you import a 
module ("import foo.stuff;") that is defined  in-file with a different 
module name ("module foo.win32Stuff;" or "module foo.linuxStuff;") You 
will get the following error:

file.d(4): module foo.win32stuff is in multiple packages foo.win32stuff

Which, BTW, I think now, is a somewhat incorrect compiler error message: 
it should state the two names by which the same module was refered to ( 
foo.stuff and foo.win32stuff).

-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 27 2005

Chris Sauls <ibisbasenji gmail.com> writes:

Bruno Medeiros wrote:
 Chris Sauls wrote:
 
 Bruno Medeiros wrote:

 Currently, if I want to do this I have to import the respective 
 modules, which unfortunately also brings the imported modules scope 
 to the local module scope, which I do not want. I want to keep the 
 local scope clean and access these imported modules only by its FQN.


 I see your point, but rather than your suggested changes, I'd rather 
 some decorator for the existant 'import' statement, or maybe a 
 companion statement that behaves as you wish.

 
 Yes, in the interests of compatibility, it would be best to do that, or 
 use a different keyword ("simport" or whatever). Altough this is only 
 second-best to having automatic FQN imports.

Maybe something like:



Only it might confuse people coming from environments where 'using' does what
our 'import' 
does... but so be it.

 Second: the module statement. The module statement specifies the name 
 of the module, and the hierarchy of parent packages. The module name 
 however must match the file name, and the packages must match the 
 underlying directory structure, otherwise the source file will be 
 unusable (un-importable actually).


 Actually this isn't precisely true.  For example, say I have a given 
 package structure:








 And say I expect this program to be compiled with one of 
 "-I./lib/ver1/" or "-I./lib/ver2/" on the command line.  Then within 
 the program one would use "import foo.stuff;"... Now, say one of the 
 "stuff" modules is for win32 and the other for linux. They're 
 respective module declerations might then be:






 Its entirely do-able.  Its also useful for a project that /will/ be 
 importing modules with  same or similar file names from different 
 places in the package hierarchy.

 No, it is not do-able (have you actually tried it?). If you import a 
 module ("import foo.stuff;") that is defined  in-file with a different 
 module name ("module foo.win32Stuff;" or "module foo.linuxStuff;") You 
 will get the following error:
 
 file.d(4): module foo.win32stuff is in multiple packages foo.win32stuff
 
 Which, BTW, I think now, is a somewhat incorrect compiler error message: 
 it should state the two names by which the same module was refered to ( 
 foo.stuff and foo.win32stuff).
 

Hmm... I did just try it, and you're right.  To me it seems either a bug or a
misfeature, 
and if the module name really is supposed to be the same as the file name, then
its almost 
useless, other than for declaring package-hierarchies.  (And maybe for
documentation if 
the new proposal for that flies, which I hope it does.)

-- Chris Sauls

Aug 27 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

There are some things in D I find less-than-ideal, but while on those 
issues I do agree my opinion (and other's) lies on a somewhat subjective 
realm, in this particular aspect I'm pretty sure D needs rethinking. And 
getting generalized indifference (got no comments) towards a change 
proposal is not a good sign :/ (people not caring?)
So could I please get an official comment (i.e. Walter's) on this issue, 
(even if just a one-liner)? This is too important to give up.

Bruno Medeiros wrote:
 Concerning in D the whole aspect of imports, packages, modules and 
 symbol name scopes, etc.(what to call this whole thing?) there are some 
 things I find ill-conceived, so much that I think this aspect of D is in 
 need of some rethinking.
 
 
 First: more often than not, I like to access a symbol using its "fully 
 qualified name" (what to call it?), like this:
 package1.module1.func(module2.var1);
 
 Currently, if I want to do this I have to import the respective modules, 
 which unfortunately also brings the imported modules scope to the local 
 module scope, which I do not want. I want to keep the local scope clean 
 and access these imported modules only by its FQN.
 The first alternative that came to mind was to change import so that it 
 would do so: make the symbols accessible only by FQN, and then we would 
 have a /using/ statement, same as C++, that would bring packages or 
 modules to the local scope (and that would replace the /with/ statement).
 
 But then I realized, why do we then even need the imports at all? Why 
 not automatically import all available modules (into FQN scope), and 
 just use the /using/ keyword when we want to bring external scopes 
 closer? (Are we still thinking C++ legacy-like here?...) It's the simple 

 do it this way, which further assures me this is a better behaviour.
 
 This feature requires an import path, i.e. a list of directories where 
 to search and import the symbol names (in Java this is the CLASSPATH). 
 In D this would be the already existant path defined by the -I 
 command-line options, plus now an aditional path element for the project 
 root dir itself.
 
 I'm fairly convict D should have this "automatic global 
 import/availability of symbol names" (what to call it?).
 
 
 
 Second: the module statement. The module statement specifies the name of 
 the module, and the hierarchy of parent packages. The module name 
 however must match the file name, and the packages must match the 
 underlying directory structure, otherwise the source file will be 
 unusable (un-importable actually).
 This makes the module name declaration pretty much redundant, as well as 
 the parent packages declaration (when given a common root dir for a set 
 of source files), thus making the whole module statement redundant. I 
 find this redundancy unnecessary and undesirable.
 So, what possibilities then to this issue of redundancy in the naming of 
 packages/modules? There are at first two options:
 
 The current D way (no changes):
 * Based upon directory+file structure AND in-file package+module 
 declarations, thus having redundancy. (This is like Java)
 
 Alternative:
 * Based upon directory+file sctucture only: no need to specify 
 package+module in-file, they are inferred.
 
 I was originaly inclined to this second option, however, the 
 introduction of automatic global import/availability of symbol names 
 possibilitates a third option:
 
 * Based upon in-file declarations only, the file pathnames do not matter 

 search the first statement of all project files, which would be the 
 module definition, to create the table of packages+modules.
 
 Honestly, on this issue I'm not totally sure what would be best (if any 
 is), but now I'm marginally inclined torwards the third option.
 


-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 30 2005

Derek Parnell <derek psych.ward> writes:

On Tue, 30 Aug 2005 23:44:43 +0000, Bruno Medeiros wrote:

 There are some things in D I find less-than-ideal, but while on those 
 issues I do agree my opinion (and other's) lies on a somewhat subjective 
 realm, in this particular aspect I'm pretty sure D needs rethinking. And 
 getting generalized indifference (got no comments) towards a change 
 proposal is not a good sign :/ (people not caring?)

I haven't responded because I'm not too sure about what the issues are. I
don't really understand what you are asking for.

 So could I please get an official comment (i.e. Walter's) on this issue, 
 (even if just a one-liner)? This is too important to give up.
 
 Bruno Medeiros wrote:
 Concerning in D the whole aspect of imports, packages, modules and 
 symbol name scopes, etc.(what to call this whole thing?) there are some 
 things I find ill-conceived, so much that I think this aspect of D is in 
 need of some rethinking.
 
 First: more often than not, I like to access a symbol using its "fully 
 qualified name" (what to call it?), like this:
 package1.module1.func(module2.var1);
 
 Currently, if I want to do this I have to import the respective modules, 
 which unfortunately also brings the imported modules scope to the local 
 module scope, which I do not want. I want to keep the local scope clean 
 and access these imported modules only by its FQN.


I haven't had a problem with this. What problems is this behaviour giving
you?

 The first alternative that came to mind was to change import so that it 
 would do so: make the symbols accessible only by FQN, and then we would 
 have a /using/ statement, same as C++, that would bring packages or 
 modules to the local scope (and that would replace the /with/ statement).
 
 But then I realized, why do we then even need the imports at all? Why 
 not automatically import all available modules (into FQN scope), and 
 just use the /using/ keyword when we want to bring external scopes 
 closer? (Are we still thinking C++ legacy-like here?...) It's the simple 

 do it this way, which further assures me this is a better behaviour.
 
 This feature requires an import path, i.e. a list of directories where 
 to search and import the symbol names (in Java this is the CLASSPATH). 
 In D this would be the already existant path defined by the -I 
 command-line options, plus now an aditional path element for the project 
 root dir itself.
 
 I'm fairly convict D should have this "automatic global 
 import/availability of symbol names" (what to call it?).


Are you saying that if somewhere in my code I have a full qualified name
for an external item, that the D compiler should assume that I had meant to
also code the import statement for that module? If so, should it assume a
private or public import?

For example:

     result = std.string.strip(theField);

would cause DMD to assume that I had also coded ...

  private import std.string;

 Second: the module statement. The module statement specifies the name of 
 the module, and the hierarchy of parent packages. The module name 
 however must match the file name, and the packages must match the 
 underlying directory structure, otherwise the source file will be 
 unusable (un-importable actually).
 This makes the module name declaration pretty much redundant, as well as 
 the parent packages declaration (when given a common root dir for a set 
 of source files), thus making the whole module statement redundant. I 
 find this redundancy unnecessary and undesirable.


I agree that the module statement is basically useless and an
inconvenience. The only thing is seems to do is enforce which package a
module is supposed to be in. 

 So, what possibilities then to this issue of redundancy in the naming of 
 packages/modules? There are at first two options:
 
 The current D way (no changes):
 * Based upon directory+file structure AND in-file package+module 
 declarations, thus having redundancy. (This is like Java)
 
 Alternative:
 * Based upon directory+file sctucture only: no need to specify 
 package+module in-file, they are inferred.

 I was originaly inclined to this second option, however, the 
 introduction of automatic global import/availability of symbol names 
 possibilitates a third option:
 
 * Based upon in-file declarations only, the file pathnames do not matter 

 search the first statement of all project files, which would be the 
 module definition, to create the table of packages+modules.


What's a "project file"? Do just mean a source code file? How would DMD
find all project files, that is, all files that belong to a project? One
can't assume that all files in a specific directory belong to the same
application, and some projects generate more than one application program.
So I guess I'm not sure what you are asking for here. Can you give
examples?

-- 
Derek
(skype: derek.j.parnell)
Melbourne, Australia
31/08/2005 9:48:34 AM

Aug 30 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Derek Parnell wrote:
 On Tue, 30 Aug 2005 23:44:43 +0000, Bruno Medeiros wrote:
 
 I haven't responded because I'm not too sure about what the issues are. I
 don't really understand what you are asking for.
 

Hum.. I was under the impression that the post was clear. I will 
clarify, but before that let know this: are you familiar with (as in 



-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 30 2005

Derek Parnell <derek psych.ward> writes:

On Wed, 31 Aug 2005 00:56:32 +0000, Bruno Medeiros wrote:

 Derek Parnell wrote:
 On Tue, 30 Aug 2005 23:44:43 +0000, Bruno Medeiros wrote:
 
 I haven't responded because I'm not too sure about what the issues are. I
 don't really understand what you are asking for.
 

 Hum.. I was under the impression that the post was clear. I will 
 clarify, but before that let know this: are you familiar with (as in 


Just a little under 1% ;-) 

Could you use plain English instead?

-- 
Derek
(skype: derek.j.parnell)
Melbourne, Australia
31/08/2005 11:04:30 AM

Aug 30 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Derek Parnell wrote:
 On Wed, 31 Aug 2005 00:56:32 +0000, Bruno Medeiros wrote:
 
 
Derek Parnell wrote:

Hum.. I was under the impression that the post was clear. I will 
clarify, but before that let know this: are you familiar with (as in 


 
 
 Just a little under 1% ;-) 
 
 Could you use plain English instead?
 

Oh, I thought so, that explains things.  And it is also somewhat 
disturbing. [I can explain this last phrase but it would take another 
thread, maybe later]

-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 31 2005

Derek Parnell <derek psych.ward> writes:

On Wed, 31 Aug 2005 21:32:24 +0000, Bruno Medeiros wrote:

 Derek Parnell wrote:
 On Wed, 31 Aug 2005 00:56:32 +0000, Bruno Medeiros wrote:
 
 
Derek Parnell wrote:

Hum.. I was under the impression that the post was clear. I will 
clarify, but before that let know this: are you familiar with (as in 


 
 
 Just a little under 1% ;-) 
 
 Could you use plain English instead?
 

 Oh, I thought so, that explains things.  And it is also somewhat 
 disturbing. [I can explain this last phrase but it would take another 
 thread, maybe later]

I can hardly wait ... :D

BTW, my programming background (in Chronological order) is 

IBM COBOL
IBM 360/370 Assembler
PL/I
IBM 1440 Autocoder
MS BASIC v1.0
C
Intel Assembler
Motorola Assembler
Forth
UFO (a 4GL used to make CICS/VS coding easier)
RPG
Progress 4GL
C++
Pascal
E (on the beloved Amiga)
Visual Basic v3/4/5/6
IBM UniVerse
Euphoria
D


-- 
Derek Parnell
Melbourne, Australia
1/09/2005 8:01:56 AM

Aug 31 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Ok, best if I clarify the whole thing at once then. Suppose we have the 
following source files:

pack1/foobar.d
	Class Whatever
	function DoStuff()
	function DoStuff2()
pack2/xpto.d
	var xpto1
	var xpto2
pack2/asdf.d
	class Alpha
	class Beta


Now suppose that in another D file we want to use doStuff, we then would 
have to issue a:

   import pack1.foobar;

Currently, D import statement such as this do two things: it makes 
DoStuff (and other symbols of pack1.foobar) available in the current 
module with two names: fully-qualified (pack1.foobar.DoStuff) and 
stripped (just DoStuff). The symbols of public imports of pack1.foobar 
are also made available in these (both) ways.


Now let's examine the same situation in Java, let's suppose all those D 
files are Java source files. (technically, then the modules would have 
to be classes, but it wouldn't make any relevant difference for this 
discussion). Now suppose that in another Java file, we want to use 
DoStuff. Things are different, if we want to access DoStuff we can 
already do it by its FQN:

   pack2.Foobar.DoStuff();

It is only if we want to access it with a shorter name, that we issue a 
Java import statement;

   import pack2.Foobar;	// or import pack2.*;
   ...
   Foobar.DoStuff();

In Java you can only import the Foobar symbol (a module-like class) 
into scope, not the contents of Foobar. Thus you cannot access DoStuff() 
directly, or any external funcion for that matter (the wildcard "*" can 
only be used after packages).



   pack2.Foobar.DoStuff();

With shorter names it is:

   using pack2;
   ...
   Foobar.DoStuff();


include statements and namespaces can only contain Classes, Structs or 
Enums, if I recall correctly.


Now with this in mind, let's look at your questions (you may also want 
to reread my original post):


Derek Parnell wrote:
Bruno Medeiros wrote:

Concerning in D the whole aspect of imports, packages, modules and 
symbol name scopes, etc.(what to call this whole thing?) there are some 
things I find ill-conceived, so much that I think this aspect of D is in 
need of some rethinking.

First: more often than not, I like to access a symbol using its "fully 
qualified name" (what to call it?), like this:
package1.module1.func(module2.var1);

Currently, if I want to do this I have to import the respective modules, 
which unfortunately also brings the imported modules scope to the local 
module scope, which I do not want. I want to keep the local scope clean 
and access these imported modules only by its FQN.


 
 
 I haven't had a problem with this. What problems is this behaviour giving
 you?
 

In terms of just being possible, there is no problem. The problem is the 
"side-effect" of bringing the stripped name (DoStuff) into local scope 
as well, not just the FQ name. Altough it may be hard for this to 
actually cause a bug, or code-inconsistency, it is conceptually an 
important behaviour. For example, down the road, if we have a D IDE that 
does code autocompletion (aka intellisense by MS), it becomes especially 
relevant, and problematic!

[...]
 
 Are you saying that if somewhere in my code I have a full qualified name
 for an external item, that the D compiler should assume that I had meant to
 also code the import statement for that module? If so, should it assume a
 private or public import?
 
 For example:
 
      result = std.string.strip(theField);
 
 would cause DMD to assume that I had also coded ...
 
   private import std.string;
 
 

Yes that would be it, *but* the import would be different, it would only 
import FQ names (let's call it fqimport) like I described before. And 
with such import there would be no difference in being public or 
private. (do you see why?)

Second: the module statement. The module statement specifies the name of 
the module, and the hierarchy of parent packages. The module name 
however must match the file name, and the packages must match the 
underlying directory structure, otherwise the source file will be 
unusable (un-importable actually).
This makes the module name declaration pretty much redundant, as well as 
the parent packages declaration (when given a common root dir for a set 
of source files), thus making the whole module statement redundant. I 
find this redundancy unnecessary and undesirable.


 
 
 I agree that the module statement is basically useless and an
 inconvenience. The only thing is seems to do is enforce which package a
 module is supposed to be in. 
 
 

The module statement is redundant, but that doesn't mean that it is the 
statement that should go away. See my reply to Ben Hinkle.

I was originaly inclined to this second option, however, the 
introduction of automatic global import/availability of symbol names 
possibilitates a third option:

* Based upon in-file declarations only, the file pathnames do not matter 

search the first statement of all project files, which would be the 
module definition, to create the table of packages+modules.


 
 
 What's a "project file"? Do just mean a source code file? How would DMD
 find all project files, that is, all files that belong to a project? One
 can't assume that all files in a specific directory belong to the same
 application, and some projects generate more than one application program.
 So I guess I'm not sure what you are asking for here. Can you give
 examples?
 

By project file I meant source code file belonging to a project, 
"project" being something that would produce a *single* output 
executable or library (yes, likely not a clear term).
The method would indeed be to search all D files in a directory 
(recursively). So in the example in the beggining, supose those files 
are in a "C:/projects/test" dir. We would have:

   dmd ... -IC:/projects/test -IC:/projects/lib

then dmd would search all files in "C:/projects/test", which would be:
   pack1/foobar.d
   pack2/xpto.d
   pack2/asdf.d

and they could now have different packages+modules names other than 
pack1.foobar, pack2.xpto or pack2.asdf respectively. The files in 
"C:/projects/lib" would be searched in the same way. So this is basicly 
the behaviour of the third option, and so far I don't see any problem 
with it.
If you have the source code of several applications mingled in the same 
dir, you should have them named with apropriate and unconflicting 
modules, or possibly better, you should separate them in different 
source dirs. (Frankly I think one should do this regardless of whether 
the compiler searches for files or not, it is basic best-pratices to 
have a good, clear code structure in terms of files too)


-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 31 2005

Derek Parnell <derek psych.ward> writes:

On Wed, 31 Aug 2005 16:14:59 +0000, Bruno Medeiros wrote:

 Ok, best if I clarify the whole thing at once then.

[snipped a good clarification]

 I haven't had a problem with this. What problems is this behaviour giving
 you?
 

 
 In terms of just being possible, there is no problem. The problem is the 
 "side-effect" of bringing the stripped name (DoStuff) into local scope 
 as well, not just the FQ name. Altough it may be hard for this to 
 actually cause a bug, or code-inconsistency, it is conceptually an 
 important behaviour. For example, down the road, if we have a D IDE that 
 does code autocompletion (aka intellisense by MS), it becomes especially 
 relevant, and problematic!

So another way of saying it is that public symbols in other modules should
only become visible if I explicitly name them. Or ... I can only see them
if I say I want to see them.

I would have thought that an IDE would want to show me symbols that I
haven't named yet so I could possibly choose them.
 
[...]
 
 Are you saying that if somewhere in my code I have a full qualified name
 for an external item, that the D compiler should assume that I had meant to
 also code the import statement for that module? If so, should it assume a
 private or public import?
 
 For example:
 
      result = std.string.strip(theField);
 
 would cause DMD to assume that I had also coded ...
 
   private import std.string;
 
 

 
 Yes that would be it, *but* the import would be different, it would only 
 import FQ names (let's call it fqimport) like I described before. And 
 with such import there would be no difference in being public or 
 private. (do you see why?)

Got it.
 

[snip]
 
 What's a "project file"?


 

 By project file I meant source code file belonging to a project, 
 "project" being something that would produce a *single* output 
 executable or library (yes, likely not a clear term).

[snip]

 If you have the source code of several applications mingled in the same 
 dir, you should have them named with apropriate and unconflicting 
 modules, or possibly better, you should separate them in different 
 source dirs. (Frankly I think one should do this regardless of whether 
 the compiler searches for files or not, it is basic best-pratices to 
 have a good, clear code structure in terms of files too)

It's not always that practical. My company has a retail banking system that
has over 5,000 applications in it. Many have three files or fewer in them.
To have each application in its own folder would cause other problems to
arise. We have then organized along functional areas, which keeps them down
to about 100 folders; this has worked for us over the last twelve years.

-- 
Derek Parnell
Melbourne, Australia
1/09/2005 7:37:26 AM

Aug 31 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

There was something ahead in your post that prompted me to change some 
of the terminology I used, and clear some more things. I realized that 
calling "symbols" to this that I have so far been calling "symbols", may 
not be very apropriate since it is a somewhat ambiguous term.
So I have come up with a better nomenclature for this subject: it is 
best to call these things entities, and to define them as: a code entity 
that can be defined and named (this is not a recursive definition btw).
That would be: variables, classes (and template instances), structs, 
typedefs, functions, enums, enum literals, modules. (technicaly labels 
too but nevermind them) Aliases are a special entity case. As for 
packages I right now don't consider them proper entities, they are 
instead part of the module base name.
A scoping entity is an entity that introduces a new scope, in which new 
entities can be defined. Most entities are scoping entities.
An entity can be refered by it's base name, if available in the current 
scope, or (recursively) trough it's parent entity with the "." operator, 
if the parent entity is available in the current scope (and if security 
and semantic restrictions allow it). Example:

   module packA.foobar;

   class Foo{
     static int var1 = 42;
     this() {
       // packA.foobar.Foo.var1 entity accessed by base name;
       int var2 = var1;
     }
   }

   void func() {
     // packA.foobar.Foo.var1 entity accessed by parent Foo;
     Foo.var1++;
   }

Thus an entity can be access/used/refered by several names.
The Fully Qualified Name of an entity is it's base name plus all parent 
entities names in its hierarchy. Example: packA.foobar.Foo.var1;

I originally called these symbols because I was thinking of the compiler 
symbol table, however, such symbols are more akin to names, than to 
entities themselves.

Derek Parnell wrote:
In terms of just being possible, there is no problem. The problem is the 
"side-effect" of bringing the stripped name (DoStuff) into local scope 
as well, not just the FQ name. Altough it may be hard for this to 
actually cause a bug, or code-inconsistency, it is conceptually an 
important behaviour. For example, down the road, if we have a D IDE that 
does code autocompletion (aka intellisense by MS), it becomes especially 
relevant, and problematic!

 
 
 So another way of saying it is that public symbols in other modules should
 only become visible if I explicitly name them. Or ... I can only see them
 if I say I want to see them.
 
 I would have thought that an IDE would want to show me symbols that I
 haven't named yet so I could possibly choose them.
  
 

Hum..I didn't understand this you just said, so no, I don't now if it is 
another way of saying it. I'm not sure if it even makes sense. What is 
"naming a symbol"? Or "seeing a symbol"? (this was what prompted me to 
change the terminology)
I should note that in this particular paragraph of mine you just replied 
I didn't say how things "should be", only what was the current situation 
with D, and the shortcomings of that.

As for the IDE code-completion, to be more clear let me give an example 
of what would happen with D currently:

   import pack1.foobar;

   void DoThings() { ... }

   int func(){
	pack1.foobar.DoStuff();
	...
	pack1.foobar.DoStuff2();
	...
	Do| <- press ctrl-space here for code-completion
   }

In this example you want to access entities from pack1.foobar by FQN, 
such as pack1.foobar.DoStuff() . However, the base name (DoStuff, 
DoStuff2, etc.) of the entities of pack1.foobar is also made availabe in 
the current module scope by the import. So when you want to use 
code-completion, with the cursor right after Do where the "|" symbol is, 
you will be presented the the local options (DoThings), plus DoStuff and 
DoStuff2, which you do not want to be presented.

If you have the source code of several applications mingled in the same 
dir, you should have them named with apropriate and unconflicting 
modules, or possibly better, you should separate them in different 
source dirs. (Frankly I think one should do this regardless of whether 
the compiler searches for files or not, it is basic best-pratices to 
have a good, clear code structure in terms of files too)

 
 
 It's not always that practical. My company has a retail banking system that
 has over 5,000 applications in it. Many have three files or fewer in them.
 To have each application in its own folder would cause other problems to
 arise. We have then organized along functional areas, which keeps them down
 to about 100 folders; this has worked for us over the last twelve years.
 

5000 programs ?! Whoa. I am tempted to ask why so many programs, and why 
would having each in its own source folder would cause other problems to 
arise. However I sense the answer may be too long/complicated to be of 
interest right now.
Still, two things at least I would like to know: in what language are 
those aplications coded? And would the other solution proposed ("naming 
them with apropriate and unconflicting modules") be fine, if the 
application was coded in D?

-- 
Bruno Medeiros
Computer Science/Engineering student

Sep 01 2005

Derek Parnell <derek psych.ward> writes:

On Thu, 01 Sep 2005 15:21:56 +0000, Bruno Medeiros wrote:

 There was something ahead in your post that prompted me to change some 
 of the terminology I used, and clear some more things. I realized that 
 calling "symbols" to this that I have so far been calling "symbols", may 
 not be very apropriate since it is a somewhat ambiguous term.
 So I have come up with a better nomenclature for this subject: it is 
 best to call these things entities, ...

Your definition of entity is the same as I was thinking of too. 

 
 So another way of saying it is that public symbols in other modules should
 only become visible if I explicitly name them. Or ... I can only see them
 if I say I want to see them.
 
 I would have thought that an IDE would want to show me symbols that I
 haven't named yet so I could possibly choose them.
  
 

 Hum..I didn't understand this you just said, so no, I don't now if it is 
 another way of saying it. I'm not sure if it even makes sense. 
 What is "naming a symbol"? 

I mean by that ... "referencing an entity in another module by using its
fully qualified name".

 Or "seeing a symbol"? 

I mean by "I can only see them if I say I want to see them." ... "the
compiler can find the entity in scope if the coder has explicitly told the
compiler the entity's fully qualified name"

(this was what prompted me to 
 change the terminology)
 I should note that in this particular paragraph of mine you just replied 
 I didn't say how things "should be", only what was the current situation 
 with D, and the shortcomings of that.
 
 As for the IDE code-completion, to be more clear let me give an example 
 of what would happen with D currently:
 
    import pack1.foobar;
 
    void DoThings() { ... }
 
    int func(){
 	pack1.foobar.DoStuff();
 	...
 	pack1.foobar.DoStuff2();
 	...
 	Do| <- press ctrl-space here for code-completion
    }
 
 In this example you want to access entities from pack1.foobar by FQN, 
 such as pack1.foobar.DoStuff() . However, the base name (DoStuff, 
 DoStuff2, etc.) of the entities of pack1.foobar is also made availabe in 
 the current module scope by the import. So when you want to use 
 code-completion, with the cursor right after Do where the "|" symbol is, 
 you will be presented the the local options (DoThings), plus DoStuff and 
 DoStuff2, which you do not want to be presented.

How do you know that I don't want them presented? 

If you have the source code of several applications mingled in the same 
dir, you should have them named with apropriate and unconflicting 
modules, or possibly better, you should separate them in different 
source dirs. (Frankly I think one should do this regardless of whether 
the compiler searches for files or not, it is basic best-pratices to 
have a good, clear code structure in terms of files too)

 
 
 It's not always that practical. My company has a retail banking system that
 has over 5,000 applications in it. Many have three files or fewer in them.
 To have each application in its own folder would cause other problems to
 arise. We have then organized along functional areas, which keeps them down
 to about 100 folders; this has worked for us over the last twelve years.
 

 5000 programs ?! Whoa. I am tempted to ask why so many programs, and why 
 would having each in its own source folder would cause other problems to 
 arise. However I sense the answer may be too long/complicated to be of 
 interest right now.

A retail banking system does a *lot* of things. 

 Still, two things at least I would like to know: in what language are 
 those aplications coded? 

Progress 4GL

 And would the other solution proposed ("naming 
 them with apropriate and unconflicting modules") be fine, if the 
 application was coded in D?

Not really. It would not make any difference.

-- 
Derek Parnell
Melbourne, Australia
2/09/2005 7:23:47 AM

Sep 01 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Derek Parnell wrote:
So another way of saying it is that public symbols in other modules should
only become visible if I explicitly name them. Or ... I can only see them
if I say I want to see them.

I would have thought that an IDE would want to show me symbols that I
haven't named yet so I could possibly choose them.
 

Hum..I didn't understand this you just said, so no, I don't now if it is 
another way of saying it. I'm not sure if it even makes sense. 
What is "naming a symbol"? 

 
 
 I mean by that ... "referencing an entity in another module by using its
 fully qualified name".
 
 
Or "seeing a symbol"? 

 
 
 I mean by "I can only see them if I say I want to see them." ... "the
 compiler can find the entity in scope if the coder has explicitly told the
 compiler the entity's fully qualified name"
 
 

Ok, now I understand. And now I can say that no, that isn't quite the
same. It is not just a matter of "finding the entity in scope", it is
also *how* it is available, since it can be available directly (base
name in scope) or hierarchly (through one or more parent entities). So
it would be: "the compiler can find the entity in scope (by FQN, not
directly) if the coder has explicitly told the compiler the entity's
fully qualified name"
This is now correctly (altough I would say it in another way) the basis
of the "automatic availability of entities by FQN" feature (was:
"automatic global import/availability of symbol names")

BTW, I'm in the process of rewriting and formalizing my proposal.

As for the IDE code-completion, to be more clear let me give an example 
of what would happen with D currently:

   import pack1.foobar;

   void DoThings() { ... }

   int func(){
	pack1.foobar.DoStuff();
	...
	pack1.foobar.DoStuff2();
	...
	Do| <- press ctrl-space here for code-completion
   }

In this example you want to access entities from pack1.foobar by FQN, 
such as pack1.foobar.DoStuff() . However, the base name (DoStuff, 
DoStuff2, etc.) of the entities of pack1.foobar is also made availabe in 
the current module scope by the import. So when you want to use 
code-completion, with the cursor right after Do where the "|" symbol is, 
you will be presented the the local options (DoThings), plus DoStuff and 
DoStuff2, which you do not want to be presented.

 
 
 How do you know that I don't want them presented? 
 
 

It was a premise of the example: "In this example you want to access
entities from pack1.foobar by FQN". Should have added an "only" there at
the end, but it was somewhat implicit.

Still, two things at least I would like to know: in what language are 
those aplications coded? 

 
 
 Progress 4GL
 
 
And would the other solution proposed ("naming 
them with apropriate and unconflicting modules") be fine, if the 
application was coded in D?

 
 
 Not really. It would not make any difference.
 

Well, I still have my doubts that a D modern application/system (that
is, one that produces several programs or libraries) could not be
structured in some (good) way, that the automatic processing of all D
source files in a dir would not be a problem.
But if I would be wrong it doesn't matter. It would be trivial, if the
need be, to have the compiler have an option to process a specified list
of files (instead of all in a dir) as an alternate search method to
implement the "automatic availability of entities by FQN" feature.

-- 
Bruno Medeiros
Computer Science/Engineering student

Sep 03 2005

"Ben Hinkle" <bhinkle mathworks.com> writes:

"Bruno Medeiros" <daiphoenixNO SPAMlycos.com> wrote in message 
news:delf3c$2p1d$1 digitaldaemon.com...
 Concerning in D the whole aspect of imports, packages, modules and symbol 
 name scopes, etc.(what to call this whole thing?) there are some things I 
 find ill-conceived, so much that I think this aspect of D is in need of 
 some rethinking.

[snip]
 But then I realized, why do we then even need the imports at all?

One benefit is that it becomes much easier to figure out the dependencies 
between modules. One only has to look at import statements to decide if 
module foo depends on module bar. Without import statements one has to not 
only find all "bar." strings one also has to make sure bar isn't a local 
variable that's a structure or object where "bar." doesn't refer to the 
module bar at all.

 Second: the module statement. The module statement specifies the name of 
 the module, and the hierarchy of parent packages. The module name however 
 must match the file name, and the packages must match the underlying 
 directory structure, otherwise the source file will be unusable 
 (un-importable actually).

[snip]

There are two points to consider:
1) like Java D source codes doesn't have to be stored in a file system with 
directories etc. See 

2) by storing the package and module name in the file(s) the entire 
hierarchy can be recreated from the source code. That is, the contents of 
the source code defines the program completely, not the source 
code+directory structure.

Aug 30 2005

Bruno Medeiros <daiphoenixNO SPAMlycos.com> writes:

Ben Hinkle wrote:
 "Bruno Medeiros" <daiphoenixNO SPAMlycos.com> wrote in message 
 news:delf3c$2p1d$1 digitaldaemon.com...
 
Concerning in D the whole aspect of imports, packages, modules and symbol 
name scopes, etc.(what to call this whole thing?) there are some things I 
find ill-conceived, so much that I think this aspect of D is in need of 
some rethinking.

 
 [snip]
 
But then I realized, why do we then even need the imports at all?

 
 
 One benefit is that it becomes much easier to figure out the dependencies 
 between modules. One only has to look at import statements to decide if 
 module foo depends on module bar. Without import statements one has to not 
 only find all "bar." strings one also has to make sure bar isn't a local 
 variable that's a structure or object where "bar." doesn't refer to the 
 module bar at all.
 
 

Yes, there is that difference (if you're not using the /using/ 
statement), but what you ask can be done easily by a tool. In fact it 
*should* be done by a tool: because you can have unused imports, or you 
can be using a module that is not imported directly, but instead by 
other imported modules, if one is not careful with private/public 
imports and such.

Second: the module statement. The module statement specifies the name of 
the module, and the hierarchy of parent packages. The module name however 
must match the file name, and the packages must match the underlying 
directory structure, otherwise the source file will be unusable 
(un-importable actually).

 
 [snip]
 
 There are two points to consider:
 1) like Java D source codes doesn't have to be stored in a file system with 
 directories etc. See 

 2) by storing the package and module name in the file(s) the entire 
 hierarchy can be recreated from the source code. That is, the contents of 
 the source code defines the program completely, not the source 
 code+directory structure. 
 
 

Hum, yes, upon further thinking I too agree: the source code should 
define the program completely. Thus option 2 should be de-favored, and 
we are left with option 1(no change) or option 3(dropping the 
restriction that dir+filename should match package+module). I'm now more 
inclined towards the third option.

-- 
Bruno Medeiros
Computer Science/Engineering student

Aug 31 2005

D Programming

C/C++ Programming

Other

digitalmars.D - Imports, packages, modules and symbol name scopes