digitalmars.D - Support for feeding data to engine threads?

Jerry Quinn (2/2) Sep 12 2011 I'm looking at porting an app that maintains a work queue to be processe...

Timon Gehr (3/5) Sep 12 2011 I don't know if I get you right, but std.parallelism uses a task pool.

Jerry Quinn (6/13) Sep 12 2011 OK I guess what I'm looking for is WorkerLocalStorage. I can create an ...

Timon Gehr (6/19) Sep 12 2011 Calling workerLocalStorage once suffices.

Jerry Quinn (2/32) Sep 12 2011 datafile is just a set of parameters for configuring the engine, i.e loc...

dsimcha (23/36) Sep 12 2011 one of N engines and written out in order. At first glance, std.paralle...

Jerry Quinn (8/50) Sep 12 2011 An engine is simply a complex class that takes a lot of memory and setup...

dsimcha (5/6) Sep 12 2011 Sorry for the misunderstanding. When workerLocalStorage() is called,
dsimcha (19/21) Sep 12 2011 This can easily be done by creating null engines in the

dsimcha (10/12) Sep 12 2011 of N engines and written out in order. At first glance, std.parallelism...

Jerry Quinn (2/15) Sep 12 2011 The question I was asking was how to execute a huge amount of per-thread...

dsimcha (6/21) Sep 12 2011 initialization once per thread in the TaskPool framework.

Jerry Quinn (9/31) Sep 12 2011 I'd probably naturally want to do something like:

dsimcha (10/22) Sep 12 2011 each thread as thread-local data. Each Engine object has a process() ca...

David Nadlinger (4/5) Sep 13 2011 A concurrent queue (SharedQueue?) would be a nice addition to Phobos

dsimcha (3/8) Sep 13 2011 A lock-free queue is one of the examples in TDPL if I recall correctly. ...

Jerry Quinn <jlquinn optonline.net> writes:

I'm looking at porting an app that maintains a work queue to be processed by
one of N engines and written out in order.  At first glance, std.parallelism
already provides the queue, but the Task concept appears to assume that there's
no startup cost per thread.

Am I missing something or do I need to roll a shared queue object?

Sep 12 2011

Timon Gehr <timon.gehr gmx.ch> writes:

On 09/12/2011 07:23 PM, Jerry Quinn wrote:
 I'm looking at porting an app that maintains a work queue to be processed by
one of N engines and written out in order.  At first glance, std.parallelism
already provides the queue, but the Task concept appears to assume that there's
no startup cost per thread.

 Am I missing something or do I need to roll a shared queue object?

I don't know if I get you right, but std.parallelism uses a task pool. 
Usually no threads are started or stopped during processing.

Sep 12 2011

Jerry Quinn <jlquinn optonline.net> writes:

Timon Gehr Wrote:

 On 09/12/2011 07:23 PM, Jerry Quinn wrote:
 I'm looking at porting an app that maintains a work queue to be processed by
one of N engines and written out in order.  At first glance, std.parallelism
already provides the queue, but the Task concept appears to assume that there's
no startup cost per thread.

 Am I missing something or do I need to roll a shared queue object?

 
 I don't know if I get you right, but std.parallelism uses a task pool. 
 Usually no threads are started or stopped during processing.

OK I guess what I'm looking for is WorkerLocalStorage.  I can create an engine
per thread.  However, I probably need to have each thread do the initialization
work.  If I create the engine on the main thread, it won't be properly accessed
by the worker thread, right?  I.e. I need each thread to run engine.init()
which will do a whole pile of loading and setup first before I can start
feeding data to the pool.

The impression I get is that

for (int i=0; i < nthreads; i++)
  taskPool.workerLocalStorage(new engine(datafile))

will not get me what I want.

Sep 12 2011

Timon Gehr <timon.gehr gmx.ch> writes:

On 09/12/2011 08:01 PM, Jerry Quinn wrote:
 Timon Gehr Wrote:

 On 09/12/2011 07:23 PM, Jerry Quinn wrote:
 I'm looking at porting an app that maintains a work queue to be processed by
one of N engines and written out in order.  At first glance, std.parallelism
already provides the queue, but the Task concept appears to assume that there's
no startup cost per thread.

 Am I missing something or do I need to roll a shared queue object?

 I don't know if I get you right, but std.parallelism uses a task pool.
 Usually no threads are started or stopped during processing.

 OK I guess what I'm looking for is WorkerLocalStorage.  I can create an engine
per thread.  However, I probably need to have each thread do the initialization
work.  If I create the engine on the main thread, it won't be properly accessed
by the worker thread, right?  I.e. I need each thread to run engine.init()
which will do a whole pile of loading and setup first before I can start
feeding data to the pool.

 The impression I get is that

 for (int i=0; i<  nthreads; i++)
    taskPool.workerLocalStorage(new engine(datafile))

 will not get me what I want.

Calling workerLocalStorage once suffices.

auto engine=taskPool.workerLocalStorage(new Engine(datafile));

This will create one engine per working thread and and the same datafile.

You can access the engine from each thread with engine.get. What is the 
exact role of datafile? Does it have to be distinct for each engine?

Sep 12 2011

Jerry Quinn <jlquinn optonline.net> writes:

Timon Gehr Wrote:

 On 09/12/2011 08:01 PM, Jerry Quinn wrote:
 Timon Gehr Wrote:

 On 09/12/2011 07:23 PM, Jerry Quinn wrote:
 I'm looking at porting an app that maintains a work queue to be processed by
one of N engines and written out in order.  At first glance, std.parallelism
already provides the queue, but the Task concept appears to assume that there's
no startup cost per thread.

 Am I missing something or do I need to roll a shared queue object?

 I don't know if I get you right, but std.parallelism uses a task pool.
 Usually no threads are started or stopped during processing.

 OK I guess what I'm looking for is WorkerLocalStorage.  I can create an engine
per thread.  However, I probably need to have each thread do the initialization
work.  If I create the engine on the main thread, it won't be properly accessed
by the worker thread, right?  I.e. I need each thread to run engine.init()
which will do a whole pile of loading and setup first before I can start
feeding data to the pool.

 The impression I get is that

 for (int i=0; i<  nthreads; i++)
    taskPool.workerLocalStorage(new engine(datafile))

 will not get me what I want.

 
 Calling workerLocalStorage once suffices.
 
 auto engine=taskPool.workerLocalStorage(new Engine(datafile));
 
 This will create one engine per working thread and and the same datafile.
 
 You can access the engine from each thread with engine.get. What is the 
 exact role of datafile? Does it have to be distinct for each engine?

datafile is just a set of parameters for configuring the engine, i.e location
of data, parameter values, etc.  In this setting it would be the same for each
engine.

Sep 12 2011