digitalmars.D - SIMD Progress?

Trevor Parscal (20/20) Mar 06 2008 Well, I've been reading everything I can about everything related to the...

bearophile (4/5) Mar 06 2008 I have to read your code still, but in the meantime, how many times does...

Trevor Parscal (4/11) Mar 06 2008 It runs 1 time for each method...

Trevor Parscal <trevorparscal hotmail.com> writes:

Well, I've been reading everything I can about everything related to the issue
of SIMD using D's inline ASM.

For whatever reason, I've been able to get simd operations working on just
about every type of data - EXCEPT the ones I would find most useful... :(

Perhaps someone could look at this program (attached) and get the other 3
working...

Current Status

ASM Paralell Operations
- Dynamic Array (OK)
- Static Array (Not Working)
- Struct->Array on Heap (OK)
- Struct->Fields on Heap (OK)
- Struct->Array on Stack (Not Working)
- Struct->Fields on Stack (Not Working)
D Serial Operations
- Dynamic Array (OK)
- Static Array (OK)
- Struct->Array on Heap (OK)
- Struct->Fields on Heap (OK)
- Struct->Array on Stack (OK)
- Struct->Fields on Stack (OK)

It all has to do with alignment, and the fact that D doesn't align data on the
stack. I don't know of a work-around, but perhaps there is one out there yet to
surface.

- Trevor

Mar 06 2008

bearophile <bearophileHUGS lycos.com> writes:

Trevor Parscal Wrote:
 For whatever reason, I've been able to get simd operations working on just
about every type of data - EXCEPT the ones I would find most useful... :(

I have to read your code still, but in the meantime, how many times does it run
compared to normal code? (Sometimes running speed comes first).

Bye,
bearophile

Mar 06 2008

Trevor Parscal <trevorparscal hotmail.com> writes:

bearophile Wrote:

 Trevor Parscal Wrote:
 For whatever reason, I've been able to get simd operations working on just
about every type of data - EXCEPT the ones I would find most useful... :(

 
 I have to read your code still, but in the meantime, how many times does it
run compared to normal code? (Sometimes running speed comes first).
 
 Bye,
 bearophile

It runs 1 time for each method...

I was hoping to get it to work before I did benchmarking - however a benchmark
I did with the ASM with Dynamic Arrays performed vector addition on 4 floats a
very large number of times 300% faster... Probably not the final word on
performance as I am sure the benchmark was highly flawed, but it seemed evident
there was performance gain to be had.

- Trevor

Mar 06 2008

D Programming

C/C++ Programming

Other

digitalmars.D - SIMD Progress?