Scalar + &array and &array + scalar performance improvements by bluss · Pull Request #890 · rust-ndarray/ndarray

bluss · 2021-01-10T16:04:36Z

This incorporates the benchmark and performance improvements from #782 by @jturner314, without
the applicable types changes (since they run into some rustc bugs at the moment). Jim Turner is
the author of this, I've only edited the commits to the changes we can integrate now.

@jturner314 writes:

The new implementation avoids cloning the elements twice, and it
avoids iterating over the elements twice. (The old implementation
called .to_owned() followed by the arithmetic operation, while the
new implementation clones the elements and performs the arithmetic
operation in the same iteration.)

On my machine, this change improves the performance for both
contiguous and discontiguous arrays. (scalar_add_1/2 go from ~530
ns/iter to ~380 ns/iter, and scalar_add_strided_1/2 go from ~1540
ns/iter to ~1420 ns/iter.)

* The new implementation avoids cloning the elements twice, and it avoids iterating over the elements twice. (The old implementation called `.to_owned()` followed by the arithmetic operation, while the new implementation clones the elements and performs the arithmetic operation in the same iteration.) On my machine, this change improves the performance for both contiguous and discontiguous arrays. (`scalar_add_1/2` go from ~530 ns/iter to ~380 ns/iter, and `scalar_add_strided_1/2` go from ~1540 ns/iter to ~1420 ns/iter.) (Other changes to impl applicability removed from this commit.) Co-authored-by: bluss <bluss@users.noreply.github.com>

jturner314 and others added 2 commits December 29, 2020 19:59

Add benches for op with scalar and strided array

52ca234

bluss mentioned this pull request Jan 10, 2021

Generalize arithmetic ops to more combinations of scalars and arrays #782

Open

bluss changed the title ~~Scalar + &Array and &Array + scalar performance improvements~~ Scalar + &array and &array + scalar performance improvements Jan 10, 2021

bluss merged commit 6483ef5 into master Jan 10, 2021

bluss deleted the jt-scalar-ops-improvement branch January 10, 2021 16:21

bluss added the performance label Jan 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalar + &array and &array + scalar performance improvements#890

Scalar + &array and &array + scalar performance improvements#890
bluss merged 2 commits intomasterfrom
jt-scalar-ops-improvement

bluss commented Jan 10, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bluss commented Jan 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bluss commented Jan 10, 2021 •

edited

Loading