Perl tweaks, also PDL version of benchmark #1

mohawk2 · 2021-09-28T20:50:43Z

Apart from whitespace changes (to ignore which: https://github.com/Fourmilab/floating_point_benchmarks/pull/1/files?diff=unified&w=1), uses Time::HiRes to do the timing rather than just saying when to stop a timer, and splits the paraxial and non-paraxial surface-code into separate functions.

mohawk2 · 2021-09-28T21:05:23Z

By the way, the comparative benchmarks in https://www.fourmilab.ch/scanalyzer/archives/2021/09/floating-point-benchmark-raku-perl-6-language-added.html show Perl results for 5.8. Having just tried both that and 5.32 with the high-res timer, it seems like 5.32 is up to 10% faster on average than 5.8 - worth a re-run?

mohawk2 · 2021-09-29T01:47:30Z

Notes for posterity: the telescope article is from Amateur Telescope Making Vol 3, available in DjVu format from https://b-ok.cc/book/449242/42c54c?id=449242; the J.H. Wyld chapter is on p581 (p588 of the PDF).

mohawk2 · 2021-09-30T17:55:15Z

I have now added a second script to the Perl directory, with a PDL "PP" function. It’s about twice as slow as pure-Perl, but the small input size is not where PDL would shine. The way this would benefit performance-wise would be to trace rays through many more surfaces than 4, or with a large number of different ray heights. That could then also benefit from automatically using multiple cores, aka "pthreading" (as documented at https://metacpan.org/dist/PDL/view/Basic/Pod/ParallelCPU.pod).

mohawk2 · 2021-10-02T13:40:52Z

With the latest commit, the for-loop is eliminated, and a "dummy dimension" is added to the 3rd and 4th parameters with a size to cause the calculations to be done the given number of times. This seems to show speed results comparable to C, which isn't surprising as that is where most of the effort happens. Also, this model shows speed improvements with the parallel-processing feature:

$ perl src/perl/fbench-pdl.pl 10000000
Name "PDL::BIGPDL" used only once: possible typo at src/perl/fbench-pdl.pl line 227.
Ready to begin John Walker's floating point accuracy
and performance benchmark.  10000000 iterations will be made.

Measured run time in seconds should be divided by 10000
to normalise for reporting results.  For archival results,
adjust iteration count so the benchmark runs about five minutes.

Time taken: 26.041648
Divided by 10000 = 0.0026041648

No errors in results.
$ PDL_AUTOPTHREAD_TARG=4 PDL_AUTOPTHREAD_SIZE=0 perl src/perl/fbench-pdl.pl 10000000
Name "PDL::BIGPDL" used only once: possible typo at src/perl/fbench-pdl.pl line 227.
Ready to begin John Walker's floating point accuracy
and performance benchmark.  10000000 iterations will be made.
[snip]
Time taken: 14.400384
Divided by 10000 = 0.0014400384

No errors in results.

…) flag)

mohawk2 added 8 commits September 28, 2021 21:43

use Time::Hires to actually calculate timings

8863fa7

zero-based indexing as not Fortran

b161abe

zap unused var

ab8dbe6

global indentation start column zero not 4

802ba9c

replace for-loop with array assignment and map

b1077a8

perl arrays know own size

d0ae637

use += etc

9f3c624

declare quasi-constant on setting

b4d5ea1

mohawk2 force-pushed the perl-tweaks branch from a8a122d to ccb2f6c Compare September 29, 2021 00:18

mohawk2 added 14 commits September 29, 2021 03:01

declare "my" vars in most local scope

34262f6

paraxial param not global

70974f2

split transit_surface into paraxial and non-paraxial versions

19fc6f0

use named var not array member

7ec860b

last surface has last elt of 0 so no need special-case adding

b7b3f56

more idiomatic assignment of array-ref

85f09e2

name all elts of the "surface" tuples

f98310f

move quasi-global declarations closer to where used

628791a

more idiomatic iterate over $paraxial values

0147096

switch * cot to / tan since cot not in PDL

f810a48

fix off-by-one in error reporting

5b0ca86

zap whitespace end of lines

ac083df

perl/fbench-pdl.pl start as straight copy of perl/fbench.pl

bf59ebe

naive conversion to use PDL funcs

4ae296f

mohawk2 force-pushed the perl-tweaks branch from 94cd843 to ac083df Compare September 30, 2021 15:19

rewrite as PP function

c998070

use dummy dim to do all calculation in one go, not in for-loop

6cf2901

mohawk2 force-pushed the perl-tweaks branch from 54e708f to 6cf2901 Compare October 2, 2021 13:37

mohawk2 changed the title ~~Perl tweaks~~ Perl tweaks, also PDL version of benchmark Oct 2, 2021

zap misleading comment about inputs (instead controlled by $paraxial(…

12bbd4a

…) flag)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perl tweaks, also PDL version of benchmark #1

Perl tweaks, also PDL version of benchmark #1

mohawk2 commented Sep 28, 2021 •

edited

Loading

mohawk2 commented Sep 28, 2021

mohawk2 commented Sep 29, 2021 •

edited

Loading

mohawk2 commented Sep 30, 2021

mohawk2 commented Oct 2, 2021

Perl tweaks, also PDL version of benchmark #1

Are you sure you want to change the base?

Perl tweaks, also PDL version of benchmark #1

Conversation

mohawk2 commented Sep 28, 2021 • edited Loading

mohawk2 commented Sep 28, 2021

mohawk2 commented Sep 29, 2021 • edited Loading

mohawk2 commented Sep 30, 2021

mohawk2 commented Oct 2, 2021

mohawk2 commented Sep 28, 2021 •

edited

Loading

mohawk2 commented Sep 29, 2021 •

edited

Loading