How can I speed up my Perl program?

Asked 7/10, 2008 at 3:23 Answered 12/1, 2009 at 12:39

This is really two questions, but they are so similar, and to keep it simple, I figured I'd just roll them together:

Firstly: Given an established Perl project, what are some decent ways to speed it up beyond just plain in-code optimization?
Secondly: When writing a program from scratch in Perl, what are some good ways to greatly improve performance?

For the first question, imagine you are handed a decently written project and you need to improve performance, but you can't seem to get much of a gain through refactoring/optimization. What would you do to speed it up in this case short of rewriting it in something like C?

Please stay away from general optimization techniques unless they are Perl specific.

I asked this about Python earlier, and I figured it might be good to do it for other languages (I'm especially curious if there are corollaries to psycho and pyrex for Perl).

Ayurveda answered 7/10, 2008 at 3:23 Comment(0)

156

Please remember the rules of Optimization Club:

The first rule of Optimization Club is, you do not Optimize.
The second rule of Optimization Club is, you do not Optimize without measuring.
If your app is running faster than the underlying transport protocol, the optimization is over.
One factor at a time.
No marketroids, no marketroid schedules.
Testing will go on as long as it has to.
If this is your first night at Optimization Club, you have to write a test case.

So, assuming you actually have working code, run your program under Devel::NYTProf.

Find the bottlenecks. Then come back here to tell us what they are.

If you don't have working code, get it working first. The single biggest optimization you will ever make is going from non-working to working.

Dogear answered 7/10, 2008 at 3:29 Comment(3)

Suggested edit: After "So, assuming..." add "If you don't have working code, get it working first. The single biggest optimization you will ever make is going from non-working to working." – Rath 26/10, 2008 at 3:36

Rhetorical question: what is Perl specific for this optimization technique? – Ingeringersoll 26/3, 2010 at 20:25

Like the "If this is your first night at Optimization Club, you have to write a test case." fight club modified quote. – Repand 9/12, 2010 at 15:36

Andy has already mentioned Devel::NYTProf. It's awesome. Really, really awesome. Use it.

If for some reason you can't use Devel::NYTProf, then you can fall back to good old Devel::DProf, which has come standard with Perl for a long time now. If you have true functions (in the mathematical sense) which take a long time to calculate (eg, Fibonacci numbers), then you may find Memoize provides some speed improvement.

A lot of poor performance comes from inappropriate data structures and algorithms. A good course in computer science can help immensely here. If you have two ways of doing things, and would like to compare their performance, the Benchmark module can also prove useful.

The following Perl Tips may also prove useful here:

Disclaimer: I wrote some of the resources above, so I may be biased towards them.

Fizzle answered 7/10, 2008 at 4:43 Comment(0)

There are many things that you might improve on, so you first have to figure out what's slow. Others have already answered that question. I talk about this a bit in Mastering Perl too.

An incomplete list of things to think about as you are writing new code:

Profile with something like Devel::NYTProf to see where you are spending most of your time in the code. Sometimes that's surprising and easy to fix. Mastering Perl has a lot of advice about that.
Perl has to compile the source every time and compilation can be slow. It has to find all the files and so on. See, for instance, "A Timely Start", by Jean-Louis Leroy, where he speeds everything up just by optimizing module locations in @INC. If your start-up costs are expensive and unavoidable, you might also look at persistent perls, like pperl, mod_perl, and so on.
Look at some of the modules you use. Do they have long chains of dependencies just to do simple things? Sure, we don't like re-invention, but if the wheel you want to put on your car also comes with three boats, five goats, and a cheeseburger, maybe you want to build your own wheel (or find a different one).
Method calls can be expensive. In the Perl::Critic test suite, for instance, its calls to isa slows things down. It's not something that you can really avoid in all cases, but it is something to keep in mind. Someone had a great quote that went something like "No one minds giving up a factor of 2; it's when you have ten people doing it that it's bad." :) Perl v5.22 has some performance improvements for this.
If you're calling the same expensive methods over and over again but getting the same answers, something like Memoize might be for you. It's a proxy for the method call. If it's really a function (meaning, same input gives same output with no side effects), you don't really need to call it repeatedly.
Modules such as Apache::DBI can reuse database handles for you to avoid the expensive opening of database connections. It's really simple code, so looking inside can show you how to do that even if you aren't using Apache.
Perl doesn't do tail recursion optimization for you, so don't come over from Lisp thinking you're going to make these super fast recursive algorithms. You can turn those into iterative solutions easily (and we talk about that in Intermediate Perl.
Look at your regexes. Lots of open ended quantifiers (e.g. .*) can lead to lots of backtracking. Check out Jeffrey Freidl'sMastering Regular Expressions for all the gory details (and across several languages). Also check out his regex website.
Know how your perl is compiled. Do you really need threading and DDEBUGGING? Those slow you down a bit. Check out the perlbench utility to compare different perl binaries.
Benchmark your application against different Perls. Some newer versions have speedups, but also some older versions can be faster for limited sets of operations. I don't have particular advice since I don't know what you are doing.
Spread out the work. Can you do some asynchronous work in other processes or on remote computers? Let your program work on other things as someone else figures out some subproblems. Perl has several asynchronous and load shifting modules. Beware, though, that the scaffolding to do that stuff well might lose any benefit to doing it.

Sic answered 7/10, 2008 at 8:44 Comment(0)

Without having to rewrite large chunks, you can use Inline::C to convert any single, slow subroutine to C. Or directly use XS. It's also possible to incrementally convert subs with XS. PPI/PPI::XS does this, for example.

But moving to another language is always a last resort. Maybe you should get an expert Perl programmer to look at your code? More likely than not, (s)he'd spot some peculiarity that's seriously hurting your performance. Other than that, profile your code. Remember, there is no silver bullet.

With regards to psyco and pyrex: No, there's no equivalent for Perl.

Roundly answered 7/10, 2008 at 7:59 Comment(0)

This only half relates to your question - but in the interest of documentation I'll post it here.

A recent CentOS/Perl bugfix increased the speed of our application more than two-fold. This is a must for anyone running CentOS Perl and use the bless/overload functions.

Raucous answered 7/10, 2008 at 3:30 Comment(1)

Oh, good point. For more on the bug and a test program to see if it affects your code, see perlbuzz.com/2008/08/… – Dogear 7/10, 2008 at 3:35

Profile your application - using for example, the profiler mentioned above. You will then see where the time's going

If the time is being spent doing things other than CPU usage, you need to reduce those first - CPU is easy to scale, other things aren't.

A few operations are particularly slow, I have found:

keys() on a large hash is very bad
Use of Data::Dumper for debug. Using this function on a large structure is very slow. Avoid it if you can. We've seen code which does:
```
use Data::Dumper; 
$debugstr = Dumper(\%bighash); 
if ($debugflag_mostlyoff) { log($debugstr); } 
```
Most modules have alternatives with different performance characteristics - some literally suck incredibly badly.
Some regular expressions can be very slow (lots of .* etc) and can be replaced by equivalent ones that are faster. Regular expressions are quite easy to unit test and performance test (just write a program which runs it in a loop against a big simulated data set). The best regular expressions start with something that can be tested very quickly, such as a literal string. Sometimes it's better not to look for the thing you're looking for first first, and do a "look behind" to check whether it really is the thing you're looking for. Optimising regexps really is a bit of a black art at which I'm not very good.

Don't consider rewriting something in C except as a last resort. Calling C from Perl (or vice versa) has a relatively large overhead. If you can get a quick Perl implementation, that's better.

If you do rewrite something in C, try to do it in a way which minimises the calling overhead, and calls to the perl runtime (The SV* functions for example mostly copy strings around). One way of achieving this is by making a C function which does more and calling it fewer times. Copying strings around in memory is not cool.

On the other hand, rewriting something in C carries big risk because you can introduce new failure modes, e.g. memory leaks, crashes, security problems.

Solvent answered 8/10, 2008 at 6:12 Comment(0)

An essay well worth reading on the subject is Nicholas Clark's talk When perl is not quite fast enough (PDF). Some of the points are slightly dated, such as the reference to Devel::DProf, but keep in mind that it was written in 2002.

Nevertheless, much of the material covered remains relevant.

Inflexible answered 8/10, 2008 at 7:36 Comment(1)

link is dead... – Disannul 15/3, 2023 at 15:6

Method and subroutine calls aren't free in Perl. They're relatively expensive. So, if your profiling turns out that you're spending a reasonably large chunk of the run time in small accessor methods, that might be a micro-optimization worth looking at.

However, what you should not do is replacing accessors such as get_color() here:

package Car;
# sub new {...}

sub get_color {
   my $self = shift;
   return $self->{color};
}

package main;
#...
my $color = $car->get_color();

with encapsulation-breaking direct accesses:

my $color = $car->{color};

One would think that this goes without saying, but one also sees this done all over the place. Here's what you can do, using Class::XSAccessor

package Car;
# sub new {...}
use Class::XSAccessor
  getters => {
    get_color => 'color',
  },
  setters => {
    set_color => 'color',
  };

This creates new methods get- and set_color() which are implemented in XS and thus about twice as fast as your hand-rolled version. Mutators (i.e. "$car->color('red')") are also available, as are chained methods.

Depending on your application, this may give you a very tiny (but essentially free) boost. Do not expect more than 1-2% unless you're doing something peculiar.

Roundly answered 8/10, 2008 at 8:16 Comment(0)

The best way to make your program run faster is to make your program do less work. Pick the right algorithm for the job. I've seen lots of slow applications because they pick a dumb algorithm in some area of the code that gets called millions of times. When you are doing a million * a million operations instead of just a million operations, your program is going to run a million times slower. Literally.

For example, here's some code I saw that inserts an element into a sorted list:

while(my $new_item = <>){
    push @list, $new_item;
    @list = sort @list;
    ... use sorted list
}

sort is O(n log n). An insertion into a sorted list is O(log n).

Fix the algorithm.

Legendary answered 12/1, 2009 at 12:39 Comment(1)

The final assertion that insertion into a sorted list is O (log n) is inaccurate unless by "list" you mean eg. "tree" – Cardew 25/6, 2014 at 23:39

The most cost effective method might be, to consider faster hardware (=> appropriate hardware architecture). I am not talking faster CPUs, but rather faster disks, faster networking .. faster anything, really, that speeds up I/O.

I experienced this many years ago, when we moved a XML-parsing based application (bleeding edge technology at that time<g>) from a (fast and reliable!) Windows Server to a dedicated, albeit somewhat outdated, SUN platform with faster I/O all around.

As always, consider

developer performance (how long does it take to code, how complex is the problem, is the result maintainable),
Hardware performance,
Software performance

and improve where most (cost!) effective for the problem at hand...

Waisted answered 27/10, 2008 at 9:22 Comment(1)

I like this idea, but it should be said that sometimes spending a little more initial on developer time may lead to far less in the long run in terms of hardware cost/speedup. It really depends on how long term the project is intended to run. – Lectionary 1/7, 2009 at 0:18

If your code needs speeding up then chances are that your test suite does too. This talk touches on the key points:

Turbo Charged Test Suites

Jepum answered 10/10, 2008 at 12:46 Comment(0)

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags