Thanks to guidance from @ikegami, I have found that the best choice for interactively reading from and writing to another process in Perl is IPC::Run. However, it requires that the program you are reading from and writing to have a known output when it is done writing to its STDOUT, such as a prompt. Here's an example that executes bash
, has it run ls -l
, and then prints that output:
use v5.14;
use IPC::Run qw(start timeout new_appender new_chunker);
my @command = qw(bash);
# Connect to the other program.
my ($in, @out);
my $ipc = start \@command,
'<' => new_appender("echo __END__\n"), \$in,
'>' => new_chunker, sub { push @out, @_ },
timeout(10) or die "Error: $?\n";
# Send it a command and wait until it has received it.
$in .= "ls -l\n";
$ipc->pump while length $in;
# Wait until our end-of-output string appears.
$ipc->pump until @out && @out[-1] =~ /__END__\n/m;
pop @out;
say @out;
Because it is running as an IPC (I assume), bash
does not emit a prompt when it is done writing to its STDOUT. So I use the new_appender()
function to have it emit something I can match to find the end of the output (by calling echo __END__
). I've also used an anonymous subroutine after a call to new_chunker
to collect the output into an array, rather than a scalar (just pass a reference to a scalar to '>'
if you want that).
So this works, but it sucks for a whole host of reasons, in my opinion:
- There is no generally useful way to know that an IPC-controlled program is done printing to its STDOUT. Instead, you have to use a regular expression on its output to search for a string that usually means it's done.
- If it doesn't emit one, you have to trick it into emitting one (as I have done here—god forbid if I should have a file named
__END__
, though). If I was controlling a database client, I might have to send something like SELECT 'IM OUTTA HERE';
. Different applications would require different new_appender
hacks.
- The writing to the magic
$in
and $out
scalars feels weird and action-at-a-distance-y. I dislike it.
- One cannot do line-oriented processing on the scalars as one could if they were file handles. They are therefore less efficient.
- The ability to use
new_chunker
to get line-oriented output is nice, if still a bit weird. That regains a bit of the efficiency on reading output from a program, though, assuming it is buffered efficiently by IPC::Run.
I now realize that, although the interface for IPC::Run could potentially be a bit nicer, overall the weaknesses of the IPC model in particular makes it tricky to deal with at all. There is no generally-useful IPC interface, because one has to know too much about the specifics of the particular program being run to get it to work. This is okay, maybe, if you know exactly how it will react to inputs, and can reliably recognize when it is done emitting output, and don't need to worry much about cross-platform compatibility. But that was far from sufficient for my need for a generally useful way to interact with various database command-line clients in a CPAN module that could be distributed to a whole host of operating systems.
In the end, thanks to packaging suggestions in comments on a blog post, I decided to abandon the use of IPC for controlling those clients, and to use the DBI, instead. It provides an excellent API, robust, stable, and mature, and suffers none of the drawbacks of IPC.
My recommendation for those who come after me is this:
- If you just need to execute another program and wait for it to finish, or collect its output when it is done running, use IPC::System::Simple. Otherwise, if what you need to do is to interactively interface with something else, use an API whenever possible. And if it's not possible, then use something like IPC::Run and try to make the best of it—and be prepared to give up quite a bit of your time to get it "just right."
system
and back ticks. – Caropen |-
file handle works. – Carselect
!!). So while IPC::Run does give you the option of doing that, it should be a last resort. The plus of IPC::Run over the others you mention is that it can hide the pipes from you. – Tambac<$fh>
or$fh->getline
to iterate over lines… – Car<$fh>
aka$fh->getline
safely if you have more than one pipe. It can lead to a deadlock. Doing IPC with file handles is really hard if you have more than one file handle (select
!!). – Tambac