How to get the PID of a process that is piped to another process in Bash?

E

14

72

I am trying to implement a simple log server in Bash. It should take a file as a parameter and serve it on a port with netcat.

( tail -f $1 & ) | nc -l -p 9977

But the problem is that when the netcat terminates, tail is left behind running. (Clarification: If I don't fork the tail process it will continue to run forever even the netcat terminates.)

If I somehow know the PID of the tail then I could kill it afterwards.
Obviously, using $! will return the PID of netcat.

How can I get the PID of the tail process?

Exocrine answered 30/10, 2009 at 22:43 Comment(3)

What happens if you don't use &? tail -f is supposed to just wait there. I don't get what the & is for, though it does look like this is part of a bigger script. Anyway, if you kill the pipe I would think that tail would then die (so long as you didn't background it). – Undershorts 22/5, 2013 at 5:11

from my understanding of shells, & in this case is used with its meaning as "start to background". At least, that's the behavior I'm seeing right now, trying to apply the solution to my own, similar issue. – Seek 7/7, 2016 at 11:2

related: unix.stackexchange.com/questions/103472/… – Stenographer 9/4, 2018 at 10:20

R

56

Write tail's PID to file descriptor 3, and then capture it from there.

( tail -f $1 & echo $! >&3 ) 3>pid | nc -l -p 9977
kill $(<pid)

Ruminate answered 24/9, 2010 at 12:15 Comment(2)

I used a variant: ( tail -f $1 & echo $! >pid ) | nc -l -p 9977 (not sure why using file descriptor 3 would help when finally redirecting to a file) – Darcie 6/2, 2012 at 16:19

Not sure why but my solution fails after a couple of log lines are output. Probably when the pipe buffer is full. Then the initial process seems to be waiting for the pipe to be processed. – Darcie 7/2, 2012 at 10:54

M

65

Another option: use a redirect to subshell. This changes the order in which background processes are started, so $! gives PID of the tail process.

tail -f $1 > >(nc -l -p 9977) &
wait $!

Myceto answered 8/11, 2011 at 9:47 Comment(5)

The advantage of this approach is that after the wait, $? also holds the exit status – Fanchie 13/12, 2013 at 18:0

This seems like the cleanest approach by a long shot. Does it change anything (compared to standard piping) from the perspective of either process other than the start order? – Claudetteclaudia 7/9, 2014 at 5:54

Actually, the syntax on that redirection is wrong (though the principle is correct). >(foo) is substituted for the name of the new file descriptor, whereas > >(foo) actually redirects output to it. You want the first line to be tail -f $1 > >(nc -l -p 9977) &. – Claudetteclaudia 7/9, 2014 at 6:27

exactly what I needed! Thanks! Just redirecting the pid didn't work in my case. – Weaks 13/2, 2019 at 7:16

If you want to capture both stdout and stderr in the subshell use &> for the redirection. Example: tail -f $1 &> >(nc -l -p 9977) &. – Strapper 26/6, 2019 at 12:54

R

56

Write tail's PID to file descriptor 3, and then capture it from there.

( tail -f $1 & echo $! >&3 ) 3>pid | nc -l -p 9977
kill $(<pid)

Ruminate answered 24/9, 2010 at 12:15 Comment(2)

I used a variant: ( tail -f $1 & echo $! >pid ) | nc -l -p 9977 (not sure why using file descriptor 3 would help when finally redirecting to a file) – Darcie 6/2, 2012 at 16:19

Not sure why but my solution fails after a couple of log lines are output. Probably when the pipe buffer is full. Then the initial process seems to be waiting for the pipe to be processed. – Darcie 7/2, 2012 at 10:54

D

18

how about this:

jobs -x echo %1

%1 is for first job in chain, %2 for second, etc. jobs -x replaces job specifier with PID.

Duron answered 18/1, 2012 at 9:38 Comment(2)

This must be the cleanest + shortest solution to 'how to get pid of any process earlier in chain' Thank You! It also works with '&' getting the pid of process earlier in the chain that is running in background! E.g. dd if=/dev/urandom bs=1M count=1024 | sha1sum & pid=$(jobs - x echo %1) kill -USR1 $pid – Kevon 29/11, 2014 at 12:5

The answer is wrong. Only one job ID is assigned, for the entire pipeline: shopt -s lastpipe; sleep 1 | cat | jobs -> [1] Running sleep 1 | cat. It has the PID of the 1st command on the pipeline but jobs -l prints all the PIDs: shopt -s lastpipe; sleep 1 | cat | jobs -l; jobs -p -> [1] 5394 Running sleep 1 5395 | cat 5394. – Klara 23/4, 2020 at 14:0

T

14

This works for me (SLES Linux):

tail -F xxxx | tee -a yyyy &
export TAIL_PID=`jobs -p`
# export TEE_PID="$!"

The ps|grep|kill trick mentioned in this thread would not work if a user can run the script for two "instances" on the same machine.

jobs -x echo %1 did not work for me (man page not having the -x flag) but gave me the idea to try jobs -p.

Tojo answered 20/8, 2012 at 16:19 Comment(2)

Note that you can only get the PID of the first command on the pipeline this way. jobs -l would print PIDs of all commands but you'll have to parse its output to extract them. – Klara 23/4, 2020 at 14:7

Unlike many of the other comments, this also works in /bin/sh to obtain the PID of the first command (process group leader), e.g. to be able to kill the entire pipeline later on. – Hygiene 18/8, 2020 at 9:52

T

7

Maybe you could use a fifo, so that you can capture the pid of the first process, e.g.:

FIFO=my_fifo

rm -f $FIFO
mkfifo $FIFO

tail -f $1 > $FIFO &
TAIL_PID=$!

cat $FIFO | nc -l -p 9977

kill $TAIL_PID

rm -f $FIFO

Trapper answered 30/10, 2009 at 22:59 Comment(4)

Yes I have tried It before. The problem about using fifo is the same: pipe never gets terminated so cat stays running even netcat terminates. Also the control stays in the cat line so it never executes kill. – Cuthbertson 31/10, 2009 at 9:25

That's odd - the script above worked perfectly for me on Mac OS X. Only slight difference was that I omitted the '-p' flag for nc. – Trapper 31/10, 2009 at 10:6

Maybe its a platform issue (about how to handle pipes). I'm trying it on a linux machine. thanks for your answer anyway! – Cuthbertson 31/10, 2009 at 10:17

What if you remove the cat and use nc -l -p 9977 < $FIFO? – Northumbria 30/3, 2012 at 20:23

E

2

Finally, I have managed to find the tail process using ps. Thanks to the idea from ennuikiller.

I have used the ps to grep tail from the args and kill it. It is kind of a hack but it worked. :)

If you can find a better way please share.

Here is the complete script:
(Latest version can be found here: http://docs.karamatli.com/dotfiles/bin/logserver)

if [ -z "$1" ]; then
    echo Usage: $0 LOGFILE [PORT]
    exit -1
fi
if [ -n "$2" ]; then
    PORT=$2
else
    PORT=9977
fi

TAIL_CMD="tail -f $1"

function kill_tail {
    # find and kill the tail process that is detached from the current process
    TAIL_PID=$(/bin/ps -eo pid,args | grep "$TAIL_CMD" | grep -v grep | awk '{ print $1 }')
    kill $TAIL_PID
}
trap "kill_tail; exit 0" SIGINT SIGTERM

while true; do
    ( $TAIL_CMD & ) | nc -l -p $PORT -vvv
    kill_tail
done

Exocrine answered 31/10, 2009 at 12:8 Comment(1)

Shouldn't the PID of the tail command be available in $! so that you could simply do kill $! instead of kill_tail? – Menticide 19/9, 2011 at 9:30

O

2

ncat automatically terminates tail -f on exit (on Mac OS X 10.6.7)!

# simple log server in Bash using ncat
# cf. http://nmap.org/ncat/
touch file.log
ncat -l 9977 -c "tail -f file.log" </dev/null   # terminal window 1
ncat localhost 9977 </dev/null                  # terminal window 2
echo hello > file.log                           # terminal window 3

Office answered 7/7, 2011 at 8:59 Comment(0)

E

1

One way would be to simply do a ps -ef and grep for tail with your script ppid

Engaged answered 30/10, 2009 at 22:49 Comment(1)

I couldn't get it using the ppid because it is detached when I fork it in a subshell. But I managed to grep it using the args param of the ps program. – Cuthbertson 31/10, 2009 at 12:2

T

1

Have you tried:

nc -l -p 9977 -c "tail -f $1"

(untested)

Or -e with a scriptfile if your nc doesn't have -c. You may have to have an nc that was compiled with the GAPING_SECURITY_HOLE option. Yes, you should infer appropriate caveats from that option name.

Tsimshian answered 30/10, 2009 at 23:50 Comment(2)

Oh, I have never thought of it. I wish it worked :) But it's still not terminating because of the nature of "tail -f" – Cuthbertson 31/10, 2009 at 10:19

without the -f, its OK. But I want to serve it in real time. – Cuthbertson 31/10, 2009 at 10:36

G

1

You may store the pid of the tail command in a variable using Bash I/O redirections only (see How to get the PID of a process in a pipeline).

# terminal window 1
# using nc on Mac OS X (FreeBSD nc)
: > /tmp/foo
PID=$( { { tail -f /tmp/foo 0<&4 & echo $! >&3 ; } 4<&0 | { nc -l 9977 ;} & } 3>&1 | head -1 )
kill $PID

# terminal window 2
nc localhost 9977

# terminal window 3
echo line > /tmp/foo

Gurnard answered 18/9, 2011 at 17:44 Comment(0)

C

1

Not an ideal answer, but I found a workaround for a logger daemon I worked on:

#!/bin/sh
tail -f /etc/service/rt4/log/main/current --pid=$$ | grep error

from $info tail:

--pid=PID
          with -f, terminate after process ID, PID dies

Catachresis answered 24/7, 2012 at 1:48 Comment(0)

D

1

You could use the coproc command twice.

The given example translates to:

coproc TAIL { tail -f $1; }; exec {TAIL[1]}<&-
coproc NC { nc -v -l -p 9977; } <&"${TAIL[0]}" >&1
wait $NC_PID; echo "nc exit code: $!"
kill $TAIL_PID; echo "done"

(I've thrown a -v and a couple echo in there for troubleshooting.)

Using coproc feels a lot like using Popen() in various other scripting languages.

Deference answered 13/11, 2018 at 17:29 Comment(2)

Closing stdin of tail with eg. </dev/null or exec as in the example is nice to do, so that no streams are left dangling around unconnected, but not strictly necessary for the tail command in particular, since tail never reads from stdin when given a filename. Could be left out when the first coproc command is tail. – Deference 13/11, 2018 at 17:41

Note that coproc is new in Bash 4 (e.g. OSX still uses 3.2). It is available in OSX's zsh though. – Klara 23/4, 2020 at 15:58

J

1

bobbogo answer works but requires a intermediary pid file.

You can leverage the process substitution feature >() which works like| but without waiting for tail to finish

tail -f $1 > >(nc -l -p 9977) & pid=!

Jermayne answered 13/5, 2020 at 11:53 Comment(1)

Correction: pipe | also doesn't wait for tail to finish. The differences are a bit more interesting. – Balustrade 30/12, 2021 at 2:36

H

0

The --pid option to tail is your best friend here. It will allow you total control of the pipeline running in background. read the tail command options for more resilience in case your file is actively rotated by another process which might leave you tailing a inactive inode. The example below, though not used to process the data demonstrate the "imposed" restriction on the tail and the ability to tell it to exit at any time. This is used for measuring the service pressure on httpd .

  # Set the tail to die in 100 second even if we die unexpectedlly.
sleep 100 & ;  ctlpid=$!
tail -q -n 0 --follow=name --retry --max-unchanged-stats=1 --pid=$ctlpid -f  /var/log/httpd/access_log 2>/dev/null | wc –l > /tmp/thisSampleRate &
…. Do some other work
….  Can kill the pipe at any time by killing $ctlpid 
…. Calculate preassure if /tmp/thisSampleRate is ready

Higgledypiggledy answered 9/1, 2015 at 13:29 Comment(0)

Recommended topics

Hot tags