Home SysAdmin Commands & Shells Grouping Output of SSH via xargs

Grouping Output of SSH via xargs

November 19, 2021

To make a long story short, I have a list of servers where I need to execute a command and get back the output. Using a for loop to run SSH with key authentication is the usual approach, except in this case accessing one server at a time was taking too long.

In a situation like this, I would usually opt for PDSH but for other reasons I needed to use SSH and xargs with a list of target hosts – one per line. The basic syntax would look something like this:

cat host_list.txt | \
xargs -d $'\n' -P$(grep -c proc /proc/cpuinfo) -n1 -I{} \
/usr/bin/ssh -qtT -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -i "${ssh_key}" ${ssh_user}@{} \
"sudo su - root -c 'wc -l /etc/{passwd,shadow}'" 2>/dev/null

So I am passing my list of target hosts to xargs; kicking off a number of parallel SSH connections (based on the number of CPU cores); logging in with some SSH user name and key; and executing a command as root on the remote systems.

The issue here is this: the output of the command I am running will have three lines. Because the SSH connections are launched in parallel, the responses will arrive in a random sequence and I will have no way of knowing which line of the output came from which remote host.

 62 /etc/passwd
 61 /etc/shadow
123 total
 48 /etc/passwd
 48 /etc/shadow
 96 total

People usually take care of this problem by appending the target hostname to the command:

c="wc -l /etc/{passwd,shadow}"
cat host_list.txt | \ 
xargs -d $'\n' -P$(grep -c proc /proc/cpuinfo) -n1 -I{} \ 
/usr/bin/ssh -qtT -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -i "${ssh_key}" ${ssh_user}@{} \ 
"sudo su - root -c 'echo "${HOSTNAME}:"; ${c}; echo'" 2>/dev/null

Now the output is a lot easier to interpret:

ncc1701:
  62 /etc/passwd
  61 /etc/shadow
 123 total

ncc1711:
  48 /etc/passwd
  48 /etc/shadow
  96 total

But, because xargs runs the same command on a bunch of remote hosts simultaneously and the amount of time it takes to generate output would differ from host to host, it is entirely possible that the output will arrive out of order.

To illustrate this point, I will change the remote command to, first, count the number of lines in /etc/passwd, then sleep for a random number of seconds between 1 and 10, and only then count the number of lines in /etc/shadow:

c="wc -l /etc/passwd; sleep $(( ( RANDOM % 10 ) + 1 )); wc -l /etc/shadow"
cat host_list.txt | \ 
xargs -d $'\n' -P$(grep -c proc /proc/cpuinfo) -n1 -I{} \ 
/usr/bin/ssh -qtT -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -i "${ssh_key}" ${ssh_user}@{} \ 
"sudo su - root -c 'echo "${HOSTNAME}:"; ${c}; echo'" 2>/dev/null

ncc1701:
62 /etc/passwd
ncc1711:
48 /etc/passwd
48 /etc/shadow

61 /etc/shadow

Even with a simple task like counting the number of lines in /etc/passwd and /etc/shadow making sense of the unordered output is impossible. One way around this is to prepend each line of the output with a random value. This value can be used to group the output for each host:

c="wc -l /etc/passwd; sleep $(( ( RANDOM % 10 ) + 1 )); wc -l /etc/shadow"
cat host_list.txt | \
xargs -d $'\n' -P$(grep -c proc /proc/cpuinfo) -n1 -I{} \
/usr/bin/ssh -qtT -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -i "${ssh_key}" ${ssh_user}@{} \ 
"sudo su - root -c 'echo "${HOSTNAME}:"; ${c}; echo' | awk -v var="${RANDOM}" '{print var,\c="wc -l /etc/passwd; sleep \$(( ( RANDOM % 10 ) + 1 )); wc -l /etc/shadow"
cat host_list.txt | \
xargs -d $'\n' -P$(grep -c proc /proc/cpuinfo) -n1 -I{} \
/usr/bin/ssh -qtT -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -i "${ssh_key}" ${ssh_user}@{} \ 
"sudo su - root -c 'echo "\${HOSTNAME}:"; ${c}; echo' | awk -v var="\${RANDOM}" '{print var,\$0}'" 2>/dev/null | awk '{first = $1; $1 = ""; print $0; }'
}'" 2>/dev/null | awk '{first = $1; $1 = ""; print $0; }'

Now the output will be always grouped by host:

ncc1711:
48 /etc/passwd
48 /etc/shadow
96 total

ncc1701:
62 /etc/passwd
61 /etc/shadow
123 total

Igor

Experienced Unix/Linux System Administrator with 20-year background in Systems Analysis, Problem Resolution and Engineering Application Support in a large distributed Unix and Windows server environment. Strong problem determination skills. Good knowledge of networking, remote diagnostic techniques, firewalls and network security. Extensive experience with engineering application and database servers, high-availability systems, high-performance computing clusters, and process automation.

Symbol	USD	% 1h	% 24h	% 7d
BTC	37,157	0.55	2.50	7.72
ETH	1,716.5	0.31	3.66	4.71
USDT	1.000	0.03	0.00	0.01
BNB	587.59	0.06	1.96	0.62
SOL	147.93	0.13	1.23	6.13
USDC	1.000	0.01	0.01	0.06
XRP	0.3813	0.14	0.63	2.13
STETH	3,291.5	0.48	1.69	5.62
DOGE	0.1355	0.19	4.54	7.43
	?	---	0.00	0.00

Bitcoin $ 37,157	Bitcoin 2.50 %
Ethereum $ 1,716.5	Ethereum 3.66 %
Litecoin $ 53.16	Litecoin 0.18 %
XRP $ 0.3813	XRP 0.63 %

Identify Overused Words

Automating Web Page Screenshots

Convert Color Text to Images in Bash

AWS CLI Cheat Sheet

Copying X11 Magic Cookies

Cutting Videos Into Smaller Segments

Compile ffmpeg From Source

AWS CLI Cheat Sheet

Late Night Rant: Agile

Working with SD Cards for Photographers

The Future of Spaghetti Code

Dealing With Windows Power Plans

Document Conversion with Unoconv

Automating Web Page Screenshots

Monitoring Application Network Connections

Atop Script with Scheduling and Logging

Plundering Facebook Photo Albums

Verify Network Port Access

Detect SSL Certificate Injection

Tracking Network Connections Over Time

Show NIC Bandwidth Utilization

Longwood Gardens, April 2018

Philly Flower Show 2018

Luxembourg 2017

London 2017

Grouping Output of SSH via xargs

Fixing Sudo

Improving Your Scripts with ShellCheck

Sending Windows Logs to Remote Syslog

BIND DNS Query Frequency Analysis

Finding Passwords in Logs and Shell History

To Mask or Not to Mask