You can use tr
to print only the printable characters from a string like below. Just use the below command on your input file.
tr -cd "[:print:]\n" < file1
The flag -d
is meant to the delete the character sets defined in the arguments on the input stream, and -c
is for complementing those (invert what's provided). So without -c
the command would delete all printable characters from the input stream and using it complements it by removing the non-printable characters. We also keep the newline character \n
to preserve the line endings in the input file. Removing it would just produce the final output in one big line.
The [:print:]
is just a POSIX bracket expression which is a combination of expressions [:alnum:]
, [:punct:]
and space. The [:alnum:]
is same as [0-9A-Za-z]
and [:punct:]
includes characters !
"
#
$
%
&
'
(
)
*
+
,
-
.
/
:
;
<
=
>
?
@
[
\
]
^
_
`
{
|
}
~