I am receiving the following error: <code>awk: cmd. line:1: (FILENAME=- FNR=798) warning: Invalid multibyte data detected. There may be a mismatch between your data and your locale.</code> The command I'm running is the following: <code>cat file.txt | awk 'length($0)<10000' > output-file.txt</code> The weird part is that if I pipe to other commands like <code>awk '{ sub("\r$", ""); print }'</code>, it works just fine without an error. Anyone see why I would get this error? Or, should I just ignore it?

Make the locale as <code>C</code> to use only ASCII character set with single byte encoding, pass <code>LC_ALL=C</code> to <code>awk</code>'s environment: <pre class="prettyprint"><code>LC_ALL=C awk 'length($0)<10000' file.txt >output-file.txt </code></pre> Also you don't need to use <code>cat</code> as <code>awk</code> takes filename(s) as argument(s).

Fix Mismatch Between Data And Local In Awk Command

1 Answers

Make the locale as C to use only ASCII character set with single byte encoding, pass LC_ALL=C to awk's environment:

LC_ALL=C awk 'length($0)<10000' file.txt >output-file.txt

Also you don't need to use cat as awk takes filename(s) as argument(s).

196

answered Oct 13 '22 01:10

heemayl

Related questions
                            
                                linux umask for sudo and apache
                            
                                Generate HTML Table from Python Dictionary
                            
                                F_SETPIPE_SZ undeclared
                            
                                How to convert multiline file into a string in bash with newline character?
                            
                                "--target list" meaning in qemu installation
                            
                                Bash, Remove empty XML tags
                            
                                Using gzip to compress files to transfer with aws command
                            
                                How do you kill zombie process using wait()
                            
                                How to understand diff -u in linux?
                            
                                Get names and addresses of exported functions in linux
                            
                                Detecting a TCP reset with Linux sockets
                            
                                How to make pipe run sequentially
                            
                                Why is RCX not used for passing parameters to system calls, being replaced with R10? [duplicate]
                            
                                How To Show Hello World with Glade/GtkD and the D Programming Language
                            
                                Is it possible to open message queue in linux with huge number of elements?
                            
                                What does htons() do on a Big-Endian system?
                            
                                Build-essential for openSUSE
                            
                                How to estimate the seek speed in file system
                            
                                Unable to pipe python output to program
                            
                                C/Linux: How to get users login name without `getlogin`

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fix Mismatch Between Data And Local In Awk Command

Tags:

linux

bash

unix

awk

locale

DomainsFeatured

People also ask

1 Answers

heemayl

Recent Activity

Donate For Us