Powershell find non-ASCII characters in text file

Tags:

2 Answers

Define a character set that describes all ASCII characters (code points 32 through 127 == [\x20-\x7F]), then negate it with ^ to match any non-ASCII character!

Let's test it against my (non-ASCII) name:

PS C:\> 'Mathias R. Jessen' -cmatch '[^\x20-\x7F]'
False
PS C:\> 'Mathias Rørbo Jessen' -cmatch '[^\x20-\x7F]'
True

To filter a list of strings, simply use the -cmatch operator in filter mode:

$strings = 'குழந்தைகளுக்கான பெயர்கள்', 'Boring John Doe', 'Léna Rémi'

$nonASCIIstrings = @($strings) -cmatch '[^\x20-\x7F]'

Or if you want to filter along a pipeline, use Where-Object:

$strings |Where-Object {$_ -cmatch '[^\x20-\x7F]'}

159

answered Oct 21 '22 02:10

Here's a script I have to remove non-ascii characters from an xml file. Maybe you can use it as a starting point. I'm removing characters that are not between space and tilde in the ascii table, and also not tab. To me, ascii is in the range 0-127. Get-content takes out the carriage returns and linefeeds.

(get-content $args[0]) -replace '[^ -~\t]' | set-content $args[0]

answered Oct 21 '22 04:10

js2010

Related questions
                            
                                Printing messages from SQL in Azure Automation powershell script doesn't work
                            
                                Pass PowerShell variables to Docker commands
                            
                                PowerShell stderr redirect to file inserts newlines
                            
                                Find array elements which values are not part of another array PowerShell
                            
                                Automatically assign command output to a variable in Powershell
                            
                                Join two hashtables to make one
                            
                                How to get an error log in Azure AD B2C tenant with correlation ID?
                            
                                Enable/Disable AppInsights Availability Tests with Powershell Azure ARM
                            
                                Creating a scheduled task on a remote machine with powershell
                            
                                `more.com` returns "Not enough memory."
                            
                                Can't get all excel processes to stop when closing through Powershell
                            
                                Powershell equivalent of ".Single()" For a C#/Java programmer
                            
                                Why can't I use PowerShell's Start-Process with both -Credential and -Verb parameters?
                            
                                Property passed to Invoke-Command changes type from IDictionary to HashTable
                            
                                'testcafe' is not recognized as the name of a cmdlet, function, script file, or operable program
                            
                                How to get sha256 hash output as binary data instead of hex using powershell?
                            
                                Is there a way to escape quotes in ripgrep for MS Windows (Powershell or CMD)?
                            
                                Powershell iwr fails when attempting -SkipCertificateCheck
                            
                                How to change the voice used for SAPI.SPVoice
                            
                                How to change tab width when converting to JSON in Powershell

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Powershell find non-ASCII characters in text file

Tags:

powershell

non-ascii-characters

Arolix

People also ask

2 Answers

Mathias R. Jessen

js2010

Recent Activity

Donate For Us