Why does Powershell file concatenation convert UTF8 to UTF16?

Tags:

I am running the following Powershell script to concatenate a series of output files into a single CSV file. whidataXX.htm (where xx is a two digit sequential number) and the number of files created varies from run to run.

$metadataPath = "\\ServerPath\foo" 

function concatenateMetadata {
    $cFile = $metadataPath + "whiconcat.csv"
    Clear-Content $cFile
    $metadataFiles = gci $metadataPath
    $iterations = $metadataFiles.Count
    for ($i=0;$i -le $iterations-1;$i++) {
        $iFile = "whidata"+$i+".htm"
        $FileExists = (Test-Path $metadataPath$iFile -PathType Leaf)
        if (!($FileExists))
        {
            break
        }
        elseif ($FileExists)
        {
            Write-Host "Adding " $metadataPath$iFile
            Get-Content $metadataPath$iFile | Out-File $cFile -append
            Write-Host "to" $cfile
        }
    }
}

The whidataXX.htm files are encoded UTF8, but my output file is encoded UTF16. When I view the file in Notepad, it appears correct, but when I view it in a Hex Editor, the Hex value 00 appears between each character, and when I pull the file into a Java program for processing, the file prints to the console with extra spaces between c h a r a c t e r s.

First, is this normal for PowerShell? or is there something in the source files that would cause this?

Second, how would I fix this encoding problem in the code noted above?

995

asked Oct 15 '13 18:10

dwwilson66

1 Answers

The Out-* cmdlets (like Out-File) format the data, and the default format is unicode.

You can add an -Encoding parameter to Out-file:

Get-Content $metadataPath$iFile | Out-File $cFile -Encoding UTF8 -append

or switch to Add-Content, which doesn't re-format

Get-Content $metadataPath$iFile | Add-Content $cFile

151

answered Oct 21 '22 03:10

mjolinor

Related questions
                            
                                Getting the arguments of the last invoked command in powershell?
                            
                                What does "- <" mean when you run powershell?
                            
                                How to pull physical path of a Windows Service using Get-Service command
                            
                                Automating remote desktop connection
                            
                                Get-ChildItem and Copy-Item explanation
                            
                                New-Item recursive registry keys
                            
                                Start vNext build from Powershell and get artifacts
                            
                                Attach EBS Volume to Windows EC2 with Powershell
                            
                                Can I run PowerShell in version 3 or 4 mode when PowerShell 5 is installed?
                            
                                How do I create an empty array of arrays in Powershell?
                            
                                How to start a console-based process and apply a custom title using Powershell
                            
                                WebAdministration powershell module not found on windows server data center
                            
                                Windows PowerShell ISE doesn't promt for input
                            
                                How to set the default ToString() on a locally created PSObject?
                            
                                how to output debugging messages from install.ps1 in NuGet
                            
                                How can I get nuget (powershell) to insert <DependentUpon> elements in the target csproj file
                            
                                Preserve newlines when concatenating or using heredoc in PowerShell
                            
                                Setting Powershell colors with hex values in profile script
                            
                                Powershell: Update current output line
                            
                                ArgumentList parameter in Invoke-Command don't send all array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does Powershell file concatenation convert UTF8 to UTF16?

Tags:

powershell

data-conversion

utf-8

utf-16

dwwilson66

People also ask

1 Answers

mjolinor

Recent Activity

Donate For Us