I am trying to point iconv to a directory and all files will be converted UTF-8 regardless of the current encoding I am using this script but you have to specify what encoding you are going FROM. How can I make it autdetect the current encoding? dir_iconv.sh <pre class="prettyprint lang-sh prettyprint-override"><code>#!/bin/bash ICONVBIN='/usr/bin/iconv' # path to iconv binary if [ $# -lt 3 ] then echo "$0 dir from_charset to_charset" exit fi for f in $1/* do if test -f $f then echo -e "\nConverting $f" /bin/mv $f $f.old $ICONVBIN -f $2 -t $3 $f.old > $f else echo -e "\nSkipping $f - not a regular file"; fi done </code></pre> terminal line <pre class="prettyprint"><code>sudo convert/dir_iconv.sh convert/books CURRENT_ENCODING utf8 </code></pre>

You can get what you need using standard gnu utils file and awk. Example: <code>file -bi .xsession-errors</code> gives me: "text/plain; charset=us-ascii" so <code>file -bi .xsession-errors |awk -F "=" '{print $2}'</code> gives me "us-ascii" I use it in scripts like so: <pre class="prettyprint lang-sh prettyprint-override"><code>CHARSET="$(file -bi "$i"|awk -F "=" '{print $2}')" if [ "$CHARSET" != utf-8 ]; then iconv -f "$CHARSET" -t utf8 "$i" -o outfile fi </code></pre>

iconv any encoding to UTF-8

Tags:

I am trying to point iconv to a directory and all files will be converted UTF-8 regardless of the current encoding

I am using this script but you have to specify what encoding you are going FROM. How can I make it autdetect the current encoding?

dir_iconv.sh

#!/bin/bash  ICONVBIN='/usr/bin/iconv' # path to iconv binary  if [ $# -lt 3 ] then   echo "$0 dir from_charset to_charset"   exit fi  for f in $1/* do   if test -f $f   then     echo -e "\nConverting $f"     /bin/mv $f $f.old     $ICONVBIN -f $2 -t $3 $f.old > $f   else     echo -e "\nSkipping $f - not a regular file";   fi done

terminal line

sudo convert/dir_iconv.sh convert/books CURRENT_ENCODING utf8

382

asked Mar 22 '12 15:03

Blainer

2 Answers

Michal Kottman

You can get what you need using standard gnu utils file and awk. Example:

file -bi .xsession-errors gives me: "text/plain; charset=us-ascii"

so file -bi .xsession-errors |awk -F "=" '{print $2}' gives me "us-ascii"

I use it in scripts like so:

CHARSET="$(file -bi "$i"|awk -F "=" '{print $2}')"  if [ "$CHARSET" != utf-8 ]; then   iconv -f "$CHARSET" -t utf8 "$i" -o outfile fi

answered Sep 24 '22 21:09

Julian Hughes

Related questions
                            
                                C++ system() not working when there are spaces in two different parameters
                            
                                Make an <a> tag move onto a new line, without using "display:block"
                            
                                Keyboard Column Selection for Sublime Text 2 on Windows
                            
                                git revert commit/push but keep changes
                            
                                Android: how do I create File object from asset file?
                            
                                Injecting C++ DLL
                            
                                Mobile site - force landscape only / no auto-rotate
                            
                                Header inside Link or Link inside Header in HTML markup? [duplicate]
                            
                                Accessing Devise Config Variables
                            
                                How to plot arrow with data coordinates in Matlab?
                            
                                ssh-config by host subnet
                            
                                How to proceed with NLP task for recognizing intent and slots

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With