Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

MS Access *.MDB Conversion to MySQL or SQLite, Problem in data Encoding

Greetings, i'll present my case:

  • I'm in linux UBUNTU
  • i have several Jet3 .MDB (MS Acess Database) about 500MB each, in which the datas encoded in cp1256/WINDOWS-1256
  • i have made the sqlite databases by following this article to do the conversion http://cltb.ojuba.org/en/articles/mdb2sqlite.

Here is the bash script that i made to convert the database. Assuming i have MS Access x.MDB

mdb-schema "x.mdb" | perl -wpe 's%^DROP TABLE %DROP TABLE IF EXISTS %i;
  s%(Memo/Hyperlink|DateTime( \(Short\))?)%TEXT%i;
  s%(Boolean|Byte|Byte|Numeric|Replication ID|(\w+ )?Integer)%INTEGER%i;
  s%(BINARY|OLE|Unknown ([0-9a-fx]+)?)%BLOB%i;
  s%\s*\(\d+\)\s*(,?[ \t]*)$%${1}%;' | sqlite3 > x.db 

for i in $(mdb-tables "x.mdb"); do echo $i; (
echo "BEGIN TRANSACTION;";
MDB_JET3_CHARSET="WINDOWS-1256" mdb-export -R ";\n" -I "x.mdb" $i;
echo "END TRANSACTION;" ) | sqlite3 "x.db"; done

I've tried to change the MDB_JET3_CHARSET to WINDOWS-1256, cp1256, WINDOWS-1251, cp1251, UTF-8. some produce different results in the data when i browse it, but still make no sense at all.

thanks before, and sorry for my bad English

like image 864
Indra Lukmana Avatar asked Oct 12 '22 00:10

Indra Lukmana


1 Answers

Okay then after playing around many sites, i stumbled on this http://git.ojuba.org/cgit/thawab/tree/ and found a script that give me an idea (it's the bok2ki.py, if anyone is curious), I LOVE OPEN SOURCE!! :)

I add MDB_ICONV parameter with "UTF-8" as it's value, and change the MDB_JET3_CHARSET parameter value to "cp1256"

acctually i don't really know what those parameter really is, but i'm guessing MDB_JET3 CHARSET is to define the charset/encoding/codepages (i really don't know the difference, i should research more) and the MDB_ICONV is to define the encoding of target database. well those are just my assumption anyway.

then here is my new script:

mdb-schema "x.mdb" | perl -wpe 's%^DROP TABLE %DROP TABLE IF EXISTS %i;
  s%(Memo/Hyperlink|DateTime( \(Short\))?)%TEXT%i;
  s%(Boolean|Byte|Byte|Numeric|Replication ID|(\w+ )?Integer)%INTEGER%i;
  s%(BINARY|OLE|Unknown ([0-9a-fx]+)?)%BLOB%i;
  s%\s*\(\d+\)\s*(,?[ \t]*)$%${1}%;' | sqlite3 x.db 

for i in $(mdb-tables "x.mdb"); do echo $i; (
echo "BEGIN TRANSACTION;";
MDB_JET3_CHARSET="cp1256" MDB_ICONV="UTF-8" mdb-export -R ";\n" -I "x.mdb" $i;
echo "END TRANSACTION;" ) | sqlite3 "x.db"; done
like image 157
Indra Lukmana Avatar answered Oct 14 '22 03:10

Indra Lukmana