I have multiple, large, csv files, each of which has missing values in many places. When I import the csv file into SQLite, I would like to have the missing values recorded as NULL for the reason that another application expects missing data to be indicated by NULL. My current method does not produce the desired result. An example CSV file (test.csv) is: <pre class="prettyprint"><code>12|gamma|17|delta 67||19|zeta 96|eta||theta 98|iota|29| </code></pre> The first line is complete; each of the other lines has (or is meant to show!) a single missing item. When I import using: <pre class="prettyprint"><code>.headers on .mode column .nullvalue NULL CREATE TABLE t ( id1 INTEGER PRIMARY KEY, a1 TEXT, n1 INTEGER, a2 TEXT ); .import test.csv t SELECT id1, typeof(id1), a1, typeof(a1), n1, typeof(n1), a2, typeof(a2) FROM t; </code></pre> the result is <pre class="prettyprint"><code>id1 typeof(id1) a1 typeof(a1) n1 typeof(n1) a2 typeof(a2) ---- ----------- ------ ---------- -- ---------- ------ ---------- 12 integer gamma text 17 integer delta text 67 integer text 19 integer zeta text 96 integer eta text text theta text 98 integer iota text 29 integer text </code></pre> so the missing values have become text. I would appreciate some guidance on how to ensure that all missing values become NULL.

sqlite3 imports values as text and there does not seem to be a way to make it treat empty values as nulls. However, you can update the tables yourself after import, setting empty strings to nulls, like <pre class="prettyprint"><code>UPDATE t SET a1=NULL WHERE a1=''; </code></pre> Repeat for each column. You can also create a trigger for such updates: <pre class="prettyprint"><code>CREATE TRIGGER trig_a1 AFTER INSERT ON t WHEN new.a1='' BEGIN UPDATE t SET a1=NULL WHERE rowid=new.rowid; END; </code></pre>

How can I get missing values recorded as NULL when importing from csv

Tags:

null

sqlite

csv

missing-data

I have multiple, large, csv files, each of which has missing values in many places. When I import the csv file into SQLite, I would like to have the missing values recorded as NULL for the reason that another application expects missing data to be indicated by NULL. My current method does not produce the desired result.

An example CSV file (test.csv) is:

12|gamma|17|delta
67||19|zeta
96|eta||theta
98|iota|29|

The first line is complete; each of the other lines has (or is meant to show!) a single missing item. When I import using:

.headers on
.mode column
.nullvalue NULL
CREATE TABLE t (
  id1     INTEGER  PRIMARY KEY,
  a1      TEXT,
  n1      INTEGER,
  a2      TEXT
);
.import test.csv t
SELECT
  id1, typeof(id1),
  a1,  typeof(a1),
  n1,  typeof(n1),
  a2,  typeof(a2)
FROM t;

the result is

id1   typeof(id1)  a1      typeof(a1)  n1  typeof(n1)  a2      typeof(a2)
----  -----------  ------  ----------  --  ----------  ------  ----------
12    integer      gamma     text      17  integer     delta   text                      
67    integer                text      19  integer     zeta    text                      
96    integer      eta       text          text        theta   text                      
98    integer      iota      text      29  integer             text

so the missing values have become text. I would appreciate some guidance on how to ensure that all missing values become NULL.

577

asked Mar 13 '14 05:03

user02814

1 Answers

sqlite3 imports values as text and there does not seem to be a way to make it treat empty values as nulls.

However, you can update the tables yourself after import, setting empty strings to nulls, like

UPDATE t SET a1=NULL WHERE a1='';

Repeat for each column.

You can also create a trigger for such updates:

CREATE TRIGGER trig_a1 AFTER INSERT ON t WHEN new.a1='' BEGIN
  UPDATE t SET a1=NULL WHERE rowid=new.rowid;
END;

158

answered Sep 28 '22 05:09

laalto

Related questions
                            
                                Setting a buffer of 0 in Pandas dataframe.to_csv
                            
                                Open CSV and copy
                            
                                How can I optimize my PowerShell - LDAP Query?
                            
                                Python in memory table data structures for analysis (dict, list, combo)
                            
                                How do I export a SSRS matrix to CSV without losing the structure?
                            
                                Join two dataframes before exporting as .csv files
                            
                                casting column from csv as date powershell
                            
                                Read Large File line by line in R without header
                            
                                Copy in Postgres from a tab delimited file to table
                            
                                Unix uniq command to CSV file
                            
                                powershell merge csv's
                            
                                DataTable memory huge consumption
                            
                                Postgresql: CSV export with escaped linebreaks
                            
                                exporting 20k or more records from MY SQL to CSV using php [duplicate]
                            
                                CSV resize columns depending on the content
                            
                                Semi-Colon Delimiter in Mongoimport
                            
                                Scrapy - Getting duplicated items using JOBDIR
                            
                                RStudio not picking the encoding I'm telling it to use when reading a file
                            
                                ConvertTo-Csv Output without quotes
                            
                                How to Return CSV Data in Browser From Spring Controller

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With