Fastest way to import CSV files in MATLAB

Tags:

I've written a script that saves its output to a CSV file for later reference, but the second script for importing the data takes an ungainly amount of time to read it back in.

The data is in the following format:

Item1,val1,val2,val3
Item2,val4,val5,val6,val7
Item3,val8,val9

where the headers are on the left-most column, and the data values take up the remainder of the row. One major difficulty is that the arrays of data values can be different lengths for each test item. I'd save it as a structure, but I need to be able to edit it outside the MATLAB environment, since sometimes I have to delete rows of bad data on a computer that doesn't have MATLAB installed. So really, part one of my question is: Should I save the data in a different format?

Second part of the question: I've tried importdata, csvread, and dlmread, but I'm not sure which is best, or if there's a better solution. Right now I'm using my own script using a loop and fgetl, which is horribly slow for large files. Any suggestions?

function [data,headers]=csvreader(filename); %V1_1
 fid=fopen(filename,'r');
 data={};
 headers={};
 count=1;
 while 1
      textline=fgetl(fid);
      if ~ischar(textline),   break,   end
      nextchar=textline(1);
      idx=1;
      while nextchar~=','
        headers{count}(idx)=textline(1);
        idx=idx+1;
        textline(1)=[];
        nextchar=textline(1);
      end
      textline(1)=[];
      data{count}=str2num(textline);
      count=count+1;
 end
 fclose(fid);

(I know this is probably terribly written code - I'm an engineer, not a programmer, please don't yell at me - any suggestions for improvement would be welcome, though.)

973

asked Jan 11 '10 17:01

Doresoom

1 Answers

It would probably make the data easier to read if you could pad the file with NaN values when your first script creates it:

Item1,1,2,3,NaN
Item2,4,5,6,7
Item3,8,9,NaN,NaN

or you could even just print empty fields:

Item1,1,2,3,
Item2,4,5,6,7
Item3,8,9,,

Of course, in order to pad properly you would need to know what the maximum number of values across all the items is before hand. With either format above, you could then use one of the standard file reading functions, like TEXTSCAN for example:

>> fid = fopen('uneven_data.txt','rt');
>> C = textscan(fid,'%s %f %f %f %f','Delimiter',',','CollectOutput',1);
>> fclose(fid);
>> C{1}

ans = 

    'Item1'
    'Item2'
    'Item3'

>> C{2}

ans =

     1     2     3   NaN  %# TEXTSCAN sets empty fields to NaN anyway
     4     5     6     7
     8     9   NaN   NaN

145

answered Oct 01 '22 15:10

gnovice

Related questions
                            
                                Contatenation of binary operators like "3 + + 2" in Matlab does not give errors
                            
                                Storing MATLAB structs in Java objects
                            
                                How do I make a surf plot in MATLAB with irregularly spaced data?
                            
                                How can I loop indefinitely, but stop on some condition(s)?
                            
                                Find outlines/ borders of label image in MATLAB
                            
                                What's the best reserved seat sorting algorithm?
                            
                                How to plot hist with log scale
                            
                                How to overcome singularities in numerical integration (in Matlab or Mathematica)
                            
                                Jet colormap to grayscale
                            
                                OpenCV function similar to matlab's "find"
                            
                                Apply plot properties to all MATLAB subplots simultaneously
                            
                                Eigen boolean array slicing
                            
                                Fastest way to find unique values in an array
                            
                                How do detect a QR code pattern in an image?
                            
                                MatLab - Shifting an image using FFT
                            
                                Mapping values of a matrix?
                            
                                How can I determine disk space in MATLAB
                            
                                Find unique rows of a cell array considering all possible permutations on each row
                            
                                Matlab how to make smooth contour plot?
                            
                                Faster version of dec2bin function for converting many elements?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fastest way to import CSV files in MATLAB

Tags:

file-io

csv

matlab

data-import

Doresoom

People also ask

1 Answers

gnovice

Recent Activity

Donate For Us