Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How do I import from Excel to a DataSet using Microsoft.Office.Interop.Excel?

Tags:

What I want to do

I'm trying to use the Microsoft.Office.Interop.Excel namespace to open an Excel file (XSL or CSV, but sadly not XSLX) and import it into a DataSet. I don't have control over the worksheet or column names, so I need to allow for changes to them.

What I've tried

I've tried the OLEDB method of this in the past, and had a lot of problems with it (buggy, slow, and required prior knowledge of the Excel file's schema), so I want to avoid doing that again. What I'd like to do is use Microsoft.Office.Interop.Excel to import the workbook directly to a DataSet, or loop through the worksheets and load each one into a DataTable.

Believe it or not, I've had trouble finding resources for this. A few searches on StackOverflow have found mostly people trying to do the reverse (DataSet => Excel), or the OLEDB technique. Google hasn't been much more helpful.

What I've got so far

    public void Load(string filename, Excel.XlFileFormat format = Excel.XlFileFormat.xlCSV)     {         app = new Excel.Application();         book = app.Workbooks.Open(Filename: filename, Format: format);          DataSet ds = new DataSet();          foreach (Excel.Worksheet sheet in book.Sheets)         {             DataTable dt = new DataTable(sheet.Name);             ds.Tables.Add(dt);              //??? Fill dt from sheet          }          this.Data = ds;     } 

I'm fine with either importing the entire book at once, or looping through one sheet at a time. Can I do this with Interop.Excel?

like image 771
Justin Morgan Avatar asked Aug 30 '11 14:08

Justin Morgan


People also ask

What is the use of Microsoft Office Interop Excel?

Microsoft Office Interop (Excel Automation) is an option when creating/reading Excel files (XLS, XLSX, CSV) from C# or VB.NET application, but it has many drawbacks.

How read Excel file and insert into database in C#?

You can use OLEDB classes to read directly from Excel file using the Excel drivers in OleDbConnection. Get the data in a datatable and save it to database. string connectString = "Provider=Microsoft. ACE.


2 Answers

What about using Excel Data Reader (previously hosted here) an open source project on codeplex? Its works really well for me to export data from excel sheets.

The sample code given on the link specified:

FileStream stream = File.Open(filePath, FileMode.Open, FileAccess.Read);  //1. Reading from a binary Excel file ('97-2003 format; *.xls) IExcelDataReader excelReader = ExcelReaderFactory.CreateBinaryReader(stream); //... //2. Reading from a OpenXml Excel file (2007 format; *.xlsx) IExcelDataReader excelReader = ExcelReaderFactory.CreateOpenXmlReader(stream); //... //3. DataSet - The result of each spreadsheet will be created in the result.Tables DataSet result = excelReader.AsDataSet(); //... //4. DataSet - Create column names from first row excelReader.IsFirstRowAsColumnNames = true; DataSet result = excelReader.AsDataSet();  //5. Data Reader methods while (excelReader.Read()) { //excelReader.GetInt32(0); }  //6. Free resources (IExcelDataReader is IDisposable) excelReader.Close(); 

UPDATE

After some search around, I came across this article: Faster MS Excel Reading using Office Interop Assemblies. The article only uses Office Interop Assemblies to read data from a given Excel Sheet. The source code is of the project is there too. I guess this article can be a starting point on what you trying to achieve. See if that helps

UPDATE 2

The code below takes an excel workbook and reads all values found, for each excel worksheet inside the excel workbook.

private static void TestExcel()     {         ApplicationClass app = new ApplicationClass();         Workbook book = null;         Range range = null;          try         {             app.Visible = false;             app.ScreenUpdating = false;             app.DisplayAlerts = false;              string execPath = Path.GetDirectoryName(Assembly.GetExecutingAssembly().CodeBase);              book = app.Workbooks.Open(@"C:\data.xls", Missing.Value, Missing.Value, Missing.Value                                               , Missing.Value, Missing.Value, Missing.Value, Missing.Value                                              , Missing.Value, Missing.Value, Missing.Value, Missing.Value                                             , Missing.Value, Missing.Value, Missing.Value);             foreach (Worksheet sheet in book.Worksheets)             {                  Console.WriteLine(@"Values for Sheet "+sheet.Index);                  // get a range to work with                 range = sheet.get_Range("A1", Missing.Value);                 // get the end of values to the right (will stop at the first empty cell)                 range = range.get_End(XlDirection.xlToRight);                 // get the end of values toward the bottom, looking in the last column (will stop at first empty cell)                 range = range.get_End(XlDirection.xlDown);                  // get the address of the bottom, right cell                 string downAddress = range.get_Address(                     false, false, XlReferenceStyle.xlA1,                     Type.Missing, Type.Missing);                  // Get the range, then values from a1                 range = sheet.get_Range("A1", downAddress);                 object[,] values = (object[,]) range.Value2;                  // View the values                 Console.Write("\t");                 Console.WriteLine();                 for (int i = 1; i <= values.GetLength(0); i++)                 {                     for (int j = 1; j <= values.GetLength(1); j++)                     {                         Console.Write("{0}\t", values[i, j]);                     }                     Console.WriteLine();                 }             }          }         catch (Exception e)         {             Console.WriteLine(e);         }         finally         {             range = null;             if (book != null)                 book.Close(false, Missing.Value, Missing.Value);             book = null;             if (app != null)                 app.Quit();             app = null;         }     } 

In the above code, values[i, j] is the value that you need to be added to the dataset. i denotes the row, whereas, j denotes the column.

like image 90
reggie Avatar answered Oct 26 '22 23:10

reggie


Have you seen this one? From http://www.aspspider.com/resources/Resource510.aspx:

public DataTable Import(String path) {     Microsoft.Office.Interop.Excel.Application app = new Microsoft.Office.Interop.Excel.Application();     Microsoft.Office.Interop.Excel.Workbook workBook = app.Workbooks.Open(path, 0, true, 5, "", "", true, Microsoft.Office.Interop.Excel.XlPlatform.xlWindows, "\t", false, false, 0, true, 1, 0);      Microsoft.Office.Interop.Excel.Worksheet workSheet = (Microsoft.Office.Interop.Excel.Worksheet)workBook.ActiveSheet;      int index = 0;     object rowIndex = 2;      DataTable dt = new DataTable();     dt.Columns.Add("FirstName");     dt.Columns.Add("LastName");     dt.Columns.Add("Mobile");     dt.Columns.Add("Landline");     dt.Columns.Add("Email");     dt.Columns.Add("ID");      DataRow row;      while (((Microsoft.Office.Interop.Excel.Range)workSheet.Cells[rowIndex, 1]).Value2 != null)     {          row = dt.NewRow();         row[0] = Convert.ToString(((Microsoft.Office.Interop.Excel.Range)workSheet.Cells[rowIndex, 1]).Value2);         row[1] = Convert.ToString(((Microsoft.Office.Interop.Excel.Range)workSheet.Cells[rowIndex, 2]).Value2);         row[2] = Convert.ToString(((Microsoft.Office.Interop.Excel.Range)workSheet.Cells[rowIndex, 3]).Value2);         row[3] = Convert.ToString(((Microsoft.Office.Interop.Excel.Range)workSheet.Cells[rowIndex, 4]).Value2);         row[4] = Convert.ToString(((Microsoft.Office.Interop.Excel.Range)workSheet.Cells[rowIndex, 5]).Value2);         index++;          rowIndex = 2 + index;         dt.Rows.Add(row);     }     app.Workbooks.Close();     return dt; } 
like image 33
Juliusz Avatar answered Oct 27 '22 00:10

Juliusz