csv to sparse matrix in python

Tags:

I have a big csv file which lists connections between nodes in a graph. example:

0001,95784
0001,98743
0002,00082
0002,00091

So this means that node id 0001 is connected to node 95784 and 98743 and so on. I need to read this into a sparse matrix in numpy. How can i do this? I am new to python so tutorials on this would also help.

427

asked Dec 21 '09 08:12

Ankur Chauhan

2 Answers

Example using lil_matrix (list of list matrix) of scipy.

Row-based linked list matrix.

This contains a list (self.rows) of rows, each of which is a sorted list of column indices of non-zero elements. It also contains a list (self.data) of lists of these elements.

$ cat 1938894-simplified.csv
0,32
1,21
1,23
1,32
2,23
2,53
2,82
3,82
4,46
5,75
7,86
8,28

Code:

#!/usr/bin/env python

import csv
from scipy import sparse

rows, columns = 10, 100
matrix = sparse.lil_matrix( (rows, columns) )

csvreader = csv.reader(open('1938894-simplified.csv'))
for line in csvreader:
    row, column = map(int, line)
    matrix.data[row].append(column)

print matrix.data

Output:

[[32] [21, 23, 32] [23, 53, 82] [82] [46] [75] [] [86] [28] []]

answered Sep 21 '22 23:09

miku

If you want an adjacency matrix, you can do something like:

from scipy.sparse import *
from scipy import *
from numpy import *
import csv
S = dok_matrix((10000,10000), dtype=bool)
f = open("your_file_name")
reader = csv.reader(f)
for line in reader:
    S[int(line[0]),int(line[1])] = True

answered Sep 17 '22 23:09

tkerwin

Related questions
                            
                                Django - taking values from POST request, JavaScript fetch API
                            
                                How to use the AWS Python SDK while connecting via SSO credentials
                            
                                How do I annotate a callable with *args and **kwargs?
                            
                                Pairwise Distances Between Two "islands"/"connected components" in Numpy Array
                            
                                How to iterate over multiple lists of different lengths, but repeat the last value of a shorter list until the longest list is done?
                            
                                How to update pandas DataFrame.drop() for Future Warning - all arguments of DataFrame.drop except for the argument 'labels' will be keyword-only
                            
                                Ruby "is" equivalent
                            
                                'from X import a' versus 'import X; X.a'
                            
                                Running a Django site under mod_wsgi
                            
                                Does PyS60 produce sis files that are native?
                            
                                Optimizing Jinja2 Environment creation
                            
                                Writing binary data to a socket (or file) with Python
                            
                                Missing datetime.timedelta.to_seconds() -> float in Python?
                            
                                A python based PowerShell?
                            
                                How to do Obj-C Categories in Python?
                            
                                Pyserial problem with Arduino - works with the Python shell but not in a program
                            
                                Scrapy SgmlLinkExtractor question
                            
                                Plotting vector fields in python (matplotlib)
                            
                                How could I get a Frame with a scrollbar in Tkinter?
                            
                                Code bacteria: evolving mathematical behavior

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

csv to sparse matrix in python

Tags:

python

data-structures

sparse-matrix

Ankur Chauhan

People also ask

2 Answers

miku

tkerwin

Recent Activity

Donate For Us