Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Serialize MD5 computation-state and resume later?

I want to serialize/deserialize md5 context. But I don't know how to do it in Python. Pseudocode of what I want to do.

import md5
# Start hash generation
m = md5.new()
m.update("Content")

# Serialize m
serialized_m = serialize(m)

# In another function/machine, deserialize m
# and continue hash generation
m2 = deserialize(serialized_m)
m2.update("More content")
m2.digest()    

There are C++ libraries for this. Is there one for Python? Why doesn't the md5 library support it? Are there security concerns? Thanks.

Edited: I want to do this because for example, an HTTP server wants to accept streaming data in different HTTP requests. It would be convenient to serialize md5 context somehow between requests.

like image 756
Yey Avatar asked Sep 25 '12 23:09

Yey


2 Answers

HASH objects are not serializable: How to serialize hash objects in Python

Assuming you can pass around the unhashed data:

from Crypto.Hash import MD5

# generate hash
m = MD5.new()
s = "foo"
m.update(s)

# serialize m
serialized = s

# deserialize and continue hash generation
m2 = MD5.new(serialized)
if m2.hexdigest() == m.hexdigest():
    print "success"
m2.update("bar")
like image 163
Ryan Shea Avatar answered Nov 13 '22 09:11

Ryan Shea


I asked Mr Guido V Rossum. He replied that "I don't think there's a way. However it might make a decent feature request. You could submit one to bugs.python.org." So I did.

http://bugs.python.org/issue16059

like image 27
Yey Avatar answered Nov 13 '22 11:11

Yey