Google Datastore - Blob or Text

1 Answers

There is no significant performance difference between the two - just use whichever one best fits your data. BlobProperty should be used to store binary data (e.g., str objects) while TextProperty should be used to store any textual data (e.g., unicode or str objects). Note that if you store a str in a TextProperty, it must only contain ASCII bytes (less than hex 80 or decimal 128) (unlike BlobProperty).

Both of these properties are derived from UnindexedProperty as you can see in the source.

Here is a sample app which demonstrates that there is no difference in storage overhead for these ASCII or UTF-8 strings:

import struct

from google.appengine.ext import db, webapp
from google.appengine.ext.webapp.util import run_wsgi_app

class TestB(db.Model):
    v = db.BlobProperty(required=False)

class TestT(db.Model):
    v = db.TextProperty(required=False)

class MainPage(webapp.RequestHandler):
    def get(self):
        self.response.headers['Content-Type'] = 'text/plain'

        # try simple ASCII data and a bytestring with non-ASCII bytes
        ascii_str = ''.join([struct.pack('>B', i) for i in xrange(128)])
        arbitrary_str = ''.join([struct.pack('>2B', 0xC2, 0x80+i) for i in xrange(64)])
        u = unicode(arbitrary_str, 'utf-8')

        t = [TestT(v=ascii_str), TestT(v=ascii_str*1000), TestT(v=u*1000)]
        b = [TestB(v=ascii_str), TestB(v=ascii_str*1000), TestB(v=arbitrary_str*1000)]

        # demonstrate error cases
        try:
            err = TestT(v=arbitrary_str)
            assert False, "should have caused an error: can't store non-ascii bytes in a Text"
        except UnicodeDecodeError:
            pass
        try:
            err = TestB(v=u)
            assert False, "should have caused an error: can't store unicode in a Blob"
        except db.BadValueError:
            pass

        # determine the serialized size of each model (note: no keys assigned)
        fEncodedSz = lambda o : len(db.model_to_protobuf(o).Encode())
        sz_t = tuple([fEncodedSz(x) for x in t])
        sz_b = tuple([fEncodedSz(x) for x in b])

        # output the results
        self.response.out.write("text:   1=>%dB  2=>%dB  3=>%dB\n" % sz_t)
        self.response.out.write("blob:   1=>%dB  2=>%dB  3=>%dB\n" % sz_b)

application = webapp.WSGIApplication([('/', MainPage)])
def main(): run_wsgi_app(application)
if __name__ == '__main__': main()

And here is the output:

text:   1=>172B  2=>128047B  3=>128047B
blob:   1=>172B  2=>128047B  3=>128047B

answered Sep 22 '22 05:09

David Underhill

Related questions
                            
                                java.security.AccessControlException: access denied ("java.lang.RuntimePermission" "accessClassInPackage.sun.reflect.annotation") Spring
                            
                                Removing header (User-agent) from make_fetch_call while requesting from GAE
                            
                                Unable to use utf8mb4 character set with CloudSQL on AppEngine Python
                            
                                java.lang.reflect.InvocationTargetException at com.twilio.sdk.AppEngineClientConnection.flush(AppEngineClientConnection.java:204)
                            
                                Is the Sockets API getting turned down?
                            
                                How to map only subdomain to App Engine without the naked domain?
                            
                                Connection refused to running Google Cloud SQL instance via proxy or from App Engine
                            
                                Google Cloud Shell is temporarily unavailable [closed]
                            
                                Google AppEngine Standard Local Server (Java8) + Spring Boot + WebFlux doesn't see my @RestController
                            
                                Google Datastore filter with OR condition
                            
                                ImportError: Importing the devappserver sandbox module from user application code is not permitted
                            
                                404 error when using Google App Engine with flask and flask-restplus
                            
                                How to log Stackdriver log messages correlated by trace id using stdout Go 1.11
                            
                                tiny random-looking ID generation
                            
                                Bulk upload data into data store for GAE Java project
                            
                                Converting an Eclipse Java project to a Google AppEngine one
                            
                                What is a good SOAP client library for python on App Engine?
                            
                                Best full-text search for google-app-engine
                            
                                How do I configure multiple Ubuntu Python installations to avoid App Engine's SSL error?
                            
                                tipfy for Google App Engine: Is it stable? Can auth/session components of tipfy be used with webapp?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Google Datastore - Blob or Text

Tags:

google-app-engine

google-cloud-datastore

Keyur

People also ask

1 Answers

David Underhill

Recent Activity

Donate For Us