I realize that there are a ton of Node modules that provide an async API for parsing JSON, but many of them seem to read the entire file or stream into memory, construct a giant string, and then pass it to <code>JSON.parse()</code>. This is what the second answer to "How to parse JSON using NodeJS?" suggests, and is exactly what the jsonfile module does. Constructing a giant string is exactly what I want to avoid. I want an API like: <code>parseJsonFile(pathToJsonFile): Promise</code> where the <code>Promise</code> that is returned resolves to the parsed JSON object. This implementation should use a constant amount of memory. I'm not interested in any sort of SAX-like thing that broadcasts events as various pieces are parsed: just the end result. I think jsonparse may do what I want (it clearly includes logic for parsing JSON without using <code>JSON.parse()</code>), but there is no simple example in the <code>README.md</code>, and the one file in the examples directory seems overly complicated.

I've written a module that does this: BFJ (Big-Friendly JSON). It exports a bunch of functions that operate at different levels of abstraction, but are all asynchronous and streaming at their core. At the highest level are two functions for reading from and writing to the file system, <code>bfj.read</code> and <code>bfj.write</code>. They each return a promise, so you call them like this: <pre class="prettyprint"><code>var bfj = require('bfj'); // Asynchronously read from a JSON file on disk bfj.read(path) .then(data => { // :) }) .catch(error => { // :( }); // Asynchronously write to a JSON file on disk bfj.write(path, data) .then(data => { // :) }) .catch(error => { // :( }); </code></pre> Also at this level is a function for serializing data to a JSON string, called <code>bfj.stringify</code>: <pre class="prettyprint"><code>// Asynchronously serialize data to a JSON string bfj.stringify(data) .then(json => { // :) }) .catch(error => { // :( }); </code></pre> Beneath those are two more generic functions for reading from and writing to streams, <code>bfj.parse</code> and <code>bfj.streamify</code>. These serve as foundations for the higher level functions, but you can also call them directly: <pre class="prettyprint"><code>// Asynchronously parse JSON from a readable stream bfj.parse(readableStream). .then(data => { // :) }) .catch(error => { // :( }); // Asynchronously serialize data to a writable stream of JSON bfj.streamify(data). .pipe(writableStream); </code></pre> At the lowest level there are two functions analagous to SAX parsers/serializers, <code>bfj.walk</code> and <code>bfj.eventify</code>. It's unlikely you'd want to call these directly, they're just the guts of the implementation for the higher levels. It's open-source and MIT-licensed. For more information, check the readme.

Is there a Node module for an async JSON parser that does not load the entire JSON string into memory?

Tags:

json

javascript

node.js

parsing

I realize that there are a ton of Node modules that provide an async API for parsing JSON, but many of them seem to read the entire file or stream into memory, construct a giant string, and then pass it to JSON.parse(). This is what the second answer to "How to parse JSON using NodeJS?" suggests, and is exactly what the jsonfile module does.

Constructing a giant string is exactly what I want to avoid. I want an API like:

parseJsonFile(pathToJsonFile): Promise

where the Promise that is returned resolves to the parsed JSON object. This implementation should use a constant amount of memory. I'm not interested in any sort of SAX-like thing that broadcasts events as various pieces are parsed: just the end result.

I think jsonparse may do what I want (it clearly includes logic for parsing JSON without using JSON.parse()), but there is no simple example in the README.md, and the one file in the examples directory seems overly complicated.

759

asked Oct 17 '14 06:10

bolinfest

1 Answers

I've written a module that does this: BFJ (Big-Friendly JSON). It exports a bunch of functions that operate at different levels of abstraction, but are all asynchronous and streaming at their core.

At the highest level are two functions for reading from and writing to the file system, bfj.read and bfj.write. They each return a promise, so you call them like this:

var bfj = require('bfj');

// Asynchronously read from a JSON file on disk
bfj.read(path)
  .then(data => {
    // :)
  })
  .catch(error => {
    // :(
  });

// Asynchronously write to a JSON file on disk
bfj.write(path, data)
  .then(data => {
    // :)
  })
  .catch(error => {
    // :(
  });

Also at this level is a function for serializing data to a JSON string, called bfj.stringify:

// Asynchronously serialize data to a JSON string
bfj.stringify(data)
  .then(json => {
    // :)
  })
  .catch(error => {
    // :(
  });

Beneath those are two more generic functions for reading from and writing to streams, bfj.parse and bfj.streamify. These serve as foundations for the higher level functions, but you can also call them directly:

// Asynchronously parse JSON from a readable stream
bfj.parse(readableStream).
  .then(data => {
    // :)
  })
  .catch(error => {
    // :(
  });

// Asynchronously serialize data to a writable stream of JSON
bfj.streamify(data).
  .pipe(writableStream);

At the lowest level there are two functions analagous to SAX parsers/serializers, bfj.walk and bfj.eventify. It's unlikely you'd want to call these directly, they're just the guts of the implementation for the higher levels.

It's open-source and MIT-licensed. For more information, check the readme.

answered Nov 02 '22 05:11

Phil Booth

Related questions
                            
                                AngularJS: Hashbang URL keeps appending to itself on full refresh
                            
                                what's the difference between <a onclick="someFunction"> and <a onclick="someFunction()">
                            
                                Creating a GeoJson in php from MySql to use with MapBox javascript API
                            
                                Replicating python's __call__ in javascript?
                            
                                Highcharts.js will not render the chart, it says error "Cannot read property 'series' of undefined"
                            
                                Don't understand why this JavaScript function can be called one way but not the other
                            
                                FB.ui send dialog box not working on iPhone
                            
                                Will the source mapping in Google Chrome push to Error.stack
                            
                                Resizing d3 / dagre-d3 svg to show all contents
                            
                                JavaScript: Can I use square brackets ([]) operator as a function?
                            
                                Why can I not name a JavaScript function `all`?
                            
                                JavaScript: Advantages and disadvantages of dynamically (on fly) creating style element
                            
                                What is the full form of an expressionless statement in javascript? [duplicate]
                            
                                script element with async attribute still block browser render?
                            
                                How to access a variable declared within browserify script
                            
                                Ember-data find record by id and also additional parameters?
                            
                                Rails REST API permissions (CanCan) with Angular JS client. How to render UI based on permissions?
                            
                                How to stop Javascript execution in Android Webview
                            
                                JSSOR - Cannot read type property 'currentStyle' of undefined
                            
                                AngularJS $http returns status code 0 from failed CORS request

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With