How to Process PostgreSQL Triggers in a Distributed Environment

Tags:

We're in the process of implementing PostgreSQL Triggers to monitor for inserts/updates/deletes on several tables so that another that app that is listening for these events can keep our relational database in sync with our full-text search database.

Here's what the trigger function looks like:

CREATE FUNCTION notification() RETURNS trigger AS $$
BEGIN
  PERFORM pg_notify('search', TG_TABLE_NAME || ',id,' || NEW.id);
  RETURN NULL;
END;
$$ LANGUAGE plpgsql;

And here's how we're adding the trigger to each table:

CREATE TRIGGER foo_trigger AFTER INSERT OR UPDATE or DELETE ON foo
FOR EACH ROW EXECUTE PROCEDURE notification();

And here is a very basic example of how we would have a node app (worker) listening for these trigger events:

var pg  = require('pg');

var connString = "postgres://user@localhost/foo_local";

pg.connect(connString, function(err, client, done) {

  client.on('notification', function(msg) {
    //get the added / updated / deleted record
    //sync it with the search database
  });

  var query = client.query('LISTEN search');
});

Here's my three part question:

Part 1 Our app is load balanced across several instances. What happens when the node / worker app, which is also distributed, receives an event? Will all instances of the worker app that are listening receive the triggered event?

If so, that's bad - we don't want all instances of the worker app to process every event because they'd all be doing the same work and that would negate the benefits of having multiple listeners to distribute the load. How do we mitigate this?

Part 2 What happens if the worker receives a trigger event, but it is long running? Will PostgreSQL queue the events that have been triggered until the listeners receive them?

Part 3 We've got about 5 tables that we want to fire triggers on INSERT / UPDATE / DELETE. We've got a lot of requests, so this would fire a lot of events in a short period of time. We need a worker to listen to these events and process the changed records so that it can send them along to the full-text search database. Is there a better way to architect this to handle the volume?

The other solution our team is considering is abandoning SQL Triggers and just using a message queuing system to shove messages in a data store (SQS or Redis) and then just have workers pick off messages from the queue. We want to avoid this route if we can as it adds more architecture to our platform; however, we're prepared to do it if it's our only option.

Your thoughts would be much appreciated.

280

asked May 05 '15 23:05

doremi

1 Answers

First of all, in your trigger function, you might want to make life easier for your listeners, by providing more specific details of exactly what changed (e.g. in an UPDATE).

You could do something like this:

CREATE OR REPLACE FUNCTION notification() RETURNS trigger AS $$
DECLARE
  id bigint;
BEGIN
  IF TG_OP = 'INSERT' OR TG_OP = 'UPDATE' THEN
    id = NEW.id;
  ELSE
    id = OLD.id;
  END IF;

  IF TG_OP = 'UPDATE' THEN
    PERFORM pg_notify('table_update', json_build_object('schema', TG_TABLE_SCHEMA, 'table', TG_TABLE_NAME, 'id', id, 'type', TG_OP, 'changes', hstore_to_json(hstore(NEW) - hstore(OLD)))::text);
    RETURN NEW;
  END IF;

  IF TG_OP = 'INSERT' THEN
    PERFORM pg_notify('table_update', json_build_object('schema', TG_TABLE_SCHEMA, 'table', TG_TABLE_NAME, 'id', id, 'type', TG_OP, 'row', row_to_json(NEW))::text);
    RETURN NEW;
  END IF;

  IF TG_OP = 'DELETE' THEN
    PERFORM pg_notify('table_update', json_build_object('schema', TG_TABLE_SCHEMA, 'table', TG_TABLE_NAME, 'id', id, 'type', TG_OP, 'row', row_to_json(OLD))::text);
    RETURN OLD;
  END IF;

END;
$$ LANGUAGE plpgsql;

Now for your questions, or at least: Part 1: I believe all the instances of the worker apps that are listening will receive the triggered event. This can be useful for pub/sub style real-time notification to multiple listeners. For your use case, it sounds like you would need to add some kind of queue package on top of the basic PostgreSQL LISTEN/NOTIFY, such as queue_classic (for Ruby) or perhaps pg-jobs for node.js.

Anyway, since it's several months since you asked this, I'm wondering what path you took in the end and how it worked out? Can you share your experience and insights?

129

answered Oct 20 '22 01:10

Yoni Rabinovitch

Related questions
                            
                                How to detect older version of browser and redirect to browser support page
                            
                                Web speech API: Consistently get the supported speech synthesis voices on iOS safari
                            
                                Javascript Add Row to HTML Table & Increment ID
                            
                                Can Angular-UI-Grid-Edit be used with "controller as" syntax?
                            
                                jquery on trigger keypress event twice
                            
                                Failed expectation: "Expected [ ] to be empty array."
                            
                                If 'if' condition is false, statements do not execute in chrome, but execute in Firefox
                            
                                How to turn on Pause On Uncaught Exceptions in Google Chrome Canary?
                            
                                setState not triggering a re-render when data has been modified
                            
                                Timing with javascript performance.now()
                            
                                How to wrap multiple middleware functions into one?
                            
                                How to dynamically update jquery datatable using js array as data source
                            
                                SVG Path Overlay and Animate Out Another Path
                            
                                html2canvas screenshot capturing current window, not entire body
                            
                                Audio tag, how to handle it from Angular
                            
                                Arrow shape using FabricJS
                            
                                Deep understanding: How code structure affects the content of date arrays created with loops
                            
                                AngularJS ng-cloak does not prevent code blinking in Mean.js
                            
                                Update AngularJS scope from 3rd party library aynchronous callback
                            
                                CSS: Performance wise, better to use calc or position absolute

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to Process PostgreSQL Triggers in a Distributed Environment

Tags:

javascript

node.js

postgresql

events

triggers

doremi

People also ask

1 Answers

Yoni Rabinovitch

Recent Activity

Donate For Us