RESTful web services vs Socket Programming for a Data Intensive Application

Tags:

I'm building a web application with Ruby on Rails which needs to be highly scalable. In this application, data is produced by a mobile client (approximately 20 bytes) every second. All of this data must be transferred to a server at some point, preferably as soon as possible.

To accomplish this task, I want the server to act as a RESTful service. The client could buffer locations (say every 5 to 30 seconds) and then shoot them off as a HTTP put request, where the server could then store them. I believe this model is simpler to implement, and better handles high volume traffic, as the clients could keep buffering data until they hear a response from the server.

My boss, on the other hand, wants to implement the server using socket programming. He believes socket programming will result in less data being transferred, which will increase the total efficiency of the system. I can't disagree on this point, but I think given modern bandwidth the extra overhead with HTTP is worth it. Plus, I think trying to maintain thousands (or millions) of simultaneous connects with users will cause its own problems and greatly increase the complexity of the server.

Honestly, I don't know the right approach to this problem, so I thought I'd post it here and get the opinions of much smarter people than myself. I'd appreciate it if any answers included the pros and cons of the proposed solution.

Thanks.

Update

We now have a few additional requirements flushed out. First, the mobile client cannot upload more than 5 GB of data per month. In this case, we're talking one message a second for eight hours a day per month. Second is we want's to combine messages as little as possible. This is to ensure if something happens to the mobile client (say a car crash), we lose as little data as possible.

759

asked May 20 '11 04:05

LandonSchropp

3 Answers

Your boss appears to be optimizing prematurely, which is not really a good idea.

Instead of trying to fight an imaginary performance bogeyman before you've even started writing your code, you should examine your application's requirements and design to them. Don't let perceived problems drive your design.

If it comes to it, have your boss outline exactly how he'd marshal data across his socket connection and then do some quick calculations to see if you could match or beat them with HTTP. Will he use something like Google's Protocol Buffers, or write his own marshaling protocol? If so, will it be self-describing? How about application "verbs" like what you'd get for free in HTTP? Will his connections be persistent? There's a lot more to "sockets" than just opening a connection and spewing bytes down it.

You've also correctly noted that your boss seems to be favoring raw speed of sockets over everything else: scalability, maintainability, availability of development and testing tools, protocol sniffers, the helpful semantics of the HTTPS verbs, and so on. HTTP is well understood by load balancers and firewalls and the like. Your proprietary socket protocol will not be so lucky.

What I'd suggest is you look into all the options out there and evaluate them from a performance perspective through testing, prototyping and benchmarking. Then weigh those numbers against the difficulty of building and maintaining the application with that technology.

124

answered Oct 19 '22 02:10

Brian Kelly

Stick to HTTP.

It's far easier to create a park of HTTP servers and put them behind a load balancer than to try do the same thing with for your own protocol. Why? Everything already exists for HTTP.

Update

What you need to reimplement yourself:

Buffer management (important if your load is high)
Making sure that you've received an entire message (A simple Receive/BeginReceive is not enough)
Asynchronous socket handling
Authentication
A load balancer (this part is tricky and you need to design it carefully)
Your own protocol (you need a way to identify when you've received an entire message)

If you use ASP.NET MVC + JSON (the steps for merb or rails is similar):

Create a new website
Activate digest authentication in IIS
Create a new controller, tag it with the [Authorize] attribute
Add an action

What is cheapest? A server or having you spend a month on something that already have been done?

answered Oct 19 '22 02:10

jgauffin

HTTP was designed to scale based on the assumption that the vast majority of requests are GETs. It sounds like the most of your interactions are the client sending data to the server. I think it is quite probable that there exists a better architectural style than REST to achieve what you are trying to do.

The question is, can you afford the time to start from scratch, or is HTTP good enough for your needs. Without knowing more details about your app, I think it is difficult to give good advice.

answered Oct 19 '22 02:10

Darrel Miller

Related questions
                            
                                Disabling Rails logging from script/runner
                            
                                Can memcache tell you how much memory it is using?
                            
                                Rails 3: Uniqueness validation for nested fields_for
                            
                                What permissions are needed for apache Passenger
                            
                                i18n: error message localization for particular model
                            
                                ActiveAdmin -- How to access instance variables from partials?
                            
                                How can I get file extension from URI for an rails application
                            
                                How to sort by created_at column of association in rails?
                            
                                uninitialized constant ActionView::CompiledTemplates::Category
                            
                                Inline HAML Links but with content
                            
                                Capybara webkit invalid response error, how to debug?
                            
                                RSpec stub helper method in controller spec
                            
                                /config/initializers/secret_token.rb not being generated. Why not?
                            
                                I can't run "bundle update" because of "mysql2" gem
                            
                                Redirect with turbolinks from javascript
                            
                                Pry like debugger for Sidekiq?
                            
                                Getting Started with IronRuby on Rails [closed]
                            
                                Ordering of has_and_belongs_to_many associations
                            
                                storing more than 255 characters in a PostgreSQL DB on heroku
                            
                                Cannot get factory_girl running under rails 3.0.5,unexpected tCONSTANT

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

RESTful web services vs Socket Programming for a Data Intensive Application

Tags:

rest

architecture

networking

ruby-on-rails

sockets

LandonSchropp

People also ask

3 Answers

Brian Kelly

Update

jgauffin

Darrel Miller

Recent Activity

Donate For Us