algorithms to evaluate user responses

Tags:

I'm working on a web application which will be used for classifying photos of automobiles. The users will be presented with photos of various vehicles, and will be asked to answer a series of questions about what they see. The results will be recorded to a database, averaged, and displayed.

I'm looking for algorithms to help me identify users which frequently don't vote with the group, indicating that they're probably either not paying attention to the photos, or that they're lying about what they see. I then want to exclude these users, and recalculate the results, such that I can say, with a known amount of confidence, that this particular photo shows a vehicle that is this and that.

This question goes out to all you computer science guys, where to find such algorithms or to give myself the theoretical background to design such algorithms. I'm assuming I'm going to have to learn some probability and statics, maybe some data mining. Some book recommendations would be great. Thanks!

P.S. These are multiple choice questions.

All of these are good suggestions. Thank you! I wish there was a way on stack overflow to select multiple correct answers so more of you could be acknowledged for your contributions!!

202

asked Nov 01 '09 19:11

Ralph

2 Answers

I believe what you described is solved using outlier/anomaly detection. A number of techniques exist:

statistical-based methods
distance-based methods
model-based methods

I suggest you take a look at these slides from the excellent book Introduction to Data Mining

answered Sep 26 '22 03:09

Amro

Read The Elements of Statistical Learning, it is a great compendium on data mining.

You can be interested especially in unsupervised algorithms, for example clustering. Assuming that most people do not lie, the biggest cluster is right and the rest is wrong. Mark people accordingly, then apply some bayesian statistics and you'll be done.

Of course, most data mining technologies are pretty experimentative, so don't count on that they will be always right... or even in most cases.

114

answered Sep 26 '22 03:09

liori

Related questions
                            
                                StAX - how to set XMLInputFactory.IS_VALIDATING to true?
                            
                                Can Python's distutils compile .S (assembly)?
                            
                                Any issues with large numbers of critical sections?
                            
                                How to deploy a Spring Integration app in Tomcat?
                            
                                Including libcurl in project
                            
                                Trouble with visual studio file extensions (.vdproj)
                            
                                Show a character's Unicode codepoint value in Eclipse
                            
                                Visual Studio pre-build scripts can't find exe files in windows/system32 [closed]
                            
                                match array against string in java
                            
                                Editing an SMTP header with an Exchange 2007 Transport Agent
                            
                                Using the new ( since Linux Kernel 2.6.20 ) workqueue interface
                            
                                Web Based Stack Dump Tool for ASP.NET Using Mdbg?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With