Brute-Force language detection

Tags:

I need an algorithm (any programming language) to test the vitality with an hill climbing algorithm for breaking a cipher for a crypto challenge. The algorithm should test how likely it is that an random-decryption (has no spaces) is an English text (also giving points for yet incomplete words!) or just a random sequence of characters.

I tried it with several algorithms I developed but they were not so good.

My research:

An enigma M4 crypto project ( http://www.bytereef.org/m4_project.html ) uses the Sinkov statistics, which I want to use, too.

The only thing I found was a document of «quebra -pedra», a Java framework that includes the Sinkov log-weight analysis I am searching for.

http://www.google.com/m?client=ms-android-samsung&source=android-home#q=Quebra-pedra+framework+java

But I have not found where to download the framework. Also I have not found any implementation or description of the Sinkov test.

I would be glad for any hints. Thanks.

911

asked Oct 17 '11 23:10

Daniel Marschall

1 Answers

I don't know about Sinkov statistics, but language models from natural language processing can do exactly what you want, scoring text by how similar it is to English.

I wrote a simple character bigram one here, it should be reasonably easy to follow.

https://github.com/rrenaud/Gibberish-Detector

answered Sep 29 '22 22:09

Rob Neuhaus

Related questions
                            
                                Wildcards in Generics: "? super T" works while "? extends T" does not?
                            
                                ExceptionHandler shared by multiple controllers
                            
                                Java null char in string
                            
                                How to use Spring SimpleThreadScope?
                            
                                Login on website with java
                            
                                JSF List Converter
                            
                                List l = new ArrayList<Number>(); The static type of l is List<Number>? What does that mean?
                            
                                Maven: partial compilation before code generation
                            
                                Java ArrayList Choose N elements
                            
                                ClassPool.getDefault(); does nothing in Javassist
                            
                                Java Graphics.fillPolygon: How to also render right and bottom edges?
                            
                                How do I @link to a JSP file in javadoc?
                            
                                Does the reference variable in Java have any size?
                            
                                sorting arrows jtable column header
                            
                                How to have a "Camera" only show a portion of a loaded area
                            
                                Regular Expressions : Find mismatched point (or char index)
                            
                                ejb3-persistence.jar source [closed]
                            
                                Waiting for a Runnable to complete before running another Runnable
                            
                                How to avoid ConcurrentModificationException when iterating over a map and changing values?
                            
                                How do I get @ParametersAreNonnullByDefault to work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Brute-Force language detection

Tags:

java

algorithm

cryptography

nlp

Daniel Marschall

People also ask

1 Answers

Rob Neuhaus

Recent Activity

Donate For Us