I see that there are many base64 implementations available in the opensource and I found multiple internal implementations in a product that I am maintaining. I'm trying to factor out duplicates but I am not 100% certain that all these implementations give identical output. Therfore I need to have a dataset that tests all possible combinations of input. Is that somewhere available ? google search did not really report it. I saw a similar question on stackoverflow but that one has not been fully answered and it is actually just asking for one phrase (in ascii) that would test all 64 chars. It does not handle padding with = for example. So one test string will certainly not fit the bill for a 100% test.

Perhaps something like Base64Test in Bouncy Castle would do what you want?. The tricky part in base64 is handling the padding correctly. It's certainly important to cover that as you mentioned. Accordingly, RFC 4648 specifies these test vectors: <pre class="prettyprint"><code> BASE64("") = "" BASE64("f") = "Zg==" BASE64("fo") = "Zm8=" BASE64("foo") = "Zm9v" BASE64("foob") = "Zm9vYg==" BASE64("fooba") = "Zm9vYmE=" BASE64("foobar") = "Zm9vYmFy" </code></pre> Some of your implementations may produce base64 output that differs only by whether they insert line breaks, and where implementations that break lines insert the break and the line termination used. You would have to do additional testing to determine whether you can safely replace an implementation that's using one style with a different one. In particular, a decoder might make assumptions about line length or termination.

Is there a dataset available to fully test a base64 encode/decoder?

Tags:

unit-testing

base64

I see that there are many base64 implementations available in the opensource and I found multiple internal implementations in a product that I am maintaining.

I'm trying to factor out duplicates but I am not 100% certain that all these implementations give identical output. Therfore I need to have a dataset that tests all possible combinations of input.

Is that somewhere available ? google search did not really report it.

I saw a similar question on stackoverflow but that one has not been fully answered and it is actually just asking for one phrase (in ascii) that would test all 64 chars. It does not handle padding with = for example. So one test string will certainly not fit the bill for a 100% test.

248

asked Aug 22 '12 09:08

David Nouls

1 Answers

Perhaps something like Base64Test in Bouncy Castle would do what you want?. The tricky part in base64 is handling the padding correctly. It's certainly important to cover that as you mentioned. Accordingly, RFC 4648 specifies these test vectors:

   BASE64("") = ""
   BASE64("f") = "Zg=="
   BASE64("fo") = "Zm8="
   BASE64("foo") = "Zm9v"
   BASE64("foob") = "Zm9vYg=="
   BASE64("fooba") = "Zm9vYmE="
   BASE64("foobar") = "Zm9vYmFy"

Some of your implementations may produce base64 output that differs only by whether they insert line breaks, and where implementations that break lines insert the break and the line termination used. You would have to do additional testing to determine whether you can safely replace an implementation that's using one style with a different one. In particular, a decoder might make assumptions about line length or termination.

157

answered Nov 13 '22 13:11

Hugh Brackett

Related questions
                            
                                How can I test / mock Hive (Flutter) open box logic in repo?
                            
                                Unit testing NHibernate UserTypes
                            
                                What's the most commonly used unit testing framework for different types of Ruby applications? [duplicate]
                            
                                What is different about the CMake command configure_file on Windows?
                            
                                Determining which classes would benefit most from unit testing?
                            
                                Unit testing repository with Moq
                            
                                testing grails data binding
                            
                                RhinoMocks expecting complex object as a parameter
                            
                                python unittest: can't call decorated test
                            
                                DB Unit Testing framework?
                            
                                unit testing modular javascript
                            
                                Unit testing... how to improve it
                            
                                Ensure non-mocked methods are not called in mockito
                            
                                In C#, what is a good way to improve unit test feedback loop?
                            
                                MVC - Unit testing the wrong things?
                            
                                How to configure minitest for integration tests using the unit style
                            
                                How to approach unittesting and TDD (using python + nose)
                            
                                OCUnit tests fail from the command line but work in Xcode when using Keychain Services
                            
                                self.attr resets between tests in unittest.TestCase
                            
                                All invocation on the mock must have a corresponding setup

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With