Tips for refactoring a 20K lines library [closed]

Tags:

I've already awarded a 100 point bounty to mario's answer, but might start a second 100 point bounty if I see new good answers coming in. This is why I'm keeping the question open and will not choose a final answer, despite having awarded the bounty to mario.

This might seem like a simple question (study the code and refactor) but I'm hoping those with lots more experience can give me some solid advice.

The library is an open source 20,000 line library that's all in a single file and which I haven't written myself. The code looks badly written and the single file is even a bigger problem, because it freezes eclipse for half a minute at least every time I want to make a change, which is one of the reasons I think it's worth it to refactor this library into smaller classes.

So aside from reading the code and trying to understand it, are there common (or not so common) tips when refactoring a library such as this? What do you advise to make my life a little easier?

Thanks to everyone for your comments.

970

asked Dec 29 '10 20:12

jblue

2 Answers

A few generic principles apply:

Divide and conquer. Split the file into smaller, logical libraries and function groupings. You will learn more about the library this way, and make it easier to understand and test incrementally.
Remove duplication. Look for repeated functions and concepts, and replace them with standard library functions, or centralized functions within the library.
Add consistency. Smooth out parameters and naming.
Add unit tests. This is the most important part of refactoring a library. Use jUnit (or similar), and add tests that you can use to verify that the functions are both correct, and that they have not changed.
Add docs. Document your understanding of the consistent, improved library as you write your tests.

198

answered Sep 24 '22 01:09

Bruce Alderson

If the code is badly written, it is likely that it has a lot of cloning. Finding and getting rid of the clones would then likely make it a lot more maintainable as well as reducing its size.

You can find a variety of clone detectors, these specifically for PHP:

Bergmann's PHPCPD
SourceForge PMD
Our CloneDR

ranked in least-to-most capability order (IMHO with my strong personal self-interest in CloneDR) in terms of qualitatively different ability to detect interesting clones.

If the code is badly written, a lot of it might be dead. It would be worthwhile to find out which part executes in practice, and which does not. A test coverage tool can give you good insight into the answer for this question, even in the absence of tests (you simply exercise your program by hand). What the test coverage tool says executes, obviously isn't dead. What doesn't execute... might be worth further investigation to see if you can remove it. A test coverage tool is also useful to tell you how much of the code is exercised by your unit tests, as suggested by another answer. Finally, a test coverage tool can help you find where some of the functionality is: exercise the functionality from the outside, and whatever code the test coverage tool says is executed is probably relevant.

Our PHP Test Coverage Tool can collect test coverage data.

answered Sep 22 '22 01:09

Ira Baxter

Related questions
                            
                                Can I display all the cookies I set in PHP?
                            
                                str_replace() with associative array
                            
                                TCPDF: HTML table and page breaks
                            
                                PHP - Check if the page run on Mobile or Desktop browser [duplicate]
                            
                                How to get all post parameters in Symfony2? [duplicate]
                            
                                How do I check if the request is made via AJAX in CodeIgniter?
                            
                                CakePHP 2.0 - How to make custom error pages?
                            
                                Detecting SSL With PHP [duplicate]
                            
                                sort array based on the dateTime in php
                            
                                What's your experience with Doctrine ORM? [closed]
                            
                                file_get_contents => PHP Fatal error: Allowed memory exhausted
                            
                                Empty string instead of null values Eloquent
                            
                                How secure is PHP?
                            
                                PDO were rows affected during execute statement
                            
                                PHP function to delete all between certain character(s) in string
                            
                                Get timestamp of today and yesterday in php
                            
                                How to get Hours from Date in PHP & Cakephp?
                            
                                How can I make sure a float will always be rounded up with PHP?
                            
                                Making a HTTP GET request with HTTP-Basic authentication
                            
                                search associative array by value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tips for refactoring a 20K lines library [closed]

Tags:

oop

php

open-source

refactoring

jblue

People also ask

2 Answers

Bruce Alderson

Ira Baxter

Recent Activity

Donate For Us