Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Finding corruption in a linked list

Tags:

c++

c

I had an interview today for a developer position and was asked an interesting techincal question that i did not know the answer to. I will ask it here to see if anyone can provide me with a solution for my curiosity. It is a multi-part question:

1) You are given a singly linked list with 100 elements (integer and a pointer to next node), find a way to detect if there is a break or corruption halfway through the linked list? You may do anything with the linked list. Note that you must do this in the list as it is iterating and this is verification before you realise that the list has any issues with it.

Assuming that the break in the linked list is at the 50th element, the integer or even the pointer to the next node (51st element) may be pointing to a garbage value which is not necessarily an invalid address.

2) Note that if there is a corruption in the linked list, how would you minimize data loss?

like image 245
John Baum Avatar asked Jun 01 '12 05:06

John Baum


People also ask

How do you find a corrupted node in a linked list?

If you can do anything to the linked list, what you can do is to calculate the checksum of each element and store it on the element itself. This way you will be able to detect corruption even if it's a single bit error on the element.

How would you find out if one of the pointers in a doubly linked list is corrupted or not?

Assuming that it's a doubly linked list: If it's the "next" pointer that got corrupted, one can begin at the tail and using the "previous" pointer, traverse the list towards the head while maintaining a reference to the last element that was traversed.

Which search is best for linked list?

Binary Search is usually fast and efficient for arrays because accessing the middle index between two given indices is easy and fast(Time Complexity O(1)). But memory allocation for the singly linked list is dynamic and non-contiguous, which makes finding the middle element difficult.


4 Answers

To test for a "corrupted" integer, you would need to know what the range of valid values is. Otherwise, there is no way to determine that the value in any given (signed) integer is invalid. So, assuming you have a validity test for the int, you would always check that value before iterating to the next element.

Testing for a corrupted pointer is trickier - for a start, what you need to do is check the value of the pointer to the next element before you attempt to de-reference it, and ensure it is a valid heap address. That will avoid a segmentation fault. The next thing is to validate that what the pointer points at is in fact a valid linked list node element - that's a bit trickier? Perhaps de-reference the pointer into a list element class/struct, and test the validity of the int and "next" pointer, if they are also good, then can be pretty sure the previous node was good also.

On 2), having discovered a corrupted node, [if the next pointer is corrupted] what you should do is set the "next pointer" of the previous node to 'NULL' immediately, marking it as the end of the list, and log your error etc etc. if the corruption was just to the integer value, but not to the "next" element pointer, then you should remove that element from the list and link the previous and following nodes together instead - as no need to throw the rest of the list away in that case!

like image 157
Ozraptor Avatar answered Oct 05 '22 17:10

Ozraptor


For the first part - Overload the new operator. When ever a new node is allocated allocate some additional space before and after the node and put some known values there. In traversal each node can be checked if it is in between the known values.

like image 40
Superman Avatar answered Oct 05 '22 17:10

Superman


If you at design time know that corruption may become a critical issue, you could add a "magic value" as a field into the node data structure which allows you to identify whether some data is likely to be a node or not. Or even to run through memory searching for nodes.

Or double some link information, i.e. store the address of the node after the next node in each node, such that you can recover if one link is broken.

The only problem I see is that you have to avoid segmentation faults.

like image 20
JohnB Avatar answered Oct 05 '22 19:10

JohnB


If you can do anything to the linked list, what you can do is to calculate the checksum of each element and store it on the element itself. This way you will be able to detect corruption even if it's a single bit error on the element.

To minimize data loss perhaps you can consider having storing the nextPtr in the previous element, that way if your current element is corrupted, you can always find the location of the next element from the previous.

like image 37
tytchong Avatar answered Oct 05 '22 18:10

tytchong