I just looked up the Set
interface and found that it mostly (or completely) only redeclares functions which are already in the Collection
interface. Set
itself extends Collection
, so doesn't that mean that the Set
interface automatically has all the functions from Collection
? So why are they redeclared then?
For example, Set
redeclares this:
/**
* Returns the number of elements in this set (its cardinality). If this
* set contains more than <tt>Integer.MAX_VALUE</tt> elements, returns
* <tt>Integer.MAX_VALUE</tt>.
*
* @return the number of elements in this set (its cardinality)
*/
int size();
/**
* Returns <tt>true</tt> if this set contains no elements.
*
* @return <tt>true</tt> if this set contains no elements
*/
boolean isEmpty();
And the declaration in Collection
:
/**
* Returns the number of elements in this collection. If this collection
* contains more than <tt>Integer.MAX_VALUE</tt> elements, returns
* <tt>Integer.MAX_VALUE</tt>.
*
* @return the number of elements in this collection
*/
int size();
/**
* Returns <tt>true</tt> if this collection contains no elements.
*
* @return <tt>true</tt> if this collection contains no elements
*/
boolean isEmpty();
This seems very redundant to me. Why not just define the Set
interface as:
public interface Set<E> extends Collection<E> {}
I think there is no single difference between those interfaces, right?
Of course I am not asking about the different semantics / meaning of Set
. I know that. I am just asking about if it technically (i.e. to the compiler) has any difference. I.e., speaking generally:
interface A { void foo(); }
interface B extends A { void foo(); }
interface C extends A {}
Now, is there any difference between A
, B
or C
?
While the contract (i.e. what is said in the documentation) can really be different for some functions (as for add
), there is a valid reason to redeclare them: To be able to put a new documentation, i.e. to define the new contract.
However, there are also functions (like isEmpty
) which have exactly the same documentation / contract. Why are they also redeclared?
The Set interface contains only methods inherited from Collection and adds the restriction that duplicate elements are prohibited. Set also adds a stronger contract on the behavior of the equals and hashCode operations, allowing Set instances to be compared meaningfully even if their implementation types differ.
Map is a more specific version of a Collection that has a Key -> Data structure. Collection is just the interface that has data structures for storing data in Java.
The set interface is present in java. util package and extends the Collection interface is an unordered collection of objects in which duplicate values cannot be stored. It is an interface that implements the mathematical set.
Technically for the compiler it makes no difference at all.
However, a set cannot have duplicate entries whereas a Collection can. This is worth knowing about.
Because of this, the methods semantics for parameters, return values and what happens can mean different things. Redeclaring also allows the javadoc to become more specific. For example for add():
Set: @return true if this set did not already contain the specified element
Collection: @return true if this collection changed as a result of the call
The meaning for set is more specific.
Even for methods that are not more specific, it enables the javadoc to be nicer. For example, for size() : "Returns the number of elements in this set (its cardinality)." which is closer to the language people used to mathematical sets will understand.
The API documents summarise this by saying: "The Set interface places additional stipulations, beyond those inherited from the Collection interface, on the contracts of all constructors and on the contracts of the add, equals and hashCode methods. Declarations for other inherited methods are also included here for convenience. (The specifications accompanying these declarations have been tailored to the Set interface, but they do not contain any additional stipulations.)"
The answer is in the java6 API for set.
"The Set interface places additional stipulations, beyond those inherited from the Collection interface, on the contracts of all constructors and on the contracts of the add, equals and hashCode methods. Declarations for other inherited methods are also included here for convenience. (The specifications accompanying these declarations have been tailored to the Set interface, but they do not contain any additional stipulations.)"
There is more to a method than its signature.
In this case, the documentation of these methods has been tailored to sets, in terms of pre- and post-conditions, and terminology.
They are redeclared because, even if the names are same, they have different meaning. The add
method in the Set
is a specific implementation of the generic add
method in Collection
.
The intention is to explicitly specify that the add
method os the Set
is very different from the add method of Collection
.
Why not just define the Set interface as:
public interface Set<E> extends Collection<E> {}
If it has been done this way, there would be no place where the contract of a Set
can be specified. How would I know that by implementing the add
method of the Set
, I should not allow duplicates?
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With