Strongly Typed String

The Setting

I have a prototype class TypedString<T> that attempts to "strongly type" (dubious meaning) strings of a certain category. It uses the C#-analogue of the curiously recurring template pattern (CRTP).

`class TypedString<T>`

public abstract class TypedString<T>     : IComparable<T>     , IEquatable<T>     where T : TypedString<T> {     public string Value { get; private set; }      protected virtual StringComparison ComparisonType     {         get { return StringComparison.Ordinal; }     }      protected TypedString(string value)     {         if (value == null)             throw new ArgumentNullException("value");         this.Value = Parse(value);     }      //May throw FormatException     protected virtual string Parse(string value)     {         return value;     }      public int CompareTo(T other)     {         return string.Compare(this.Value, other.Value, ComparisonType);     }      public bool Equals(T other)     {         return string.Equals(this.Value, other.Value, ComparisonType);     }      public override bool Equals(object obj)     {         return obj is T && Equals(obj as T);     }      public override int GetHashCode()     {         return Value.GetHashCode();     }      public override string ToString()     {         return Value;     } }

The TypedString<T> class can now be used to eliminate code duplication when defining a bunch of different "string categories" throughout my project. An example simple usage of this class is in defining a Username class:

`class Username` (example)

public class Username : TypedString<Username> {     public Username(string value)         : base(value)     {     }      protected override string Parse(string value)     {         if (!value.Any())             throw new FormatException("Username must contain at least one character.");         if (!value.All(char.IsLetterOrDigit))             throw new FormatException("Username may only contain letters and digits.");         return value;     } }

This now lets me use the Username class throughout my whole project, never having to check if a username is correctly formatted - if I have an expression or variable of type Username, it's guaranteed to be correct (or null).

Scenario 1

string GetUserRootDirectory(Username user) {     if (user == null)         throw new ArgumentNullException("user");     return Path.Combine(UsersDirectory, user.ToString()); }

I don't have to worry about formatting of the user string here - I already know it's correct by nature of the type.

Scenario 2

IEnumerable<Username> GetFriends(Username user) {     //... }

Here the caller knows what it's getting as the return just based on the type. An IEnumerable<string> would require reading into the details of the method or documentation. Even worse, if someone were to change the implementation of GetFriends such that it introduces a bug and produces invalid username strings, that error could silently propagate to callers of the method and wreak all kinds of havoc. This nicely typed version prevents that.

Scenario 3

System.Uri is an example of a class in .NET that does little more than wrap a string that has a huge number of formatting constraints and helper properties/methods for accessing useful parts of it. So that's one piece of evidence that this approach isn't totally crazy.

The Question

I imagine this kind of thing has been done before. I already see the benefits of this approach and don't need to convince myself any more.

Is there a downside I may be missing?
Is there a way this could come back to bite me later?

633

asked Jun 03 '13 23:06

Timothy Shields

1 Answers

General Thoughts

I'm not fundamentally against the approach (and kudos for knowing/using the CRTP, which can be quite useful). The approach allows metadata to be wrapped around a single value, which can be a very good thing. It's extensible too; you can add additional data to the type without breaking interfaces.

I don't like the fact that your current implementation seems to depend heavily on exception-based flow. This may be perfectly appropriate for some things or in truly exceptional cases. However, if a user was trying to pick a valid username, they could potentially throw dozens of exceptions in the process of doing so.

Of course, you could add exception-free validation to the interface. You must also ask yourself where you want the validation rules to live (which is always a challenge, especially in distributed applications).

WCF

Speaking of "distribution": consider the implications of implementing such types as part of a WCF data contract. Ignoring the fact that data contracts should usually expose simple DTOs, you also have the problem of proxy classes which will maintain your type's properties, but not its implementation.

Of course, you can mitigate this by placing the parent assembly on both client and server. In some cases, this is perfectly appropriate. In other cases, less so. Let's say that the validation of one of your strings required a call to a database. This would most likely not be appropriate to have in both the client/server locations.

"Scenario 1"

It sounds like you are seeking consistent formatting. This is a worthy goal and works great for things like URIs and perhaps usernames. For more complex strings, this can be a challenge. I've worked on products where even "simple" strings can be formatted in many different ways depending on context. In such cases, dedicated (and perhaps reusable) formatters may be more appropriate.

Again, very situation-specific.

"Scenario 2"

Even worse, if someone were to change the implementation of GetFriends such that it introduces a bug and produces invalid username strings, that error could silently propagate to callers of the method and wreak all kinds of havoc.

IEnumerable<Username> GetFriends(Username user) { }

I can see this argument. A few things come to mind:

A better method name: GetUserNamesOfFriends()
Unit/integration testing
Presumably these usernames are validated when they are created/modified. If this is your own API, why wouldn't you trust what it gives you?

Side note: when dealing with people/users, an immutable ID is probably more useful (people like changing usernames).

"Scenario 3"

System.Uri is an example of a class in .NET that does little more than wrap a string that has a huge number of formatting constraints and helper properties/methods for accessing useful parts of it. So that's one piece of evidence that this approach isn't totally crazy.

No argument there, there are many such examples in the BCL.

Final Thoughts

There's nothing wrong with wrapping a value into a more complex type so that it may be described/manipulated with richer metadata.
Centralizing validation in a single place is a good thing, but make sure you pick the right place.
Crossing serialization boundaries can present challenges when logic resides within the type being passed.
If you are mainly focused on trusting the input, you could use a simple wrapper class that lets the callee know that it is receiving data that has been validated. It doesn't matter where/how this validation has occurred.

ASP.Net MVC uses a similar paradigm for strings. If a value is IMvcHtmlString, it is treated as trusted and not encoded again. If not, it is encoded.

101

answered Sep 22 '22 05:09

Tim M.

Related questions
                            
                                Android Alpha, Beta for Paid Apps on Google Play Developer Console
                            
                                psql: FATAL: connection requires a valid client certificate
                            
                                Camera not working/saving when using Cache Uri as MediaStore.EXTRA_OUTPUT
                            
                                Android Swipe View with Tabs Without Using the V4 support library
                            
                                Clueless About a (Possible) Android Memory Leak
                            
                                iphone keyboard not hiding when tapping the screen
                            
                                Using emoji as identifier names in c++ in Visual Studio or GCC
                            
                                How to mask the domain forward from Google domains without Google app
                            
                                How to disable key preview in popup keyboard (not in main softkeyboard layout)?
                            
                                mpandroidchart - How can I avoid the repeated values in Y-Axis?
                            
                                Chrome dev tools pauses on exceptions in blackboxed script
                            
                                Chrome sends two requests when downloading a PDF (and cancels one of them)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Strongly Typed String

Tags:

The Setting

`class TypedString<T>`

`class Username` (example)

Scenario 1

Scenario 2

Scenario 3

The Question

Timothy Shields

People also ask

1 Answers

General Thoughts

WCF

"Scenario 1"

"Scenario 2"

"Scenario 3"

Final Thoughts

Tim M.

Recent Activity

Donate For Us

Strongly Typed String

Tags:

The Setting

class TypedString<T>

class Username (example)

Scenario 1

Scenario 2

Scenario 3

The Question

Timothy Shields

People also ask

1 Answers

General Thoughts

WCF

"Scenario 1"

"Scenario 2"

"Scenario 3"

Final Thoughts

Tim M.

Related questions

Recent Activity

Donate For Us

`class TypedString<T>`

`class Username` (example)