I would like to create a protocol like the following: <pre class="prettyprint lang-swift prettyprint-override"><code>protocol Parser { func parse() -> ParserOutcome<?> } enum ParserOutcome<Result> { case result(Result) case parser(Parser) } </code></pre> I want to have parsers that return either a result of a specific type, or another parser. If I use an associated type on <code>Parser</code>, then I can't use <code>Parser</code> in the <code>enum</code>. If I specify a generic type on the <code>parse()</code> function, then I can't define it in the implementation without a generic type. How can I achieve this? <hr> Using generics, I could write something like this: <pre class="prettyprint lang-swift prettyprint-override"><code>class Parser<Result> { func parse() -> ParserOutcome<Result> { ... } } enum ParserOutcome<Result> { case result(Result) case parser(Parser<Result>) } </code></pre> This way, a <code>Parser</code> would be parameterized by the result type. <code>parse()</code> can return a result of the <code>Result</code> type, or any kind of parser that would output either a result of the <code>Result</code> type, or another parser parameterized by the same <code>Result</code> type. With associated types however, as far as I can tell, I'll always have a <code>Self</code> constraint: <pre class="prettyprint lang-swift prettyprint-override"><code>protocol Parser { associatedtype Result func parse() -> ParserOutcome<Result, Self> } enum ParserOutcome<Result, P: Parser where P.Result == Result> { case result(Result) case parser(P) } </code></pre> In this case, I can't have any type of parser that would return the same <code>Result</code> type anymore, it has to be the same type of parser. I would like to obtain the same behavior with the <code>Parser</code> protocol as I would with a generic definition, and I would like to be able to do that within the bounds of the type system, without introducing new boxed types, just like I can with a normal generic definition. It seems to me that defining <code>associatedtype OutcomeParser: Parser</code> inside the <code>Parser</code> protocol, then returning an <code>enum</code> parameterized by that type would solve the problem, but if I try to define <code>OutcomeParser</code> that way, I get the error: <blockquote> Type may not reference itself as a requirement </blockquote>

I wouldn't be so quick to dismiss type erasures as "hacky" or "working around [...] the type system" – in fact I'd argue that they work with the type system in order to provide a useful layer of abstraction when working with protocols (and as already mentioned, used in the standard library itself e.g <code>AnySequence</code>, <code>AnyIndex</code> & <code>AnyCollection</code>). As you said yourself, all you want to do here is have the possibility of either returning a given result from a parser, or another parser that works with the same result type. We don't care about the specific implementation of that parser, we just want to know that it has a <code>parse()</code> method that returns a result of the same type, or another parser with that same requirement. A type erasure is perfect for this kind of situation, as all you need to do is take a reference to a given parser's <code>parse()</code> method, allowing you to abstract away the rest of the implementation details of that parser. It's important to note that you aren't losing any type safety here, you're being exactly as precise about the type of the parser as you requirement specifies. If we look at a potential implementation of a type-erased parser, <code>AnyParser</code>, hopefully you'll see what I mean: <pre class="prettyprint"><code>struct AnyParser<Result> : Parser { // A reference to the underlying parser's parse() method private let _parse : () -> ParserOutcome<Result> // Accept any base that conforms to Parser, and has the same Result type // as the type erasure's generic parameter init<T:Parser where T.Result == Result>(_ base:T) { _parse = base.parse } // Forward calls to parse() to the underlying parser's method func parse() -> ParserOutcome<Result> { return _parse() } } </code></pre> Now in your <code>ParserOutcome</code>, you can simply specify that the <code>parser</code> case has an associated value of type <code>AnyParser<Result></code> – i.e any kind of parsing implementation that can work with the given <code>Result</code> generic parameter. <pre class="prettyprint"><code>protocol Parser { associatedtype Result func parse() -> ParserOutcome<Result> } enum ParserOutcome<Result> { case result(Result) case parser(AnyParser<Result>) } ... struct BarParser : Parser { func parse() -> ParserOutcome<String> { return .result("bar") } } struct FooParser : Parser { func parse() -> ParserOutcome<Int> { let nextParser = BarParser() // error: Cannot convert value of type 'AnyParser<Result>' // (aka 'AnyParser<String>') to expected argument type 'AnyParser<_>' return .parser(AnyParser(nextParser)) } } let f = FooParser() let outcome = f.parse() switch outcome { case .result(let result): print(result) case .parser(let parser): let nextOutcome = parser.parse() } </code></pre> You can see from this example that Swift is still enforcing type-safety. We're trying to wrap a <code>BarParser</code> instance (that works with <code>String</code>s) in an <code>AnyParser</code> type erased wrapper that expects an <code>Int</code> generic parameter, resulting in a compiler error. Once <code>FooParser</code> is parameterised to work with <code>String</code>s instead of <code>Int</code>, the compiler error will be resolved. <hr> In fact, as <code>AnyParser</code> in this case only acts as a wrapper for a single method, another potential solution (if you really detest type erasures) is to simply use this directly as your <code>ParserOutcome</code>'s associated value. <pre class="prettyprint"><code>protocol Parser { associatedtype Result func parse() -> ParserOutcome<Result> } enum ParserOutcome<Result> { case result(Result) case anotherParse(() -> ParserOutcome<Result>) } struct BarParser : Parser { func parse() -> ParserOutcome<String> { return .result("bar") } } struct FooParser : Parser { func parse() -> ParserOutcome<String> { let nextParser = BarParser() return .anotherParse(nextParser.parse) } } ... let f = FooParser() let outcome = f.parse() switch outcome { case .result(let result): print(result) case .anotherParse(let nextParse): let nextOutcome = nextParse() } </code></pre>

Status of the features needed to make this work: <ul> <li>Recursive protocol constraints (SE-0157) Implemented (Swift 4.1) </li> <li>Arbitrary requirements in protocols (SE-0142) Implemented (Swift 4) </li> <li>Generic Type Aliases (SE-0048) Implemented (Swift 3) </li> </ul> <hr> Looks like this is currently not possible without introducing boxed types (the "type erasure" technique), and is something looked at for a future version of Swift, as described by the Recursive protocol constraints and Arbitrary requirements in protocols sections of the Complete Generics Manifesto (since generic protocols are not going to be supported). When Swift supports these two features, the following should become valid: <pre class="prettyprint lang-swift prettyprint-override"><code>protocol Parser { associatedtype Result associatedtype SubParser: Parser where SubParser.Result == Result func parse() -> ParserOutcome<Result, SubParser> } enum ParserOutcome<Result, SubParser: Parser where SubParser.Result == Result> { case result(Result) case parser(P) } </code></pre> With generic <code>typealias</code>es, the subparser type could also be extracted as: <pre class="prettyprint lang-swift prettyprint-override"><code>typealias SubParser<Result> = Parser where SubParser.Result == Result </code></pre>

Protocol function with generic type

Tags:

types

generics

swift

swift-protocols

associated-types

I would like to create a protocol like the following:

protocol Parser {
    func parse() -> ParserOutcome<?>
}

enum ParserOutcome<Result> {
    case result(Result)
    case parser(Parser)
}

I want to have parsers that return either a result of a specific type, or another parser.

If I use an associated type on Parser, then I can't use Parser in the enum. If I specify a generic type on the parse() function, then I can't define it in the implementation without a generic type.

How can I achieve this?

Using generics, I could write something like this:

class Parser<Result> {
    func parse() -> ParserOutcome<Result> { ... }
}

enum ParserOutcome<Result> {
    case result(Result)
    case parser(Parser<Result>)
}

This way, a Parser would be parameterized by the result type. parse() can return a result of the Result type, or any kind of parser that would output either a result of the Result type, or another parser parameterized by the same Result type.

With associated types however, as far as I can tell, I'll always have a Self constraint:

protocol Parser {
    associatedtype Result

    func parse() -> ParserOutcome<Result, Self>
}

enum ParserOutcome<Result, P: Parser where P.Result == Result> {
    case result(Result)
    case parser(P)
}

In this case, I can't have any type of parser that would return the same Result type anymore, it has to be the same type of parser.

I would like to obtain the same behavior with the Parser protocol as I would with a generic definition, and I would like to be able to do that within the bounds of the type system, without introducing new boxed types, just like I can with a normal generic definition.

It seems to me that defining associatedtype OutcomeParser: Parser inside the Parser protocol, then returning an enum parameterized by that type would solve the problem, but if I try to define OutcomeParser that way, I get the error:

Type may not reference itself as a requirement

885

asked Jul 07 '16 05:07

rid

2 Answers

I wouldn't be so quick to dismiss type erasures as "hacky" or "working around [...] the type system" – in fact I'd argue that they work with the type system in order to provide a useful layer of abstraction when working with protocols (and as already mentioned, used in the standard library itself e.g AnySequence, AnyIndex & AnyCollection).

As you said yourself, all you want to do here is have the possibility of either returning a given result from a parser, or another parser that works with the same result type. We don't care about the specific implementation of that parser, we just want to know that it has a parse() method that returns a result of the same type, or another parser with that same requirement.

A type erasure is perfect for this kind of situation, as all you need to do is take a reference to a given parser's parse() method, allowing you to abstract away the rest of the implementation details of that parser. It's important to note that you aren't losing any type safety here, you're being exactly as precise about the type of the parser as you requirement specifies.

If we look at a potential implementation of a type-erased parser, AnyParser, hopefully you'll see what I mean:

struct AnyParser<Result> : Parser {

    // A reference to the underlying parser's parse() method
    private let _parse : () -> ParserOutcome<Result>

    // Accept any base that conforms to Parser, and has the same Result type
    // as the type erasure's generic parameter
    init<T:Parser where T.Result == Result>(_ base:T) {
        _parse = base.parse
    }

    // Forward calls to parse() to the underlying parser's method
    func parse() -> ParserOutcome<Result> {
        return _parse()
    }
}

Now in your ParserOutcome, you can simply specify that the parser case has an associated value of type AnyParser<Result> – i.e any kind of parsing implementation that can work with the given Result generic parameter.

protocol Parser {
    associatedtype Result
    func parse() -> ParserOutcome<Result>
}

enum ParserOutcome<Result> {
    case result(Result)
    case parser(AnyParser<Result>)
}

...

struct BarParser : Parser {
    func parse() -> ParserOutcome<String> {
        return .result("bar")
    }
}

struct FooParser : Parser {
    func parse() -> ParserOutcome<Int> {
        let nextParser = BarParser()

        // error: Cannot convert value of type 'AnyParser<Result>'
        // (aka 'AnyParser<String>') to expected argument type 'AnyParser<_>'
        return .parser(AnyParser(nextParser))
    }
}

let f = FooParser()
let outcome = f.parse()

switch outcome {
case .result(let result):
    print(result)
case .parser(let parser):
    let nextOutcome = parser.parse()
}

You can see from this example that Swift is still enforcing type-safety. We're trying to wrap a BarParser instance (that works with Strings) in an AnyParser type erased wrapper that expects an Int generic parameter, resulting in a compiler error. Once FooParser is parameterised to work with Strings instead of Int, the compiler error will be resolved.

In fact, as AnyParser in this case only acts as a wrapper for a single method, another potential solution (if you really detest type erasures) is to simply use this directly as your ParserOutcome's associated value.

protocol Parser {
    associatedtype Result
    func parse() -> ParserOutcome<Result>
}

enum ParserOutcome<Result> {
    case result(Result)
    case anotherParse(() -> ParserOutcome<Result>)
}


struct BarParser : Parser {
    func parse() -> ParserOutcome<String> {
        return .result("bar")
    }
}

struct FooParser : Parser {
    func parse() -> ParserOutcome<String> {
        let nextParser = BarParser()
        return .anotherParse(nextParser.parse)
    }
}

...

let f = FooParser()
let outcome = f.parse()

switch outcome {
case .result(let result):
    print(result)
case .anotherParse(let nextParse):
    let nextOutcome = nextParse()
}

answered Oct 11 '22 13:10

Hamish

Status of the features needed to make this work:

Recursive protocol constraints (SE-0157) Implemented (Swift 4.1)
Arbitrary requirements in protocols (SE-0142) Implemented (Swift 4)
Generic Type Aliases (SE-0048) Implemented (Swift 3)

Looks like this is currently not possible without introducing boxed types (the "type erasure" technique), and is something looked at for a future version of Swift, as described by the Recursive protocol constraints and Arbitrary requirements in protocols sections of the Complete Generics Manifesto (since generic protocols are not going to be supported).

When Swift supports these two features, the following should become valid:

protocol Parser {
    associatedtype Result
    associatedtype SubParser: Parser where SubParser.Result == Result

    func parse() -> ParserOutcome<Result, SubParser>
}

enum ParserOutcome<Result, SubParser: Parser where SubParser.Result == Result> {
    case result(Result)
    case parser(P)
}

With generic typealiases, the subparser type could also be extracted as:

typealias SubParser<Result> = Parser where SubParser.Result == Result

answered Oct 11 '22 14:10

rid

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Protocol function with generic type

Tags:

types

generics

swift

swift-protocols

associated-types

rid

People also ask

2 Answers

Hamish

rid

Recent Activity

Donate For Us

Protocol function with generic type

Tags:

types

generics

swift

swift-protocols

associated-types

rid

People also ask

2 Answers

Hamish

rid

Related questions

Recent Activity

Donate For Us