Alexandros Salazar

The Ghost of Swift Bugs Future

2015-06-12T22:50:03-07:00

Update: I wrote this with Xcode 7 β1, and playgrounds crashed a lot at the time. As a result, I gave up on testing all the cases, and a lot of errors creeped into the snippets. They are now corrected, thanks to (among others) @CalQL8ed_K_OS and @IanKay, who both corrected me and shamed me into fixing things. Thanks guys!

So Swift 2 is out, and they fixed enums with variable payloads, so the party is on.

I haven’t had a chance to play with it too much, but watching the [Protocol-Oriented Programming in Swift][pop] session, a particular construct struck me as the most likely source of arcane, incomprehensible bugs in the future. I expect it to be the novice’s crucible, similar to the way deallocated delegates would lead to crashes in the days before the weak attribute was introduced. I’m not yet sure what the searches will look like, but the fundamental question will be a variation of:

“Why does the method that I wrote overriding protocol extension X never get called?”

Stack Overflow will no doubt provide short answers. Here is my longer, more in-depth answer, hoping to explain the details to some lost soul.

• • • • •

[Protocol extensions][pe] are a new feature in Swift that allows default implementations of methods to be shared across types that conform to the protocol. The exact semantics place them in the vague mixin/typeclass area that other languages implement in different ways—I don’t claim to know enough about the semantics of the construct in any language to get more specific than that. It allows for shorter code and better modularity, and I think it is a fantastic feature. (And if you haven’t watched the session, go do that. Now. I’ll wait.)

However.

However, the dispatch rules can get confusing. To say the least.

Let’s begin with a quick intro; suppose we declare a protocol:

protocol Formattable {
    /// A string.
    var content:String { get }

    /// An formatting function. 
    func formattedContent() -> String
}

As stated, a protocol extension allows us to provide a default implementation for formattedContent():

extension Formattable {
    func formattedContent() -> String {
        return self.content
    }
}

But another thing we can do is add entirely new methods to our protocol:

extension Formattable {
    func debugFormattedContent() -> String {
        return "Content: \(self.content)"
    }
}

And now, any type that implements Formattable has access to both methods. The Swift standard library uses this to have one centrally defined sorting algorithm, dependent on a type conforming to the Comparable protocol. This is very different from how things were in Swift 1.x, where defining a sort([Self]) method on Comparable would require every type to have its own distinct implementation—hence the existence of large numbers of top-leve functions like map, reduce, and, indeed, sort.

Once you understand how protocol extensions work, their power is impressive, and their allure irresistible. But as with all siren songs, dangers lurk. Consider this implementation of Formattable:

struct Day : Formattable {

    var content:String

    func formattedContent() -> String {
        return "Today is \(self.content)"
    }

    func debugFormattedContent() -> String {
        return "Day: \(self.content)"
    }
}

Seems simple enough; it overrides both methods. Now see if you can predict what happens with each of these calls:

let a = Day(content:"Monday")
let b:Formattable = Day(content:"Monday")

a.formattedContent()
b.formattedContent()
a.debugFormattedContent()
b.debugFormattedContent()

Given it a go? Did you come up with:

a.formattedContent() // "Today is Monday"
b.formattedContent() // "Today is Monday"
a.debugFormattedContent() // "Day: Monday"
b.debugFormattedContent() // "Content: Monday"

If yes, and you understand why, congratulations; you don’t need to read any further. If you expected b.debugFormattedContent() to be the same as a.debugFormattedContent(), read on.

• • • • •

In Objective-C and Swift 1.x a protocol, which could not be extended with methods, was an interface that defined behavior that a type conformed to. That is, the conforming type guaranteed that it implemented every function in the protocol. So in Swift 1.x, suppose we had:

protocol A {
    func m1() -> String
}

struct B : A {
    func m1() -> String {
        return "hello"
    }
}

Then let a = B() and let a:A = B() were the same thing as far as calling m1 is concerned: they will both call the only implementation available, which is the one in B. This is still true in Swift 2.

Protocol extensions, however, allow a form of polymorphism, which brings up the question of which method to dispatch. Suppose in Swift 2 we extend A:

extension A {
     func m1() -> String {
        return "hello"
    }

    func m2() -> String {
        return "planet"
    }
}

And suppose we want B to override both methods:

struct B : A {
    func m1() -> String{
        return "greetings"
    }

    func m2() -> String {
        return "earthling"
    }
}

Now, for any instance of B : A, there are two possible implementations of m1 and m2: the one defined in B, and the one defined in the A extension. Which one should we choose?

Suppose we decide we should always choose the one in the type that implements A, if it exists. And suppose we have another type C defined as follows:

struct C : A {
    func m2() -> String {
        return "darkness, my old friend"
    }
}

Now suppose we have a heterogeneous array and call the methods:

let a:[A] = [B(), C()]
let b = a.map {$0.m1()} // ["greetings", "hello"]
let c = a.map {$0.m2()} // ["earthling", "darkness, my old friend"]

This is what we would expect. So far so good. Suppose, however, that we call a function:

func callM2(arr:[A]) -> [String]{
    return arr.map { $0.m2() }
}

let a:[A] = [B(), C()]
let b = a.map {$0.m1()} // ["greetings", "hello"]
let c = callM2(a) // ["earthling", "darkness, my old friend"]

This time, the result, which is the same, might actually surprise the author of callM2 very much. After all m2 is not defined in the original protocol A. So if they chose to call it, it must be because they expected the specific implementation of the extension.

By always calling the value in the type’s implementation, then, we forever hide the default implementation of the extension, even in cases where it would be expected. The solution Swift 2 adopted is to call the default implementation when the protocol is explicitly specified. So let’s look back at our example:

let a = Day(content:"Monday")
let b:Formattable = Day(content:"Monday")

a.debugFormattedContent() // "Day: Monday"
b.debugFormattedContent() // "Content: Monday"

Since b is explicitly specified as being a Formattable, the method that gets called is the default implementation in the extension. Since a is instead inferred to be a Day, the method that gets called on it is the implementation in Day.

In the case of the A and B protocols, this translates into Swift always calling the default implementation for the elements of the array:

func callM2(arr:[A]) -> [String]{
    return arr.map { $0.m2() }
}

let a:[A] = [B(), C()]
let b = a.map {$0.m1()} // ["greetings", "hello"]
let c = callM2(a) // ["planet", "planet"]

This brings up an interesting question. Is it possible to get the type’s implementation every time, instead of the extension’s? Or have we lost that ability forever? If you look back at the call to m1(), you’ll see that it is always the type’s implementation that gets called. The difference is that m1 is declared in A, whereas m2 is declared in the extension. So if we wanted m2 to be called on the actual type of the array’s elements, we would declare it in A.

• • • • •

The rules for dispatch for protocol extensions, then, are:

IF the inferred type of a variable is the protocol:
- AND the method is defined in the original protocol
  - THEN the runtime type’s implementation is called, irrespective of whether there is a default implementation in the extension.
- AND the method is not defined in the original protocol,
  - THEN the default implementation is called.
ELSE IF the inferred type of the variable is the type
- THEN the type’s implementation is called.

Note the use of “runtime type” in the first THEN clause. This refers to the type of the variable when the program is actually running, as opposed to the type the compiler infers. This is relevant if we write a function callM1() as follows:

func callM1(arr:[A]) -> [String] {
    return arr.map($0.m1)
}

Here, the compiler doesn’t know the type of the elements of arr; it only knows that they all have an m1 method. At runtime, however, each element will have a specific type, perhaps B, or perhaps C. That’s when the method that gets called will be determined. This is referred to as [dynamic dispatch][].

By contrast, in the callM2 implementation, per the rules we just established, the compiler will know exactly what method to call: the one in the extension to A. This is referred to as static dispatch and, incidentally, allows for certain optimizations (general rule: if you know something at compile time, odds are you can optimize something with that knowledge).

• • • • •

So, that’s it. As I said, I expect this to be a very common error among beginners; in fact, I expect to run into this issue multiple times myself. It’s just too easy to forget when things are dispatched dynamically and when they’re dispatched statically.

Incidentally, if anyone knows of other reasons to dispatch statically on extensions, please let me know; I’ll amend the post to reflect them.

[pop]: https://developer.apple.com/videos/wwdc/2015/?id=408
[pe]: https://developer.apple.com/library/prerelease/ios/documentation/Swift/Conceptual/Swift_Programming_Language/Protocols.html#//apple_ref/doc/uid/TP40014097-CH25-ID521
[dynamic dispatch]: https://en.wikipedia.org/wiki/Dynamic_dispatch

ReactiveCocoa II: Reacting to Events

2015-05-22T14:45:11-07:00

In my [previous post][rac-intro], I introduced [ReactiveCocoa][rac] and went over the basic steps of creating a SignalProducer. I showed how you can process the events via the pipe forward operator |>, and teased a little at the end about the various functional compositions you can work with. What I did not go into was how to react to those events. Given that this is functional reactive programming that we’re talking about, perhaps a follow-up on that is not the worst idea.

Even if it’s three weeks overdue.

• • • • •

As I discussed, events come in four varieties: .Next(T), .Error(E), .Completed, and .Interrupted. Working with a Signal (which remember, has neither beginning nor end), we listen for those events by observing them:

someSignal.observe(next: {
    data in
        displayData(data)
})

Signals typically don’t have errors and don’t terminate, so that’s all we need to do for most signals.

SignalProducers, on the other hand, can run the full gamut. And since they are started on demand, their API is a little different:

someSignalProducer.start(next: {
    data in
        displayData(data)
}, error: {
    error in
        displayError(error)
}, interrupted: {
        handleInterruption()
})

The start and observe methods have very similar calling conventions, and all take the parameters next:, error:, interrupted:, and completed:, which have default values that do nothing. That’s why in the last snippet, I was able to do nothing on completion: if I don’t need to inform the user that a request finished, there’s no need to include a parameter for it.

• • • • •

Okay, so now we know how to create signals and signal producers, and how to listen to them. Are we done?

Theoretically, yes. But in our work with RAC, there have been many patterns that have come up over and over again that are supported, but that are not always obvious to implement. To help maybe pave the way for others, I’m going to go over a few common examples.

Let’s get back to Comic Cathy. The data stored in her local store is missing the cover images, which she wants to load from the server. If you remember, her producer function was:

func comicCollectionProducer()
       -> SignalProducer<[Comic], RetrievalError> {

    let localFetchProducer = SignalProducer(result:localComics())
        |> mapError(retrievalErrorFromStoreError)

    let networkFetchProducer = SignalProducer.try(networkComics)
        |> mapError(retrievalErrorFromNetworkError)

    return localFetchProducer |> concat(networkFetchProducer)
}

What she wants to do is this: take every group of comics returned by the producer, and for each comic, request the cover image. At this point, we need to start making things a little more realistic. In order to do this, what Cathy has written is a SignalProducer that will fetch the image and send events with the data (or any errors). She has a function that will return it:

func producerToFetchImageForComic(comic:Comic)
       -> SignalProducer<UIImage, NSError>

The first problem is that this function takes a single Comic, but we will be getting an array of Comics. What we want, then, is to use this function to create a new function:

func producerToFetchImagesForComics(comics:[Comic])
       -> SignalProducer<[UIImage], NSError>

We can’t use a plain old for-loop, because SignalProducers are asynchronous by nature. Instead, we have to break down the task into steps:

Create a stream of individual comics from the array.
Replace each comic in the stream with the corresponding image.
When the stream completes, recombine the entire stream into one array of images.

ReactiveCocoa has functions that let us do each of these things. First off, much like it has a convenience initializer for a Result, it also has a convenience initializer for an array of values:

let individualComicProducer = 
    SignalProducer<Comic, NSError>(values:comics)

When started, this producer will emit the values one by one and then complete. So that’s step one. Step three is also easy. RAC has a function called collect that waits for a signal to complete, and then forwards all the values in an array. In effect, it’s the reverse of the convenience initializer and would look like this:

let comics:SignalProducer<Comic, NSError>
    = individualComicProducer |> collect

As before, the pipe forward operator is a shorthand for collect(individualComitProducer), and returns a SignalProducer. This is important: there is no way to get the values put into a producer out of it without using the start function; in effect, the producer adds a context that can’t be discarded. But that’s okay, because our producerToFetchImagesForComics function returns a producer.

This leaves step two in our list. That’s the trickiest one. What we want to do is:

Take every comic handed to us.
Call the producerToFetchImageForComic function.
Return the image we receive as a result of the producer.

In type terms we have a producer of type SignalProducer<Comic, NSError>. And we have a function that turns Comic values into SignalProducer<UIImage, NSError> values. What we want is a SignalProducer<UIImage, NSError>.

Hmm. We’ve seen this pattern before. It’s like what we want to do with Result sometimes:

result:Result<T, E>
COMBINED_WITH
    f:T -> Result<U, E>
SHOULD_RETURN
newResult:Result<U, E>

That’s our old friend flatMap!

newResult = result.flatMap { value in f(value) }

So does RAC have flatMap for signals? Of course it does! RAC loves flatMap! It loves flatMap so much, it has three versions of it, depending on what behavior you want out of your signal. I can’t go into too much detail about what each does,¹ so I’ll stick with the one that will preserve the order of the comics: flatMap(.Concat). It works like this:

let signalOfImages = individualComicProducer
    |> flatMap(.Concat, producerToFetchImageForComic)

And therefore, the complete function looks like this:

func producerToFetchImagesForComics(comics:[Comic])
       -> SignalProducer<[UIImage], NSError> {

    return SignalProducer(values:comics)
        |> flatMap(.Concat, producerToFetchImageForComic)
        |> collect
}

Again, step back for a second and realize that it took me multiple paragraphs to explain three lines of code — which exactly match the three conceptual steps we wanted to execute. So while there was a lot of explanation for the details, the mapping from concept to code is almost one-to-one. Once you become fluent with the various functions in the API, you might not even think of things in ambiguous English steps — the transformations might naturally map into collects and takeWhiles and signalOns and flatMaps until running across a problem that doesn’t match those fundamentals becomes rare, and a warning sign that the problem statement is unclear.

• • • • •

Now Cathy has her function to get the images. How will she integrate it in her app? Ideally, she wants a producer that will return both the images and the comics, together. This is a trickier thing to get right, because it involves creating a producer that calls producers, which always feels dirty to me. Here’s how Cathy goes about it:

func comicsCollectionDisplayProducer()
    -> SignalProducer<([Comic], [UIImage]), NSError> {

    return SignalProducer { sink, disposable in
        let disp = comicCollectionProducer()
            |> start(next: {
                comics in
                let disp2 = producerToFetchImagesForComics(comics)
                    |> start(next: {
                        images in
                        sendNext(sink, (comics, images))
                    }
                disposable.add(disp2)
            }
        disposable.add(disp)
    }
}

We’re now seeing the use of that disposable parameter. It exists to ensure that if the enclosing SignalProducer errors out or is terminated, no events from the inner producers are propagated along the computational chain. In other words, it’s a way way of doing manual computation management. I don’t love it, but I haven’t found a better way to do it, so I accept it. The rule is: whenever you observe or start something, you should have worked out how to manage the Disposable that is returned. It’s similar to retain/release management. In this case, the initializer provides a CompositeDisposable to which Cathy can add any disposables she creates. The initializer is responsible for managing the disposable it provides.

Cathy’s implementation works great. It does exactly what it needs to do. And yet it still looks ugly. There’s a lot of deep nesting going on, a lot of closures getting called back and forth, and it generally feels clunky.

There are ways around this. For one, none of the solutions so far change the fundamental nature of what the code is doing: the new functions are composed from previously written functions. Because RAC makes composition natural, that’s a tendency that can go too far; sometimes the solution is a different approach to the problem. For instance, instead of producerToFetchImageForComic, we could have a similar function that returned a tuple:

func producerToFetchInfoForComic(comic:Comic)
       -> SignalProducer<(Comic, UIImage), NSError>

The tuple pairs the comic with its corresponding images — a very useful pairing. That change would propagate up the chain, and lead us to a top-level function whose signature would be:

func comicsCollectionDisplayProducer()
    -> SignalProducer<[(Comic, UIImage)], NSError>

See if you can work out how. This pairs each comic with its corresponding image explicitly and is decidedly cleaner.²

This is only one approach to resolving the nesting issue, and what I want to emphasize is that the old [Haskell joke][haskell-joke] applies equally well to RAC: “An hour of meditation, followed by the emission of a single ‘fold’ expression.” Just because what you’ve come up with works doesn’t mean that it’s the best or cleanest way to do it. Trust your instinct if something feels ugly. There may be better ways of doing it.

Update: [Justin Spahr-Summers][jss] was nice enough to provide a much better implementation of comicsCollectionDisplayProducer():

func comicsCollectionDisplayProducer()
    -> SignalProducer<([Comic], [UIImage]), NSError> {
    return comicCollectionProducer()
        |> flatMap(.Concat) { comics in
            producerToFetchImagesForComics(comics)
                |> map { images in (comics, images)
        }
}

He’s using the form of flatMap that takes an explicit closure, which I hadn’t thought about. I like it much better because is avoids the use of disposables entirely. Since this pattern comes up over and over again, I recommend learning it; I’m probably going to go back and look through some of our code and see if we can use it to clean things up.

• • • • •

With these two posts, you should have the basics to start writing code that leverages RAC. We have found it great to work with, despite some annoying warts, like a nagging feeling that disposables should not exist. While there is a steep learning curve, the [issues page][rac-issues] on the ReactiveCocoa project is full of active, helpful, and knowledgeable users who try their best to help newcomers. As the popularity of the framework grows (and I expect it to keep growing), so will the quality and promptness of the help available.

So give RAC a try. If you’re already familiar with functional programming, it will be like seeing an old friend. If you’re not, it will teach you new ways of thinking about your software. And in both cases, it will lead to very expressive code that is far more testable than the code that typically finds its way in iOS apps.

¹ I literally can’t. I am at this point not even positive that all three version of flatMap obey the three monad laws, but I believe they do. The most interesting thing is that there is yet another method, mapError that is technically a flatMap, but on the error part of the Signal. All in all, SignalProducers and Signals are much more complex than regular monads, and I’m not going to pretend I can tell you much more about their theoretical underpinnings, though I suspect that would be a fantastic topic for a post or even paper.↩︎

² We could get the same result by operating on the arrays we get from comicsCollectionDisplayProducer, but that would assume that they are of the same size and in the same order. Written properly, that would include all sorts of error checking that this approach sidesteps.↩︎

[haskell-joke]: http://www.quora.com/What-are-some-unofficial-mottos-of-programming-languages
[rac-intro]: http://nomothetis.svbtle.com/an-introduction-to-reactivecocoa
[rac-issues]: https://github.com/ReactiveCocoa/ReactiveCocoa/issues
[rac]: https://github.com/ReactiveCocoa/ReactiveCocoa
[jss]: https://twitter.com/jspahrsummers

An Introduction to ReactiveCocoa

2015-04-28T21:08:05-07:00

A lot of the [posts][result-post] I’ve [written][error-handling-2] [so][optional-chaining] [far][implicit-optionals] are by and large foundational work. They are, so to speak, table stakes for functional programming. But once at the table, it’s hard to know exactly where to go. There are many great articles on using these principles to, e.g., [parse JSON][eidhof], but at the end of the day, that’s one problem, there are [solid solutions][argo] out there, and it doesn’t need to be solved again. Parsing JSON is hardly a reason to adopt functional programming wholesale. Functional programming should help you write better code.

Over the past couple of months, our team at work has been developing an application in pure Swift using the pre-release versions of [ReactiveCocoa][], and it has been a complete joy. We have been able to test far more of our code than ever before in unit tests, we have been able to break it into tiny functions that are easy to review on their own, and we have been having a ton of fun. Since RAC, as it is often called, uses and expands on a lot of the topics I’ve written about in the past, I thought it would be good to share.

Let’s get the introductions out of the way. ReactiveCocoa is a functional reactive programming (FRP) framework developed by GitHub, primarily [Justin Spahr-Summers][jss] and [Josh Abernathy][joshaber]. FRP, for its part, is a specific way of writing and architecting software that creates a malleable abstraction for timelines; RAC implements one version of it for iOS and OS X.

The good folks at GitHub are about to release version 3.0, which is the one we have been using and I will focus on. While version 3.0 is (mostly) backward compatible with version 2.0, we have actually not used 2.0 at all; the basic Swift API has served us well so far, and understanding it will give you the tools to explore the more elaborate features at your leisure. So with that said, let’s get started. I assume you have no idea what FRP is about and will try to build things up from scratch.

As a result, this will be long.

• • • • •

At the base of FRP is the notion of events. Events are simply things that happen — which is obviously a concept that every type of programming supports. However, in ReactiveCocoa, events are first-class citizens; in fact, they have their own type. Here is a summary:¹ ²

enum Event<T, E: ErrorType> {
    case Next(T)
    case Error(E)
    case Completed
    case Interrupted
}

The .Error case is the simplest: it represents an error event. The .Next, .Completed and .Interrupted cases are a little different: they imply an ordering. What does a .Next event follow? What is .Completed? What got .Interrupted? Say hello to the next fundamental type: the signal.

struct Signal<T, E:ErrorType> { /*…*/ }

A Signal<T, E> is a sequence of Event<T,E>s in time, with precise semantics: every event must be of type .Next, except the last one. The last one can either be an .Error, a .Completed, or an .Interrupted. But the key factor is that these events carry information. That’s the generic types T and E, denoting arbitrary and error-specific information in both in Signal and Event.

The information T can be anything: the components of a data stream, the contents of a text field over time, or even a Void type that signifies something happened, but doesn’t have any actual data (think of a signal that represents button presses; we need to know they happened, but there is no data associated with the button press event).

Here is a valid sequence of events for a signal, one that enumerates the first three letters of the alphabet:

.Next("a") -- .Next("b") -- .Next("c") -- .Completed

Here’s another, that enumerates the result of dividing 3 by 3, 2, 1, and 0:

.Next(1) -- .Next(1.5) -- .Next(3) -- .Error("Division by zero")

The remaining case, .Interrupted, can come up when a signal is forcibly stopped, but it has been relatively rare for us, and the framework often handles that case transparently.

• • • • •

Let’s be honest, the examples above for a signal were pretty artificial. That’s because in real life, signals actually come in two flavors, typically referred to as “hot” signals and “cold” signals, and I wanted to avoid mixing them up.

Signal represents hot signals: signals that have no beginning and typically no end, but are simply a set of events in the world that can be observed. UI interactions, for instance, fall nicely within this. Button presses are a signal: they don’t really have a beginning, they just happen. But so do events like push notifications. Signal can represent any stream of such events, possibly combined.

For instance, suppose we want to update the screen on button press and push notification. We can represent both these events with a single Signal<Void, NoError>, where NoError is a built-in type that, you guessed it, means the signal can’t error out. This makes sense, since there is no notion of a button press being an error from the application’s standpoint, nor of a push notification being one. The timeline for a signal like that is dead simple:

…  --  .Next(Void) -- .Next(Void) -- .Next(Void) -- …

In our experience, most of our Signal instances have had NoError as their error type. When something doesn’t have a well-defined beginning or end, it becomes more convenient to model it as never failing.

Cold signals, by contrast, are signals that encapsulate a behavior that can be started and that often finishes. A network call is an excellent example: it is started on demand, and it can succeed, returning the data, or it can fail, returning an error code. The type we use for cold signals is SignalProducer:

struct SignalProducer<T, E:ErrorType> { /*…*/ }

Like Signal, a SignalProducer emits Events. The big difference comes in the way the timelines will typically look. For instance, let’s think again about our network call. Its type would likely be SignalProducer<NSData, NetworkError>, where we assume we have a NetworkError type that conforms to ErrorType. We have several possible timelines for this signal. One is the successful network call:

| -- .Next(data) -- .Completed

Here, I have used | as an indication that the producer was explicitly started. Another timeline is the bad call:

| -- .Error(.NotFound)

Finally, another one is the cancelled call:

| -- .Interrupted

But what makes this representation really powerful is that there is no need to assume all the data returns at once. If we have a long-lived data task, say an NSURLSessionDownloadTask that calls a delegate many times during its execution, it can still be represented by the same type. Here are the equivalent timelines in that case:

| -- .Next(data1) -- .Next(data2) -- .Completed

| -- .Next(data1) -- .Error(.ConnectionLost)

| -- .Next(data1) -- .Interrupted

Thus, a SignalProducer<NSData, NetworkError> is a generalized representation of a network call that can be adapted to any specific case.

• • • • •

Okay, all of this may make sense, but it still doesn’t explain why anyone would go through the effort of representing things as Signals or SignalProducers. Nor do we yet know how to create them, or use them. So let’s look at the first part, creating a SignalProducer.

Comic Cathy is writing an app that keeps track of her comic book collection. The collection is on a server, but she doesn’t want to download the entire thing every time she launches her app, so she has a local store. When the app launches, she wants to populate a table view with her collection. However, she doesn’t want to wait until the app is done syncing; she wants to display what’s in the store first, and then load any updates.

Fresh from learning about SignalProducers, Cathy thinks about her data. She sees that what she wants will come in two steps: first an array of existing comics, and then an array with any new comics. Getting the first array could fail if the local store produces an error, and getting the second array could fail if the network call produces an error. Either way, that would be a retrieval error. Oh, and if the store fails, she doesn’t want to make the network call and show the user confusing or incomplete info. Perfect! Cathy can define a function that returns the appropriate producer:

func comicCollectionProducer()
       -> SignalProducer<[Comic], RetrievalError>

What should the implementation be? Let’s make a few simplifying assumptions. Let’s assume that both retrieving the comic info from the local store and retrieving it from the network are synchronous calls. Let’s say the functions are:³

func localComics() -> Result<[Comic], LocalStoreError>
func networkComics() -> Result<[Comic], NetworkError>

If we look at the API for SignalProducer, we see that the main initializer has a strange type:

public init(_ startHandler:
            (Signal<T, E>.Observer, CompositeDisposable) -> ())

Ouch. What does that mean? Let’s break it down. First of all, init takes a closure. This is called the startHandler because it gets called when the start method is called on the producer. Now let’s look at the parameters. The first parameter to the handler is a Signal<T, E>.Observer; this is, in common parlance, a sink: it’s where we send the events that the producer generates. The second parameter is a disposable. This is a memory management mechanism that is specific to ReactiveCocoa; for now, we can ignore it.

Armed with this knowledge, Cathy writes the following implementation for her function:

func comicCollectionProducer()
       -> SignalProducer<[Comic], RetrievalError> {

    return SignalProducer { sink, disposable in
        switch localComics() {
        case .Success(let comics):
            sendNext(sink, comics)
        case .Failure(let error)
            sendError(sink, retrievalErrorForStoreError(error))
            return // errors terminate the signal
        }

        switch networkComics() {
        case .Success(let comics):
            sendNext(sink, comics)
            sendCompleted(sink)
        case .Failure(let error)
            sendError(sink, retrievalErrorForNetworkError(error))
        }
    }
}

She first fetches the comics from the local store and sends them along by calling sendNext. This creates a .Next event of the appropriate type and emits it on the sink. If that fails, she sends an error to the sink, first turning it into a RetrievalError.

If the local fetch completes successfully, she carries out the network call and repeats the process. The only difference is that, since there is no more work to be done after sending the network data, she calls sendComplete, thus terminating the signal.

To her delight, everything works.

• • • • •

Still, this doesn’t seem like it’s a brilliant argument for using ReactiveCocoa, does it? Or for thinking all those posts about error handling and flatMap were particularly useful. That’s because Cathy’s implementation doesn’t make use of the standard library of functions that ships with RAC and, indeed, with any FRP framework.

You see, signals are fundamentally collections. And just like one can define map, reduce, flatMap, and other functions on arrays, one can define similar functions on signals and signal producers. So the power of this representation is in the way it allows us to manipulate signals as collections. For instance, what if I told you that the initializer above could be rewritten as:

func comicCollectionProducer()
       -> SignalProducer<[Comic], RetrievalError> {

    let localFetchProducer = SignalProducer(result:localComics())
        |> mapError(retrievalErrorFromStoreError)

    let networkFetchProducer = SignalProducer.try(networkComics)
        |> mapError(retrievalErrorFromNetworkError)

    return localFetchProducer |> concat(networkFetchProducer)
}

Now it’s looking more interesting, isn’t it? But of course, a lot more dense. So let’s take a look line by line at what is happening. I am going to focus on the meaning of the lines, and less on the mechanics of how every detail is achieved, because part of the power of FRP is that it gives you a vocabulary that abstracts those mechanics away.

At a high level, we are creating two producers and concatenating them. The |> operator is an extremely versatile and powerful operator whose mechanics we are going to ignore for now. When you read it, just read “take the thing on the left, and do the thing on the right to it once the signal is started”.

That last bit is crucial, by the way. The |> operator creates a specification. It doesn’t do anything during the call to comicCollectionProducer; instead, it defers all actions to the moment when the SignalProducer returned by the function is started.

Reading the return line in that light, we see it says “take the producer on the left and concatenate to it the producer on the right”. In this context, “concatenate” means “wait until the first one is done, and then start the second one”. Simple. Crucially, the second one is started only if the first one completes; if it is interrupted or errors out, the second one is never started. This is exactly the behavior Cathy wants.

Now let’s take a look at how we create the two producers. The first producer, localFetchProducer, is created in two steps. First, we create a new SignalProducer from the result of the localComics() call. This equivalent to writing the following:

SignalProducer { sink, disposable in
        switch localComics() {
        case .Success(let comics):
            sendNext(sink, comics)
            sendCompleted(sink)
        case .Failure(let error)
            sendError(sink, error)
        }
}

It’s such a common thing to want to write that the framework provides this convenience initializer. Now if you look at the code carefully, you’ll see that the type of this producer is SignalProducer<[Comic], LocalStoreError>. However, the signature of the comicCollectionProducer function calls for the error to be a RetrievalError. That’s where the second part of the creation comes in.

Per our previous semantics, the second part of the initialization of localFetchProducer says “take the signal producer and map its errors to new errors using the retrievalErrorFromStoreError function”. Again, it looks like this:

    |> mapError(retrievalErrorFromStoreError)

In other words, if the first signal results in an array of comics, this line has no effect. If, however, it results in an error, it takes that error and uses the retrievalErrorFromStoreError to turn it into a RetrievalError. Since this is a specification rather than a direct action, what the |> operator returns is actually another signal producer, with type SignalProducer<[Comic], RetrievalError>. Victory! That’s what we wanted.

The second producer is slightly different. We want to make the network call, but it’s very important that it happen after the local store fetch, because we don’t want to block or delay that fetch (remember that networkComics is synchronous).

If we were to use the same initializer as before, networkComics would get called at initialization, i.e. during the comicCollectionProducer call. That’s fine for the local store call, but definitely not for the network call, which should not be made if, say, the local store call ends in error.

Fortunately, that too is a very common scenario, and SignalProducer has the try static function that instead of taking a Result takes a closure that returns a Result. This function gets called only when the start method is called on the producer. Effectively, it can take a function, like networkComics, and wait until it is started to execute it.

Once again, the signal producer returned by SignalProducer.try(networkComics) has the wrong type: SignalProducer<[Comic], NetworkError>. Like before, we deal with that through mapError, which is the second line of this call.

• • • • •

If you’re still with me, you waded through twelve paragraphs to explain five lines of code. First of all, congratulations. Second, isn’t that cool? The semantics of that code are precise and concise — and more abstract than anything I’ve ever been able to write with any framework. At the end of the day, this is what that code is saying:

Try to fetch things locally, and turn any errors into something we can understand.
If that succeeds, do a network fetch, and again turn any errors into things we can understand.

The fact that we can express it almost as concisely as we can say it at that high level is incredible. And in addition to being concise and high-level, each part of this process is testable in isolation:

The network and local fetches can be tested by themselves.
The error transformations can be tested by themselves.

Finally, if we make networkComics and localComics parameters to comicCollectionProducer, the entire chain can be unit tested. In a completely controlled manner. That’s truly golden.

• • • • •

Oh, man. It’s the end of the day, and Cathy just decided she really would rather wait for everything to sync. And she really wants to show her work to her friend in like an hour. How can she rewrite the whole thing to return everything at once quickly and without error?

Turns out, she doesn’t have to. All she needs to do is change the return line in comicCollectionProducer from this:

return localFetchProducer |> concat(networkFetchProducer)

to this:

return localFetchProducer |> concat(networkFetchProducer)
       |> reduce([]) { $0 + $1 }

It’s that simple. I promise.

• • • • •

Alright, I’ve shown you a reasonably thorough example of how to create a SignalProducer, including creating simple ones from primitives and combining them into the producer we want using the |> operator and various higher order functions. That’s half of using FRP. The other half is how to use the events the producer emits. This post has grown enormous already, so I’m going to leave that aspect for [my next post][part-ii]. Stay tuned.

¹ Swift 1.2 still doesn’t support declarations like that one; the cases have to instead take Box<T> and Box<E> types. I have removed that for simplicity and because I’m hopeful that one day, that blight on my soul will be lifted and this post will be both easy to follow and valid Swift.↩︎

² SWIFT 2 MAKES THIS CORRECT CODE!! Ahem. Carry on.↩︎

³ I’ve [written before][result-post] about the Result enum, and [Robert Napier][] wrote a nice implementation that has been [merged][result-mf] into the microframeworks maintained by [Rob Rix][].↩︎

[Robert Napier]: http://twitter.com/cocoaphony
[result-mf]: https://github.com/antitypical/Result
[Rob Rix]: https://twitter.com/rob_rix
[result-post]: http://nomothetis.svbtle.com/error-handling-in-swift
[error-handling-2]: http://nomothetis.svbtle.com/error-handling-in-swift-part-ii
[optional-chaining]: http://nomothetis.svbtle.com/understanding-optional-chaining
[implicit-optionals]: http://nomothetis.svbtle.com/implicitly-unwrapped-optionals-in-depth
[ReactiveCocoa]: http://reactivecocoa.io
[argo]: https://github.com/thoughtbot/Argo
[eidhof]: http://chris.eidhof.nl/posts/json-parsing-in-swift.html
[jss]: https://twitter.com/jspahrsummers
[joshaber]: https://twitter.com/joshaber
[part-ii]: http://nomothetis.svbtle.com/reactivecocoa-ii-reacting-to-signals

You Are More Than a Coder

2015-02-25T09:35:32-08:00

The answer — by demonstration — would take care of that, too.

— Isaac Asimov, [The Last Question][last-question]

From time to time, I stumble across something beautiful and true. It happened recently, and this is me trying to share it. It has a formal name, the [Curry-Howard correspondence][chc], but it’s one of those rare bits of knowledge that, once known, feel so inevitable, they almost don’t need a name. It may not impress you as deeply as it did me; you may not see the point; you may not care. But it is the most profound and elegant thing I know about programming.

• • • • •

Programming is an act of creation. Constrained by business needs, by hardware limitations, by our own reach which may exceed our grasp, we are still, in the end, creators. But — we are not artists. There is elegance in what we do; it is not the goal. There can be beauty in the product; it is subordinate to function. There can even be transcendence; it is not a given. We have obligations: stability, reliability, functionality, effectiveness, productivity, testability, maintainability; we have jobs, and if art results, fantastic; if it doesn’t, that’s fine.

Neither are we engineers or scientists. We don’t design physical objects; we don’t care about gravity, electricity, or magnetism; evolutions is irrelevant to us; neuroscience is a mystery. We create in an abstract world that interacts, in points, with the concrete, but that is separate from it. We manipulate ideas; we transform data; we process knowledge; we present concepts. Under it all, no matter how we dress things up, what we do is invent new ways to move around zeros and ones. There’s a name for our kind: we are mathematicians.

Some of us may not think so. Some of us hated mathematics in school, some ran from calculus, some balked at statistics. Some of us came to programming through an altogether different route, via the liberal or fine arts. Some of us may even be bad mathematicians. But mathematicians we all are. That is what the Curry-Howard correspondence tells us.

• • • • •

If you’re like me, your first instinct when you look at a function is to wonder what it does. Take the following Ruby method:

def add_numbers(arr)
  arr.reduce { |memo, num|
    memo + num
  }
end

What it does is clear: it adds ([or concatenates…][types-as-units]) the values in an array. But let’s look at it a different way; let’s look at only the declaration: def add_numbers(arr). Even by itself, it carries information: it says there is a method that takes an object as a parameter and returns another object.¹

Let’s look at a similar method in Swift:

func addNumbers(arr:[Int]) -> Int {
    return reduce(arr, 0) { memo, num in
        return memo + num
    }
}

What does the declaration tell us this time? It tells us hat there is a function that takes an Array of Ints and returns an Int. Obviously this declaration conveys more information than the Ruby one; that’s an important point and I’ll get back to it later. For now, let’s just recognize that in both languages, declarations contain information.

I’m still phrasing the information in terms of what the function or method does. It’s equally appropriate to phrase it as a statement about the world:

The Ruby method declaration states that there exists a way to transform an object into a (possibly different) object.
The Swift function declaration states that there exists a way to transform an Array of Ints into a single Int.

Once you start making statements about the world, though, you have to back them up. Doing so in this case is easy: write at an implementation. How do you show that there is a way to transform an object into another object? Write code that does it! How do you show there is a way to transform an array of integers into a single integer? Write code that does it! And how do you know your implementation is valid? You type-check it.

And this is the important caveat. A proof of a declaration is valid if it type-checks. But that doesn’t mandate a particular implementation. Here is a Ruby method that type-checks,² i.e. that matches its declaration:

def add_numbers(arr)
  return nil
end

This implementation doesn’t do what the function name says, but that doesn’t prevent it from type-cheking. Likewise, this Swift implementation function fulfills its declaration, despite not adding the numbers either:

func addNumbers(arr:[Int]) -> Int {
    return 0
}

From this, we can get to the essence of the Curry-Howard correspondence, which is that type systems define a logic: a set of statements that can be made and proven using only types. Function declarations are the statements; implementations are the proof. Since the name of a function isn’t part of its type, it places no constraints on the proof; that’s why we can have a Swift function named addNumbers that does no such thing. What we can’t do is declare our addNumbers function as above, but have it return a String. Its type doesn’t allow it.

Put all this another way, every time we write a function declaration, we state a theorem, and every time we write an implementation that type-checks, we prove it. As I said, we are all mathematicians.

• • • • •

If type systems define the statements that can be made and proven, that would imply that different type systems let you prove different things. And indeed, that is the case. Some type systems are powerful and others are not, in a very precise sense. Ruby’s type system is not very powerful, because not many statements can be made with it. In fact, only one statement can be made. Here it is:

Every group of objects can be transformed into an object.

To be a little more charitable, there is in fact an infinity of statements that can be made, but they all follow the same pattern:

There is a way to create an object.
There is a way to transform an object into an object.
There is a way to transform two objects into an object.
There is a way to transform three objects into an object.
Etc.

This is a fundamental limitation of the language — but notice that it doesn’t make the language any less useful for software development. It’s simply that its type system isn’t very expressive.

Swift, by contrast, is much more expressive. Here are a few statements that can be made with Swift function declarations:

There is a way to transform a string into an integer.
There is a way to transform a URL into an HTTP response code.
There is a way to transform a touch and a map view into geographical coordinates.
There is a way to transform an array of integers into an image.

As before, we need to remember that statements being more specific is not the same as their admitting only one implementation (in other words, a unique proof). However, an expressive type system serves to narrow down the possible implementations to only those that satisfy the statements’ type, and an automated type checker serves to verify that the type declarations are respected — that is, proven.

And this is where we come full circle to how this knowledge is changing the way I program. Because while the Curry-Howard correspondence shows that there is a direct mapping between doing something and proving that it can be done, there is a world of difference in how I approach a task and how I approach a proof.

In the past, my mental process when I wrote a function has been:

What does my function need to do?
What parameters does it need to do it?
Implement it.

I am now shifting to a different way of thinking:

What truth do I know about the world?
Prove it.

The first two questions are condensed into one, because the parameters are the first part of any theorem: given X, Y is true. By tying them at the hip, I find that I often have to think about whether the statement makes sense before starting to write code, particularly if I try to write in a [referentially transparent][rt] style.

The action item, though, is where the main difference lies. An implementation has to work; a proof must be ironclad. While the type checker will verify that my implementation proves the function declaration, I am invariably trying to prove a much more specific statement than can be expressed in a declaration. For instance, I would not be trying to prove just that a touch and a map can be transformed into a coordinate; I would be trying to prove that they can be transformed into the coordinate of the point on the map where the touch occurred. This is not expressible in the function declaration, but it is still a provable fact about the world. I find that thinking about my functions as proofs makes me write better code, code that is more cognizant of edge cases and error conditions.

This approach is completely language-agnostic and type-system agnostic. Yes, stronger type systems allow the function declaration to be more specific. Since all strong type systems typically include a type checker in the compiler toolchain, a side-benefit is that they also verify the declaration’s proof is valid.³ But once again, valid is not necessarily useful. I’m trying to encourage a different way of thinking about what it means to program, not pushing a particular language or type system.

• • • • •

I’ve tried to keep this post light on the theory of the Curry-Howard correspondence, because what was revelatory to me wasn’t the mathematics of it, but rather the idea that I could approach programming as an exercise in proving theorems. For actual details, I highly recommend Alyssa Carter’s [fantastic introduction][type-systems-and-logic] to the formalism of type systems and the logics they correspond to; it’s an incredible read and I learned a lot from it — with a grin on my face the whole time. This stuff is just cool.

If you’re not so inclined, though, I still hope you have gotten something out of this rumination. If nothing else this: that you, the programmer, are a mathematician. Every time you’ve written a function declaration, you have had something to prove. And proven it you have, over and over again.

¹ In Ruby, every scope has a return value, and it is the last value computed in the scope. Therefore, every function has a return value, whether return is explicitly called or not. In addition, everything in Ruby is an object (including entities other languages might treat as primitives, like nil).↩︎

² Ruby does have types; they’re just so simple that a Ruby method is consistent with its declaration by virtue of being syntactically correct.↩︎

³ And as [increasingly powerful type systems][idris] start appearing, more and more of the proof will be verifiable.↩︎

[last-question]: http://www.multivax.com/last_question.html
[types-as-units]: http://nomothetis.svbtle.com/types-as-units
[chc]: http://en.wikipedia.org/wiki/Curry–Howard_correspondence
[type-systems-and-logic]: https://codewords.hackerschool.com/issues/one/type-systems-and-logic
[rt]:http://en.wikipedia.org/wiki/Referential_transparency_(computer_science)
[idris]:http://www.idris-lang.org

Types as Units

2015-02-18T20:20:31-08:00

A few years ago, Steve Yegge wrote a [great piece][yegge] arguing that software developers can be divided into conservatives and liberals, and moreover that, like in politics, there are some issues where the dividing lines are very clear. The first issue on his list was type systems. Given how different Swift’s type system is from Objective-C’s, I’m going to take a more general look at types as a concept. You’ll find that, like many people who were enthusiastic about Swift from the get-go, I am a software conservative on this issue.

The prototypical example of a liberal language is Ruby. Ruby has no compile-time type checking. As long as your syntax looks like it belongs to a Ruby program, the interpreter will happily load it and try to do something with it — but it will crash your app if you messed up and tried to add a Restaurant and a PoisonDart (you have an interesting problem space).

The paragon of conservative languages is Haskell. Its compiler will exhaustively check every implication of your declared types and only produce an executable if they are consistent. Your application will rarely crash once it’s running, but the compiler will not let you get away with trying to juggle an Airplane.

On the spectrum, Swift falls much closer to Haskell than to Ruby. Objective-C as used by most people I know and open-source projects I’ve seen is still closer to Haskell than to Ruby, but only by a little; it can potentially be very close to Ruby, as it uses dynamic dispatch under the hood and practically everything can be cast to id.

But really … what are these “types” that so divide us?

• • • • •

Before I ever wrote a line of code, I studied engineering. Early physics classes emphasized the importance of units: a ball doesn’t have a speed of 5, it has a speed of 5 meters per second. Non-controversial, but there’s a subtlety here. Suppose I want to find another speed. Nothing in arithmetic prevents me from multiplying 5 meters per second by 10 apples per pear. I’ll get fifty…meter-apples per second-pear? That makes zero sense. But why?

To say that 5 is the speed of a ball implies that 5 has units of distance per unit of time, and vice versa. The reason the result was nonsense is clearly that meter-apples per second-pear is by definition not a unit of speed — any such unit must be some distance over some time interval.

Put that way, units seem tautological, except for this: arithmetic doesn’t understand units. Units constrain arithmetic, defining how quantities with units combine to create quantities with other units — and letting you know that certain arithmetical operations give nonsense results for your problem and certain operations aren’t allowed at all.

For instance, you can always multiply quantities with disparate units — but you might get results that are in meter-apples per second-pear, which might not be what you were looking for. On the other hand, you can’t add quantities with disparate units: 5 meters per second can’t be added 20 inches per year; you have to convert one to unit to the other. Units annotate our quantities and imbue them with additional information that, when enforced, keeps results consistent.

• • • • •

Several of my classes featured equations galloping across the whiteboard like Cossack armies in the Russian winter, and my professors often brought up [dimensional analysis][] as a useful tool for double-checking our work. After a derivation that would often take a page or two, they recommended looking at the resulting units of all the quantities before plugging the final numbers in, to make sure that we were getting the right units out.

None of us ever did that.

It’s not that we didn’t care. We were learning to design airplanes; an error would result fiery death; of course we cared. (Okay, we didn’t care that much; we were students.) The problem was that it was an extra set of computations that was time-consuming and that was about as good as looking at our results and making sure they passed the smell test, i.e. that they were in the right ballpark. It wouldn’t save us on a test if we really screwed up, but we trusted our symbolic manipulation skills enough to say that if 1. we got a result and 2. the result was in the right ballpark, the result was probably right.

If we’d had a way to put the final algebraic result in a computer and have it spit out the units, though, a lot more of us would have done it. After all, manually computing the units is tedious (and error-prone), but using an automated tool to sanity-check our work would have been just common sense.

Here’s where I’m going with this: types in software development are directly equivalent to units in engineering and science. They aren’t necessary to do your work, they don’t prevent you from screwing up, and they don’t prevent you from being inconsistent — but they help you understand what the entities you’re dealing with are.

Automated type checking, on the other hand, is a tool that helps you be consistent. And the more powerful your type system, the surer you can be of your code’s internal consistency.

• • • • •

The value of consistency to a software project is hard to quantify. Consistency doesn’t equal quality, and it doesn’t even equal sanity. But I am a conservative when it comes to type systems because it does equal focus.

Back in school, when I made a mistake that resulted in a quantity with nonsense units, I felt like an idiot. Seriously, at what point did I think I’d multiply speed by distance and get a pressure? The mistake was completely preventable and I had nevertheless burned time thinking about my problem with an inconsistent mental model of the world. For however many computations until I caught a given unit error, my mental model of the universe had that error as a valid thing to do.

And sometimes the model would stay wrong for a good long while! A page or two of calculations! If someone had told me, from the get-go, that what I was trying to do was nonsense, all that work would have been saved, and I would have spent more time building an appropriate model and solving the actual problem. That someone, when I program, is the compiler.

In a language like Ruby, a type error will go undetected until runtime. Consider this simple method to add the numbers in an array:

def add_numbers(arr)
  arr.reduce { |memo, obj|
    memo + obj
  }
end

It does its job magnificently — as long as we’re passing in an array with numbers. If we pass in an array with strings, though, it concatenates them. Most of the time, this is fine. But what if I am manipulating an array of numbers represented as strings, say ["1", "2", "3"]. Mentally, I could very well be thinking of that array as numbers, and when I pass it to the array, I would get back "123" instead of 6. At some point, this inconsistency will bite me — but not until I’ve expended significant effort writing code that assumes there is no inconsistency.

In Swift, by contrast, the method is explicit:

func addNumbers(nums:[Int]) -> Int {
    return reduce(nums, 0) { memo, num in
        return memo + num
    }
}

I could never call addNumbers with ["1", "2", "3"], because that array’s type would be inferred as [String], and the compiler would immediately let me know that I am trying to do something inconsistent. That wouldn’t by itself solve the problem (do I need to make the method more generic, or do I need to convert the strings into integers?), but at least it would prevent me from writing code as if there were no problem.

• • • • •

I emphasize that this is personal preference. I can’t claim that I will write code faster, or that my code will be better, due to the type system. I merely say that the compiler will catch a certain class of mistake early in the development process, and force me to keep an internally-consistent model of the world — to focus on my model. I value that, but others might prefer the freewheeling development afforded to them by Ruby. That’s fine.

What I do want to emphasize, though, is how much like units types are. You can do science and engineering correctly without doing dimensional analysis. But if you get into an inconsistent state with your units, they will [bite you][gimli-glider] — perhaps [catastrophically][mars-orbiter]. The only way to avoid such issues is to do (correct) dimensional analysis before using the results of any calculations.

Software has it a little easier. During development, it’s okay to get in an inconsistent state: the app will crash in the test server, or the compiler will whine, and the bug will be fixed before being rolled out to the real world. Even in production, most applications aren’t so critical that an occasional crash will have dire consequences.

Some are, though. And those applications — avionics, guidance, air traffic control — are often written in type safe languages like [Ada][]. Even when written in more traditional languages, their code style guidelines are [very stringent][jpl-code-style], in hopes that the compiler and analyzer will catch more bugs. Isn’t that interesting?

It makes sense, if you think about it. Mission critical software has a very high cost of failure, and is often difficult to test until it is actually controlling a plane, or guiding a missile, or directing air traffic — and those are terrible, horrible, no good, very bad times to discover an internal consistency error. Even in a testing context, crashing a test plane is a multi-million dollar error, and I’m sure air traffic control validation tests are not cheap.

What type systems give us, then, is a particular class of verifiable facts about our code base. In some contexts, knowing those facts is not particularly useful; in others, it is absolutely critical. But no matter what the actual needs of the software system itself, one can have a preference for up-front verifiability or a preference for implicit trust in the developer’s abilities. I tend to prefer verifiability, stodgy conservative that I am.

• • • • •

“Verifiability” … that’s very very close to “provability”, isn’t it …? Does that mean that type systems let us prove certain things?

Yes, as a matter of fact.

And this newfound knowledge is slowly changing how I approach programming. I’ll go into that soon.

[yegge]: https://plus.google.com/110981030061712822816/posts/KaSKeg4vQtz
[dimensional analysis]:http://en.wikipedia.org/wiki/Dimensional_analysis
[Ada]:http://en.wikipedia.org/wiki/Ada_(programming_language)#History
[gimli-glider]:http://en.wikipedia.org/wiki/Gimli_Glider
[mars-orbiter]:http://en.wikipedia.org/wiki/Mars_Climate_Orbiter#Cause_of_failure
[jpl-code-style]:http://spinroot.com/gerard/pdf/P10.pdf

Magical Future Swift Is (Almost) Here

2015-02-15T09:01:00-08:00

I’ve been holed up dealing with Metal for a bit (which I was completely unfamiliar with, and therefore unable to blog about), but Swift 1.2 came out last week, rousing me out of my graphics-induced coma. And boy was I in for a surprise. Remember my posts on [monads], and all my talk of Magical Future Swift? It’s here!

Well…almost.

• • • • •

A brief recap for those who don’t remember: I motivated my discussion of Magical Future Swift by talking about the nested optionals problem, wherein Birthday Beth wants to have a super fun birthday party, but only if all of her friends show up:

func partyGameForFriend1(friend1:Friend?,
                        friend2:Friend?, 
                        friend3:Friend?) -> Game {
    if let f1 = friend1 {
        if let f2 = friend2 {
            if let f3 = friend3 {
                return Game.Superfun(f1, f2, f3)
            }
        }
    }

    return Game.Regular
}

Recognizing how awkward this was (as has everyone who has used Swift 1.1 and below), I made the suggestion that Magical Future Swift would allow something different: for-comprehensions:

// Magical Future Swift
func partyGameForFriend(friend1:Friend?,
                        friend2:Friend?, 
                        friend3:Friend?) -> Game {

    let optionalGame = for {
        f1 <- friend1
        f2 <- friend2
        f3 <- friend3
    } yield {
        Game.Superfun(f1, f2, f3)
    }

    return optionalGame ?? Game.Regular
}

Basically, the <- operator unwraps the optionals, and calls the yield block with the unwrapped result, but only if all the optionals are unwrapped. If you’ve looked at Swift 1.2, this should be awfully familiar. It’s the new, supercharged if-let:

// Swift 1.2
func partyGameForFriend(friend1:Friend?,
                        friend2:Friend?, 
                        friend3:Friend?) -> Game {

    if let f1 = friend1,
       let f2 = friend2,
       let f3 = friend3  {

        return Game.Superfun(f1, f2, f3)
    }

    return Game.Regular
}

I didn’t discuss the possibility at the time (and to be truthful, was only vaguely aware of it), but Swift 1.2 adds some extra sweetness to the deal in the form of guard clauses. Suppose the super fun party only works if Beth’s first two friends are still on good terms. With the new if-let syntax, we can do that without an extra if block:

// Swift 1.2
func partyGameForFriend(friend1:Friend?,
                        friend2:Friend?, 
                        friend3:Friend?) -> Game {

    if let f1 = friend1,
       let f2 = friend2,
       let f3 = friend3 
       where f1.isGoodFriendsWith(f2) {

        return Game.Superfun(f1, f2, f3)
    }

    return Game.Regular
}

This will only call the super fun game branch if all the values exist and f1 is still good friends with f2. Pretty powerful stuff. To make things even better, we can use an optional unwrapped in an early let declaration in a later let declaration. Suppose, for instance, that inviting friend3 always results in her best friend showing up (if she has one). We could write:

// Swift 1.2
func partyGameForFriend(friend1:Friend?,
                        friend2:Friend?, 
                        friend3:Friend?) -> Game {

    if let f1 = friend1,
       let f2 = friend2,
       let f3 = friend3,
       let f4 = f3.bestFriend() // Friend.bestFriend() -> Friend?
       where f1.isGoodFriendsWith(f2) {

        return Game.Superfun(f1, f2, f3)
    }

    return Game.Regular
}

This works as long as the right side of the binding (i.e. f3.bestFriend()) returns an optional—which makes sense; if it didn’t, we could simply work it in the body of the if-let block.

The only real difference between Magical Future Swift’s for-comprehensions and the if-let version is that if-let does not return a value; instead it behaves as a traditional control flow statement. But this is, in fact, a little limiting.

• • • • •

If you remember, for-comprehensions weren’t limited to optionals. They could handle any monadic type: optionals, lists, futures, errors, etc. Swift 1.2’s if-let syntax doesn’t let us do that—and the fact that if-let statements don’t return a value actually makes it somewhat difficult to adapt them.

Let’s be specific. What would it look like if we used if-let in the context of lists? Let’s review the example where Scheduling Sam tries to create match-ups between players from different cities for a chess tournament, presented as a for-comprehension:

func matchupsForCities(cities:[City]) -> [Matchup] {

    return for {
        city <- cities
        p1 <- teamAForCity(city)
        p2 <- teamBForCity(city)
    } yield {
        Matchup(city:city, player1:p1, player2:p2)
    }
}

With the appropriate semantics, the if-let idiom can be adapted:

func matchupsForCities(cities:[City]) -> [Matchup] {

    var matchups = [Matchup]()
    if let city = cities
       let p1 = teamAForCity(city)
       let p2 = teamBForCity(city) {
        matchups.append(Matchup(city:city,
                             player1:p1,
                             player2:p2))
    }

    return matchups
}

This follows the same rules as the optional binding: the right side of the let bindings must be lists, and the left side will be whatever the elements of the list are: City or Player, in our case. But there’s that nasty var that we have to declare at the beginning, because if-let doesn’t return values. And if it did return values, we’d have to introduce a new keyword, like yield, to differentiate what is supposed to be returned as a value from the if-let statement from an early return call that exits the function.

To add a further blemish, if-let isn’t very intuitive in this context. While we could get used to reading it, in the case of lists, as “do this if there are still city-and-two-player-combinations that haven’t been processed”, it’s not the first interpretation; for-let would probably work better.

So while I really like the if-let idiom for optionals, it’s not easily extensible. What happens when we deal with futures and promises? At that point, should we stick with if-let, try for-let, or have a more dedicated when-let or after-let? What about other monadic types? Does Swift have to provide specific syntax for every monad we could possibly want to use? The advantage of for-comprehensions is that they’re just generic enough that they can acceptably work in any context. The if-let idiom is less intuitive to expand.

Of course, some of this is habit, and, much like before, I have no particular intuition as to where the language is headed. In fact, I’d say that my focus on for-comprehensions made me miss the obvious if-let extensions that made optional handling better, despite them having been discussed on the developer forums a few times.

• • • • •

At the end of the day, I have to try to remember Chris Lattner’s admonition that Swift is intended above all to be a practical language. The new if-let syntax solves an acute existing problem; full for-comprehensions would be nice to have in my view, but people aren’t exactly clamoring for them (I checked by searching the dev forums; nowhere to be found).

So Magical Future Swift is almost here. But it might stay almost here for quite a while. As for myself, I look forward to Swift Next. Extrapolating the release schedule so far, it would be shipping sometime in June.

Say, does Apple have some sort of event in June where it might showcase it?

[monads]: http://nomothetis.svbtle.com/the-culmination-part-ii

Swift for Scripting

2014-10-09T08:55:27-07:00

So Swift 1.0 has come and gone, and Swift 1.1 is just around the corner. As we’re getting closer to a more stable shape for the language, I’m interested in its potential as a scripting language for OS X. In case you missed it, Swift [can be used][shell-script] as a scripting language by invoking it via the good old-fashioned shebang syntax:

#!/usr/bin/env xcrun swift

import Foundation

… // do the things

However, being able to invoke it and being able to do something useful with it are two very different issues. Swift is not yet ready for useful scripting — but I think it has the potential.

• • • • •

Brevity is the soul of scripting. There are a few things any scripting language needs to get right, to support out-of-the-box in a solid and reasonably intuitive sense. They are:

File I/O (including file system traversal, permissions, and of course reading and writing)
String manipulation (including sensible methods for trimming strings, as well as strong regular expression support)
Option parsing — because any non-trivial script will take options

At this point, Swift is good at none of these. String manipulation is reasonably close, minus the complicated regex syntax, but file I/O is shackled to the [NSFileManager API][], which is powerful but insanely verbose, and [Objective-C option parsing][obj-c-opts] libraries are also not notable for pithiness.

So a good place to start to make Swift usable is to write a good library to support strings (ExSwift is already reasonable), a good library to do file I/O (preferably with strong defaults for line-by-line parsing of a file), and a good library for option parsing. With that, the foundation is laid.

• • • • •

That’s the language foundation. If Swift is to take off as a scripting language, it also needs infrastructure: a good framework management system. Apple does provide strong support for frameworks, with /Library/Frameworks being reserved for third-party frameworks that the compiler automatically picks up. It also provides [versioning guidelines][apple-versions] that are very close to semantic versioning, minus the obligation that the main version identifier be a number.

However, this is short of what’s needed. What we really need is [Homebrew][] for OS X modules. I would expect that CocoaPods is a better starting point than Homebrew for actual implementation purposes, of course, since it already addresses a similar problem for applications, but the general idea is there.

Irrespective of how it works, though, it brings up the problem of proper versioning. The most consistent and well-defined versioning scheme I’ve come across is [Semantic Versioning][semver], which is also the system used by CocoaPods. If the tools quickly emerge to support this standard, a lot of heartache, confusion, and dependency hell is going to be easily avoidable going forward.

Good versioning tools should support two main things:

Consistent bumping of versions.
Automated API diffing that determines whether a bump between two points in code should be a major, minor, or patch bump.

This would allow automated versioning, helping to reduce human error, and allowing versions to live up to their contract more consistently.

• • • • •

With this in mind, over the past few weeks, I’ve worked on two libraries that help a little bit with Swift’s command-line scripting.

The first and most complete is [SemverKit][]. It’s a library for parsing version strings according to the semantic versioning spec, with additional support for bumping versions in a consistent manner. Applications include:

Dependency resolution
Automated versioning (coupled with some API diff tools)
Stabilization of CocoaPod libraries (this can be useful if you use internal libraries with alpha/beta versions)

SemverKit is ready for general usage; it implements version equality and comparison in a spec-compliant manner and supports metadata as well.

The second, which is less feature-complete, is [OptionKit][], a library for parsing command-line options. It supports the most common scenarios (flags, named commands, required parameters), but does not have any advanced features (sub-parsers, for instance). Still, I believe it can be taken out for a spin, and will serve basic needs. As more advanced uses are needed, it can expand.

Incidentally, both of these depend on custom implementations of the Result object, which has been implemented many times, most notably by [Rob Napier][]‘s [LlamaKit][]. LlamaKit is a great example of what should be one of the foundational libraries of all Swift — including command-line Swift. It could then live in /Library/Frameworks and would be easily imported into a project.

• • • • •

So there we have it. I believe Swift is an excellent candidate for scripting on OS X, being that it’s naturally pithier than Objective-C, and has strong type inferencing that reduces typing overhead. In addition, it’s more productive to be able to script and write application code in the same language.

However, we are not there yet. There is still a lot of work to do to set up the plumbing. I hope OptionKit and SemverKit help a little along the way. How about it? Want to join the fun?

[shell-script]: http://practicalswift.com/2014/06/07/swift-scripts-how-to-write-small-command-line-scripts-in-swift/
[apple-versions]:https://developer.apple.com/library/mac/documentation/macosx/Conceptual/BPFrameworks/Concepts/VersionInformation.html#//apple_ref/doc/uid/20002255-BCIECADD
[NSFileManager API]: https://developer.apple.com/library/mac/documentation/Cocoa/Reference/Foundation/Classes/NSFileManager_Class/Reference/Reference.html
[obj-c-opts]: https://github.com/mysteriouspants/ArgumentParser
[Homebrew]: http://brew.sh
[semver]: http://semver.org
[SemverKit]: https://github.com/nomothetis/SemverKit
[OptionKit]: https://github.com/nomothetis/OptionKit
[Rob Napier]: https://twitter.com/cocoaphony
[LlamaKit]: https://github.com/LlamaKit/LlamaKit

A Gotcha When Testing Swift Frameworks with Xcode 6

2014-09-11T03:25:26-07:00

After spending most of the summer delving into what Swift can and can’t do as a functional programming language, I’ve turned my attention to writing real library code in Swift. I do have a day job, after all, and while thinking about Magical Future Swift is a fun exercise, I still need to learn how to do the basic everyday things. Which is where I was tripped up by the default framework test settings.

The issue occurred when I created a new Cocoa framework project, in Swift. By default these come with tests, so you’d be forgiven for thinking that the tests just run when you type Cmd-U, as they would for an app. You would be wrong, however. Most of the “Product” menu is disabled and gives that heart-sinking “you can’t do what you think you can do” sound when you try the shortcuts.

What happens is that by default, the test bundle is not added to the test configuration of the framework’s build scheme. Here’s how you reenable it. Say you named your framework “test” (I’m creative like that). Select it and then select “Edit Scheme…”:

Then go to “Test” and hit the + in the lower left-hand corner of the detail view:

Select your test bundle (my clever naming pays off dividends…):

Make any modifications you want — in this case, let’s add a location for the tests to run against, and click “Close”:

Now all your testing shortcuts should be enabled and working (your running shortcuts still won’t be, since a framework can’t be run on its own).

Happy testing!

Addendum: Deriving the Third Monad Law From Nested Comprehensions

2014-08-30T16:35:36-07:00

The third monad law declares that the following identity must hold for a monad M, where a:M<A>, f: A -> M, and g: B -> M<C>:

(a >>=- f) >>=- g   ==   a >>=- { b in f(x) >>=- g }

To motivate this, we first want to show that the snippet below is a form of the left-hand-side of the identity:

// Form 1
let val = for {
    b <- for {
             ignore <- doSomething()
             b1 <- doSomething()
         } yield {
             b1
         }
    c <- process(b) // process(val:Int?) -> Int?
} yield {
    c
}

Then we want to show that Form 2 below is a form of the right-hand-side:

// Form 2
let val = for {
    ignore <- doSomething()
    b <- doSomething()
    c <- process(b) // process(val:Int?) -> Int?
} yield {
    c
}

Intuitively, these two snippets should do the same thing, which is the impetus for enshrining the behavior in a law.

• • • • •

Remembering that the right-hand side of a <- assignment is monadic, we can write Form 1 as:

// Form 1
let optB = doSomething() >>=- { ignore in
    doSomething() >>=- { b in
        lift(b)
    }

let val = optB >>=- { b in
    process(b) >>= { c in {
        lift( c)
    }
}

Clearly, we can rewrite the snippet like this:

// Form 1
let val = doSomething() >>=- { ignore in
    doSomething() >>=- lift
} >>=- { b in
    process(b) >>=- lift
}

Notice, incidentally, that nested for-comprehensions are the equivalent of chained >>=- calls. Using the second monad law (see, it’s useful), we can simplify it as:

// Form 1
let val =  doSomething() >>=- { ignore in
    doSomething()
} >>=- { b in
    process(b)
}

Now let’s simplify by introducing the definition of
doSomethingWhileIgnoring:

func doSomethingWhileIgnoring(ignored:Int) -> Int? {
    return doSomething()
}

This takes us to the final form, with added parentheses to emphasize that >>=- is a left-associative operator.

// Form 1
let val = (doSomething() >>=- doSomethingWhileIgnoring) >>=- process

• • • • •

For the second snippet, we’re going to work backwards. This means I’m going to start with what I already know I’m trying to get to, and I’m going to show you that it’s equivalent to Form 2. This is just as valid as going from Form 2 to the final result, but it’s easier to follow. What I want to show is that this snippet is the same as Form 2:

// Form 2 candidate
let val = doSomething() >>=- { ignore in
    doSomethingWhileIgnoring(ignore) >>=- process
}

So let’s get to substituting:

// Form 2 candidate
let val = doSomething() >>=- { ignore in
    { _ in doSomething() }(ignore) >>=- { b in
        process(b)
    }
}

And now, using the second law, but in reverse, we have:

// Form 2 candidate
let val = doSomething() >>=- { ignore in
    { _ in doSomething() }(ignore) >>=- { b in
        process(b) >>=- { c in
            lift(c)
        }
    }
}

Finally, it’s pretty clear that the second line is an unnecessary wrapper, and that we can write it as:

// Form 2 candidate
let val = 
    doSomething() >>=- { ignore in
        doSomething() >>=- { b in
            process(b) >>=- { c in
                lift(c)
            }
        }
    }

But this is precisely the >>=- version of Form 2.

• • • • •

Again, remember that this appendix is not proving the law, but instead showing how it naturally comes out of expecting intuitive semantics from for-comprehensions. In the context of monads, “law” doesn’t mean something observed to be true, but rather a prescribed rule that a type must follow in order to be called a monad.

The Culmination: Final Part

2014-08-30T16:25:50-07:00

In my [last post][culmination-ii], I finally came out and admitted that all my posts about errors and optionals have been about monads. And to justify this gross act of deception, I posited for-comprehensions, a nifty new syntax in Magical Future Swift that improves how we can work with optionals and arrays—a syntax that only works with monads.

So monads are entities that work with for-comprehensions. But for-comprehensions must work in a sane manner, that is, intuitively. I’ve already gone over two behaviors that are required; they are called the first two monad laws. There is one missing. You’ll be stunned to know it is the third law.

• • • • •

The first law addressed using a for-comprehension with a lifted value. The second law addressed the behavior of comprehensions when nothing was done to the assigned values. One last thing we want to be able to do with comprehensions is nest them.

To see why, suppose we have a function that has side effects with type signature doSomething() -> Int?. We call this function to do things for us, and we potentially get a number back. Here’s something we might want to do with this function:

func doSomethingTwice() -> Int? {
    return for {
        ignore <- doSomething()
        b <- doSomething()
    } yield {
        b
    }
}

In effect, we call the function twice, but we only care about the result the second time. There is no need to do this with a comprehension, of course, but if comprehensions are available, it must work, and the whole point is to discover what monad laws make comprehensions intuitive. Now, let’s use this function:

let a = for {
    b <- doSomethingTwice()
    c <- process(b) // process(val:Int?) -> Int?
} yield {
    c
}

We can substitute the definition of doSomethingTwice:

// Form 1
let val = for {
    b <- for {
             ignore <- doSomething()
             b1 <- doSomething()
         } yield {
             b1
         }
    c <- process(b) // process(val:Int?) -> Int?
} yield {
    c
}

I call it “Form 1” because we know that the return value of the first call gets discarded (though still made), which means we should be able to write that snippet like this:

// Form 2
let val = for {
    ignore <- doSomething()
    b <- doSomething()
    c <- process(b) // process(val:Int?) -> Int?
} yield {
    c
}

Let me be clear. When I said “which means” above, I meant that it made intuitive sense. There is nothing that guarantees that this is the case. It’s our job to guarantee it—and that is the role of the third monad law.

The the third law follows from turning these two different forms into their >>=- versions. Doing so is nothing overly complicated, but it is a bit lengthy, so I’ve put the details in an [addendum]. The end result is that Form 1 can be rewritten as:

let val = (doSomething() >>=- doSomethingWhileIgnoring) >>=- process

Here, doSomethingWhileIgnoring(Int) -> Int? is a wrapper around doSomething that matches the required type for the right-hand-side of >>=-. The parentheses are there to emphasize that >>=- is defined as a left-associative operator.

As for Form 2, it can be rewritten as:

let val = doSomething() >>=- { ignore in
    doSomethingWhileIgnoring(ignore) >>= process
}

So these two forms have to be equivalent for our for-comprehensions to work. What we are dealing with here is a monadic value a:M<A> (the output of doSomething()), and two functions: f:A -> M (doSomethingWhileIgnoring) and g:B -> M<C> (process). Our law then becomes that these two forms must be the same for all monads:

// Form 1                // Form 2
(a >>=- f) >>=- g   ==   a >>=- { b in f(b) >>= g }

That’s the third monadic law, for your sugary syntactic pleasure.

• • • • •

Whew! That was a lot of ground to cover, so let me give you a brief recap. A very common problem when dealing with optionals is dealing with multiple ones. We quickly end up with a bunch of nested if-let statements that look very bad—especially since most of the time, the recovery from any one of them failing is the same.

Other languages with optionals deal with this problem by introducing syntactic sugar; Scala calls it for-comprehensions, while Haskell calls it do-notation. They are similar, though not identical, and I endowed Magical Future Swift with the Scala version.

While investigating for-comprehensions I showed that they work for a specific class of types, which have one type parameter, and which follow three laws. These types are called monads, and they follow the following rules:

There exists a function, by convention called lift: A -> M<A>, which takes a non-monadic value a:A and wraps it in a monadic value M<A>.
There exists a function, by convention called >>=-, whith signature (val:A, f:A -> M) -> M.
First monad law: for any unwrapped value a:A and function f:A -> M, lift(a) >>=- f == f(a).
Second monad law: for any monadic value m:M<A>, m >>=- return == m.
Third monad law: for any monadic value m:M<A>, and functions f:A -> M, g:B -> M<C>, (m >>=- f) >>=- g == m >>=- { x in f(x) >>=- g }.

The three laws allow for-comprehensions to work the way we would intuitively expect them to:

The first law allows us to apply for-comprehensions to unwrapped values by lifting them into the relevant monad.
The second law allows us to have an identity operator in for-comprehensions, for the times when we don’t actually want to modify the unwrapped values.
The third law allows us to nest for-comprehensions.

The wonderful thing about for-comprehensions is that they don’t only apply to optionals—they are a general way of dealing with monads. They can apply to arrays, results, futures—any type that follows the rules above.

These rules might seem much ado about nothing, but if you look at a for-comprehension, it tends to look like a regular imperative program—but with some logic abstracted away. Beth’s birthday problem looked like regular assignments—except we didn’t have to worry about the nil case; everything was taken care of. Sam’s scheduling problem looked like it was doing regular assignments as well—abstracting away the fact that the code was iterating through multiple lists at the same time.

This means that each monad, in a way, defines its own imperative semantics when used in a for-comprehension—but in a way that still provides type safety and referential transparency. For-comprehensions therefore provide the best of both the imperative and the functional world.

• • • • •

Alright, I’m almost done. There’s one thing I still need to justify. I claimed, rather brashly, that monads had to do with the future of Swift. Of course, since I don’t work at Apple I can’t know for certain. But every language that has introduced optionals has had to introduce the associated syntactic sugar—dealing with them is too painful otherwise.

So because Scala, Haskell, and F-Sharp all sooner or later felt the need to introduce for-comprehensions, I feel fairly confident in predicting that they will make their way into Swift. What this would mean for the rank-and-file developer is some new syntax; this doesn’t require a deep understanding of monads—the whole point of for-comprehensions is that they work intuitively. I happen to believe, however, that taking the time to truly understand the semantics of your language can only help.

And if we can be a step ahead of the game by looking toward the future, so much the better.

• • • • •

Thanks for sticking with me through this series. There were seven total posts—beginning with [error][error-handling-i] [handling][error-handling-ii], continuing with [understanding][optionals-i] [optionals][optionals-ii], and ending with the [three][culmination-i] [culmination][culmination-ii] posts. I hope I managed to convey why the monadic pattern is important, and to explain where the three laws come from.

“Important”, though, is not the same as “required knowledge”. The details of RAM fetching are important, but most developers can ignore them, and so it is with monads. At the end of the day, it is never the point of a construct to make life more difficult. As a dev, you’ll only have to worry about monads when trying to implement a type that can be used in a for-comprehension. For the rest, life will go on as usual.

But a little safer, and a little more functional. And isn’t that a good thing?

[error-handling-i]: http://nomothetis.svbtle.com/error-handling-in-swift
[error-handling-ii]: http://nomothetis.svbtle.com/error-handling-in-swift-part-ii
[optionals-i]: http://nomothetis.svbtle.com/understanding-optional-chaining
[optionals-ii]: http://nomothetis.svbtle.com/implicitly-unwrapped-optionals-in-depth
[culmination-i]: http://nomothetis.svbtle.com/the-culmination-i
[culmination-ii]: http://nomothetis.svbtle.com/the-culmination-part-ii
[addendum]: http://nomothetis.svbtle.com/third-monad-law-derivation