Intro
I know.. yet another Monad tutorial. :)
Have you ever thought "hmm, these all look kinda same":
- Promise, Future, Rx, IO, Monad
await
,flatMap
,do
,for
then you might be right!
Let's see!
First we'll see how and why the Monad
abstraction came into Haskell (a purely functional programming language).
Then we will see a nicer syntax for writing functions that operate on monads.
And lastly I will show you that await
/for
/do
is "just nicer syntax" for flatMap
.
Haskell
A few relevant things about Haskell:
- it is a lazy language
- it wants to separate pure functions from impure functions (actions)
Laziness
By "lazy" we mean evaluation is not strict.
In Java for example, when you call a function myFun(a, b)
, the order of evaluation is strict and consistent:
- first, the arguments are evaluated: left to right, one by one
- function is evaluated
But in Haskell that's not the case. Nothing is evaluated until needed.
So if the parameter a
is not used in the body of myFun
it will not be evaluated at all.
This is fine+desirable+performant when your functions are pure (not doing anything),
but it is a big issue when they do side effects: write to a file/db etc.
For example, if you want these actions to be executed:
f1
- write to a filef2
- read from that file
you need to make sure that f1
always gets evaluated before f2
!!
In Haskell you are never sure because eval order is unspecified..
The next Haskell will be strict.
~ Simon Peyton Jones
Pure functions
Pure functions are like mathematical functions, they do calculations, and only return new values (no mutable variables).
They are only considering "insides" of a program, its own memory.
Impure functions
Impure functions go beyond our program, they "go outside", play and get dirty.
They read/write to a file/console/database etc.
Haskell wants you not to get dirty, and play as much as you can inside (stay safe).
But how does it know which functions are "impure"?
Well, by marking them with IO
wrapper type (which is "a monad").
Main function
"Normal" programming languages have a main
function, which usually looks something like this:
but in Haskell you have this:
Haskell marks the main
as an IO action, so by definition it is impure.
History and pre-history
Before monads were introduced, main function's type looked like this:
main :: [Response] -> [Request]
.
Example taken from StackOverflow:
In a nutshell, you had to write all of the impure stuff that your whole program will do as a return value.
That is represented as a list of requests: [Request]
.
Return values from those IO actions are delivered in the [Response]
list, that you use inside the program logic.
The number of responses is the same as the number of requests you gave.
So you have to keep in mind the indices, which is a bummer.
What if you add one request in the middle? You'd have to change all indices after it...
Which request belongs to which response? That's really hard to see.
We can already see that this way of writing a program is very cumbersome, unreadable, and limited.
Notice also that the approach above works only because Haskell is lazy!
Monads
Back to IO t
. The IO t
is an action that will do some side effects before returning a value of type t
.
This can be anything: writing to disk, sending HTTP requests etc.
Examples of impure functions in Haskell:
You are probably familiar with some functions that operate on the value inside the IO
!
For example, we have fmap :: Functor f => (a -> b) -> f a -> f b
which transforms the value inside any Functor f
(monad is a functor).
But what about chaining, sequencing actions one after another?
How can we ensure that getChar
executes strictly before putChar
?
Monads to the rescue! Its core function is called flatMap
(or bind
, or >>=
..):
The flatMap
function:
- takes an
IO a
action - takes a function that takes
a
(thea
fromIO a
) - returns a new
IO b
So there we have it, Monad in all its glory! :)
It's "just" a semicolon (;
), duhh, sequencing operations.
Let's see our solution now:
In Scala you'd write val echo = getChar.flatMap(putChar)
.
This is the reason why Monads were introduced in Haskell.
In short, Haskell is the world's finest imperative programming language.
~ Simon Peyton Jones
Syntax sugar for Monads
Haskell and some other languages have built-in syntax for Monads.
Haskell has "do notation" and Scala has "for comprehensions".
It makes them more readable by flipping sides:
Scala:
The <-
symbol gets translated into >>=
by Haskell's compiler.
In case of Scala, it gets turned into a flatMap
.
It turns out that Monads are useful not only in the IO
context, but for other types too.
Whenever you have unwanted nesting like Wrapper[Wrapper[T]]
, you need to "flatMap that shit", hence you need a Monad.
If you have List[List[String]]
you probably needed a flatMap
instead of map
.
If you have Option[Option[String]]
, same thing.
You can imagine doing the same example above with a List[T]
, where c
would be one element of the list.
Async / Await
After some time it came to my mind that we are doing a similar thing in JS/C#/Kotlin with await
.
It is almost the same thing, we are "pulling a value from a Promise/Task" (~Monads):
Before this we used to write "normal callback functions":
Seems like then
corresponds to flatMap
, and await
corresponds to <-
syntax sugar.
Some noticable differences:
- do/for is general, while await is specific just for Promise
- do/for in statically typed languages is checked for proper types, while in JS you're on your own
My opinions
To me, it feels very awkward to program in a lazy programming language.
It is hard to reason about and you have to use monads for doing even the simplest IO operations.
If you have a pure function and want to println
a value.. nope, you gotta wrap everything in an IO
.. wack.
When used inside a normally strict language like Scala or Java, it "spreads like a virus".
If you have an IO[String]
, all code that calls it must be an IO[..]!! It's viral! No bueno!
You have to ask yourself: is all that complexity really worth it? IMHO it is not, 99% of the time.
Especially today when we have JVM virtual threads.
So, in my opinion, use Monads/Rx/whatever only when you have to!
The simpler the program - the better.
For example, in Java you can use threads and concurrent datastructures. Web servers like Tomcat, Jetty etc. are working just fine with a thread-per-request model.
But in JS you don't have that liberty, you need to use Promises.
That's because JS doesn't have "normal threads", is has only one thread with an event-loop so you have to program asynchronous code.
I hope this gave you a better insight into scary Monads and the FP way of handling IO.
Additional resources
- Tackling the Awkward Squad by Simon Peyton Jones
- Essential Effects by Adam Rosien
- State of Loom by Ron Pressler (comparing RX vs threads)
- Future vs IO by Diogo Castro
- The Observable disguised as an IO Monad by Luis Atencio
- Benefits of IO discussion on Reddit