[...] the question of whether Machines Can Think [...] is about as relevant as the question of whether Submarines Can Swim.
In “project” lessons, we’ll stop pummeling you with new theory for a brief moment, and instead we’ll work through a program together. Theory is necessary to learn to program, but reading and understanding actual programs is just as important.
Our project in this lesson is to build an automaton, a little program that performs a task in a virtual world. Our automaton will be a mail-delivery robot picking up and dropping off parcels.
The village of Meadowfield isn’t very big. It consists of 11 places with 14 roads between them. It can be described with this array of roads:
The network of roads in the village forms a graph. A graph is a collection of points (places in the village) with lines between them (roads). This graph will be the world that our robot moves through.
The array of strings isn’t very easy to work with. What we’re interested in is the destinations that we can reach from a given place. Let’s convert the list of roads to a data structure that, for each place, tells us what can be reached from there.
Given an array of edges, buildGraph
creates a map object that, for each node, stores an array of connected nodes.
It uses the split
method to go from the road strings, which have the form "Start-End"
, to two-element arrays containing the start and end as separate strings.
Our robot will be moving around the village. There are parcels in various places, each addressed to some other place. The robot picks up parcels when it comes to them and delivers them when it arrives at their destinations.
The automaton must decide, at each point, where to go next. It has finished its task when all parcels have been delivered.
To be able to simulate this process, we must define a virtual world that can describe it. This model tells us where the robot is and where the parcels are. When the robot has decided to move somewhere, we need to update the model to reflect the new situation.
If you’re thinking in terms of object-oriented programming, your first impulse might be to start defining objects for the various elements in the world: a class for the robot, one for a parcel, maybe one for places. These could then hold properties that describe their current state, such as the pile of parcels at a location, which we could change when updating the world.
This is wrong.
At least, it usually is. The fact that something sounds like an object does not automatically mean that it should be an object in your program. Reflexively writing classes for every concept in your application tends to leave you with a collection of interconnected objects that each have their own internal, changing state. Such programs are often hard to understand and thus easy to break.
Instead, let’s condense the village’s state down to the minimal set of values that define it. There’s the robot’s current location and the collection of undelivered parcels, each of which has a current location and a destination address. That’s it.
And while we’re at it, let’s make it so that we don’t change this state when the robot moves but rather compute a new state for the situation after the move.
The move
method is where the action happens. It first checks whether there is a
road going from the current place to the destination, and if not, it
returns the old state since this is not a valid move.
Then
it creates a new state with the destination as the robot’s new place.
But it also needs to create a new set of parcels—parcels that the robot
is carrying (that are at the robot’s current place) need to be moved
along to the new place. And parcels that are addressed to the new place
need to be delivered—that is, they need to be removed from the set of
undelivered parcels. The call to map
takes care of the moving, and the call to filter
does the delivering.
Parcel objects aren’t changed when they are moved but re-created. The move
method gives us a new village state but leaves the old one entirely intact.
The move causes the parcel to be delivered, and this is reflected in the next state. But the initial state still describes the situation where the robot is at the post office and the parcel is undelivered.
Data structures that don’t change are called immutable or persistent. They behave a lot like strings and numbers in that they are who they are and stay that way, rather than containing different things at different times.
In JavaScript, just about everything can be changed, so working with values that are supposed to be persistent requires some restraint. There is a function called Object.freeze
that changes an object so that writing to its properties is ignored.
You could use that to make sure your objects aren’t changed, if you want
to be careful. Freezing does require the computer to do some extra
work, and having updates ignored is just about as likely to confuse
someone as having them do the wrong thing. So we usually prefer to just
tell people that a given object shouldn’t be messed with and hope they
remember it.
Why are we going out of our way to not change objects when the language is obviously expecting me to?
Because it helps me understand my programs. This is about complexity management again. When the objects in my system are fixed, stable things, we can consider operations on them in isolation—moving to Alice’s house from a given start state always produces the same new state. When objects change over time, that adds a whole new dimension of complexity to this kind of reasoning.
For a small system like the one we are building in this lesson, we could handle that bit of extra complexity. But the most important limit on what kind of systems we can build is how much we can understand. Anything that makes your code easier to understand makes it possible to build a more ambitious system.
Unfortunately, although understanding a system built on persistent data structures is easier, designing one, especially when your programming language isn’t helping, can be a little harder. We’ll look for opportunities to use persistent data structures in this course, but we’ll also be using changeable ones.
A
delivery robot looks at the world and decides in which direction it
wants to move. As such, we could say that a robot is a function that
takes a VillageState
object and returns the name of a nearby place.
Because we want robots to be able to remember things, so that they can make and execute plans, we also pass them their memory and allow them to return a new memory. Thus, the thing a robot returns is an object containing both the direction it wants to move in and a memory value that will be given back to it the next time it is called.
Consider what a robot has to do to “solve” a given state. It must pick up all parcels by visiting every location that has a parcel and deliver them by visiting every location that a parcel is addressed to, but only after picking up the parcel.
What is the dumbest strategy that could possibly work? The robot could just walk in a random direction every turn. That means, with great likelihood, it will eventually run into all parcels and then also at some point reach the place where they should be delivered.
Here’s what that could look like:
Remember that Math.random()
returns a number between zero and one—but always below one. Multiplying
such a number by the length of an array and then applying Math.floor
to it gives us a random index for the array.
Since
this robot does not need to remember anything, it ignores its second
argument (remember that JavaScript functions can be called with extra
arguments without ill effects) and omits the memory
property in its returned object.
To put this sophisticated robot to work, we’ll first need a way to create a new state with some parcels. A static method (written here by directly adding a property to the constructor) is a good place to put that functionality.
We don’t want any parcels that are sent from the same place that they are addressed to. For this reason, the do
loop keeps picking new places when it gets one that’s equal to the address.
Let’s start up a virtual world.
It takes the robot a lot of turns to deliver the parcels because it isn’t planning ahead very well. We’ll address that soon.
For a more pleasant perspective on the simulation, you can use the runRobotAnimation
function that’s available in this lesson. This runs the simulation, but instead of outputting text, it shows you the robot moving around the village map.
runRobotAnimation(VillageState.random(), randomRobot);
The way runRobotAnimation
is implemented will remain a mystery for now, but after you’ve read the later lessons of this course, which discuss JavaScript integration in web browsers, you’ll be able to guess how it works.
We should be able to do a lot better than the random robot. An easy improvement would be to take a hint from the way real-world mail delivery works. If we find a route that passes all places in the village, the robot could run that route twice, at which point it is guaranteed to be done. Here is one such route (starting from the post office):
To implement the route-following robot, we’ll need to make use of robot memory. The robot keeps the rest of its route in its memory and drops the first element every turn.
This robot is a lot faster already. It’ll take a maximum of 26 turns (twice the 13-step route) but usually less.
Still, we wouldn’t really call blindly following a fixed route intelligent behavior. The robot could work more efficiently if it adjusted its behavior to the actual work that needs to be done.
To do that, it has to be able to deliberately move toward a given parcel or toward the location where a parcel has to be delivered. Doing that, even when the goal is more than one move away, will require some kind of route-finding function.
The problem of finding a route through a graph is a typical search problem. We can tell whether a given solution (a route) is a valid solution, but we can’t directly compute the solution the way we could for 2 + 2. Instead, we have to keep creating potential solutions until we find one that works.
The number of possible routes through a graph is infinite. But when searching for a route from A to B, we are interested only in the ones that start at A. We also don’t care about routes that visit the same place twice—those are definitely not the most efficient route anywhere. So that cuts down on the number of routes that the route finder has to consider.
In fact, we are mostly interested in the shortest route. So we want to make sure we look at short routes before we look at longer ones. A good approach would be to “grow” routes from the starting point, exploring every reachable place that hasn’t been visited yet, until a route reaches the goal. That way, we’ll only explore routes that are potentially interesting, and we’ll find the shortest route (or one of the shortest routes, if there are more than one) to the goal.
Here is a function that does this:
The exploring has to be done in the right order—the places that were reached first have to be explored first. We can’t immediately explore a place as soon as we reach it because that would mean places reached from there would also be explored immediately, and so on, even though there may be other, shorter paths that haven’t yet been explored.
Therefore, the function keeps a work list. This is an array of places that should be explored next, along with the route that got us there. It starts with just the start position and an empty route.
The search then operates by taking the next item in the list and exploring that, which means all roads going from that place are looked at. If one of them is the goal, a finished route can be returned. Otherwise, if we haven’t looked at this place before, a new item is added to the list. If we have looked at it before, since we are looking at short routes first, we’ve found either a longer route to that place or one precisely as long as the existing one, and we don’t need to explore it.
You can visually imagine this as a web of known routes crawling out from the start location, growing evenly on all sides (but never tangling back into itself). As soon as the first thread reaches the goal location, that thread is traced back to the start, giving us our route.
Our code doesn’t handle the situation where there are no more work items on the work list because we know that our graph is connected, meaning that every location can be reached from all other locations. We’ll always be able to find a route between two points, and the search can’t fail.
This robot uses its memory value as a list of directions to move in, just like the route-following robot. Whenever that list is empty, it has to figure out what to do next. It takes the first undelivered parcel in the set and, if that parcel hasn’t been picked up yet, plots a route toward it. If the parcel has been picked up, it still needs to be delivered, so the robot creates a route toward the delivery address instead.
Let’s see how it does.
This robot usually finishes the task of delivering 5 parcels in about 16 turns. That’s slightly better than routeRobot
but still definitely not optimal.
It’s hard to objectively compare robots by just letting them solve a few scenarios. Maybe one robot just happened to get easier tasks or the kind of tasks that it is good at, whereas the other didn’t.
Write a function compareRobots
that takes two robots (and their starting memory). It should generate
100 tasks and let each of the robots solve each of these tasks. When
done, it should output the average number of steps each robot took per
task.
For the sake of fairness, make sure you give each task to both robots, rather than generating different tasks per robot.
You’ll have to write a variant of the runRobot
function that, instead of logging the events to the console, returns the number of steps the robot took to complete the task.
Your
measurement function can then, in a loop, generate new states and count
the steps each of the robots takes. When it has generated enough
measurements, it can use console.log
to output the average for each robot, which is the total number of steps taken divided by the number of measurements.
Can you write a robot that finishes the delivery task faster than goalOrientedRobot
? If you observe that robot’s behavior, what obviously stupid things does it do? How could those be improved?
If you solved the previous exercise, you might want to use your compareRobots
function to verify whether you improved the robot.
The main limitation of goalOrientedRobot
is that it
considers only one parcel at a time. It will often walk back and forth
across the village because the parcel it happens to be looking at
happens to be at the other side of the map, even if there are others
much closer.
One possible solution would be to compute routes for all packages and then take the shortest one. Even better results can be obtained, if there are multiple shortest routes, by preferring the ones that go to pick up a package instead of delivering a package.
Most data structures provided in a standard JavaScript environment aren’t very well suited for persistent use. Arrays have slice
and concat
methods, which allow us to easily create new arrays without damaging the old one. But Set
, for example, has no methods for creating a new set with an item added or removed.
Write a new class PGroup
, similar to the Group
class from the previous lesson, which stores a set of values. Like Group
, it has add
, delete
, and has
methods.
Its add
method, however, should return a new PGroup
instance with the given member added and leave the old one unchanged. Similarly, delete
creates a new instance without a given member.
The class should work for values of any type, not just strings. It does not have to be efficient when used with large amounts of values.
The
constructor shouldn’t be part of the class’s interface (though you’ll
definitely want to use it internally). Instead, there is an empty
instance, PGroup.empty
, that can be used as a starting value.
Why do you need only one PGroup.empty
value, rather than having a function that creates a new, empty map every time?
The most convenient way to represent the set of member values is still as an array since arrays are easy to copy.
When
a value is added to the group, you can create a new group with a copy
of the original array that has the value added (for example, using concat
). When a value is deleted, you filter it from the array.
The class’s constructor can take such an array as argument and store it as the instance’s (only) property. This array is never updated.
To add a property (empty
)
to a constructor that is not a method, you have to add it to the
constructor after the class definition, as a regular property.
You need only one empty
instance because all empty groups are the same and instances of the
class don’t change. You can create many different groups from that
single empty group without affecting it.
ExplorableJS is a course by the Learning Technologies Research Group of RWTH Aachen University. It is based on "Eloquent JavaScript" (3rd Edition, 2018) by Marijn Haverbeke. Content is reused according to CC-BY-NC 3.0. ExplorableJS is licensed as CC-BY-NC 4.0.