If you’ve used any flavour of Unix for development, you’ve probably installed software from source with this magic incantation:
./configure
make
make install
I know I’ve typed it a lot, but in my early days using Linux, I didn’t really understand what it meant. I just knew that if I wanted to install software this was the spell to recite.
Recently, I’ve been building my own Unix tools, and I wanted to tap into this standard install process; not only is it familiar to many Unix users, it’s also a great starting point for building a package for Homebrew and the various Linux and BSD package managers.
It was time to dig into the Unix Grimoire and find out what the incantation does.
What does ./configure, make, make install
do
There are three distinct steps in this process:
configure:
configure the softwareThe
configure
script is responsible for getting ready to build the software on your specific system. It makes sure all of the dependencies for the rest of the build and install process are available, and finds out whatever it needs to know to use those dependencies.Unix programs are often written in C, so we’ll usually need a C compiler to build them. In these cases, the
configure
script will establish that your system does indeed have a C compiler, find out what it’s called and where to find it.make
: Build the softwareOnce
configure
has done its job successfully, we can invokemake
to build the software. This runs a series of tasks defined in aMakefile
to build the finished program from its source code.The tarball you download usually doesn’t include a finished
Makefile
. Instead, it comes with a template calledMakefile.in
and theconfigure
script produces a customisedMakefile
specific to your system.make install
: Install the softwareNow that the software is built and ready to run, the files can be copied to their final destinations. The
make install
command will copy the built program, and its libraries and documentation, to the correct locations.This usually means that the program’s binary will be copied to a directory on your
PATH
, the program’s manual page will be copied to a directory on yourMANPATH
, and any other files it depends on will be safely stored in the appropriate place.Since the install step is also defined in the
Makefile
, where the software is installed can change based on options passed to theconfigure
script, or things theconfigure
script discovered about your system.Depending on where the software is being installed, you might need escalated permissions for this step so you can copy files to system directories. Using
sudo
will often do the trick.
Where do these UNIX scripts come from
All of this works because a configure
script examines your system, and uses
the information it finds to convert a Makefile.in
template into a Makefile
,
but where do the configure
script and the Makefile.in
template come from?
If you’ve ever opened up a configure
script, or associated Makefile.in
, you
will have seen that they are thousands of lines of dense shell script. Sometimes
these supporting scripts are longer than the source code of the program they
install.
Even starting from an existing configure
script, it would be very daunting to
manually construct one. Don’t worry, though: these scripts aren’t built by hand.
Programs that are built in this way have usually been packaged using a suite of
programs collectively referred to as autotools. This
suite includes autoconf
, automake
, and many other programs, all of which
work together to make the life of a software maintainer significantly easier.
The end user doesn’t see these tools, but they take the pain out of setting up an install process that will run consistently on many different flavours of Unix.
Packaging a ‘Hello world’ program with Unix Autotools
Let’s take a simple “Hello world” C program, and see what it would take to package it with autotools.
Here’s the source of the program, in a file called main.c
:
// src/configure.ac
#include <stdio.h>
int
main(int argc, char* argv[])
{
printf("Hello world\n");
return 0;
}
Creating the configure script
Instead of writing the configure script by hand, we need to create a
configure.ac
file written in m4sh—a combination of m4 macros and POSIX
shell script—to describe what the configure script needs to do.
The first m4 macro we need to call is AC_INIT
, which will
initialise autoconf and set up some basic information about the program we’re
packaging. The program is called helloworld
, the version is 0.1
, and the
maintainer is george@thoughtbot.com
:
AC_INIT([helloworld], [0.1], [george@thoughtbot.com])
We’re going to use automake
for this project, so we need to initialise that
with the AM_INIT_AUTOMAKE
macro:
AM_INIT_AUTOMAKE
Next, we need to tell autoconf about the dependencies our configure script needs
to look for. In this case, the configure script only needs to look for a C
compiler. We can set this up using the AC_PROG_CC
macro:
AC_PROG_CC
If there were other dependencies, then we’d use other m4 macros here to discover
them; for example the AC_PATH_PROG
macro looks for a given
program on the user’s PATH
.
Now that we’ve listed our dependencies, we can use them. We saw earlier that a
typical configure
script will use the information it has about the user’s
system to build a Makefile
from a Makefile.in
template.
The next line uses the AC_CONFIG_FILES
macro to tell
autoconf that the configure script should do just that: it should find a file
called Makefile.in
, substitute placeholders like @PACKAGE_VERSION@
with
values like 0.1
, and write the results to Makefile
.
AC_CONFIG_FILES([Makefile])
Finally, having told autoconf everything our configure script needs to do, we
can call the AC_OUTPUT
macro to output the script:
AC_OUTPUT
Here’s the whole thing. Not bad, compared to the 4,737 lines the configure
script
it’s going to produce!
# src/configure.ac
AC_INIT([helloworld], [0.1], [george@thoughtbot.com])
AM_INIT_AUTOMAKE
AC_PROG_CC
AC_CONFIG_FILES([Makefile])
AC_OUTPUT
We’re almost ready to package up and distribute our program, but we’re still
missing something. Our configure
script will expect a Makefile.in
file that
it can substitute all of those system-specific variables into. Next, we will create it.
Creating the Makefile
As with the configure
script, the Makefile.in
template is very long and
complex. So instead of writing it by hand, we write a shorter Makefile.am
file, which automake
will use to generated the Makefile.in
for us.
First, we need to set some options to tell automake about the layout of the
project. Since we’re not following the standard layout of a GNU project, we warn
automake that this is a foreign
project:
AUTOMAKE_OPTIONS = foreign
Next, we tell automake that we want the Makefile to build a program called
helloworld
:
bin_PROGRAMS = helloworld
There’s a lot of information packed into this line, thanks to automake’s uniform naming scheme.
The PROGRAMS
suffix is called a primary. It tells automake what
properties the helloworld
file has. For example, PROGRAMS
need to be built,
whereas SCRIPTS
and DATA
files don’t need to be built.
The bin
prefix tells automake that the file listed here should be installed to
the directory defined by the variable bindir
. There are various directories
defined for us by autotools—including bindir
, libdir
, and
pkglibdir
—but we can also define our own.
For example, if we wanted to install some Ruby scripts as part of our program,
we could define a rubydir
variable and tell automake to install our
Ruby files there:
rubydir = $(datadir)/ruby
ruby_DATA = my_script.rb my_other_script.rb
Additional prefixes can be added before the install directory to further nuance automake’s behaviour.
Since we’ve defined a PROGRAM
, we need to tell automake where to find its
source files. In this case, the prefix is the name of the program these source
files build, rather than the place where they will be installed:
helloworld_SOURCES = main.c
Here’s the whole Makefile.am
file for our helloworld
program. As with the
configure.ac
and the configure
script, it’s a lot shorter than the
Makefile.in
that it generates:
# src/Makefile.am
AUTOMAKE_OPTIONS = foreign
bin_PROGRAMS = helloworld
helloworld_SOURCES = main.c
Putting it all together
Now we’ve written our config files, we can run autotools and generate the
finished configure
script and Makefile.in
template.
First, we need to generate an m4 environment for autotools to use. In your terminal, run:
aclocal
Now we can run autoconf
to turn our configure.ac
into a configure
script,
and automake
to turn our Makefile.am
into a Makefile.in
:
autoconf
automake --add-missing
The --add-missing
option adds any missing files required to build the package, whenever possible.
Distributing the ‘Hello World’ UNIX package
The end user doesn’t need to see our autotools setup, so we can distribute the
configure
script and Makefile.in
without all of the files we used to
generate them.
Fortunately, autotools will help us with distribution too. The Makefile contains all kinds of interesting targets, including one to build a tarball of the project containing all of the files we need to distribute:
./configure
make dist
You can even test that the distribution tarball can be installed under a variety of conditions:
make distcheck
Unix: an overview of configure, make, make install
Now we know where this incantation comes from and how it works!
We’ve successfully packaged a program. From the maintainer’s side, this is what it takes:
aclocal # Set up an m4 environment
autoconf # Generate configure from configure.ac
automake --add-missing # Generate Makefile.in from Makefile.am
./configure # Generate Makefile from Makefile.in
make distcheck # Use Makefile to build and test a tarball to distribute
The end user will have the ‘Hello world’ program ready to be installed with the magical incantation:
./configure # Generate Makefile from Makefile.in
make # Use Makefile to build the program
make install # Use Makefile to install the program