Gerby

online tag-based view for large LaTeX documents

Gerby

Introduction

If you have a LaTeX document which

is large (probably several hundreds of pages at least)
is regularly updated
needs to be externally referenced often

you will run into the problem that

large PDFs are not easily navigable
PDFs of any size are not very searchable
the internal numbering changes often, making external references outdated

Gerby addresses these problems by providing an online tag-based view, instead of just having a big PDF. Gerby is tailored towards making large online textbooks and reference works more accessible.

In case you were wondering, a gerbe is a kind of stack (in the mathematical sense), and the software was originally meant for the Stacks project.

Online view

When everything has been assigned a tag, Gerby will convert your LaTeX document into small HTML files, one for each tag. We then import these into the database, which in turn is used by a Flask website. This way you get

a page for every tag
easy navigation around your document
a fulltext search

Additionally

the bibliography is taken care of
there are statistics pages for each tag
it is possible to comment on each tag, so that readers can interact when there are questions, typos or mistakes

Showcase

The main example are

The Stacks project: an open source textbook and reference work on algebraic geometry, by Johan de Jong (Gerby was originally developed for this purpose)
Kerodon: an online resource for homotopy-coherent mathematics, by Jacob Lurie.

If you are interested in using the system for your own project and have questions, please get in touch!

How to use it

The process consists of 3 parts, which we will explain in detail below.

run plastex on your LaTeX file and tags file
run the import script on the output from step 2
start the server

The result of each step is

a large number of small HTML and auxiliary files
an SQLite3 database containing the output of step 1, together with all the metadata
a local web server that serves your Gerby instance

We are in the process of making the code more generic, please get in touch if you are interested in running your own instance.

Hello world

In gerby-project/hello-world we have collected a minimal working example of what it means to use Gerby and plasTeX for your own project.

The main files here are:

document.tex contains a minimal LaTeX document, with labels
tagger.py is an example script to automate the tag assignment
configuration.py is the Gerby configuration file for our example project

and most importantly .travis.yml contains all the steps that are needed to setup the system. This script is run automatically on a virtual server, so all steps are guaranteed to work (provided your configuration approximates that of the virtual server).

Some remarks:

the file document.tex cannot be built by pdflatex: we have to remove the amsthm package
similarly other packages might break plasTeX, so you should keep your packages as vanilla as possible, and work with custom preambles

The file .travis.yml should be self-explanatory, as it lists the shell commands one needs to execute to set up everything. The subtle steps are probably step 4 and 5 of the installation. One way of avoiding this is by using full paths in the configuration file.

If you are more ambitious, you can also set up the Stacks project, following some more elaborate instructions.

Gerby

online tag-based view for large LaTeX documents

Gerby

Introduction

Tags

Online view

Showcase

How to use it

Hello world

Contributors