Home >Backend Development >Python Tutorial >Deploying Docs-as-Code on AWS: Building Dynamic Documentation Sites in MkDocs and Docusaurus
In this article, I will guide you step by step to create a dynamic documentation site, adaptable to any project, where you can connect your documentation to a database to extract and display data, ensuring the information is always up to date. We will also explore how to automate the entire process, from content generation to deployment in the cloud with AWS.
The solution includes support for charts and diagrams, continuous integration (CI/CD) using a simple workflow in GitHub Actions, and automatic deployment using Terraform. Let’s get started!
Documentation and its updates are an important process in many companies that develop software, often carried out using different tools, many of which are paid solutions.
Therefore, in recent times, the concept of "doc as code" has emerged. This means using the same tools and workflows used in software development to manage, version, and deploy documentation.
This approach not only allows for better tracking of the documentation but also facilitates its maintenance and ensures alignment with the same best practices used in software development, not just in the code but also in the documentation.
For the development of these sites, it is essential to understand some practices and tools that allow us to implement this approach. Below is a detailed list of the most important aspects to cover in this tutorial.
MkDocs is a static site generator written in ?Python, designed specifically for documenting projects. Its goal is to simplify creating documentation using Markdown files, which are easy to write and read.
With minimal configuration, MkDocs converts Markdown files into a navigable and well-structured documentation website, making it ideal for developers and teams who want to keep their documentation up to date.
MkDocs Material is an advanced theme for MkDocs that follows Google’s Material Design guidelines.
Mermaid is a JavaScript library for creating diagrams and charts from text. By integrating with MkDocs Material, Mermaid allows you to generate visualizations such as flowcharts, entity-relationship diagrams, and other charts within the documentation without external tools.
Jinja is a library that allows embedding variables and data from Python dictionaries into HTML, making web pages dynamic. This library is commonly used for generating dynamic HTML and sending personalized emails.
Docusaurus is an open-source project developed by Meta in 2007 that simplifies the creation, deployment, and maintenance of documentation websites in a fast and efficient way. It allows the use of Markdown and MDX to write content, while its core built on React enables full customization of the styles to fit the specific needs of the project.
Additionally, Docusaurus supports Mermaid through the @docusaurus/theme-mermaid plugin, enabling the inclusion of charts and diagrams directly within the documentation.
Diagram as Code is an approach that allows you to create diagrams through code, rather than using traditional graphic tools. Instead of manually building diagrams, you write code in a text file to define the structure, components, and connections of your diagrams.
This code is then translated into graphical images, making it easier to integrate and document in software projects. It's especially useful for creating and updating architectural and flow diagrams programmatically.
As previously mentioned, Diagrams allows you to generate blueprints using the icons of major cloud technologies. The representation of these diagrams is done through nodes, and in our example, we’ll use all cloud-related nodes and AWS services.
For more details on how I created this, you can read my article about Diagram as Code, and the full implementation can be found in this repository:
Diagram as Code is an approach that allows you to create diagrams through code instead of traditional graphic tools. Instead of manually building diagrams, you can write code in a text file to define the structure, components, and connections of your diagrams.
This code is then translated into graphical images, making it easier to integrate and document in software projects, where it is especially useful for creating and updating architectural and flow diagrams programmatically.
Diagrams is a ?Python library that implements the Diagram as Code approach, enabling you to create architectural infrastructure diagrams and other types of diagrams through code. With Diagrams, you can easily define cloud infrastructure components (such as AWS, Azure, and GCP), network elements, software services, and more, all with just a few lines of code.
In this use case, I will create a documentation site for a machine learning project involving ? hospital data. The goal is to build an interactive documentation site using MkDocs initially and later migrate it to Docusaurus. The site will include both static and dynamic components to meet specific requirements, such as embedding visual diagrams and updating data dynamically from a SQLite database.
For this reason our documentation site will have the following pages:
In this section, we’ll walk through the steps to set up a documentation project using MkDocs from scratch and explain its organized directory structure.
To get started, you'll need to install the following ?Python libraries:
Install MkDocs and the Material
pip install mkdocs mkdocs-material
Install additional libraries to enable dynamic content updating
pip install aiosql pandas sqlite3 jinja2 shutil
Initialize the Project
Start by creating a new MkDocs project. Run the following commands in your terminal:
mkdocs new mkdocs cd mkdocs
This command creates a basic MkDocs project with a default structure.
Explore the Directory Structure
Once the MkDocs site is created, you need to add the following files and folders, as they are not included by default.
Remember, the links to the repository are provided at the end of this post for your reference, and each component will be explained in detail below.
pip install mkdocs mkdocs-material
Component | Directory | Description |
---|---|---|
Database (db) | db | Contains the SQLite database (hospital.db) and queries (metadata.sql, person.sql) to manage dynamic data. Learn more about managing SQL queries in Python in my previous article: Python Projects with SQL. |
?️ Templates & Pages | template | Markdown templates: index.md, tables.md, architecture.md, glossary.md. Supports Mermaid diagrams, embedded images, and database-driven content. |
?️ Static Content (docs) | docs | Final site generated by update.py, including images (img/) and dynamic content populated from template. |
? Infrastructure (infraestructure) | infraestructure | Terraform scripts (main.tf, variables.tf) to deploy an S3 bucket for documentation hosting. |
Once we have our project structure set up, we will configure it step by step, starting with the mkdocs.yml file. This file defines the structure and settings for your documentation site. Here's how it should be structured:
mkdocs.yml
pip install mkdocs mkdocs-material
In this configuration file, you can primarily see in the nav section the pages that will be accessible from the menu. Then, we specify the Mermaid extension, which will be explained in the next section. Finally, the theme section applies the Material theme, enabling styling and components available within this library.
As mentioned earlier, Mermaid is a JavaScript library for creating diagrams and charts from text. Below, we will see some examples. In our case, we will use it to generate an Entity Relationship Diagram (ERD) on the tables page of the documentation.
In the repository, you will be able to see how to construct this code based on the Entity Relationship Diagram (ERD) found in the official Synthea documentation. You can also check the example of the tables page in the following link: tables.md.
To enable dynamic content generation for our documentation site, we’ll use Jinja to process templates and replace placeholders with actual data. Below is a step-by-step breakdown:
Set Up a templates Folder
Create a folder named templates to store all Markdown files for the site. These files should include placeholders. For instance, in index.md, you might have placeholders like {{database.version_date}} and {{database.version}}.
Utilize Placeholders
Placeholders are dynamic variables in the Markdown files. These variables will be updated automatically using Python dictionaries to inject relevant data.
Generate Dynamic Content with update.py
pip install mkdocs mkdocs-material
pip install aiosql pandas sqlite3 jinja2 shutil
By following these steps, you can automate the updating process for your documentation site, ensuring the content remains dynamic and relevant without manual edits.
In the next example, we will update the content in the tables.md file to show an example of the persons table from the database. To do this, we will create a placeholder {{table.person}} within the Markdown file. The idea is to dynamically fetch the data from the persons table, and then use the Jinja library along with pandas to convert the query results into a Markdown table format.
Here’s an example of how the tables.md file looks with the placeholder:
mkdocs new mkdocs cd mkdocs
The process is as follows:
? docs/ ├── ? img/ ├── `architecture.md` ├── `glossary.md` ├── `index.md` ├── `tables.md` ├── ? template/ │ ├── ? db/ │ │ ├── ? data/ │ │ │ ├── hospital.db │ │ ├── ? queries/ │ ├── `architecture.md` │ ├── `glossary.md` │ ├── `index.md` │ ├── `tables.md` │ └── `update.py` ? infraestructure/ ? github/ ├── ? workflows/ │ ├── main.yml ? mkdocs.yml
This way, the documentation always reflects up-to-date data, displaying dynamic examples based on the actual content from the database.
In the following sections, I will provide detailed steps and insights on how to implement a documentation site using Docusaurus. This includes setup, customization, and deployment options.
To get started with Docusaurus, we follow a quick setup process, which is very similar to the steps we used for MkDocs but with different tools.
pip install mkdocs mkdocs-material
pip install aiosql pandas sqlite3 jinja2 shutil
mkdocs new mkdocs cd mkdocs
The configuration file docusaurus.config.js is where we customize the title, theme, navigation, and enable features like Mermaid for diagram rendering.
Example snippet for enabling Mermaid:
pip install mkdocs mkdocs-material
To customize the homepage, we modify the src/components/HomepageFeatures/index.js file. Here, you can adjust the FeatureList object to update the features displayed on the homepage.
Just like in MkDocs, Docusaurus supports Markdown files for content, and we organize the structure as follows:
pip install aiosql pandas sqlite3 jinja2 shutil
__category__.json Example:
mkdocs new mkdocs cd mkdocs
To incorporate dynamic content, such as database tables, we use a ?Python script named update.py, which you can find in the repository.
This script fetches data from a SQLite database and processes the Markdown files stored in the templates folder. It then updates these files with the fetched data and copies them into the docs folder, preparing them for site rendering.
This workflow ensures that the content remains up-to-date and ready for deployment, following a similar approach to what we implemented with MkDocs.
In this section, we will cover the deployment process for both MkDocs and Docusaurus using AWS S3 for hosting. While the deployment steps are the same for both tools, the installation processes differ, with MkDocs being Python-based and Docusaurus being JavaScript-based.
To deploy a static documentation site to AWS S3, we use Terraform to provision and configure the required resources. The setup defines the S3 bucket, enables static website hosting, and configures public access with a bucket policy to allow read-only access. You can find the main.tf file in the repository.
You can access the complete Terraform file and the corresponding configurations for deploying the site in the repository:
Terraform Configuration File:
GitHub Action Workflow for Automatic Deployment: A CI/CD pipeline to automate the deployment process is also included in the repository.
GitHub Actions Configuration
Make sure to configure your AWS credentials in the GitHub repository secrets under Settings > Secrets > Actions. This will allow GitHub Actions to securely access your AWS account and perform actions like uploading files to S3 when you push changes to the main branch.
Below are the links to all the code to deploy your documentation site. If you find it useful, you can leave a star ⭐️ and follow me to receive notifications of new articles. This will help me grow in the tech community and create more content.
MkDocs is an excellent solution for implementing a documentation portal that can be easily updated with code, helping to keep your software development project documentation up-to-date and versioned.
In this repository, I have created a simple site to document the data model and machine learning project.
The documentation will include charts, tables, and architecture examples, providing a comprehensive and easy-to-understand guide on how to implement this framework in combination with two other ?Python libraries.
Documentation and its updates are an important process in many companies that develop software, where this process is carried out using different tools, many of which are paid solutions.
Therefore, in recent times, the concept of "doc as code" has emerged. This means using the same tools and workflow used in software development to manage, version, and…
Docusaurus is an excellent solution for implementing a documentation portal that can be easily updated with code, helping to keep your software development project documentation up-to-date and versioned.
In this repository, I have created a simple site to document the data model and machine learning project.
The documentation will include charts, tables, and architecture examples, providing a comprehensive and easy-to-understand guide on how to implement this framework in combination with two other ?Python libraries.
Documentation and its updates are an important process in many companies that develop software, where this process is carried out using different tools, many of which are paid solutions.
Therefore, in recent times, the concept of "doc as code" has emerged. This means using the same tools and workflow used in software development to manage, version, and deploy documentation…
Both solutions are easy to implement, but in the following items, we can explore some differences, and what is the best solution depends on the context, knowledge, and complexity you may need to implement.
The above is the detailed content of Deploying Docs-as-Code on AWS: Building Dynamic Documentation Sites in MkDocs and Docusaurus. For more information, please follow other related articles on the PHP Chinese website!