Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

Jan 05, 2025 am 07:25 AM

In this post, we'll discuss the Model Context Protocol, why it might be important and walk through building an MCP Server to help us talk to Jina.ai and be able to add web search and fact-checking functionality in Claude Desktop using Python and FastMCP.

The Model Context Protocol

Anthropic announced around Thanksgiving last year. Although it garnered some attention, the recognition it has received may be insufficient, considering it could be a pivotal stepping stone in developing the next layer of the AI software stack.

What

The Model Context Protocol (MCP) is a standardized communication protocol designed specifically for large language models (LLMs).

Think of it as the "HTTP of AI"—just as HTTP standardized how web browsers communicate with web servers, MCP standardizes how LLM applications communicate with tools and data sources.

Why Do We Need MCP?

The current landscape of LLM development faces several hurdles:

Tool Integration Complexity: Each LLM service (like OpenAI, Anthropic, etc.) has its way of implementing tool calls and function calling, making it complex to build portable tools.
Context Management: LLMs need access to various data sources and tools, but managing this access securely and efficiently has been challenging.
Standardization: Without a standard protocol, developers must rebuild integration layers for each LLM platform they want to support.

MCP solves these challenges by providing:

A standardized way to expose tools and data to LLMs
A secure client-server architecture
A consistent interface regardless of the underlying LLM

How Does MCP Work?

MCP follows a client-server architecture with three main components:

MCP Server: A service that exposes:
- Tools (functions that LLMs can call)
- Resources (data sources)
- Prompts (templated instructions)
- Context (dynamic information)
MCP Client: The application connects to MCP servers and manages communication between the LLM and the servers. Client support is in its early stages, with only a handful of tools that implement any part of the protocol specification thus far and some functionality that no clients support yet.

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

And, of course, the LLM...

The workflow is straightforward:

An MCP server registers its capabilities (tools, resources, etc.)
A client connects to the server
The LLM can then use these capabilities through a standardized interface

Transport Protocol

Multiple Transport Mechanisms
- SSE (Server Sent Events)
  - Communicates over HTTP bidirectionally, server process is isolated from client
- Stdio (Standard Input/Output)
  - Communicates over Standard Input/Output pipes, server process is essentially a child process of the client

Security

The security situation is more nuanced. While servers using stdio transport are typically colocated with the client, and thus API keys are not necessarily exposed to the internet. They do seem to get passed around fairly casually, IMO.

These keys needed to be loaded into the client when the server started so they could be passed to the child process, and they even appeared in the desktop app logs, which was...concerning.

The widespread use of API keys is a broader issue affecting Gen AI services, platforms, and tooling. Companies like Okta and Auth0 are working on a solution to manage and authorize Gen AIs without relying on keys.

SDKs

Anthropic officially supports low-level SDKs for TypeScript, Python, and Kotlin. Some of the boilerplate wrappers that have recently been created already cover some of the boilerplate and have other nice features, such as a CLI for debugging, inspecting, and installing servers on the client to make developing MCP servers easier.

Getting Started with FastMCP

jlowin / fastmcp

The fast, Pythonic way to build Model Context Protocol servers ?

FastMCP ?

The fast, Pythonic way to build MCP servers.

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

Model Context Protocol (MCP) servers are a new, standardized way to provide context and tools to your LLMs, and FastMCP makes building MCP servers simple and intuitive. Create tools, expose resources, and define prompts with clean, Pythonic code:

# demo.py

from fastmcp import FastMCP


mcp = FastMCP("Demo ?")


@<span>mcp.tool()</span>
def add(a: int, b: int) -> int:
    """Add two numbers"""
    return a + b

That's it! Give Claude access to the server by running:

fastmcp install demo.py

FastMCP handles all the complex protocol details and server management, so you can focus on building great tools. It's designed to be high-level and Pythonic - in most cases, decorating a function is all you need.

Key features:

Fast: High-level interface means less code and faster development
Simple…

View on GitHub

FastMCP is one such framework. We'll now explore how to create an almost practical tool for reading websites, answering search queries through the web, and fact-checking information. We will be using Jina.ai.

It is a very slick service that provides a "Search Foundation platform" that combines "Embeddings, Rerankers, and Small Language Models" to aid businesses in building Gen AI and Multimodal search experiences.

Prerequisites

You will need uv installed. It is the recommended way to create and manage Python projects. It's part of a relatively recent but exciting Python toolchain called astral.sh. I recommend you check it out.

It aims to be a one-stop shop for managing projects, dependencies, virtual environments, versions, linting, and executing Python scripts and modules. It's written in Rust. Do with that information what you will ?.

Claude Desktop App

You will also need to install the Claude Desktop App. For our purposes, the Claude Desktop App will serve as the MCP Client and is a key target Client for Anthropic.

ASRagab / mcp-jinaai-reader

Model Context Protocol (MCP) Server for the Jina.ai Reader API

MCP Server for the Jina.ai Reader API

Full Walkthrough here:

https://dev.to/asragab/building-a-model-context-protocol-server-using-jinaai-and-fastmcp-in-python-1od8

View on GitHub

Project Setup

Using uv you can initialize a project with:

# demo.py

from fastmcp import FastMCP


mcp = FastMCP("Demo ?")


@<span>mcp.tool()</span>
def add(a: int, b: int) -> int:
    """Add two numbers"""
    return a + b

This will create a folder called mcp-jinaai-reader and a .python-version along with a pyproject.toml.

fastmcp install demo.py

This will create a virtual env corresponding to the python version we chose.

After creating the environment, it will provide instructions on how to activate it for the session.

uv init mcp-jinaai-reader --python 3.11

Add a src directory and install the one dependency we need

cd mcp-jinaai-reader
uv venv

Create a .env file at the project root and add your JINAAI_API_KEY to the file. You can obtain one for free by signing up at Jina. In general, any API keys or other env variables your server needs to run will go in this file.

source .venv/bin/activate

In the src directory, create a server.py file...and we should be able to get to the code.

Server Code

uv add fastmcp

Starting with the imports: httpx, will be the library we use here to make http requests; we need the urlparse method to help us determine whether a string is possibly a valid URL.

JINAAI_API_KEY=jina_*************

This initializes the server; the first argument is the tool's name. I am not 100% sure why uvicorn needs to be explicitly added as a dependency here since it is a transitive dependency of FastMCP but it does seem to be required.

It is likely due to how the fastmcp cli (more on that shortly) installs the server. If you have other dependencies, you must add them here so the client knows you need to install them before running the client; we will see how that works in a moment.

from fastmcp import FastMCP
import httpx
from urllib.parse import urlparse
import os

You can probably suss out the pattern here, but Jina uses different subdomains to route particular requests. The search endpoint expects a query, the reader endpoint expects a URL, and the grounding endpoint can provide the llm with a specific response or answer.

Grounding is a much larger topic and is used with other techniques, such as RAG and fine-tuning, to assist LLMs in reducing hallucinations and improving decision-making.

Our first tool

# Initialize the MCP server
mcp = FastMCP("search", dependencies=["uvicorn"])

The annotation @mcp.tool does a lot of the heavy lifting. Similar annotations for resources and prompts exist in the library. The annotation extracts the details of the function signature and return type to create an input and output schema for the llm to call the tool. It configures the tool so the client understands the server's capabilities. It also registers the function calls as handlers for the configured tool.

Next, you'll notice that the function is async. No runtime configuration is needed, and no asyncio.run stuff either. If you need to, for some reason, run the server as a standalone service, you do need to handle some of this yourself. There is an example in the FastMCP repo for how to do this.

The function body is reasonably uninteresting; it validates whether it is receiving a URL, sets the appropriate headers, calls the Jina endpoint, and returns the text.

# demo.py

from fastmcp import FastMCP


mcp = FastMCP("Demo ?")


@<span>mcp.tool()</span>
def add(a: int, b: int) -> int:
    """Add two numbers"""
    return a + b

Second Tool

fastmcp install demo.py

And that's it...

Testing and Debugging

uv init mcp-jinaai-reader --python 3.11

Running the above command will start the mcp inspector it's a tool that the sdk provides in order to test and debug server responses. The --with-editable flag allows you to make changes to the server, without having to relaunch the inspector (highly, HIGHLY recommended)

You should see:

cd mcp-jinaai-reader
uv venv

By default the inspector runs on port 5173, and the server (the code you just wrote) will run on port 3000, you can change this by setting the SERVER_PORT and CLIENT_PORT before invocation.

source .venv/bin/activate

The Inspector

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

If all goes well you should see something like the following, on the left you can add the environment variables you'll need, here the JINAAI_API_KEY is the only one.

If you click on Tools on the top menu bar, and then List Tools you should the tools we created, notice that the docstring serves as the description for the tool.

Clicking on a particular tool will bring up the textboxes for you to enter the parameters needed to call the tool.

Installing the Server

After you are satisfied things are working as expected, you are now ready to install the server on the Claude Desktop App client.

uv add fastmcp

Will do this, I am sure in the future it will support other clients, but for now, this is all you need to do. The -f .env will pass the env variables to the app client.

What this does under the hood is update the claude_desktop_config.json and provides the necessary command and arguments to run the server. By default this uses uv which must be available on your PATH.

If you now open the Claude Desktop App, and go to the Menu Bar and Click Claude > Settings and then click on Developer you should see the name of your tool you set when initializing the server.

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

Clicking on it should bring up it's config. Not only will you how it gets executed, but in the Advanced Options you'll see the env variables that have been set.

You can also edit this config directly, but I wouldn't necessarily recommend it here.

Running it

If all goes well when you go the Desktop App you should see no errors (if you do, going to the Settings should give you a button to check out the logs and investigate from there).

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

Additionally you should see a hammer symbol with the number of individual tools you have at your disposal (note: yours should probably be two unless you've installed other MCP servers)

Rather than invoking the tool directly you chat with the app as you would normally, and when it encounters a situation where it deduces that the tool is helpful it will ask if you want to use it. No additional code or configuration here is necessary.

I think it relies both on the tool name and description in order to decide whether it is appropriate, so it's worth crafting a clear simple description of what the tool does.

You will get a prompt like the following:

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

And you can just "chat" with it, admittedly the tool as written sometimes runs into issues. Occasionally it decides it can't access the internet, sometimes it fails to retrieve results, but sometimes you get this:

Building a Model Context Protocol Server using Jina.ai and FastMCP in Python

This had kind of a natural flow, where it read the page, provided a summary, and you ask it to go to a specific article and read that.

Final Notes

Hopefully, that gave you some insight into MCP Servers. There's plenty to read and watch but one more site I'll recommend is glama.ai they are keeping a fairly comprehensive list of available MCP Servers to download and try out, including other web search tools that more reliable than our toy example. Check it out, and thank you for following along.

The above is the detailed content of Building a Model Context Protocol Server using Jina.ai and FastMCP in Python. For more information, please follow other related articles on the PHP Chinese website!

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Merging Lists in Python: Choosing the Right MethodMay 14, 2025 am 12:11 AM

TomergelistsinPython,youcanusethe operator,extendmethod,listcomprehension,oritertools.chain,eachwithspecificadvantages:1)The operatorissimplebutlessefficientforlargelists;2)extendismemory-efficientbutmodifiestheoriginallist;3)listcomprehensionoffersf

How to concatenate two lists in python 3?May 14, 2025 am 12:09 AM

In Python 3, two lists can be connected through a variety of methods: 1) Use operator, which is suitable for small lists, but is inefficient for large lists; 2) Use extend method, which is suitable for large lists, with high memory efficiency, but will modify the original list; 3) Use * operator, which is suitable for merging multiple lists, without modifying the original list; 4) Use itertools.chain, which is suitable for large data sets, with high memory efficiency.

Python concatenate list stringsMay 14, 2025 am 12:08 AM

Using the join() method is the most efficient way to connect strings from lists in Python. 1) Use the join() method to be efficient and easy to read. 2) The cycle uses operators inefficiently for large lists. 3) The combination of list comprehension and join() is suitable for scenarios that require conversion. 4) The reduce() method is suitable for other types of reductions, but is inefficient for string concatenation. The complete sentence ends.

Python execution, what is that?May 14, 2025 am 12:06 AM

PythonexecutionistheprocessoftransformingPythoncodeintoexecutableinstructions.1)Theinterpreterreadsthecode,convertingitintobytecode,whichthePythonVirtualMachine(PVM)executes.2)TheGlobalInterpreterLock(GIL)managesthreadexecution,potentiallylimitingmul

Python: what are the key featuresMay 14, 2025 am 12:02 AM

Key features of Python include: 1. The syntax is concise and easy to understand, suitable for beginners; 2. Dynamic type system, improving development speed; 3. Rich standard library, supporting multiple tasks; 4. Strong community and ecosystem, providing extensive support; 5. Interpretation, suitable for scripting and rapid prototyping; 6. Multi-paradigm support, suitable for various programming styles.

Python: compiler or Interpreter?May 13, 2025 am 12:10 AM

Python is an interpreted language, but it also includes the compilation process. 1) Python code is first compiled into bytecode. 2) Bytecode is interpreted and executed by Python virtual machine. 3) This hybrid mechanism makes Python both flexible and efficient, but not as fast as a fully compiled language.

Python For Loop vs While Loop: When to Use Which?May 13, 2025 am 12:07 AM

Useaforloopwheniteratingoverasequenceorforaspecificnumberoftimes;useawhileloopwhencontinuinguntilaconditionismet.Forloopsareidealforknownsequences,whilewhileloopssuitsituationswithundeterminediterations.

Python loops: The most common errorsMay 13, 2025 am 12:07 AM

Pythonloopscanleadtoerrorslikeinfiniteloops,modifyinglistsduringiteration,off-by-oneerrors,zero-indexingissues,andnestedloopinefficiencies.Toavoidthese:1)Use'i

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055612 fails to install in Windows 10?

4 weeks agoByDDD

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Roblox: Grow A Garden - Complete Mutation Guide

3 weeks agoByDDD

Nordhold: Fusion System, Explained

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 English version

Recommended: Win version, supports code prompts!

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.