I Created an AI Companion that Monitors My Screen and Helps Fix My Screw Ups ✨-Python Tutorial-php.cn

Home

Backend Development

Python Tutorial

I Created an AI Companion that Monitors My Screen and Helps Fix My Screw Ups ✨

PHPz

Aug 09, 2024 am 07:46 AM

Lately, I've been struggling with my addiction to binge-watching Naruto. While it's enjoyable, it definitely doesn't help me deliver shareholder value. ?

So, why not build an AI personal assistant that monitors my screen and lets me know if I am overdoing something I should not do, like watching anime? ?

Considering the rapid development in AI in the past year, I decided to use a multi-modal language model to monitor my screen and let me know when I'm spending too much time on non-productive activities.

So, here’s how I did it.

Configure OpenAI GPT-4o, multi-modal AI model.
Use a screen analyzer tool from Composio to monitor the screen.
Pass the screenshots to GPT at regular intervals.
Rendering the message from GPT as a notification in the system.

I Created an AI Companion that Monitors My Screen and Helps Fix My Screw Ups ✨

In this article, I will also explain how you can build your personal AI friend using OpenAI and Composio.

Composio - Your AI Agent Tooling Platform

Composio is an open-source platform that equips your AI agents with tools and integrations. It lets you extend the ability and versatility of your AI agents through integration tools like code interpreter, RAG, Embedding and integrations like GitHub, Slack, Jira, etc.

I Created an AI Companion that Monitors My Screen and Helps Fix My Screw Ups ✨

Please help us with a star. ?

It would help us to create more articles like this ?

Star the Composio.dev repository ⭐

Prerequisites for Building AI Friend

To successfully complete the project, you will need the following.

OpenAI SDK and API key: To interact with the LLM.
Composio: For accessing image analysing tool.
PyAutoGUI: To automate interactions on the screen.
Osascript: To execute AppleScript commands for controlling macOS applications.

So, let’s get started.

Let's Get Started ?

Begin by creating a Python virtual environment.

python -m venv ai-friend
cd ai-friend
source bin/activate

Now, install the following dependencies.

pip install composio-core
pip install composio-openai openai
pip install pyautogui

Next, Create a .env file and add environment variables for the OpenAI API key.

OPENAI_API_KEY=your API key

Set Up Composio

You can use the CLI to set up Composio easily.

First, log in to your account by running the following command.

composio login

Finish the login flow to proceed further.

Now, update apps.

composio apps update

Now, you are ready to move to the coding part.

Building the AI Friend

Now that you have set up the environment, let's hop on to the coding part.

First, import the libraries and initialize the toolsets.

import dotenv
from openai import OpenAI

from composio_openai import App, ComposioToolSet
from composio.utils.logging import get as get_logger

logger = get_logger(__name__)


# Load environment variables from .env
dotenv.load_dotenv()

# Initialize tools.
openai_client = OpenAI()
composio_toolset = ComposioToolSet()

# Retrieve actions
actions = composio_toolset.get_tools(apps=[App.SYSTEMTOOLS, App.IMAGEANALYSERTOOL])

So, in the above code block,

We imported all the required libraries and modules.
Loaded the variables defined in the .env file.
Created an instance of OpenAI() and ComposioToolSet.
Retrieved the Actions from SYSTEMTOOLS and IMAGEANALYSERTOO.

So, here is what these tools do.

SYSTEM TOOLS: The system tools have two Actions: push notifications and screen capture.
IMAGEANALYSERTOOL: This tool has only one Action: analyzes images using multi-modal LLMs like GPT-4o and Claude Sonnet, etc.

If you want to examine the code and how it works, check the code files for system tools and the image analyser tool.

Note: Actions in Composio are tasks that your agent can perform, such as clicking a screenshot, sending a notification, or sending a mail.

Set Up OpenAI Assistant

Now, define a clear and concise prompt for the agent. This is crucial for agent performance. You can alter the prompts based on your requirements.

assistant_instruction = (
"""You are an intelligent and proactive personal productivity assistant.
Your primary tasks are:
1. Regularly capture and analyze screenshots of the user's screen.
2. Monitor user activity and provide timely, helpful interventions.

Specific responsibilities:
- Every few seconds, take a screenshot and analyze its content.
- Compare recent screenshots to identify potential issues or patterns.
- If you detect that the user is facing a technical or workflow problem:
- Notify them with concise, actionable solutions.
- Prioritize non-intrusive suggestions that can be quickly implemented.
- If you notice extended use of potentially distracting websites or applications (e.g., social media, video streaming):
- Gently remind the user about their productivity goals.
- Suggest a brief break or a transition to a more focused task.
- Maintain a balance between being helpful and not overly disruptive.
- Tailor your interventions based on the time of day and the user's apparent work patterns.

Operational instructions:
- You will receive a 'CHECK' message at regular intervals. Upon receiving this:
1. Take a screenshot using the screenshot tool.
2. Then, analyse that screenshot using the image analyser tool.
3. Then, check if the user uses distracting websites or applications.
4. If they are, remind them to do something productive.
5. If they are not, check if the user is facing a technical or workflow problem based on previous history.
6. If they are, notify them with concise, actionable solutions.
7. Try to maintain a history of the user's activity and notify them if they are doing something wrong.

Remember: Your goal is to enhance productivity while respecting the user's autonomy and work style."""
)
assistant = openai_client.beta.assistants.create(
name="Personal Productivity Assistant",
instructions=assistant_instruction,
model="gpt-4-turbo",
tools=actions, # type: ignore
)
# create a thread
thread = openai_client.beta.threads.create()
print("Thread ID: ", thread.id)
print("Assistant ID: ", assistant.id)

In the above code block,

A detailed assistant instruction is provided.
Created a new assistant instance with the previously defined instruction, model name, and previously defined actions.
Finally, create a thread for interaction with the models.

Define and Run the Assistant

Now, define a function for running the assistants.

def check_and_run_assistant():
    logger.info("Checking and running assistant")

    # Send 'CHECK' message to the assistant
    message = openai_client.beta.threads.messages.create(
        thread_id=thread.id,
        role="user",
        content="CHECK",
    )

    # Execute Agent
    run = openai_client.beta.threads.runs.create(
        thread_id=thread.id,
        assistant_id=assistant.id,
    )

    # Execute function calls
    run_after_tool_calls = composio_toolset.wait_and_handle_assistant_tool_calls(
        client=openai_client,
        run=run,
        thread=thread,
    )

# Run the assistant check every 10 seconds
while True:
    check_and_run_assistant()

Here’s what is going on in the above code.

Send a 'CHECK' Message: This sends a "CHECK" message to the assistant in the specified thread to ensure the model is responsive.
Execute Agent: Creates a run for the assistant using the specified thread and assistant IDs.
Handle Tool Calls: Waits for and handles tool calls made by the assistant using the Composio toolset.
Loop the Agent: Loop the agent so it runs and monitors your workflow continuously.

Finally, execute the file by running the Python file and letting your new AI friend keep you focused on your goals.

The agent monitors your screen and sends a notification when it sees you doing something you should not.

The full code can be found here

Here is an example of the agent in action.

Next Steps

In this article, you built your personalised AI friend that monitors your activity. However, adding external integrations such as a Calendar or Gmail tool can make it even more useful. This lets you know if you have some events to attend or important emails to respond to.

You can do it easily with Composio’s wide array of integrations, from GitHub and Calendar to Slack, Discord, and more.

If you want to see more AI-related articles, let me know in the comments and give us a star on GitHub.

Star the Composio.dev repository ⭐

Thank you for reading! <script> // Detect dark theme var iframe = document.getElementById('tweet-1820129229683454160-179'); if (document.body.className.includes('dark-theme')) { iframe.src = "https://platform.twitter.com/embed/Tweet.html?id=1820129229683454160&theme=dark" } </script>

The above is the detailed content of I Created an AI Companion that Monitors My Screen and Helps Fix My Screw Ups ✨. For more information, please follow other related articles on the PHP Chinese website!

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

The 2-Hour Python Plan: A Realistic ApproachApr 11, 2025 am 12:04 AM

You can learn basic programming concepts and skills of Python within 2 hours. 1. Learn variables and data types, 2. Master control flow (conditional statements and loops), 3. Understand the definition and use of functions, 4. Quickly get started with Python programming through simple examples and code snippets.

Python: Exploring Its Primary ApplicationsApr 10, 2025 am 09:41 AM

Python is widely used in the fields of web development, data science, machine learning, automation and scripting. 1) In web development, Django and Flask frameworks simplify the development process. 2) In the fields of data science and machine learning, NumPy, Pandas, Scikit-learn and TensorFlow libraries provide strong support. 3) In terms of automation and scripting, Python is suitable for tasks such as automated testing and system management.

How Much Python Can You Learn in 2 Hours?Apr 09, 2025 pm 04:33 PM

You can learn the basics of Python within two hours. 1. Learn variables and data types, 2. Master control structures such as if statements and loops, 3. Understand the definition and use of functions. These will help you start writing simple Python programs.

How to teach computer novice programming basics in project and problem-driven methods within 10 hours?Apr 02, 2025 am 07:18 AM

How to teach computer novice programming basics within 10 hours? If you only have 10 hours to teach computer novice some programming knowledge, what would you choose to teach...

How to avoid being detected by the browser when using Fiddler Everywhere for man-in-the-middle reading?Apr 02, 2025 am 07:15 AM

How to avoid being detected when using FiddlerEverywhere for man-in-the-middle readings When you use FiddlerEverywhere...

What should I do if the '__builtin__' module is not found when loading the Pickle file in Python 3.6?Apr 02, 2025 am 07:12 AM

Error loading Pickle file in Python 3.6 environment: ModuleNotFoundError:Nomodulenamed...

How to improve the accuracy of jieba word segmentation in scenic spot comment analysis?Apr 02, 2025 am 07:09 AM

How to solve the problem of Jieba word segmentation in scenic spot comment analysis? When we are conducting scenic spot comments and analysis, we often use the jieba word segmentation tool to process the text...

How to use regular expression to match the first closed tag and stop?Apr 02, 2025 am 07:06 AM

How to use regular expression to match the first closed tag and stop? When dealing with HTML or other markup languages, regular expressions are often required to...

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

SublimeText3 English version

Recommended: Win version, supports code prompts!

Zend Studio 13.0.1

Powerful PHP integrated development environment

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7463

CakePHP Tutorial

1376

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers