Analyzes the entire history of a macOS Messages conversation

These details have not been verified by PyPI

Project links

Project description

iMessage Conversation Analyzer

Copyright 2020-2024 Caleb Evans
Released under the MIT license

iMessage Conversation Analyzer (ICA) is a fully-typed Python library that will read the contents of an iMessage conversation via the Messages app's database on macOS. You can then gather various metrics of interest on the messages. The library also includes a CLI utility for easy use.

Much of this program was inspired by and built using findings from this blog post by Yorgos Askalidis.

Caveats

Please note that currently, you can only query conversations between you and a single other person (i.e. group chats are currently unsupported).

It should also be clarified that this program only supports macOS, since the presence of the chat.db file in your user Library directory is essential for the program to function.

Installation

1. Install Python 3

To install Python 3, you'll need to install the Apple Command Line Tools, which you can install by running:

xcode-select --install

Don't worry if you see a long download time; it will shorten rather quickly.

2. Set up virtualenv

pip3 install virtualenv

virtualenv --python=python3 .virtualenv
source .virtualenv/bin/activate

3. Install project depdendencies

pip install -r requirements.txt

Usage

The package includes both a Command Line API for simplicity/convenience, as well as a Python API for maximum flexibility.

Command Line API

To use ICA from the command line, simply invoke the ica command. The minimum required arguments are:

A path to an analyzer file to run, or the name of a built-in analyzer
The first and last name of the contact, via the -c/--contact flag
1. If the contact has no last name on record, you can just pass the first name

Other formats

You can optionally pass the -f/--format flag to output to a specific format like CSV (supported formats include csv, excel/xlsx, and markdown/md).

ica message_totals -c 'Jane Fernbrook' -f csv

ica ./my_custom_analyzer.py -c 'Jane Fernbrook' -f csv

Writing to a file

Finally, there is an optional -o/--output flag if you want to output to a specified file. ICA will do its best to infer the format from the file extension, although you could also pass --format if you have special filename requirements.

ica transcript -c 'Thomas Riverstone' -o ./my_transcript.xlsx

Built-in analyzers

The library includes several built-in analyzers so that you can use ICA out of the box:

message_totals: a summary of message and reaction counts, by person and in total, as well as other insightful metrics
attachment_totals: lists count data by attachment type, including number of Spotify links shared, YouTube videos, Apple Music, etc.
most_frequent_emojis: count data for the top 10 most frequently used emojis across the entire conversation
totals_by_day: a comprehensive breakdown of message totals for every day you and the other person have been messaging in the conversation
transcript: a full, unedited transcript of every message, including reactions, between you and the other person (attachment files not included)

Again, to call one of these built-in analyzers, just pass it as the first argument to the ica command:

ica most_frequent_emojis -c 'Jane Fernbrook'

Python API

The Python API is much more powerful, allowing you to integrate ICA into any type of Python project that can run on macOS. All of the built-in analyzers themselves (under the ica/analyzers directory) actually use this API.

# get_my_transcript.py

import pandas as pd

import ica


# Export a transcript of the entire conversation
def main() -> None:
    # Allow your program to accept all the same CLI arguments as the `ica`
    # command; you can skip calling this if have other means of specifying the
    # contact name and output format
    cli_args = ica.get_cli_args()
    # Retrieve the dataframes corresponding to the massaged contents of the
    # database; dataframes include `message` and `attachment`
    dfs = ica.get_dataframes(
        contact_name=cli_args.contact_name,
        timezone=cli_args.timezone
    )
    # Send the results to stdout (or to file) in the given format
    ica.output_results(
        pd.DataFrame(
            {
                "timestamp": dfs.messages["datetime"],
                "is_from_me": dfs.messages["is_from_me"],
                "is_reaction": dfs.messages["is_reaction"],
                # U+FFFC is the object replacement character, which appears as
                # the textual message for every attachment
                "message": dfs.messages["text"].replace(
                    r"\ufffc", "(attachment)", regex=True
                ),
            }
        ),
        # The default format (None) corresponds to the pandas default dataframe
        # table format
        format=cli_args.format,
        # When output is None (the default), ICA will print to stdout
        output=cli_args.output,
    )


if __name__ == "__main__":
    main()

You can run the above program using the ica command, or execute it directly like any other Python program.

ica ./get_my_transcript.py -c 'Thomas Riverstone'

python ./get_my_transcript.py -c 'Thomas Riverstone'

python -m get_my_transcript -c 'Thomas Riverstone'

You're not limited to writing a command line program, though! The ica.get_dataframes() function is the only function you will need in any analyzer program. But beyond that, feel free to import other modules, send your results to other processes, or whatever you need to do!

You can also import any built-in analyzer (for your own post-processing) via the ica.analyzers namespace.

Errors and exceptions

BaseAnalyzerException: the base exception class for all library-related errors and exceptions
ContactNotFoundError: raised if the specified contact was not found
ConversationNotFoundError: raised if the specified conversation was not found
FormatNotSupportedError: raised if the specified format is not supported by the library

Using a specific timezone

By default, all dates and times are in the local timezone of the system on which ICA is run. If you'd like to change this, you can pass the --timezone / -t option to the CLI with an IANA timezone name.

ica totals_by_day -c 'Daniel Brightingale' -t UTC

ica totals_by_day -c 'Daniel Brightingale' -t America/New_York

The equivalent option for the Python API is the timezone parameter to ica.get_dataframes:

dfs = ica.get_dataframes(contact_name=my_contact_name, timezone='UTC')

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.1.0

Jan 29, 2026

3.0.0

Jan 13, 2026

3.0.0rc1 pre-release

Jan 13, 2026

2.9.0

Nov 8, 2025

2.8.0

Jul 2, 2025

2.7.0

Mar 5, 2025

2.6.0

Feb 2, 2025

2.5.0

Jan 3, 2025

2.5.0rc5 pre-release

Jan 3, 2025

2.5.0rc3 pre-release

Jan 2, 2025

2.4.0

Dec 27, 2024

2.3.0

Mar 6, 2024

2.2.0

Feb 22, 2024

This version

2.1.0

Feb 8, 2024

2.0.0

Feb 6, 2024

2.0.0b1 pre-release

Feb 6, 2024

1.2.3

Jan 20, 2024

1.2.3b2 pre-release

Jan 20, 2024

1.2.3b1 pre-release

Jan 20, 2024

1.2.2 yanked

Jan 20, 2024

Reason this release was yanked:

Missing other critical fixes

1.2.1

Jan 19, 2024

1.2.0

Jan 17, 2024

1.1.0

Mar 6, 2021

1.0.1

Mar 6, 2021

1.0.0

Mar 6, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imessage-conversation-analyzer-2.1.0.tar.gz (30.1 kB view details)

Uploaded Feb 8, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

imessage_conversation_analyzer-2.1.0-py3-none-any.whl (26.0 kB view details)

Uploaded Feb 8, 2024 Python 3

File details

Details for the file imessage-conversation-analyzer-2.1.0.tar.gz.

File metadata

Download URL: imessage-conversation-analyzer-2.1.0.tar.gz
Upload date: Feb 8, 2024
Size: 30.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for imessage-conversation-analyzer-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9d31495609ef02427fc4ad58ee64948d09e901e770a96ce6ff20006ed6bf4535`
MD5	`0c552dfce3b501f98a55856af4a2a334`
BLAKE2b-256	`75fbf68c1720d49726cb2fa1cb85ee28c307740cc43c2ed3a7e57ec1a91f1118`

See more details on using hashes here.

File details

Details for the file imessage_conversation_analyzer-2.1.0-py3-none-any.whl.

File metadata

Download URL: imessage_conversation_analyzer-2.1.0-py3-none-any.whl
Upload date: Feb 8, 2024
Size: 26.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for imessage_conversation_analyzer-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d0152849cc78326adabe423baa6f34c841b69b1a375c0928984d1fc9075b7806`
MD5	`3b28e80597f3e5e2f9e07f6c543747da`
BLAKE2b-256	`f6754091284ffedea24bea70406089b3423d56589520ea48d762358085327113`

See more details on using hashes here.

imessage-conversation-analyzer 2.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

iMessage Conversation Analyzer

Caveats

Installation

1. Install Python 3

2. Set up virtualenv

3. Install project depdendencies

Usage

Command Line API

Other formats

Writing to a file

Built-in analyzers

Python API

Errors and exceptions

Using a specific timezone

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes