GitHub - jacopofar/adium-to-avro: Produce an Avro-formatted corpus of chat messages from Adium logs

1 min read Original article ↗

Skip to content

Navigation Menu

This repository was archived by the owner on Dec 13, 2022. It is now read-only.

adium-to-avro

Produce an Avro-formatted corpus of chat messages from Adium logs

The resulting avro file is suitable for another tool for generating Markov models.

How to use

Just run sbt run, there are no options. It will produce a TSV and an Avro file containing all of the messages in Adium logs, from each kind of protocol (IRC, XMPP, Telegram, etc.)