Registration now open for the RATOM 2020 Hackathon!

Registration for the 2020 RATOM Hackathon is now open!

Click here to register

Dates: October 19-21, 2020

This is a chance to get involved with other members of community working on email preservation, dig in to state-of-the-art open source tools, learn some new skills, or simply to explore a new problems space and get feedback in a friendly and inclusive environment.

The hackathon is a free virtual event, and is open to everyone. We’ve provided three suggested areas of interest in the registration to help bring together those interested in similar features, workflows, or automation tasks.

Recently released: libratom 0.4.3

We’ve released libratom 0.4.3, which includes a new flag option (-m) for the entity and report CLI commands, allowing users to populate the message table of the sqlite3 database output with message bodies (stripped of markup and inline attachments) and headers. This option is intended to facilitate a broader range of dataset production workflows, particularly supporting downstream statistical content analysis and ML tasks.

This release also includes additional improvements to the feedback provided by existing commands, bug fixes, and dependency updates.

You can download this release on PyPI or read detailed instructions on installation and usage on GitHub.

Recently released: libratom 0.4.1

We’ve released libratom 0.4.1, which introduces a new CLI command to allow users to batch export preselected messages from PST files using machine-generated JSON. This command is intended to complement the operation of our (in development – see the ratom-deploy repository on GitHub) web interface allowing collecting organizations to select, appraise, and export messages for delivery. More info coming soon!

This release also includes additional improvements to the feedback provided by existing commands, bug fixes, and dependency updates.

You can download this release on PyPI, or read detailed instructions on installation and usage on GitHub.

Recently released: libratom 0.3.0

We’ve released libratom 0.3.0, which introduces a new CLI command to inspect locally installed spaCy models and install specific versions of models as desired. This release includes additional minor bug fixes and dependency updates.

You can download this release on PyPI, or read detailed instructions on installation and usage on GitHub.

RATOM Activities in Review

You can track our development progress on GitHub – our core email processing library and tools, our Jupyter notebooks, and our web-based selection and appraisal tool deployment, server, and UI repositories.

Community needs and workflows drive our development efforts, and we work with a wide range of practitioners and organizations to refine the tools as the project advances. A selection of groups we’ve presented to or conducted workshops with recently is listed below.

Open Preservation Foundation Webcast (November 13, 2019)
“Review, Appraisal, and Triage of Mail Project” (Slides) (OPF link and member recording)

BitCurator User Forum 2019 (October 24-25, 2019)
“NLP and Machine Learning for Digital Curation – Part 2 (Email Analysis)” (Slides)

NSDA Digital Preservation 2019 (October 16-17, 2019)
“Tackling Archival Email with Natural Language Processing and Machine Learning” (Slides)

Learn@DLF 2019 (October 13, 2019)
“NLP and Machine Learning for Born-Digital Materials Part 2 (Collections Containing Email)” (Slides)

iPRES 2019 (September 16-20, 2019)
“RATOM: Review, Appraisal, and Triage of Mail Project” (Slides)

Archival Education and Research Institute 2019 (July 8-12, 2019)
“Review, Appraisal, and Triage of Mail Project: Early Development Demos” (Slides)