Open building blocks

Addressing financial crime data analysis takes many shapes: from transaction screening to building knowledge graphs, each use case needs its own solutions. We think that open source technology can provide much-needed building blocks needed to create cooperation in this space.

Open Source AML/Fintech

  • moov.io maintains dozens of open source fintech components, from implementing ACH parsing, BSA e-filing to watchlist screening.
  • Marble build an open source real-time AML and fraud decision system.
  • Ballerine is an open-source data orchestration platform for risk decisioning.
  • Tazama is a charity building an open source transaction monitoring system focussed around ISO20022 messages.
  • IBM AMLSim is a multi-agent based simulator that generates synthetic banking transaction data together with a set of known money laundering patterns - mainly for the purpose of testing machine learning models and graph algorithms.

FollowTheMoney Ecosystem

The FollowTheMoney data model was first developed at OCCRP in order to provide a comprehensive data language around financial crime investigations. It has since been adopted by several other projects, including OpenSanctions.

  • Aleph is an open source data platform. The tool was built to help investigative journalists track people and companies, usually as part of their corruption investigations.
    • ingest-file is the component of Aleph that converts unstructured data - such as documents, PDFs, emails - into an FtM entity graph.
    • memorious is the crawling framework used by OCCRP's Aleph team. It can inject both structured data and documents into an Aleph, where they are turned into FtM entities.
  • FollowTheMoney itself includes a CLI tool that can convert CSV and SQL data to FtM, and export entities to various formats, including Gephi GEXF, Neo4J Cypher, etc.
  • Investigraph as an open source ETL framework producing FtM data. It was developed by investigativedata.io, who also maintain a data catalog with FtM open data assets focussed on the EU institutions and Germany.
  • thebeast is also an FtM data processing tool, focussed on YAML-defined pipelines for large-scale processing.

Open Data Community

Governments and organisations around the world have committed to publishing their data under open licenses in order to promote public re-use. Here's some other international organisations who support an open data commons focussed around financial crime-related data: