Get Started

Interactive Document Intelligence for ESG

Automatically consolidate unstructured documents into a unified, searchable and thematically categorized database. Leverage sentiment analysis to accelerate analysis.

The goal of this plug and play solution is to automatically consolidate unstructured documents into a unified, searchable and thematically categorized database, leverage sentiment analysis to accelerate analysis, and provide end users with an interactive dashboard to analyze their document corpus all within Dataiku.  More details on the specifics of the solution can be found on the knowledge base.


  • Rapid time-to-insight using an interactive, purpose-built dashboard showing high level trends via time series and peer group comparison, alongside drill downs to underlying documents, all without needing technical knowledge.
  • Discover ‘clusters’ of words that form around specific topics of interest via topic analytics, revealing new insights and perspectives.
  • Easily and quickly adjust core components to suit the needs and preferences of the business, whether technical or topic focused, thanks to the highly customizable and modular design .

Business Overview

Inflows in ESG products have increased by 140% in 2020 to represent above $40 trillion assets.  The move to ESG is accelerated by the surge in regulation impacting all players across the value chain combined with a growing number of large-scale industry initiatives such as the Net Zero Banking Alliance launched by the UN in 2021. 

The data sources required to effectively embed ESG into financial processes, including KYC, trade finance, credit scoring, and investments, are many and varied. The ability to leverage unstructured data through document intelligence is critical. Currently, organizations rely on individuals to read sections of these documents, or search for relevant materials without a systematic way of categorizing and understanding the data.

This solution automatically consolidates unstructured document data into a unified, searchable and automatically categorized database, with insight accessible via a powerful and easy to use dashboard. Using a modular ESG keyword database (which can be enhanced or swapped out for other topics with ease) the solution can be  used to tackle questions such as: 

  • What ESG topics are being addressed within a portfolio or document collection, and which are rarely tackled?
  • What firms or offerings are facing challenges or successes associated with ESG topics of interest, e.g., relating to environmental impact?
  • What documents or entities are ESG outliers according to my document collection, positive and negative?
  • What ESG trends emerge over time around topics and firms associated with them?


  • Requires Dataiku v9+
  • Prior to installation, your Dataiku instance Admin will need to create a code environment.  The full list of requirements can be found here.
  • This adapt and apply solution can be installed and used right away in one of two ways:
    • On your Dataiku instance click + New Project > Industry Solutions > Search for Interactive Document Intelligence for ESG
    • Download the .zip project file for your Dataiku version and import it directly to your Dataiku instance