Interactive Document Intelligence for ESG

Automatically consolidate unstructured documents into a unified, searchable and thematically categorized database. Leverage sentiment analysis to accelerate analysis.

The goal of this plug and play solution is to automatically consolidate unstructured documents into a unified, searchable and thematically categorized database, leverage sentiment analysis to accelerate analysis, and provide end users with an interactive dashboard to analyze their document corpus all within Dataiku.  More details on the specifics of the solution can be found on the knowledge base. This solution is only available on installed instances.

Business Overview

Inflows in ESG products have increased by 140% in 2020 to represent above $40 trillion assets.  The move to ESG is accelerated by the surge in regulation impacting all players across the value chain combined with a growing number of large-scale industry initiatives such as the Net Zero Banking Alliance launched by the UN in 2021. 

The data sources required to effectively embed ESG into financial processes, including KYC, trade finance, credit scoring, and investments, are many and varied. The ability to leverage unstructured data through document intelligence is critical. Currently, organizations rely on individuals to read sections of these documents, or search for relevant materials without a systematic way of categorizing and understanding the data.

This solution automatically consolidates unstructured document data into a unified, searchable and automatically categorized database, with insight accessible via a powerful and easy to use dashboard. Using a modular ESG keyword database (which can be enhanced or swapped out for other topics with ease) the solution can be  used to tackle questions such as: 

  • What ESG topics are being addressed within a portfolio or document collection, and which are rarely tackled?
  • What firms or offerings are facing challenges or successes associated with ESG topics of interest, e.g., relating to environmental impact?
  • What documents or entities are ESG outliers according to my document collection, positive and negative?
  • What ESG trends emerge over time around topics and firms associated with them?


  • Rapid time-to-insight using an interactive, purpose-built dashboard showing high level trends via time series and peer group comparison, alongside drill downs to underlying documents, all without needing technical knowledge.
  • Discover ‘clusters’ of words that form around specific topics of interest via topic analytics, revealing new insights and perspectives.
  • Easily and quickly adjust core components to suit the needs and preferences of the business, whether technical or topic focused, thanks to the highly customizable and modular design.