SAS format reader

SAS is a software suite that can work with data from a variety of sources and perform statistical analysis on it.

This plugin provides a custom format reader to import the contents of a .sas7bdat data file into your DSS project.

This implementation will work with almost any SAS file but may be slower than the default one provided with DSS.
It is provided by the Pandas library shipped with DSS, and may change when future Pandas versions are released.

Dataiku DSS screenshot showing how to import your SAS file directly in DSS

Import your SAS data directly in DSS.

Plugin information

Version1.1.0
AuthorDataiku (Thomas Labadie)
Released2017-12-20
Last updated2017-12-20
LicenseApache Software License
Source codeGithub
Reporting issuesGithub

How to use

You need to import your .sas7bdat file into DSS and then choose the "SAS using Pandas" format, as shown in the picture above.

The "chunksize" parameter lets you choose how many lines will be read during each iteration ; a higher value means faster read but higher memory usage.

Implementation choices:

  • Date values may be kept in their original SAS format (number of seconds since 1960-1-1) depending on the Pandas version.
  • Integer values may be transformed into double values.