Derived Data

Derived Data lets you to reference results in new queries (query stacking) and eases cohort analysis. In Arcadia Instant, we use derived data for computed fields in data modeling, weighted sums and averages, custom binning, for set-based and group-based analysis, and for combining data from different data sources.

Derived data enables you to reference query results in new queries, in essence "stacking" results from sub-select queries. The derived data feature also supports cohort analysis, where a set of data from a report is used (joined back) in another report, and allows you to build computed columns for re-use.

You must first define derived data, perhaps define additional derived data, and then use it. There also options for saving, viewing and deleting existing derived data measures.

Availability Note. By default, the Derived Data option is turned off. It can be turned on by a user with administrative privileges; see Enabling Derived Data.

Derived Data is very useful in determining weighted averages and other, more complex calculations. For example, in the dataset World Life Expectancy, life expectancy is reported at the level of each country, for each year. If we wanted to determine the life expectancy by region or subregion, we have to calculate a weighted average of life expectancies. You can also parametrize derived data definitions using bracket notation.

The following steps demonstrate how to use derived data on a table visual based on the dataset World Life Expectancy [data source samples.world_life_expectancy]. The initial set-up follows:

using derived data on a table visual
  1. Define Derived Data
  2. Define Additional Derived Data
  3. Use Derived Data
  4. View Derived Data Definitions
  5. Save Derived Data
  6. Delete Derived Data Definitions
Tip. The following video demonstrates how to derive data using nested sort, and store intermediate results in multi-pass calculations.