Distinct values from a cube

We have a postgresql table with about 28 million facts with a `financial_year` column. Users can use the babbage API to essentially query the distinct `financial_year` values, which is about 10 unique values.

Postgresql seems to be very naive when doing `SELECT DISTINCT financial_year FROM table` because it runs a table scan even though `financial_year` has an index, which takes 60+ seconds. This seems to [be a known problem with postgresql](https://explainextended.com/2009/05/03/postgresql-optimizing-distinct/).

How have others solved this problem? Do we split out the financial_year data (and all the other dimensions of a fact) into a separate table?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distinct values from a cube #24

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Distinct values from a cube #24

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions