How will you determine which column will be appropriate for Index, that can avoid skewness?
I have given below the syntax. Before assigning a column as Index, make sure you check data in that column is not skewed.
HASHAMP(HASHBUCKET(HASHROW(col whose amp distribution that needs to be checked; eg: PI)))
,CAST(COUNT(*) AS DECIMAL(18,0))
GROUP BY 1
ORDER BY 1
PS: This alone cannot let you determine PI. As, you may sometimes have to decide based on how frequently it is used for Joining purposes etc.