Is your feature request related to a problem? Please describe.
Spark has ANALYZE TABLE to help better planning.
We meet a case to compare another engine which enable the analyze table before running.
To get a fair comparison, I want the same capability in the scripts.
I want to add a parameter to enable/disable "analyze table compute statistics" before running queries.
With statistics, we can test with spark.sql.cbo.enabled = true to see if it has a better running.
Describe the solution you'd like
Simply add one parameter to enable one additional step between creating tables and running queries.
To run "ANALYZE TABLE table_name COMPUTE STATISTICS;" on each table.
Is your feature request related to a problem? Please describe.
Spark has ANALYZE TABLE to help better planning.
We meet a case to compare another engine which enable the analyze table before running.
To get a fair comparison, I want the same capability in the scripts.
I want to add a parameter to enable/disable "analyze table compute statistics" before running queries.
With statistics, we can test with spark.sql.cbo.enabled = true to see if it has a better running.
Describe the solution you'd like
Simply add one parameter to enable one additional step between creating tables and running queries.
To run "ANALYZE TABLE table_name COMPUTE STATISTICS;" on each table.