What is Presto?

What is Presto?

What is Presto? Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. Presto was designed and written from the ground up for interactive analytics and approaches the speed of commercial data warehouses while scaling to the size

What is the performance of Prestos on the sf10000 scale?

At the SF10000 (10TB) scale, the maximum performance gain versus Prestos for BI/reporting queries was 1,700x and ad hoc up to 3,000x with tremendous savings on the infrastructure cost. The biggest observation is that Prestos did not do well on relatively small clusters (less than 8 nodes) or on Parquet files with large row group size.

What is the response time for Presto?

Presto is targeted at analysts who expect response times ranging from sub-second to minutes. Presto breaks the false choice between having fast analytics using an expensive commercial solution or using a slow “free” solution that requires excessive hardware.

How many nodes does starburst Presto require?

At the 1000 (1TB) scale factor, Starburst Presto requires at least 12 worker nodes to achieve the same performance as a 4-node Dremio engine. In other words, for the same performance as Dremio, Starburst required 3.4x higher cost and 3x as many nodes (despite leveraging higher-end memory-optimized instances).

You Might Also Like