uk.co.nichesolutions.presto:presto-ml

Presto - Machine Learning Plugin

License

License

GroupId

GroupId

uk.co.nichesolutions.presto
ArtifactId

ArtifactId

presto-ml
Last Version

Last Version

0.143-CUSTOM
Release Date

Release Date

Type

Type

presto-plugin
Description

Description

Presto - Machine Learning Plugin

Download presto-ml

Dependencies

compile (1)

Group / Artifact Type Version
com.facebook.thirdparty : libsvm jar 3.18.1

provided (14)

Group / Artifact Type Version
uk.co.nichesolutions.presto : presto-spi jar 0.143-CUSTOM
uk.co.nichesolutions.presto : presto-main jar 0.143-CUSTOM
javax.validation : validation-api jar 1.1.0.Final
org.openjdk.jol : jol-core jar 0.2
com.fasterxml.jackson.core : jackson-annotations jar 2.4.4
com.fasterxml.jackson.core : jackson-core jar 2.4.4
com.fasterxml.jackson.core : jackson-databind jar 2.4.4
javax.inject : javax.inject jar 1
it.unimi.dsi : fastutil jar 6.5.9
io.airlift : json jar 0.124
io.airlift : slice jar 0.19
io.airlift : concurrent jar 0.124
com.google.guava : guava jar 18.0
com.google.inject : guice jar 4.0

test (4)

Group / Artifact Type Version
org.testng : testng jar 6.9.6
uk.co.nichesolutions.presto : presto-tests jar 0.143-CUSTOM
uk.co.nichesolutions.presto : presto-tpch jar 0.143-CUSTOM
uk.co.nichesolutions.presto : presto-main test-jar 0.143-CUSTOM

Project Modules

There are no modules declared in this project.

Presto Build Status

Presto is a distributed SQL query engine for big data.

See the User Manual for deployment instructions and end user documentation.

Requirements

  • Mac OS X or Linux
  • Java 8 Update 40 or higher (8u40+), 64-bit
  • Maven 3.2.3+ (for building)
  • Python 2.4+ (for running with the launcher script)

Building Presto

Presto is a standard Maven project. Simply run the following command from the project root directory:

mvn clean install

On the first build, Maven will download all the dependencies from the internet and cache them in the local repository (~/.m2/repository), which can take a considerable amount of time. Subsequent builds will be faster.

Presto has a comprehensive set of unit tests that can take several minutes to run. You can disable the tests when building:

mvn clean install -DskipTests

Running Presto in your IDE

Overview

After building Presto for the first time, you can load the project into your IDE and run the server. We recommend using IntelliJ IDEA. Because Presto is a standard Maven project, you can import it into your IDE using the root pom.xml file. In IntelliJ, choose Open Project from the Quick Start box or choose Open from the File menu and select the root pom.xml file.

After opening the project in IntelliJ, double check that the Java SDK is properly configured for the project:

  • Open the File menu and select Project Structure
  • In the SDKs section, ensure that a 1.8 JDK is selected (create one if none exist)
  • In the Project section, ensure the Project language level is set to 8.0 as Presto makes use of several Java 8 language features

Presto comes with sample configuration that should work out-of-the-box for development. Use the following options to create a run configuration:

  • Main Class: com.facebook.presto.server.PrestoServer
  • VM Options: -ea -Xmx2G -Dconfig=etc/config.properties -Dlog.levels-file=etc/log.properties
  • Working directory: $MODULE_DIR$
  • Use classpath of module: presto-main

The working directory should be the presto-main subdirectory. In IntelliJ, using $MODULE_DIR$ accomplishes this automatically.

Additionally, the Hive plugin must be configured with location of your Hive metastore Thrift service. Add the following to the list of VM options, replacing localhost:9083 with the correct host and port (or use the below value if you do not have a Hive metastore):

-Dhive.metastore.uri=thrift://localhost:9083

Using SOCKS for Hive or HDFS

If your Hive metastore or HDFS cluster is not directly accessible to your local machine, you can use SSH port forwarding to access it. Setup a dynamic SOCKS proxy with SSH listening on local port 1080:

ssh -v -N -D 1080 server

Then add the following to the list of VM options:

-Dhive.metastore.thrift.client.socks-proxy=localhost:1080

Running the CLI

Start the CLI to connect to the server and run SQL queries:

presto-cli/target/presto-cli-*-executable.jar

Run a query to see the nodes in the cluster:

SELECT * FROM system.runtime.nodes;

In the sample configuration, the Hive connector is mounted in the hive catalog, so you can run the following queries to show the tables in the Hive database default:

SHOW TABLES FROM hive.default;
uk.co.nichesolutions.presto

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Versions

Version
0.143-CUSTOM