apache-spark-1.5

3

Solved

This might be a very simple question. But is there any simple way to measure the execution time of a spark job (submitted using spark-submit)? It would help us in profiling the spark jobs based on...

apache-spark apache-spark-mllib apache-spark-1.5

Fluorene asked 30/4, 2016 at 0:28

3

Solved

Convert null values to empty array in Spark DataFrame

I have a Spark data frame where one column is an array of integers. The column is nullable because it is coming from a left outer join. I want to convert all null values to an empty array so I don'...

apache-spark dataframe apache-spark-sql apache-spark-1.5

Schwinn asked 7/1, 2016 at 16:55

3

Solved

How to transpose dataframe in Spark 1.5 (no pivot operator available)?

I want to transpose following table using spark scala without Pivot function I am using Spark 1.5.1 and Pivot function does not support in 1.5.1. Please suggest suitable method to transpose follo...

scala apache-spark apache-spark-sql apache-spark-1.5

Tepee asked 25/3, 2016 at 7:26

6

Solved

"INSERT INTO ..." with SparkSQL HiveContext

I'm trying to run an insert statement with my HiveContext, like this: hiveContext.sql('insert into my_table (id, score) values (1, 10)') The 1.5.2 Spark SQL Documentation doesn't explicitly sta...

apache-spark apache-spark-sql pyspark apache-spark-1.5 hivecontext

Reamer asked 25/11, 2015 at 17:55

2

Solved

How to limit decimal values to 2 digits before applying agg function?

I am following this solution from one of the stack overflow post, my only requirement here is how can I limit the values that I want to sum to 2 digit after the decimal before applying the df.agg(s...

scala apache-spark apache-spark-sql apache-spark-1.5

Boletus asked 17/1, 2017 at 19:40

1

Solved

How to get Precision/Recall using CrossValidator for training NaiveBayes Model using Spark

Supossed I have a Pipeline like this: val tokenizer = new Tokenizer().setInputCol("tweet").setOutputCol("words") val hashingTF = new HashingTF().setNumFeatures(1000).setInputCol("words").setOutpu...

apache-spark apache-spark-mllib apache-spark-ml apache-spark-1.5

Divided asked 12/6, 2016 at 19:59

2

Solved

Passing additional jars to Spark via spark-submit

I'm using Spark with MongoDB, and consequently rely on the mongo-hadoop drivers. I got things working thanks to input on my original question here. My Spark job is running, however, I receive warn...

mongodb hadoop jar apache-spark apache-spark-1.5

Nablus asked 27/11, 2015 at 16:43

1

Solved

Save Spark Dataframe into Elasticsearch - Can’t handle type exception

I have designed a simple job to read data from MySQL and save it in Elasticsearch with Spark. Here is the code: JavaSparkContext sc = new JavaSparkContext( new SparkConf().setAppName("MySQLtoEs"...

elasticsearch apache-spark elasticsearch-hadoop apache-spark-1.5

Depopulate asked 19/9, 2015 at 10:21

1

Saving / exporting transformed DataFrame back to JDBC / MySQL

I'm trying to figure out how to use the new DataFrameWriter to write data back to a JDBC database. I can't seem to find any documentation for this, although looking at the source code it seems like...

apache-spark apache-spark-sql apache-spark-1.5

Embroidery asked 16/9, 2015 at 23:6

1

Solved

How to connect Zeppelin to Spark 1.5 built from the sources?

I pulled the latest source from the Spark repository and built locally. It works great from an interactive shell like spark-shell or spark-sql. Now I want to connect Zeppelin to my Spark 1.5, acco...

apache-spark apache-zeppelin apache-spark-1.5

Landowska asked 30/8, 2015 at 7:28

apache-spark-1.5 Questions

Recommended topics

Hot tags