cloudera Questions

4

Let's suppose we have a table: Owner | Pets ------------------------------ Jack | "dog, cat, crocodile" Mary | "bear, pig" I want to get as a result: Owner | Pets -----------------------------...
Lashawnda asked 23/5, 2016 at 19:38

7

Solved

We have the following string which is a valid JSON written to a file on HDFS. { "id":"tag:search.twitter.com,2005:564407444843950080", "objectType":"activity", "actor":{ "objectType":"perso...
Ripply asked 19/2, 2015 at 10:48

8

I am using Cloudera on a VM machine that I am playing around with. Unfortunately I am having issues copying data to the HDFS, I am getting the following: [cloudera@localhost ~]$ hadoop fs -mkdir i...
Meatman asked 27/3, 2014 at 1:31

8

I'm trying to run very simple task with mapreduce. mapper.py: #!/usr/bin/env python import sys for line in sys.stdin: print line my txt file: qwerty asdfgh zxc Command line to run the job: ...
Muzzleloader asked 27/3, 2017 at 14:6

5

Solved

I am getting: Application application_1427711869990_0001 failed 2 times due to AM Container for appattempt_1427711869990_0001_000002 exited with exitCode: -1000 due to: Not able to initialize user...
Unconventionality asked 1/4, 2015 at 17:52

3

Solved

I installed hbase from cloudera 5.3.3 distribution and as I run the hbase everything seems to be working fine... When I try assign hbase.master.port via /etc/hbase/conf/hbase-site.xml it does not ...
Peaked asked 3/7, 2015 at 9:36

4

I have a table pos.pos_inv in hdfs which is partitioned by yyyymm. Below is the query: select DATE_ADD(to_date(from_unixtime(unix_timestamp(Inv.actvydt, 'MM/dd/yyyy'))),5), to_date(from_unixtime...
Winthrop asked 20/8, 2016 at 1:23

4

I have been using Cloudera's hadoop (0.20.2). With this version, if I put a file into the file system, but the directory structure did not exist, it automatically created the parent directories: ...
Renie asked 7/5, 2014 at 16:41

2

Solved

Is there a way to alter the location that a database points to? I tried the following ways: alter database <my_db> set DBPROPERTIES('hive.warehouse.dir'='<new_hdfs_loc>'); alter data...
Spoon asked 1/6, 2015 at 16:11

1

From time to time, when developers run hive queries, they get the following error (first log entry below). When I look through the hive longs on the node, I see that Metastoreclient lost connectio...
Azpurua asked 28/4, 2016 at 16:24

1

Solved

We are testing our Hadoop applications as part of migrating from Hortonworks Data Platform (HDP v3.x) to Cloudera Data Platform (CDP) version 7.1. While testing, we found below issue while trying t...
Vaporescence asked 13/4, 2021 at 7:26

4

I get from time to time the following errors in cloudera manager: This DataNode is not connected to one or more of its NameNode(s). and The Cloudera Manager agent got an unexpected response fr...
Gosnell asked 24/2, 2014 at 21:17

1

I have Hive table created from JSON file. CREATE external TABLE logan_test.t1 ( name string, start_time timestamp ) ROW FORMAT SERDE 'org.apache.hive.hcatalog.data.JsonSerDe' WITH SERDEPROPERTI...
Zoroaster asked 9/6, 2017 at 22:0

1

Solved

I happened to work on CDH longtime back ( around 1 year) and am planning to start again.Now we had CDH , HDP and Hortonwork acquired by Cloudera . Is HDP being developed actively ? Or Is CDH being...

1

I am trying to read parquet file and perform some operations on it and save the result as parquet on HDFS. I am doing it using Spark. While doing so I am getting following exception. java.io.EOFEx...
Luz asked 22/4, 2016 at 11:25

2

Solved

I am trying to execute a Spark application built through Scala IDE through my standalone Spark service running on cloudera quickstart VM 5.3.0. My cloudera account JAVA_HOME is /usr/java/default ...
Sorrel asked 27/11, 2015 at 10:43

2

How can I extract the date from a timestamp value variable in Impala? eg time = 2018-04-11 16:05:19 should be 2018-04-11
Collide asked 24/6, 2018 at 20:19

2

Solved

I use Spark 1.6.0 with Cloudera 5.8.3. I have a DStream object and plenty of transformations defined on top of it, val stream = KafkaUtils.createDirectStream[...](...) val mappedStream = stream.tr...
Macrophage asked 1/2, 2017 at 12:59

3

Solved

I tried to adding new host into the cluster o CentOS. It fails on install & gives "Installation failed. Failed to detect root privileges" in status. I know that Cloudera needs user to have pass...
Ohara asked 27/1, 2015 at 13:33

2

Solved

I am very new to Docker and Hadoop system. I have installed the Docker in Ubuntu 16.04 and run the Hadoop image from Cloudera inside a new Docker container. But when I try to run any command in hdf...
Bouley asked 10/2, 2017 at 3:15

1

Solved

I want to find my Hive version, I am using Cloudera Hadoop distribution, Is it possible to find hive version using hive editor?
Pickel asked 3/2, 2020 at 6:35

2

Solved

I recently upgraded my Cloudera environment from 5.8.x (hadoop 2.6.0, hdfs-1) to 6.3.x (hadoop 3.0.0, hdfs-1) and after some days of data loads with moveFromLocal, i just realized that the DFS Used...
Categorize asked 7/1, 2020 at 11:16

7

I'm on a W8 machine, where I use Python (Anaconda distribution) to connect to Impala in our Hadoop cluster using the Impyla package. Our hadoop cluster is secured via Kerberos. I have followed the ...
Uralite asked 24/1, 2016 at 15:27

8

Solved

I recently upgraded my cluster from Apache Hadoop1.0 to CDH4.4.0. I have a weblogic server in another machine from where i submit jobs to this remote cluster via mapreduce client. I still want to u...
Gorey asked 27/9, 2013 at 6:21

4

I know there have been many posts regarding this exception, but I am not able to fix this issue. Classpath has to be edited I think to resolve it. I am trying to run a program called DistMap in ha...
Cisco asked 24/9, 2013 at 6:5

© 2022 - 2025 — McMap. All rights reserved.