hdfs - 2 - McMap

17

The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rw-rw-rw- (on Windows)

I am running Spark on Windows 7. When I use Hive, I see the following error The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rw-rw-rw- The permissions are set ...

windows hive hdfs

Bronchitis asked 10/12, 2015 at 7:46

4

Cannot create directory in hdfs NameNode is in safe mode

I upgrade to the latest version of cloudera.Now I am trying to create directory in HDFS hadoop fs -mkdir data Am getting the following error Cannot Create /user/cloudera/data Name Node is in Safe...

hadoop hdfs

Cockcroft asked 10/6, 2017 at 3:30

4

Solved

HDFS: How do you list files recursively?

How do you, through Java, list all files (recursively) under a certain path in HDFS. I went through the API and noticed FileSystem.listFiles(Path,boolean) but it looks like that method doesn't exis...

hadoop hdfs

Dunghill asked 8/6, 2012 at 0:51

9

Solved

Writing to HDFS could only be replicated to 0 nodes instead of minReplication (=1)

I have 3 data nodes running, while running a job i am getting the following given below error , java.io.IOException: File /user/ashsshar/olhcache/loaderMap9b663bd9 could only be replicated to 0 ...

java hadoop mapreduce hive hdfs

Drinkable asked 22/3, 2013 at 13:29

0

GCS Connector Hadoop3 hadoop3-2.2.8 - Slow read, exists, rename and create operations

In my Java application I have an implementation for a file-system layer, where my file class is a wrapper for Hadoop filesystem methods. I am upgrading the from hadoop3-1.9.17 to hadoop3-2.2.8 and ...

java hadoop google-cloud-storage hdfs google-cloud-dataproc

Becka asked 18/10, 2022 at 14:16

12

Solved

The way to check a HDFS directory's size?

I know du -sh in common Linux filesystems. But how to do that with HDFS?

hadoop command-line directory hdfs

Dysuria asked 28/6, 2011 at 9:7

2

Spark concurrent writes on same HDFS location

I have a spark code which saves a dataframe to a HDFS location (date partitioned location) in Json format using append mode. df.write.mode("append").format('json').save(hdfsPath) sample hdfs locat...

apache-spark hadoop apache-spark-sql hdfs apache-nifi

Detrimental asked 3/9, 2019 at 18:15

4

Solved

Spark and Java: Exception thrown in awaitResult

I am trying to connect a Spark cluster running within a virtual machine with IP 10.20.30.50 and port 7077 from within a Java application and run the word count example: SparkConf conf = new SparkC...

java scala apache-spark hdfs protocol-buffers

Agential asked 5/11, 2016 at 14:58

8

Python: save pandas data frame to parquet file

Is it possible to save a pandas data frame directly to a parquet file? If not, what would be the suggested process? The aim is to be able to send the parquet file to another team, which they can ...

python-3.x hdfs parquet

Ingeminate asked 9/12, 2016 at 18:20

6

Solved

Difference between hadoop fs -put and hadoop fs -copyFromLocal

-put and -copyFromLocal are documented as identical, while most examples use the verbose variant -copyFromLocal. Why? Same thing for -get and -copyToLocal

hadoop hdfs

Belvabelvedere asked 18/10, 2011 at 17:29

9

apache spark - check if file exists

I am new to spark and I have a question. I have a two step process in which the first step write a SUCCESS.txt file to a location on HDFS. My second step which is a spark job has to verify if that ...

hadoop apache-spark hdfs

Melloney asked 22/5, 2015 at 20:55

5

Solved

Spark-submit not working when application jar is in hdfs

I'm trying to run a spark application using bin/spark-submit. When I reference my application jar inside my local filesystem, it works. However, when I copied my application jar to a directory in h...

hadoop apache-spark hdfs

Epiphysis asked 26/2, 2015 at 10:18

2

PyArrow: How to copy files from local to remote using new filesystem interface?

Could somebody give me a hint on how can I copy a file form a local filesystem to a HDFS filesystem using PyArrow's new filesystem interface (i.e. upload, copyFromLocal)? I have read the documentat...

python hdfs pyarrow apache-arrow

Ejective asked 28/7, 2021 at 11:11

2

Flink - No FileSystem for scheme: hdfs

I am currently developing a Flink 1.4 application that reads an Avro file from a Hadoop cluster. However, running it in local mode on my IDE works perfectly fine. But when I submit it to the Jobman...

hadoop hdfs apache-flink avro

Smoothtongued asked 14/2, 2018 at 10:12

1

Why do I get the "is not a Parquet file" error when reading a parquet file

The following error occures when reading a parquet file from an hdfs 2020-06-04 14:11:23 WARN TaskSetManager:66 - Lost task 44.0 in stage 1.0 (TID 3514, 192.168.16.41, executor 1): java.lang.Runti...

apache-spark hdfs parquet

Reed asked 4/6, 2020 at 14:28

6

Solved

How to use Hive without hadoop

I am a new to NoSQL solutions and want to play with Hive. But installing HDFS/Hadoop takes a lot of resources and time (maybe without experience but I got no time to do this). Are there ways to i...

hadoop hive hdfs

Erebus asked 24/1, 2014 at 10:10

5

Solved

How to access hdfs by URI consisting of H/A namenodes in Spark which is outer hadoop cluster?

Now I have some Spark applications which store output to HDFS. Since our hadoop cluster is consisting of namenode H/A, and spark cluster is outside of hadoop cluster (I know it is something bad) I...

hadoop apache-spark hdfs

Naphthyl asked 12/6, 2015 at 6:52

12

Hadoop: ...be replicated to 0 nodes instead of minReplication (=1). There are 1 datanode(s) running and no node(s) are excluded in this operation

I'm getting the following error when attempting to write to HDFS as part of my multi-threaded application could only be replicated to 0 nodes instead of minReplication (=1). There are 1 datanode(s...

hadoop configuration hdfs

Auscultation asked 15/3, 2016 at 15:42

4

How to get hadoop put to create directories if they don't exist

I have been using Cloudera's hadoop (0.20.2). With this version, if I put a file into the file system, but the directory structure did not exist, it automatically created the parent directories: ...

hadoop hdfs cloudera put biginsights

Renie asked 7/5, 2014 at 16:41

7

Solved

How to read only n rows of large CSV file on HDFS using spark-csv package?

I have a big distributed file on HDFS and each time I use sqlContext with spark-csv package, it first loads the entire file which takes quite some time. df = sqlContext.read.format('com.databricks...

apache-spark pyspark hdfs apache-spark-sql spark-csv

Larva asked 31/5, 2017 at 6:15

2

Solved

Change Hive Database location

Is there a way to alter the location that a database points to? I tried the following ways: alter database <my_db> set DBPROPERTIES('hive.warehouse.dir'='<new_hdfs_loc>'); alter data...

hadoop hive hdfs cloudera

Spoon asked 1/6, 2015 at 16:11

8

Solved

what's the difference between "hadoop fs" shell commands and "hdfs dfs" shell commands?

Are they supposed to be equal? but, why the "hadoop fs" commands show the hdfs files while the "hdfs dfs" commands show the local files? here is the hadoop version information: Hadoop 2.0.0-mr...

hadoop hdfs

Ossicle asked 9/8, 2013 at 8:37

4

Solved

How do I read Snappy compressed files on HDFS without using Hadoop?

I'm storing files on HDFS in Snappy compression format. I'd like to be able to examine these files on my local Linux file system to make sure that the Hadoop process that created them has performed...

hadoop compression hdfs snappy

Macrocosm asked 21/5, 2013 at 16:23

28

Hadoop commands

I have Hadoop installed in this location /usr/local/hadoop$ Now I want to list the files inside the dfs. The command I used is : hduser@ubuntu:/usr/local/hadoop$ bin/hadoop dfs -ls This g...

hadoop hdfs

Karlynkarma asked 26/3, 2014 at 7:53

2

Solved

Java Copying File in HDFS to another Directory in HDFS

I'm using the example in this link here to copy contents from one directory in hdfs to another directory in hdfs. The copying of file works, but it creates a new subdirectory in the target vs. just...

java hadoop hdfs

Leakage asked 21/5, 2017 at 5:1

hdfs Questions

Recommended topics

Hot tags