elastic-map-reduce Questions
1
Solved
I am trying to enable bad input skipping on my Amazon Elastic MapReduce jobs. I am following the wonderful recipe described here:
http://devblog.factual.com/practical-hadoop-streaming-dealing-with...
Sec asked 22/8, 2012 at 10:48
1
Solved
I am trying to parse data from commoncrawl.org using hadoop streaming. I set up a local hadoop to test my code, and have a simple Ruby mapper which uses a streaming ARCfile reader. When I invoke my...
Chaste asked 25/1, 2012 at 8:32
3
Solved
I have tried a simple Map/Reduce task using Amazon Elastic MapReduce and it took just 3 mins to complete the task. Is it possible to re-use the same instance to run another task.
Even though I ha...
Poundfoolish asked 30/7, 2011 at 0:27
1
Solved
I've noticed that neither mrjob nor boto supports a Python interface to submit and run Hive jobs on Amazon Elastic MapReduce (EMR). Are there any other Python client libraries that supports running...
Arabia asked 23/5, 2011 at 22:36
2
Solved
When files are transferred to nodes using the distributed cache mechanism in a Hadoop streaming job, does the system delete these files after a job is completed? If they are deleted, which i presum...
Astrahan asked 19/12, 2010 at 15:57
© 2022 - 2024 — McMap. All rights reserved.