elastic-map-reduce Questions

1

Solved

I am trying to enable bad input skipping on my Amazon Elastic MapReduce jobs. I am following the wonderful recipe described here: http://devblog.factual.com/practical-hadoop-streaming-dealing-with...
Sec asked 22/8, 2012 at 10:48

1

Solved

I am trying to parse data from commoncrawl.org using hadoop streaming. I set up a local hadoop to test my code, and have a simple Ruby mapper which uses a streaming ARCfile reader. When I invoke my...
Chaste asked 25/1, 2012 at 8:32

3

Solved

I have tried a simple Map/Reduce task using Amazon Elastic MapReduce and it took just 3 mins to complete the task. Is it possible to re-use the same instance to run another task. Even though I ha...
Poundfoolish asked 30/7, 2011 at 0:27

1

Solved

I've noticed that neither mrjob nor boto supports a Python interface to submit and run Hive jobs on Amazon Elastic MapReduce (EMR). Are there any other Python client libraries that supports running...
Arabia asked 23/5, 2011 at 22:36

2

Solved

When files are transferred to nodes using the distributed cache mechanism in a Hadoop streaming job, does the system delete these files after a job is completed? If they are deleted, which i presum...
Astrahan asked 19/12, 2010 at 15:57

© 2022 - 2024 — McMap. All rights reserved.