25 Jan 2011 05:45
Hadoop is ignoring my hadoop.tmp.dir variable
Greetings,
I'm encountering the following problem with hadoo
p (0.20.2) - for whatever reason, hadoop seems to be completely
ignoring my hadoop.tmp.dir variable. I expect I've just made some
stupid mistake, but I'm a bit stuck. Can anyone offer any suggestions?
From my core-site.xml:
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
<property>
<name>hadoop.tmp.dir</name>
<value>/media/hadoop-tmp/</value>
</property>
...
However, when I run a job, the spill data is dumped to
/tmp/hadoop-hadoopuser, and printing out the
configuration.get("hadoop.tmp.dir") reveals that that is indeed the
temporary directory.
I believe this is the cause of another error I am running into:
11/01/24 20:46:17 WARN mapred.LocalJobRunner: job_local_0002
org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find
taskTracker/jobcache/job_local_0002/attempt_local_0002_m_000000_0/output/file.out
in any of the configured
(Continue reading)
RSS Feed