@InterfaceAudience.Public public class MultiTableOutputFormat extends org.apache.hadoop.mapreduce.OutputFormat<ImmutableBytesWritable,Mutation>
Hadoop output format that writes to one or more HBase tables. The key is
taken to be the table name while the output value must be either a
Put
or a Delete
instance. All tables must already exist, and
all Puts and Deletes must reference only valid column families.
Write-ahead logging (WAL) for Puts can be disabled by setting
WAL_PROPERTY
to WAL_OFF
. Default value is WAL_ON
.
Note that disabling write-ahead logging is only appropriate for jobs where
loss of data due to region server failure can be tolerated (for example,
because it is easy to rerun a bulk import).
Modifier and Type | Class and Description |
---|---|
protected static class |
MultiTableOutputFormat.MultiTableRecordWriter
Record writer for outputting to multiple HTables.
|
Modifier and Type | Field and Description |
---|---|
static boolean |
WAL_OFF
Property value to disable write-ahead logging
|
static boolean |
WAL_ON
Property value to use write-ahead logging
|
static String |
WAL_PROPERTY
Set this to
WAL_OFF to turn off write-ahead logging (WAL) |
Constructor and Description |
---|
MultiTableOutputFormat() |
Modifier and Type | Method and Description |
---|---|
void |
checkOutputSpecs(org.apache.hadoop.mapreduce.JobContext context) |
org.apache.hadoop.mapreduce.OutputCommitter |
getOutputCommitter(org.apache.hadoop.mapreduce.TaskAttemptContext context) |
org.apache.hadoop.mapreduce.RecordWriter<ImmutableBytesWritable,Mutation> |
getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context) |
public static final String WAL_PROPERTY
WAL_OFF
to turn off write-ahead logging (WAL)public static final boolean WAL_ON
public static final boolean WAL_OFF
public void checkOutputSpecs(org.apache.hadoop.mapreduce.JobContext context) throws IOException, InterruptedException
checkOutputSpecs
in class org.apache.hadoop.mapreduce.OutputFormat<ImmutableBytesWritable,Mutation>
IOException
InterruptedException
public org.apache.hadoop.mapreduce.OutputCommitter getOutputCommitter(org.apache.hadoop.mapreduce.TaskAttemptContext context) throws IOException, InterruptedException
getOutputCommitter
in class org.apache.hadoop.mapreduce.OutputFormat<ImmutableBytesWritable,Mutation>
IOException
InterruptedException
public org.apache.hadoop.mapreduce.RecordWriter<ImmutableBytesWritable,Mutation> getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context) throws IOException, InterruptedException
getRecordWriter
in class org.apache.hadoop.mapreduce.OutputFormat<ImmutableBytesWritable,Mutation>
IOException
InterruptedException
Copyright © 2007–2019 Cloudera. All rights reserved.