hadoop系列(二) 有更新!
- 37套精品Java架构师高并发高性能高可用分布式集群电商缓存性能调优设计项目实战视教程 置顶! 有更新!
- 二、hadoop参数配置说明
- 1. etc/hadoop/core-site.xml
- 2. etc/hadoop/hdfs-site.xml
- 3. etc/hadoop/mapred-site.xml
37套精品Java架构师高并发高性能高可用分布式集群电商缓存性能调优设计项目实战视教程 置顶! 有更新!
二、hadoop参数配置说明
http://hadoop.apache.org/docs/r2.7.3/hadoop-project-dist/hadoop-common/ClusterSetup.html
TODO :整理各个配置参数
1. etc/hadoop/core-site.xml
Parameter |
Value |
Notes |
fs.defaultFS |
NameNode URI |
|
io.file.buffer.size |
131072 |
Size of read/write buffer used in SequenceFiles. |
2. etc/hadoop/hdfs-site.xml
Parameter |
Value |
Notes |
dfs.namenode.name.dir |
Path on the local filesystem where the NameNode stores the namespace and transactions logs persistently. |
If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy. |
dfs.hosts / dfs.hosts.exclude |
List of permitted/excluded DataNodes. |
If necessary, use these files to control the list of allowable datanodes. |
dfs.blocksize |
268435456 |
HDFS blocksize of 256MB for large file-systems. |
dfs.namenode.handler.count |
100 |
More NameNode server threads to handle RPCs from large number of DataNodes. |
dfs.datanode.data.dir |
Comma separated list of paths on the local filesystem of a DataNode where it should store its blocks. |
If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. |
3. etc/hadoop/mapred-site.xml
Parameter |
Value |
Notes |
|||
mapreduce.framework.name |
yarn |
Execution framework set to Hadoop YARN. |
|||
mapreduce.map.memory.mb |
1536 |
Larger resource limit for maps. |
|||
mapreduce.map.java.opts |
-Xmx1024M |
Larger heap-size for child jvms of maps. |
|||
mapreduce.reduce.memory.mb |
3072 |
Larger resource limit for reduces. |
|||
mapreduce.reduce.java.opts |
-Xmx2560M |
Larger heap-size for child jvms of reduces. |
|||
mapreduce.task.io.sort.mb |
512 |
Higher memory-limit while sorting data for efficiency. |
|||
mapreduce.task.io.sort.factor |
100 |
More streams merged at once while sorting files. |
|||
mapreduce.reduce.shuffle.parallelcopies |
50 |
Higher number of parallel copies run by reduces to fetch outputs from very large number of maps. |
|||
Parameter |
Value |
Notes |
|||
mapreduce.jobhistory.address |
MapReduce JobHistory Server host:port |
Default port is 10020. |
|||
mapreduce.jobhistory.webapp.address |
MapReduce JobHistory Server Web UI host:port |
Default port is 19888. |
|||
mapreduce.jobhistory.intermediate-done-dir |
/mr-history/tmp |
Directory where history files are written by MapReduce jobs. |
|||
mapreduce.jobhistory.done-dir |
/mr-history/done |
Directory where history files are managed by the MR JobHistory Server. |
评论
发表评论
|
|