hadoop系列（二） ^有更新！

2018-02-26 | 0 评论 | 2,426 浏览

37套精品Java架构师高并发高性能高可用分布式集群电商缓存性能调优设计项目实战视教程置顶！有更新！
二、hadoop参数配置说明
1. etc/hadoop/core-site.xml
2. etc/hadoop/hdfs-site.xml
3. etc/hadoop/mapred-site.xml

37套精品Java架构师高并发高性能高可用分布式集群电商缓存性能调优设计项目实战视教程置顶！有更新！

二、hadoop参数配置说明

http://hadoop.apache.org/docs/r2.7.3/hadoop-project-dist/hadoop-common/ClusterSetup.html

TODO :整理各个配置参数

1. etc/hadoop/core-site.xml

Parameter	Value	Notes
fs.defaultFS	NameNode URI	hdfs://host:port/
io.file.buffer.size	131072	Size of read/write buffer used in SequenceFiles.

2. etc/hadoop/hdfs-site.xml

Parameter	Value	Notes
dfs.namenode.name.dir	Path on the local filesystem where the NameNode stores the namespace and transactions logs persistently.	If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy.
dfs.hosts / dfs.hosts.exclude	List of permitted/excluded DataNodes.	If necessary, use these files to control the list of allowable datanodes.
dfs.blocksize	268435456	HDFS blocksize of 256MB for large file-systems.
dfs.namenode.handler.count	100	More NameNode server threads to handle RPCs from large number of DataNodes.
dfs.datanode.data.dir	Comma separated list of paths on the local filesystem of a DataNode where it should store its blocks.	If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices.

3. etc/hadoop/mapred-site.xml

Parameter	Value		Notes
mapreduce.framework.name	yarn		Execution framework set to Hadoop YARN.
mapreduce.map.memory.mb	1536		Larger resource limit for maps.
mapreduce.map.java.opts	-Xmx1024M		Larger heap-size for child jvms of maps.
mapreduce.reduce.memory.mb	3072		Larger resource limit for reduces.
mapreduce.reduce.java.opts	-Xmx2560M		Larger heap-size for child jvms of reduces.
mapreduce.task.io.sort.mb	512		Higher memory-limit while sorting data for efficiency.
mapreduce.task.io.sort.factor	100		More streams merged at once while sorting files.
mapreduce.reduce.shuffle.parallelcopies	50		Higher number of parallel copies run by reduces to fetch outputs from very large number of maps.
Parameter		Value		Notes
mapreduce.jobhistory.address		MapReduce JobHistory Server host:port		Default port is 10020.
mapreduce.jobhistory.webapp.address		MapReduce JobHistory Server Web UI host:port		Default port is 19888.
mapreduce.jobhistory.intermediate-done-dir		/mr-history/tmp		Directory where history files are written by MapReduce jobs.
mapreduce.jobhistory.done-dir		/mr-history/done		Directory where history files are managed by the MR JobHistory Server.

评论

发表评论