1. 概念
集群的意思是多台机器,最少有2台机器,一台机器从数据源中获取数据,将数据传送到另一台机器上,然后输出。接下来就要实现Flume集群搭建。集群如下图所示。
2. Flume搭建
2.1 部署准备
- 部署主机
-
192.168.9.139 host14
-
192.168.9.128 host15
-
host14主机下载flume软件包
-
# cd /opt/tools
-
# wget http://mirrors.tuna.tsinghua.edu.cn/apache/flume/1.7.0/apache-flume-1.7.0-bin.tar.gz
- 上传解压flume
-
# mkdir -p /apps/svr/flume/
-
# tar -zxvf /opt/tools/apache-flume-1.7.0-bin.tar.gz -C /apps/svr/flume/
2.2 部署Flume
部署的是集群,需要在2台机安装Flume,host14作为push推送数据,host15作为pull获取数据后显示出来。
- 修改配置文件
-
# cd /apps/svr/flume/apache-flume-1.7.0-bin/conf/
-
# cp flume-env.sh.template flume-env.sh
-
# cp flume-conf.properties.template flume-telent.conf
-
# vim flume-env.sh
-
export JAVA_HOME=/apps/svr/java/jdk1.8.0_172
- host15主机部署Flume
# scp -r /apps/svr/flume/ host15:/apps/svr/
- 验证flume
-
# /apps/svr/flume/apache-flume-1.7.0-bin/bin/flume-ng version
-
Flume 1.7.0
-
Source code repository: https://git-wip-us.apache.org/repos/asf/flume.git
-
Revision: 511d868555dd4d16e6ce4fedc72c2d1454546707
-
Compiled by bessbd on Wed Oct 12 20:51:10 CEST 2016
-
From source with checksum 0d21b3ffdc55a07e1d08875872c00523
2.3 Flume集群配置
- 配置push.conf
[host14]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/conf
# vim push.conf
-
# Name the components on this agent
-
a2.sources= r1
-
a2.sinks= k1
-
a2.channels= c1
-
# Describe/configure the source
-
a2.sources.r1.type= spooldir
-
a2.sources.r1.spoolDir= /apps/svr/flume/logs
-
a2.sources.r1.channels= c1
-
# Use a channel which buffers events in memory
-
a2.channels.c1.type= memory
-
a2.channels.c1.keep-alive= 10
-
a2.channels.c1.capacity= 100000
-
a2.channels.c1.transactionCapacity= 100000
-
# Describe/configure the source
-
a2.sinks.k1.type= avro
-
a2.sinks.k1.channel= c1
-
a2.sinks.k1.hostname= host15
-
a2.sinks.k1.port= 8899
- 配置pull.conf
[host15]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/conf
# vim pull.conf
-
# Name the components on this agent
-
a1.sources= r1
-
a1.sinks= k1
-
a1.channels= c1
-
# Describe/configure the source
-
a1.sources.r1.type= avro
-
a1.sources.r1.channels= c1
-
a1.sources.r1.bind= host15
-
a1.sources.r1.port= 8899
-
# Describe the sink
-
a1.sinks.k1.type= logger
-
a1.sinks.k1.channel = c1
-
# Use a channel which buffers events in memory
-
a1.channels.c1.type= memory
-
a1.channels.c1.keep-alive= 10
-
a1.channels.c1.capacity= 100000
-
a1.channels.c1.transactionCapacity= 100000
- 创建spoolDir目录
[host14]
# mkdir -p /apps/svr/flume/logs
2.4 Flume集群启动
- 启动pull主机
[host15]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/
# ./bin/flume-ng agent -c conf -f conf/pull.conf -n a1 -Dflume.root.logger=INFO,console
显示如图所示则为启动成功
- 启动push主机
[host14]
# cd /apps/svr/flume/apache-flume-1.7.0-bin/
# ./bin/flume-ng agent -n a2 -c conf -f conf/push.conf -Dflume.root.logger=INFO,console
显示如图所示则为启动成功
- 验证连接
[host15]
显示如图所示表示连接成功
3. Flume测试
3.1 创建测试用例
[host14]
# cd /apps/svr/flume/logs/
# vim flume-use-case-test.log
-
HELLO WORLD!!!
-
HELLO FLUME!!!
3.2 验证测试
- pull主机
显示如图所示表示测试成功
- push主机
显示如图所示表示测试成功
结论:用例测试成功,证明Flume集群搭建成功。
原文地址:https://1csh1.github.io/2016/04/21/Flume%E9%9B%86%E7%BE%A4%E6%90%AD%E5%BB%BA/