Ambari Metrics安装1.0.0+
# 基于 Ambari 安装 Ambari Metrics 服务
本文将详细介绍如何通过 Ambari 安装和配置 Ambari Metrics 服务。Ambari Metrics 是 Ambari 自带的监控体系,支持集群资源、服务组件和节点状态的可视化展示,帮助用户实时掌握大数据平台运行状况。
# 1. 进入添加服务入口
登录 Ambari 控制台,点击页面左上角的 「...」,进入操作菜单,选择 Add Service
开始添加服务。
提示
新增服务可按需动态集成到已有集群中,操作不会影响已运行的其他服务。
# 2. 勾选 Ambari Metrics 服务
在服务选择页面中,找到并勾选 Ambari Metrics,点击下一步继续。
笔记
Ambari Metrics 是 Ambari 原生支持的核心监控组件,无需额外下载或配置集成插件。
# 3. 分配组件至主机
此页面将各组件分配到具体节点上:
- Metrics Collector:负责接收并聚合所有节点的指标数据
- Grafana:部署于任意管理节点即可
- Metrics Monitor:应安装在每一台主机上,用于采集本机系统级指标(CPU、内存、磁盘等)
警告
Metrics Monitor 如未部署在某主机上,该主机的监控数据将不可见,可能导致图表不完整。
# 4. 参数配置说明(可跳过)
在“Customize Services”界面,可对 Metrics 服务进行参数设置。包括:
- Grafana 登录账号密码设置
- Collector 启动配置
- HA 模式参数
该参数用于控制是否启用 Watcher 功能。当值为 false
时,会启用 Collector 的运行状态监控机制,有利于自动恢复。
若部署 Collector HA,该参数建议设置为 0.0.0.0:6188
,以支持外部节点访问。
提示
如不配置高可用,可直接使用默认配置并点击下一步跳过此页。
# 5. 安装完成后确认组件
安装成功后,可点击页面中的“查看详情”进入服务明细页,确认已成功部署以下组件:
- Metrics Collector
- Grafana
- Metrics Monitor(在每个主机都应存在)
笔记
点击某条任务可展开其详细执行日志。若部署失败,可据此排查配置问题、主机网络等常见异常。
# 6. 回到首页
部署启动后,Ambari 会实时显示每个组件的安装日志及进度状态,可监控任务是否成功完成。
提示
组件状态均为绿色(Started)即表示服务运行正常。
# 7. 检查运行状态
回到首页,Ambari 将展示 Metrics 服务整体运行状态。
你可以点击 Metrics 进入详情页进一步查看各组件的日志、状态、运行端口等信息。
# 8. 启用 Collector 高可用(可选)
timeline.metrics.service.watcher.disabled=false该参数用于控制是否启用 Watcher 功能。当值为 false
时,会启用 Collector 的运行状态监控机制,有利于自动恢复。
若部署 Collector HA,该参数建议设置为 0.0.0.0:6188
,以支持外部节点访问。
去host下找个目标机器,然后添加collect
最终重启这个服务
# 9. Collector 启动日志验证
查看 /var/log/ambari-metrics-collector/ambari-metrics-collector.out
文件中的启动输出,确认以下关键日志:
如果你启用了高可用 Collector,并部署了两个节点,在启动日志中将出现如下聚合分区信息:
2025-04-11 15:40:50,717 WARN org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Phoenix metrics system already initialized!
2025-04-11 15:40:50,717 INFO org.apache.ambari.metrics.AMSApplicationServer: Instantiating metrics collector at 0.0.0.0:6188
2025-04-11 15:40:50,774 INFO org.eclipse.jetty.util.log: Logging initialized @24576ms to org.eclipse.jetty.util.log.Slf4jLog
2025-04-11 15:40:50,954 WARN org.apache.hadoop.security.authentication.server.AuthenticationFilter: Unable to initialize FileSignerSecretProvider, falling back to use random secrets. Reason: Could not read signature secret file: /home/ams/hadoop-http-auth-signature-secret
2025-04-11 15:40:50,968 INFO org.apache.hadoop.http.HttpRequestLog: Http request log for http.requests.timeline is not defined
2025-04-11 15:40:50,975 INFO org.apache.hadoop.http.HttpServer2: Added global filter 'safety' (class=org.apache.hadoop.http.HttpServer2$QuotingInputFilter)
2025-04-11 15:40:50,979 INFO org.apache.hadoop.http.HttpServer2: Added filter static_user_filter (class=org.apache.hadoop.http.lib.StaticUserWebFilter$StaticUserFilter) to context timeline
2025-04-11 15:40:50,979 INFO org.apache.hadoop.http.HttpServer2: Added filter static_user_filter (class=org.apache.hadoop.http.lib.StaticUserWebFilter$StaticUserFilter) to context static
2025-04-11 15:40:51,319 INFO org.apache.hadoop.yarn.webapp.WebApps: Registered webapp guice modules
2025-04-11 15:40:51,325 INFO org.apache.hadoop.http.HttpServer2: Jetty bound to port 6188
2025-04-11 15:40:51,327 INFO org.eclipse.jetty.server.Server: jetty-9.4.43.v20210629; built: 2021-06-30T11:07:22.254Z; git: 526006ecfa3af7f1a27ef3a288e2bef7ea9dd7e8; jvm 1.8.0_202-b08
2025-04-11 15:40:51,373 INFO org.eclipse.jetty.server.session: DefaultSessionIdManager workerName=node0
2025-04-11 15:40:51,374 INFO org.eclipse.jetty.server.session: No SessionScavenger set, using defaults
2025-04-11 15:40:51,377 INFO org.eclipse.jetty.server.session: node0 Scavenging every 600000ms
2025-04-11 15:40:51,404 INFO org.eclipse.jetty.server.handler.ContextHandler: Started o.e.j.s.ServletContextHandler@64c781a9{static,/static,jar:file:/usr/lib/ambari-metrics-collector/hadoop-yarn-common-3.3.4.jar!/webapps/static,AVAILABLE}
2025-04-11 15:40:51,583 INFO TimelineMetricHostAggregatorMinute: 503 row(s) updated in aggregation.
2025-04-11 15:40:51,584 INFO TimelineMetricHostAggregatorMinute: Aggregated host metrics for METRIC_RECORD_MINUTE_UUID, with startTime = Fri Apr 11 15:35:00 CST 2025, endTime = Fri Apr 11 15:40:00 CST 2025
2025-04-11 15:40:51,584 INFO TimelineMetricHostAggregatorMinute: End aggregation cycle @ Fri Apr 11 15:40:51 CST 2025
2025-04-11 15:40:51,668 INFO org.apache.hadoop.hbase.client.ConnectionImplementation: Closing master protocol: MasterService
2025-04-11 15:40:51,673 INFO org.apache.zookeeper.ZooKeeper: Session: 0x300145edee00283 closed
2025-04-11 15:40:51,673 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down for session: 0x300145edee00283
2025-04-11 15:40:52,414 INFO org.eclipse.jetty.server.handler.ContextHandler: Started o.e.j.w.WebAppContext@14b8a751{timeline,/,file:///tmp/jetty-0_0_0_0-6188-hadoop-yarn-common-3_3_4_jar-_-any-8039215916515613597/webapp/,AVAILABLE}{jar:file:/usr/lib/ambari-metrics-collector/hadoop-yarn-common-3.3.4.jar!/webapps/timeline}
2025-04-11 15:40:52,430 INFO org.eclipse.jetty.server.AbstractConnector: Started ServerConnector@65cf4d6d{HTTP/1.1, (http/1.1)}{0.0.0.0:6188}
2025-04-11 15:40:52,430 INFO org.eclipse.jetty.server.Server: Started @26232ms
2025-04-11 15:40:52,430 INFO org.apache.hadoop.yarn.webapp.WebApps: Web app timeline started at 6188
2025-04-11 15:41:20,697 INFO org.apache.ambari.metrics.core.timeline.availability.MetricCollectorHAController:
######################### Cluster HA state ########################
CLUSTER: ambari-metrics-cluster
RESOURCE: METRIC_AGGREGATORS
PARTITION: METRIC_AGGREGATORS_0 hadoop1_12001 ONLINE
PARTITION: METRIC_AGGREGATORS_1 hadoop2_12001 ONLINE
##################################################
2025-04-11 15:41:48,492 INFO org.apache.zookeeper.ZooKeeper: Session: 0x300145edee00277 closed
2025-04-11 15:41:48,492 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down for session: 0x300145edee00277
2025-04-11 15:42:50,693 INFO TimelineClusterAggregatorSecond: Started Timeline aggregator thread @ Fri Apr 11 15:42:50 CST 2025
2025-04-11 15:42:50,694 INFO TimelineClusterAggregatorSecond: Skipping aggregation function not owned by this instance.
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37