【发布时间】:2017-07-18 13:19:24
【问题描述】:
Hadoop 版本:2.7.3 数据节点:32 块大小:512m 复制:3
我使用命令清除了HDFS中的所有数据
hdfs dfs -rm -r /*
清除 HDFS 集群后,尽管块为零,但仍有少数数据节点显示 Block pool used。
hdfs fsck /的输出如下
Connecting to namenode via http://ip-10-0-2-7:50070/fsck?ugi=ubuntu&path=%2F
FSCK started by ubuntu (auth:SIMPLE) from /10.0.2.7 for path / at Tue Jul 18 04:34:19 UTC 2017
Status: HEALTHY
Total size: 0 B
Total dirs: 1
Total files: 0
Total symlinks: 0
Total blocks (validated): 0
Minimally replicated blocks: 0
Over-replicated blocks: 0
Under-replicated blocks: 0
Mis-replicated blocks: 0
Default replication factor: 3
Average block replication: 0.0
Corrupt blocks: 0
Missing replicas: 0
Number of data-nodes: 32
Number of racks: 1
FSCK ended at Tue Jul 18 04:34:19 UTC 2017 in 1 milliseconds
The filesystem under path '/' is HEALTHY
我知道块删除是异步过程,但是Block Pool Used的数据节点没有设置为0,需要很多时间来释放块。
谁能帮我弄清楚,为什么在这些服务器上,块删除很慢,或者任何其他导致块被使用的问题。
【问题讨论】:
-
您是否在删除文件时尝试过跳过选项?例如。 hdfs dfs -rm [-skipTrash] URI
标签: hadoop hdfs hadoop2.7.3