当前位置 博文首页 > 涂作权的博客:Error: recoverUnfinalizedSegments failed for r
转自:https://blog.csdn.net/dudefu011/article/details/78463207#
一、问题描述
HA按照规划配置好,启动后,NameNode不能正常启动。刚启动的时候 jps 看到了NameNode,但是隔了一两分钟,再看NameNode就不见了。
但是测试之后,发现下面2种情况:
1)先启动JournalNode,再启动Hdfs,NameNode可以启动并可以正常运行
2)使用start-dfs.sh启动,众多服务都启动了,隔两分钟NameNode会退出,再次hadoop-daemon.sh start namenode单独启动可以成功稳定运行NameNode。
再看NameNode的日志,不要嫌日志长,其实出错的蛛丝马迹都包含其中了,如下:
2016-03-09 10:50:27,123 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting NameNode
STARTUP_MSG: host = node1/192.168.56.201
STARTUP_MSG: args = []
STARTUP_MSG: version = 2.5.1
STARTUP_MSG: build = Unknown -r Unknown; compiled by ‘root’ on 2014-10-20T05:53Z
STARTUP_MSG: java = 1.7.0_09
************************************************************/
2016-03-09 10:50:27,132 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: registered UNIX signal handlers for [TERM, HUP, INT]
2016-03-09 10:50:27,138 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: createNameNode []
2016-03-09 10:50:27,465 INFO org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properties
2016-03-09 10:50:27,623 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot period at 10 second(s).
2016-03-09 10:50:27,623 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: NameNode metrics system started
2016-03-09 10:50:27,625 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: fs.defaultFS is hdfs://hadoopha
2016-03-09 10:50:27,626 INFO org.apache.hadoop.hdfs.server.namenode.NameNode: Clients are to use hadoopha to access this namenode/service.
2016-03-09 10:50:28,048 INFO org.apache.hadoop.hdfs.DFSUtil: Starting web server as:
d
f
s
.
w
e
b
.
a
u
t
h
e
n
t
i
c
a
t
i
o
n
.
k
e
r
b
e
r
o
s
.
p
r
i
n
c
i
p
a
l
2016
?
03
?
0910
:
50
:
28
,
048
I
N
F
O
o
r
g
.
a
p
a
c
h
e
.
h
a
d
o
o
p
.
h
d
f
s
.
D
F
S
U
t
i
l
:
S
t
a
r
t
i
n
g
W
e
b
?
s
e
r
v
e
r
f
o
r
h
d
f
s
a
t
:
h
t
t
p
:
/
/
n
o
d
e
1
:
500702016
?
03
?
0910
:
50
:
28
,
121
I
N
F
O
o
r
g
.
m
o
r
t
b
a
y
.
l
o
g
:
L
o
g
g
i
n
g
t
o
o
r
g
.
s
l
f
4
j
.
i
m
p
l
.
L
o
g
4
j
L
o
g
g
e
r
A
d
a
p
t
e
r
(
o
r
g
.
m
o
r
t
b
a
y
.
l
o
g
)
v
i
a
o
r
g
.
m
o
r
t
b
a
y
.
l
o
g
.
S
l
f
4
j
L
o
g
2016
?
03
?
0910
:
50
:
28
,
128
I
N
F
O
o
r
g
.
a
p
a
c
h
e
.
h
a
d
o
o
p
.
h
t
t
p
.
H
t
t
p
R
e
q
u
e
s
t
L
o
g
:
H
t
t
p
r
e
q
u
e
s
t
l
o
g
f
o
r
h
t
t
p
.
r
e
q
u
e
s
t
s
.
n
a
m
e
n
o
d
e
i
s
n
o
t
d
e
f
i
n
e
d
2016
?
03
?
0910
:
50
:
28
,
145
I
N
F
O
o
r
g
.
a
p
a
c
h
e
.
h
a
d
o
o
p
.
h
t
t
p
.
H
t
t
p
S
e
r
v
e
r
2
:
A
d
d
e
d
g
l
o
b
a
l
f
i
l
t
e
r
′
s
a
f
e
t
y
′
(
c
l
a
s
s
=
o
r
g
.
a
p
a
c
h
e
.
h
a
d
o
o
p
.
h
t
t
p
.
H
t
t
p
S
e
r
v
e
r
2
{dfs.web.authentication.kerberos.principal} 2016-03-09 10:50:28,048 INFO org.apache.hadoop.hdfs.DFSUtil: Starting Web-server for hdfs at: http://node1:50070 2016-03-09 10:50:28,121 INFO org.mortbay.log: Logging to org.slf4j.impl.Log4jLoggerAdapter(org.mortbay.log) via org.mortbay.log.Slf4jLog 2016-03-09 10:50:28,128 INFO org.apache.hadoop.http.HttpRequestLog: Http request log for http.requests.namenode is not defined 2016-03-09 10:50:28,145 INFO org.apache.hadoop.http.HttpServer2: Added global filter 'safety' (class=org.apache.hadoop.http.HttpServer2
dfs.web.authentication.kerberos.principal2016?03?0910:50:28,048INFOorg.apache.hadoop.hdfs.DFSUtil:StartingWeb?serverforhdfsat:http://node1:500702016?03?0910:50:28,121INFOorg.mortbay.log:Loggingtoorg.slf4j.impl.Log4jLoggerAdapter(org.mortbay.log)viaorg.mortbay.log.Slf4jLog2016?03?0910: