Andrew Beekhof | 14 Jan 2009 08:39
Picon

Re: When STONITH is not completed, a resource starts.


On Jan 14, 2009, at 2:52 AM, <renayama19661014@...>
<renayama19661014@... 
 > wrote:

> Hi,
>
> About movement of STONITH, I tested it.
> (heartbeat 2.99.2 + Pacemaker-1-0-6fd0eebd186e.tar.gz on  
> RHEL5.2(i386VM))
>
> When what I confirmed carries out STONITH from a DC node and a non- 
> DC node.
>
> I confirmed it in the next flow.
>
> 1)I make it the state that a resource starts in a standby node.
> 2)I change it so that a stop error occurs in a dummy resource.
> 3)I generate the monitor error of the dummy resource in a standby  
> node.
> 4)After a stop error, STONITH is carried out by a partner node.
> 5)Keep STONITH from a standby node waiting.
> 6)While STONITH is not completed, I reboot a standby node.

Is this in a two-node cluster?

> I watched log.

>
> Though STONITH from a DC node does not succeed, a resource is started.
(Continue reading)

renayama19661014 | 14 Jan 2009 09:59
Picon
Gravatar

Re: When STONITH is not completed, a resource starts.

Hi,

> > 1)I make it the state that a resource starts in a standby node.
> > 2)I change it so that a stop error occurs in a dummy resource.
> > 3)I generate the monitor error of the dummy resource in a standby  
> > node.
> > 4)After a stop error, STONITH is carried out by a partner node.
> > 5)Keep STONITH from a standby node waiting.
> > 6)While STONITH is not completed, I reboot a standby node.
> 
> Is this in a two-node cluster?
Yes.

> > Though STONITH from a DC node does not succeed, a resource is started.
> > When STONITH did not succeed, the resource was not started at a non- 
> > DC node.
> 
> I don't understand what you're saying here.
> The first statement says a resource was started and the second says it  
> wasn't... they can't both be true.

I'm sorry. 
It caused misunderstanding.

It is time when STONITH is carried out in the environment of two nodes by a standby node.

A resource is started without waiting for completion of STONITH from a DC node.
While STONITH is not completed, this problem happens if an active node fell.

I confirmed the same confirmation based on OpenAIS.
(Continue reading)

Andrew Beekhof | 14 Jan 2009 13:37
Picon

Re: When STONITH is not completed, a resource starts.

On Wed, Jan 14, 2009 at 09:59,  <renayama19661014@...> wrote:
> Hi,
>
>> > 1)I make it the state that a resource starts in a standby node.
>> > 2)I change it so that a stop error occurs in a dummy resource.
>> > 3)I generate the monitor error of the dummy resource in a standby
>> > node.
>> > 4)After a stop error, STONITH is carried out by a partner node.
>> > 5)Keep STONITH from a standby node waiting.
>> > 6)While STONITH is not completed, I reboot a standby node.
>>
>> Is this in a two-node cluster?
> Yes.
>
>> > Though STONITH from a DC node does not succeed, a resource is started.
>> > When STONITH did not succeed, the resource was not started at a non-
>> > DC node.
>>
>> I don't understand what you're saying here.
>> The first statement says a resource was started and the second says it
>> wasn't... they can't both be true.
>
> I'm sorry.
> It caused misunderstanding.
>
> It is time when STONITH is carried out in the environment of two nodes by a standby node.
>
> A resource is started without waiting for completion of STONITH from a DC node.
> While STONITH is not completed, this problem happens if an active node fell.

(Continue reading)

renayama19661014 | 15 Jan 2009 01:55
Picon
Gravatar

Re: When STONITH is not completed, a resource starts.

Hi Andrew,

> > It is time when STONITH is carried out in the environment of two nodes by a standby node.
> >
> > A resource is started without waiting for completion of STONITH from a DC node.
> > While STONITH is not completed, this problem happens if an active node fell.
> 
> So let me see if I understand this correctly...
> 
> You start with two healthy nodes.
Yes.

> 
> You cause a resource on A to fail, at which point B tries to shoot it.
Yes.

> 
> The stonith op never completes and before it times out, you restart B.
No.
It is node A to reboot. 
- Node A is the one that node B is going to shoot.

> 
> Resources get started on B.
Yes.
A dummy resource is started at the time of DC node B. 
When node B is not DC, it is not started.

> 
> Questions:
(Continue reading)

Andrew Beekhof | 15 Jan 2009 13:54
Picon

Re: When STONITH is not completed, a resource starts.


On Jan 15, 2009, at 1:55 AM, <renayama19661014@...>
<renayama19661014@... 
 > wrote:

> Hi Andrew,
>
>>> It is time when STONITH is carried out in the environment of two  
>>> nodes by a standby node.
>>>
>>> A resource is started without waiting for completion of STONITH  
>>> from a DC node.
>>> While STONITH is not completed, this problem happens if an active  
>>> node fell.
>>
>> So let me see if I understand this correctly...
>>
>> You start with two healthy nodes.
> Yes.
>
>>
>> You cause a resource on A to fail, at which point B tries to shoot  
>> it.
> Yes.
>
>>
>> The stonith op never completes and before it times out, you restart  
>> B.
> No.
> It is node A to reboot.
(Continue reading)

renayama19661014 | 16 Jan 2009 01:53
Picon
Gravatar

Re: When STONITH is not completed, a resource starts.

Hi Andrew,

> Ah!
> Can you log a bug for this please?

The log is attached to the first email. 

>I attach a result of hb_report. 
>- stonith_exec_dc.tar.gz (A result when STONITH was carried out by a DC node(ais-1)) 
>- stonith_exec_nodc.tar.gz(A result when STONITH was carried out by a non-DC node(ais-1)) 

Or do you need other log?

> >> Questions:
> >>
> >> Is the above accurate?
> >> Is only the dummy resource started, or are other ones started too?
> > Yes.
> 
> There were two alternatives in that question, the answer cant be  
> "yes" :)

The answer to the first question is no.
Because the reason is because nodes to reboot were different from your recognition.

The answer to the second question is a yes. 
Only a dummy resource starts. 
But I do not confirm it when I set more group resources.

Best Regards,
(Continue reading)

Andrew Beekhof | 19 Jan 2009 16:17
Picon

Re: When STONITH is not completed, a resource starts.


On Jan 16, 2009, at 1:53 AM, <renayama19661014@...>
<renayama19661014@... 
 > wrote:

> Hi Andrew,
>
>> Ah!
>> Can you log a bug for this please?
>
> The log is attached to the first email.
>
>> I attach a result of hb_report.
>> - stonith_exec_dc.tar.gz (A result when STONITH was carried out by  
>> a DC node(ais-1))
>> - stonith_exec_nodc.tar.gz(A result when STONITH was carried out by  
>> a non-DC node(ais-1))
>
> Or do you need other log?

no.  sorry, i just meant: "can you create an entry for it in bugzilla  
please?"

>
>
>>>> Questions:
>>>>
>>>> Is the above accurate?
>>>> Is only the dummy resource started, or are other ones started too?
>>> Yes.
(Continue reading)

renayama19661014 | 20 Jan 2009 01:36
Picon
Gravatar

Re: When STONITH is not completed, a resource starts.

Hi Andrew, 

> no.  sorry, i just meant: "can you create an entry for it in bugzilla  
> please?"
All right.
I register this problem in bugzilla.

Regards,
Hideo Yamauchi.

--- Andrew Beekhof <beekhof@...> wrote:

> 
> On Jan 16, 2009, at 1:53 AM, <renayama19661014@...>
<renayama19661014@... 
>  > wrote:
> 
> > Hi Andrew,
> >
> >> Ah!
> >> Can you log a bug for this please?
> >
> > The log is attached to the first email.
> >
> >> I attach a result of hb_report.
> >> - stonith_exec_dc.tar.gz (A result when STONITH was carried out by  
> >> a DC node(ais-1))
> >> - stonith_exec_nodc.tar.gz(A result when STONITH was carried out by  
> >> a non-DC node(ais-1))
> >
(Continue reading)


Gmane