Control of a deterministic system es a fuzzy environment over infinite planning horizon

June 6, 2017 | Autor: Janusz Kacprzyk | Categoria: Pure Mathematics, Fuzzy Sets and Systems

Descrição do Produto

Fuzzy Sets North-Holland

and

Systems

10 (1983)

291-298

291

CONTROL OF A DETERMINISTIC ENVIRONMENT OVER INFINITE Janusz KACPRZYK’” Sys~tns

Resectrch

Received

Institute.

January

Revised

May

and Piotr STANIEWSKI P&s/l

Academy

of Scimces.

0 I-147

Warstrw.

Pohd

I979

control infinite

of a time-invariant finite-state planning horizon is considered.

fuzzy constraints imposed on controls assumed to he state-dependent. The constraints and politics relating For the problem.

and fuzzy

deterministic system The fuzzy environment

fuzzy goals to he attained. The decision is assumed to he the

in a fuzzy is meant as

fuzzy constraints are intersection of fuzzy

fuzzy goals. The problem is to find an optimal strategy. i.e. a sequence of the control to the state, maximizing the mcmhership function of fuzzy decision. a functional equation is formulated. An algorithm of policy iteration type is

existence of a time-invariant optimal A numerical example is shown.

Keywords: Policy

IN A FUZZY G HORIZON

1979

The multistage environment over

given. The arc proved.

SYSTEM PLANNIN

Control

(ci~cision-making)

strategy

in a fuzzy

and

the convergence

cnvironmsnt.

Infinite

of the algorithm

planning

horizon.

iteration.

1. Introduction Many control processes in the real world proceed in a fuzzy environment, which-due to Bellman & Zadeh Cl]- may be formalized as fuzzy constraints imposed on controls and fuzzy goals to be attained. The problem is to find an optimal decision, i.e. a sequence of controls maximizing the membership function of fuzzy decision. Following the above source paper, a number of contributions appeared. see e.g. [2-91, discussing various aspects. However. in all of them the termination time of the control process is finite, i.e. fixed and specified (e.g. [l, 2, 4, 7, 9]), implicitly given by entering a specified termination set of states (e.g. [l]) or fuzzy (e.g. [3, 6, 81). There arc, however, some long, low-varying control processes in which the goal is just to maintain some level of activity, to avoid decay, etc. and the planning horizon is very long. Possible examples may concern many social processes, agriculture in numerous regions and countries. industries producing commodities for which the demand is saturated, etc. In such cases the use of conventional techniques, i.e. to iterate over subsequent control stages, is evidently inefficient and a different approach should be employed. The purpose of this paper is to present such a different approach. The ” On Rochelle,

Icavc NY

0165~0114/83/$03.00

at

Machine

10801.

Intelligence

Institute.

Hagan

School

of

Business.

USA (Q 1983 Elsevier

Science

Publishers

B.V.

(North-Holland)

Iona

College,

New

292

J. Kacprzyk.

P. Staniewski

termination time is assumed to be infinite. The system under control is a time-invariant finite-state deterministic one. The set of controls is finite as well. An optimal time-invariant strategy is sought. Its existence is proved and a converging algorithm of policy iteration type for determining it is given. A numerical example and some concluding remarks are presented. What concerns the notation used, the fuzzy sets will be denoted by capital Latin letters, e.a. A E X and - assumed to be defined in finite spaces X = {s,, , s,,,) given as A = p+,(x,)/s,f. .+ pa(s,)/s,,,, where ~~ is the membership function and F~(. 1y) means pLA conditioned on y. Moreover, ar\b=min(a.b), /\~,a,=a,r\.~.r\a,. ~~~,~=a,r\a,r\~~~ and i/J.) = max, (.). 2. Decision-making

in a fuzzy environment

over finite planning

Let us assume that the system under control deterministic one given by a state equation X,+I=f(X,,U,),

t=0,

1..

horizon

is a time-invariant

finite-state

.,

(1)

where: x,,,+, is the state (output) at the control stage r/t+ 1, +1+1 E X= {s,, . , s,,,}, u, is the control (input) at t, U,E U={c,, . . . ,ck} and f:Xx U-*X. The initial state is x,). At each control stage t the control is subjected to a state-dependent fuzzy constraint C(x,,)c U characterized by pc(u, 1x,). The rationale for the statedependency of fuzzy constraints is that, first, it exists in most real cases (e.g. in economic systems what we can invest depends usually upon what we have produced) and, second, the termination time is infinite, hence the timedependency is extremely inconvenient. The fuzzy goal G cX, characterized by am, is assumed to be the same at each control stage, which evidently stems from the very nature of the problem. The fuzzy decision D c U x U x . . . , characterized by kr,( uo, ui, . . . , x,,), is given by the intersection of fuzzy constraints and fuzzy goal at subsequent control stages with a discount factor b > 1, which reflects an obvious fact that what happens at earlier control stages should have more influence on the value of in than that what happens at later stages, i.e. D(x,,)

= C(x,,) n G n b(C(x,)

n G) n b’(C(x,)

n G) n . .

= 6 b’ ‘(C(q-,)T\G), ,=I or in terms of membership

Hence, the problem IA:, UT,. . . given by

(2)

functions

is to find an optimal

ELD(U$, UT>. . 1XO)=

decision, i.e. a sequence of controls

V (i bi-‘(/+(Ui-l U,).U I.... ,=I

1x,-,)Apa(f(xi-l,

.I)),). (4)

Control in a fuzzy

Evidently,

it is more convenient

a, :X+

environment

293

to express the solution by a policy function

U,

(5)

a, E A, relating at each control stage t the control u, E U to the state x, E X. The sequence of policy functions will be termed the strategy and denoted by S = policy function and strategy are denoted by a and (a,,, a,, . .). The time-invariant a Q= (a, a, . .), respectively. Thus, the problem (4) -expressed in terms of strategy S - is to find an optima1 strategy S* (time-invariant, if possible), such that cLD(s* I x,,) = v ILLJ(S 1x,J s with the following

preordering

S, 2 S2 e

(6)

defined in the space of strategies:

I+,(S, 1xc,)3 pD(S2 ( x0) for each X,,E X,

(7)

and, evidently, S*s Si for each Si. Let us now proceed to the determination of functional equation. Due to the time-invariance of the system under control and the time-independence of fuzzy constraints and fuzzy goal, (2) may be written as D(x,,) = Ctx,,) n G C-IbD(x,)

(8)

where D(x,)=(C(x,)nG)nb(C(x,)nG)n..

”

(9)

Thus, in general

D(x,) = c(x,) n G n bDb,+,). This is the desired functional used in the following form:

(10)

equation

for our problem.

fiD(S ) x,,) = pc(a,,(xo) 1x0) A k(f(xc,,

In the sequel it will be

a,,(x,,))) A bh(TS

1ftx,,, a,,(x,))) (11)

where T is a time shift operator, i.e. TS = (a ,, a2, .I. Evidently, (11) should be meant as the following set of simultaneous p..,(S) s,)= Fc(a,,(s,)) pD(S 1s2) = pc(a,,(sd

s,)~k(f(s,, I s2)~pG(f(sZ,

a,,(s,)))AbpD(TS

equations

jf(s,, ads,))),

a,(s2)))r\ bpD(TS ( f(s,, a,,(%))).

c12j

PD(S1s,)= f-da&,,) I s,,)AcLGV(S,,,, a,ds,,,)))A b(TS lfts,,, a,,(s,,,))). Let us now remark that since X and U are finite, then CL~(TS 1f(x,,, a,(x,))) = H(a,,)b~D(S

1x,,)

where H(a,,) is an m x m transition matrix and the matrix multiplication usual sense. Let us now introduce an operator L : [0, 11”’ -+ [0, l]“‘, such that L(a)w

= &da

1x~AI-%(~(xo,

a))r\H(a)bw.

(13) is in the

(14)

294

.I. Kacprzyk,

Thus, denoting

(a, S) = (a, a,,, aI, . _ .) we obtain

/JtJ((LI, S) I x,,) = L(a)kl(S We have now an important Theorem Proof.

P. Sraniewski

(15)

Ix.,,).

theorem:

1. The operator L(a) is monotone, i.e. if w, 3 w2, then L(aJw,

2 L(a)wz.

Evident.

As the consequences of Theorem Theorem

1 we obtain the following

three theorems.

2. If (16)

cLD(SX ( x,,) ?=/L”((U, S’“) 1x,,) v E A, tlren the strntegy

for each

Proof.

S” is optimal.

From the relation (15) and the assumptions L(U)~~(S”

/ X,,)~/L”W

for each a E A. Taking we obtain L(a,)/.b(S*

we have

/ x0).

an arbitrary

strategy

s = (a,,, a,, . . . ) and assuming u = a,,

1x,Jc pD(s* I x,,).

Using the superposition operator, we find L(a,,)L(a,) Applying

of the theorem

L(a,,)L(a,)

. . . t(a,, ~, ), which is obviously a monotone

. L(n,,)/LL,(S”

subsequently

1x,,)~L(n,,)L(a,).

the above relation,

!.$cs* 1x,,)~L(aJL(a,)

. L(a,,-,)CLD(S*

1x,,).

we finally get

. . U4,)p,W

1XCJ

= pD((a,,, a,, . . , a,,, S”) 1x,,) for each n. For n + = we obtain pIJs*

1x,,)> CLJJS1x,,)

for each S. Hence S* is optimal. Thorem

3. If pD((a, /&(a”

Proof.

q

S) 1 x,,) > ~~(s 1 x,,). then

I x,,) > /-b(S

(17)

1 x,,).

From (15) and the assumption

of the theorem we have

Uu)/-b(S I Xl,)>/-b(S Ix,,). Applying

a monotone

L”(a)pD(S

operator

L”-‘(a)

(X’J~L”-‘(a)CLD(S

we obtain 1x,,)

Conrrol

for each II >O. Applying following inequality:

in a

the principle

fuzzy

295

environtnenl

of mathematical

induction

we arrive at the

L”(a)kl(S 1Xo)=/.Lo((a,S) (X,,)>k(S I x0) for each

II

> 0. For II + 7, we find

p.,(tr” 1x,,)> /-LD(S ( x,,).

cl

Thus, if we only can improve our strategy, then a better time-invariant strategy may be found. Let us remark here that for a time-invariant strategy TS = S. Now WC formulate the following main theorem, which will form a basis of a procedure for the determination of an optimal strategy. Theorem

4. Let a E A arld for each i E { 1. rn} let us denote

u(s, 1E A. such

by B( i, a) the set of all

that

(18)

CLD(~~“Isi) pD(a”

(19)

( x,,).

Proof. The ith element of the vector pD((z, a’) ( x,,) is equal to j.h((z.

a”)

I s,) =

cLtax!c

control Decision

with control

fuzzy

of a nonfuxzy

termination control

termination

time,

of 21 fuzzy

time.

system Fuzzy

system

in 2,

Sets and in a fury

Lihat lebih banyak...

Control of a deterministic system es a fuzzy environment over infinite planning horizon

Descrição do Produto

Comentários