- Open Access
The interval versions of the Kalman filter and the EM algorithm
© Al-Gahtani et al.; licensee Springer 2012
- Received: 2 May 2012
- Accepted: 14 September 2012
- Published: 2 October 2012
In this paper, we study state space models represented by interval parameters and noise. We introduce an interval version of the Expectation Maximization (EM) algorithm for the identification of the interval parameters of the system. We also introduce a suboptimal interval Kalman filter for the identification and estimation of the state vectors. The work requires the introduction of the concept of interval random variables which we also include in this work together with a study of their interval statistical properties such as expectation, conditional expectation and variance. Although the interval Kalman filter introduced here is suboptimal, it successfully recovers the state vectors to a high precision in the simulation examples we have run.
- Probability Density Function
- Kalman Filter
- State Space Model
- Expectation Maximization Algorithm
- Interval Arithmetic
In a state space model, some parameters of the system such as the coefficient matrices may not be precisely known or they gradually change with time. One way to account for these uncertainties is to allow such parameters to be represented by interval entities. The question then arises as to how to extend identification and estimation techniques to interval settings.
To our knowledge, no attempt has been made so far to extend identification techniques such as the EM algorithm to interval state space models. In this work, we give one such an extension.
In the existing literature, an optimal interval Kalman filter was attempted in . That attempt suffered from the lack of proper definitions and rigorous treatment. The idea in  was to replace the interval system setting with the ‘worst case inversion’ while keeping everything else unchanged. So, the ultimate treatment in  amounts to the application of the traditional Kalman filter to the system representing the worst case scenario. This way the authors were able to avoid the difficulties that arise when dealing with interval arithmetic and concepts. On the other hand, this algorithm cannot be called optimal and the concept of the optimal interval Kalman filter remains an open question.
In our work, we introduce a spacial interval arithmetic that always produces results that are smaller (in the sense that it is contained) than the traditional interval arithmetic [2, 3]. This arithmetic enables the extension of the Kalman filter as well as the EM algorithm to interval setting in a true sense. In our restricted interval arithmetic, the interval Kalman filter we introduce here is optimal. However, with respect to the more general interval arithmetic, our interval Kalman filter is suboptimal.
We introduce a special set of interval operations that will enable the extension of the usual linear system concepts to the interval setting in a seamless manner. The more general definitions of the interval operations can be found in . The arithmetic introduced here avoids such vague terms as ‘interval extension’, ‘inclusion function’, determinants etc. that have been used in the literature [1, 4–6].
with the usual restriction if • = ÷.
Observe that all operations in Definition 1 result in intervals since they can be regarded as continuous functions defined on the unit interval . For example, a typical element in is which is a continuous function of α. The operations in Definition 1 give similar results to the usual interval operations as given in  when , but generally they give only subintervals if . For example, if , then according to Definition 1, while the usual definition in  gives .
These two properties, which are missing in the usual interval operations, will enable the extension of many results from usual state space models to interval state space models. On the other hand, these definitions were motivated by our attempt to arrive at a definition of interval random variables and investigate the corresponding statistical properties. We feel that they are the natural ones to handle interval systems. This feeling is reassured by the numerical results we obtained in the simulation examples (see Section 5). While we expected to obtain a construction of a suboptimal interval Kalman filter, the constructed filter was actually able to recover the exact simulated intervals rather than subintervals.
Interval vectors and matrices are defined similarly:
A vector is defined as
and the inequality holds componentwise.
A matrix is defined as
and the inequality holds componentwise.
provided that the involved operations make sense. In the same spirit, interval matrix operations are defined as follows:
The interval determinant is defined by
The interval adjoint is defined by
The interval inverse is defined by
Suppose that exists. We define the solution of the interval linear system to be
The last inclusion holds because if , then there is an and a with . Then . Thus, . Noting that is an interval vector and is minimal, we get that .
For the rest of this paper, we will use the special interval operations defined above.
The map q defines a metric in IR.
We begin by discussing the measurability of set-valued maps and then introduce the definition of an interval random variable. The basic definitions and more details can be found in . A measurable space consists of a basic set Ω together with a σ-algebra of subsets of Ω called measurable sets. Here, we consider closed convex value set-valued maps , i.e., is a closed convex subset of for each . This is the case when F is interval valued. The latter notion means that for each , the components of are closed intervals in ℝ.
Definition 3 Let be a measurable space and be a set-valued map. F is called measurable if the inverse image of each open set is a measurable set: if is open, then .
We are now in a position to introduce the definition of interval random variables and interval stochastic processes.
X is measurable, and
the function is continuous on X, where is the probability density function for the random variable x.
An interval stochastic process is an indexed set of interval random variables.
In order to study the expectations and variances of interval random variables, we need to discuss first the integral of set-valued maps and, in particular, interval-valued maps. The discussion begins with the notion of measurable selections.
Definition 5 Let be a measurable space and be a measurable set-valued map. A measurable selection of F is a measurable map satisfying for each .
F is measurable.
for every .
- 4.There exists a sequence of measurable selections of F such that
for each .
A countable family of measurable selections satisfying the last property is called dense.
Let be an interval-valued map. We define the two special functions and such that and , where for each . The next lemma shows that and are measurable selections of F when the latter is measurable.
Lemma 7 Let be a measurable interval-valued map. Then the point functions and are measurable selections of F.
Then and (here the inf and sup operations are taken componentwise). Since the inf and the sup operators preserve measurability, we see that the functions and are measurable selections of F. □
Thus, and . For every , the set is dense in the interval . Thus, .
Now suppose that is a measure space and is a set-valued map. A measurable selection f of F is an integrable selection if f is integrable with respect to the measure μ. The set of all integrable selections of F will be denoted by ℱ. The map F is called integrably bounded if there exists a μ-integrable function such that for μ-almost every . Here, B denotes the unit ball in . In this case, every measurable selection f of F is also an integrable selection since implies that , where denotes the Euclidean norm on .
We shall say that F is integrable if every measurable selection is integrable.
where . Hence, .
The second equality is an immediate consequence of this. □
It will always be assumed that both and are integrable.
In view of (3), we have the following corollary.
We shall say that Z is normally distributed if each is normally distributed. An interval stochastic process will be called normally distributed if for each , is normally distributed.
Guided by this and Lemma 9, we can define the interval expectation of the interval random variable Z as follows.
It should also be noted that the expectation of a vector random variable is the vector of expectations of its components.
The same is true if I is an interval vector and Z is an interval random variable.
To introduce covariance of two interval random variables Y, Z, we need to assume that the function is continuous on . Here, is the joint probability density function of the two random variables x, y.
where , , . This last equation provides a formula for computing the interval .
For interval random vectors, the above definitions hold componentwise.
The two interval random variables Y, Z will be called uncorrelated if for each , , , are uncorrelated. Therefore, Y, Z are uncorrelated if and only if .
It is now straightforward to check the following theorem.
The assumed continuous dependence of the probability density function (joint density function) on the random variable (variables) in an interval random variable (interval random variables) implies that the conditional probability density function is also continuous. This guarantees that the generalization of the conditional density function to the interval setting is always an interval.
The following theorem is easily checked.
with initial value .
4.1 The interval Kalman filter
is called the Kalman gain. The initial conditions are and .
4.1.1 The EM algorithm in interval setting
Initialize the procedure by selecting starting values for the elements of the parameter set and estimate .
Repeat steps 2 and 3 above until convergence is achieved.
Dr. O. Al-Gahtani extends his appreciation to the Research Center of Teachers College, King Saud University for funding his work through the research group project No. RGP-TCR-07. The second and third authors would like to thank King Fahd University of Petroleum and Minerals for the excellent research facilities they provide.
- Chen G, Wang J, Shieh LS: Interval Kalman filtering. IEEE Trans. Aerosp. Electron. Syst. 1997, 33(1):250–259.View ArticleGoogle Scholar
- Alefeld G, Herzberger J: Introduction to Interval Computations. Academic Press, San Diego; 1983.Google Scholar
- Rohn J: Inverse interval matrix. SIAM J. Numer. Anal. 1993, 3: 864–870.MathSciNetView ArticleGoogle Scholar
- Bentbib AH: Conjugate directions method for solving interval linear systems. Numer. Algorithms 1999, 21: 79–86. 10.1023/A:1019149111226MathSciNetView ArticleGoogle Scholar
- Kubica BJ, Malinowski K: Interval random variables and their application in queueing systems with long-tailed service times. SMPS 2006, 393–403.Google Scholar
- Chen W, Tan S: Robust portfolio selection using interval random programming. In: FUZZ-IEEE, Korea, 2009, August 20-24 (2009)Google Scholar
- Aubin J-P, Frankowska H: Set-Valued Analysis. Birkhäuser, Basel; 1990.Google Scholar
- Ekland I, Témam R Classics in Applied Mathematics 28. In Convex Analysis and Variational Problems. SIAM, Philadelphia; 1999.View ArticleGoogle Scholar
- Jazwinski A: Stochastic Precesses and Filtering Theory. Academic Press, New York; 1970.Google Scholar
- Tanizaki H: Nonlinear Filtering: Estimation and Applications. Springer, Berlin; 1996.View ArticleGoogle Scholar
- Bilms, JA: A gentle tutorial of the EM algorithm and its applications to parameter estimation for Gaussian mixture and hidden Markov models. Technical report TR-97–021, ICSI (1997)Google Scholar
- Dempster AP, Laird NM, Rubin DB: Maximum likelihood from uncomplete data via the EM algorithm. J. R. Stat. Soc. B 1977, 39: 1–38.MathSciNetGoogle Scholar
- Shumway RH, Stoffer DS: An approach to time series smoothing and forecasting using the EM algorithm. J. Time Ser. Anal. 1982, 3(4):253–264. 10.1111/j.1467-9892.1982.tb00349.xView ArticleGoogle Scholar
- Shumway RH, Stoffer DS: Time Series Analysis and Its Applications. Springer, Berlin; 2006.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.