Automatic time series analytics with UComp!

Last August, 17th my paper on automatic identification of Unobserved Components models was finally published at the Journal of Statistical Software (https://www.jstatsoft.org/article/view/v103i09). The paper develops a methodology for running UC models without any prior assumptions. This methodology is implemented in UComp, a library written in several ‘languages’, like R, MATLAB/Octave and Python. R users may install the package from CRAN, MATLAB/Octave users may install it from GitHub (https://github.com/djpedregal), Python users should contact me directly.
There are many options to make a crafted manual estimation as well, with possibilities of automatic outliers identification, introduction of input variables, etc. Just have a go at it!
Any feedbacks are welcome!!

ISF2020

The 40th edition of the International Symposium of Forecasting 2020 just ended last Wednesday. It was initialled scheduled in June to be held at Rio de Janeiro, but we had to do it on-line (what a pity!).

I have contributed in two ways: i) giving a talk on session «Software and support systems II» entitled «Comprehensive automatic identification of Unobserved Components models», and ii) running a workshop on Thursday about «Modern State Space methods: a workshop for practitioners».

The talk was about UComp, a library for the automatic identification of structural unobserved components models (UC) useful both for time series decomposition and forecasting inspired by those of Andrew C. Harvey. The tool has been developed in C++ and is offered as a R package implemented with the help of the Rcpp api, and is also available as a MATLAB/Octave toolbox via mex files. Some of the salient features are: i) identification of models is done automatically by optimizing information criteria; ii) the type of models is more extense than usually assumed in this sort of models (damped trends, general ARMA coloured irregular components, etc.); iii) models allow for exogenous cycles and inputs; iv) automatic outlier detection is available; etc.

The workshop was about State Space modelling, and was mainly practical, covering some typical examples and more advanced as well, like Hodrick-Prescott filters, constant and time varying parameter regressions, dynamic harmonic regression, basic structural models, nesting in variables, time aggregation, hierarchical forecasting, and others. The last session was dedicated to the automatic identification of UC models with UComp.

Nada es Gratis

Hoy ha salido un post en el blog Nada es Gratis de José I Castillo y un servidor (aquí) sobre la crisis económica y las predicciones macroeconómicas que se nos vienen encima como consecuencia de la pandemia. Es obvio darse cuenta que la dificultad de tales predicciones es enorme, por lo que allí ofrecemos una sencilla herramienta que permite a quien se la descargue plantear sus propios escenarios, de forma que puede obtener la predicción del PIB para 2020. ¡Que aproveche!

ECOTOOL in PLOS ONE

Finally the ECOTOOL toolbox for Matlab was published in PLOS ONE (Time series analysis and forecasting with ECOTOOL).
It embodies several routines for identification, validation and forecasting of dynamic models. The toolbox includes a wide range of exploratory, descriptive and diagnostic statistical tools with visual support, designed in easy-to-use Graphical User Interfaces, and may be downloaded from here.
It also incorporates complex automatic procedures for identification, exact maximum likelihood estimation and outlier detection for many types of models available in the literature (like multi-seasonal ARIMA models, transfer functions, Exponential Smoothing, Unobserved Components, VARX). You may estimate, for example, exponential smoothing or unobserved components models with automatic identification of outliers, something I have never seen before. You also may run the automatic identification of ARIMA models with two seasonalities (like diurnal and weekly)…
Just with a few lines of code you may run a comprehensive analysis of time series. The toolbox is supplied with an in-depth documentation system and online help, and it containes many demos that will guide you through the process of time series modelling.
Have a go at it and keep in touch!

39th International Symposium on Forecasting

This conference just closed this evening at Thessaloniki (Greece). A big event with a lot of interesting people giving interesting talks, with a mixture of traditional greek dancing and many other exciting thigs to do / see / share.

I spoke about a library that hopefully will appear a soon as possible on the automatic identification of Unobserved Components (UC) models. The idea is as simple and as basic as saying that there are automatic identification algorithms for every sort of model around, except for UC models. However, such a simple idea is much more complex that one may think, especially if one is trying to do it in C++. The slides with preliminary results here.

SSpace tutorial

This entry is really a link to a short tutorial on how to deal in practice with SS systems with the help of SSpace in MATLAB.

SSpace is a piece of software designed to specify State Space models with a maximum of flexibility and a minimum of coding and with a lot of coding from the side of the developers that makes life much easier to the users. The tutorial is here.

M4 Conference

I just have been to this remarkable conference, that is a sort of spin-off of the M4 competition. It has been really nice, mainly because of the people involved and the high level of the talks. We had the chance to see how people from industry (big ones, actually, Amazon, Google, Microsoft, Uber and SAS) deal with the problem of forecasting in different contexts, as well as notorious academics.
The winners of the competition explained their methods and people from this big companies explained how their day-by-day forecasting problems look like. It was interesting to see how some of the presenters spoke of statistical or classical methods as opposed to Machine Learning or AI methods. The organizers did a great work in making the competition as transparent as possible and actually the results of all methods finally submitted may be replicated with the software provided by the contestants, making replicability straight away. The winners were a mixture of mature and quite young people, and statistical and «new» methods. The results were then well balanced and mark the way to go if we like to improve our forecasting abilities.
The environment was very cooperative and I had the chance to see quite open minded people in both sides of the academic and industry. A big pleasure!!

SSpace published in the Journal of Statistical Software

SSpace is a MATLAB toolbox for time series modeling and forecasting in a rather flexible and powerful State Space framework. It may be downloaded from Bitbucket, but the news is that it is now published in the Journal of Statistical Software (here).

There is a lot to do still, but any potential user that gives it a try would soon realize that it is very easy to use. I would recommend running the demos from beginning to end, since they are conceived as a tutorial that drives the user through the modeling process step by step. People who are not directly interested in State Space modeling may also take advantage of the toolbox, because there are several templates for implementing many sort of usual time series models in a very flexible way. In other words, there is no need to know anything about State Space approaches to exploit fully the toolbox.

There are still more toolboxes to come soon!!

R Course at the University of Seville

I have been invited to teach the course entitled ‘Introduction to Predictive Intelligence in R with applications’ at the Economic Analysis department PhD program in the University of Seville. Coming to Seville among good friends is always I gratifying experience.

The material of the course is here for anyone interested (in Spanish). The main objective of the course is to demonstrate the usefulness and power of R in the analysis of data, both in cross-sectional problems, as in time series and prediction. A set of predictive intelligence techniques such as regression, exponential smoothing, ad-hoc models and ARIMA will be progressively analysed. The course is eminently practical, with many examples and cases to analyze. It is divided into four sessions of two hours in which a theoretical introduction will be made for each technique, to then resolve the cases proposed in R. The course is of an introductory nature, but the students who complete it will have a good starting level to extend or adapt by themselves what they have learned to their own research topics. Attendees are encouraged to submit their own case studies.

ITISE2018

Last week I attended the ITISE2018 conference in Granada, Spain (5th International Conference on Time Series and Forecasting). It is a nice conference, not too big, quite multidisciplinary and in a rather nice setting. Attendants had also the chance to speak to really interesting people working on time series modelling and forecasting from many different perspectives, big data included. Some of the people around were first rank (Peter M Robinson or Andrew C Harvey among others are good examples).

I just gave a short talk (jointly prepared with D Villegas, M Villegas and JR Trapero) on SSpace (see slides here). This is a MATLAB toolbox for State Space modelling and forecasting in a very flexible and efficient way (it will come up soon in the Journal of Statistical Software). It has many advantages on other pieces of software about SS modelling:

  • It is free.
  • It incorporates up-to-date algorithms, models and methods on SS systems. It allows for rather general linear Gaussian models, non-Gaussian models and non-linear models.
  • The models are specified by writing a MATLAB function. In this way, the user has complete flexibility to specify the model.
  • Templates for general and particular models in SS forma are provided with the toolbox.
  • We have tried to make the toolbox as friendly as possible to the user. For example, there is only one function for filtering regardless of the type of model, this means that such function selects internally whether the model is linear-Gaussian, non-Gaussian or non-linear. Then, the user neither have to worry about the type of system is using nor select and remember the correct function name for the appropriate filtering function, reducing the risk of errors. This is just one example of many.
  • This design makes possible a full time series analysis with just a few functions with a code that is rather simple and repetitive, i.e. easy to remember. Even those who are not interested in State Space modelling would find the toolbox compelling. Function names were chosen following nemonic rules, so that the user have to remember just a few names and will be easy to locate those functions forgotten.
  • The toolbox is provided with eight examples with thorough explanations that introduce  the user to all the features of the toolbox.

In summary, a pleasant conference that gave me the opportunity to disseminate a bit the work we have done during the last years.