
A general caution about statistical models is that the available data has two major limitations. First one is it may not include the piece of data that really has the impact and influences the outputs the most. This piece of data may be not collected at all or simply dismissed as not important at the data selection process. Of course, a proper model building process should make this danger very small or non-existent. The second limitation comes with the fact that historical data is, well, historical and some new or possible developments could not be there at all. For example, no data set could be used to estimate changes in car market if women would be allowed to drive in an Arabic country. This has never happened so no data is available for the effect. Another example is for introduction of a lamp with better lifetime - the pure statistical approach would not give good answers for the effect on the lighting market effect of that.
Another danger for statistical models comes with the usage of huge data set - the bigger the data set the bigger the number of spurious co-variations, i.e. the relations found by pure coincidence. We could create a nice story around every detected relation but it would not make it any more real.
Sometimes I got the feeling that stat models strip the developers from responsibility - "this is what data shows".
Mechanical or Classic Models

Functional dependencies to be used in this type of models could be too complex to calculate or too rough due to insufficient knowledge about them. This situation is remedied by usage of approximating functions and that could kick back as some important insights could be missed.
An advantage of these models is that they provide a "what if" option to easily explore the outcomes with different sets of input files and assumptions. This includes also understanding the sensitivity to the initial parameters - the value ranges that provide same outcome is also a crucial piece of information.
Maybe the most important feature of classical models is that they are transparent - there is logical and calculation flows that could be traced and checked. This very helpful for the model acceptance in contrast to the statistical models where the statistical methods look like a black box for most of the managers while they understand well the logic of the market. This points to another important feature - this type of models require deeper understanding of the specifics of the model object. Stat models seem to be more independent from the subject.
Classical models require more precise definition of the questions to answer - the model built to a specific requirements and changing them could mean that it should be totally reworked and more input parameters obtained.
This post has been inspired by a discussion with a co-worker about the application and advantages of each model type. I hope it is a good starting point for further discussions and has some highlights to help better understand and select models.
No comments:
Post a Comment