13 Building Theories from Multiple Evidence Sources about the underlying phenomenon. Hayes is able only to speculate about some causes for this – for example, that the studies were run indifferent cultural contexts and by subjects with different levels of experience – but it is worth noting that these resulting hypotheses maybe of as much practical interest to the research community as a successful meta-analysis would have been.
A final application of meta-analysis in the software domain that is especially worthy of note was a study conducted by Galin and Avrahami (2005). These authors attempted to address the question of whether software quality assurance programs work by conducting a meta-analysis of studies examining the effects of the Capability Maturity Model (CMM) for software.
The authors point out that CMM has been one of the most widely-deployed software process improvement methods for an extended number of years, and so would be among the most likely approaches for which sufficient data would exist. For the same reason, this analysis was also a good test of the suitability of meta-analysis for software engineering research. In this case, the results were more positive 22 studies were found that examined the effects of the CMM on software
process improvement and, of these,
19 contained sufficiently detailed quantitative information to be suitable for analysis. The analysis did find substantial productivity gains when organizations achieved the initial improvement levels of the CMM (although data was missing that addressed higher levels of achievement).
In the end, the lesson learned about applying meta-analysis to software engineering seems to be that the heterogeneity of current empirical results is a major limitation in our ability to apply meta-analytic procedures (Miller, 2000). Because of the large amounts of variation from so many different context variables, which exists in any set of software
engineering experiments, we maybe unable to generate statistically definitive answers for many phenomena other than those with the largest effect sizes (e.g., organizations going from an undisciplined development process to achieving initial levels of the CMM). This is true even in cases which seem to lend themselves to cross-study analysis, for example, topics for which there
is a rich body of studies, some of which may even be replications of one another. For many other topics of interest which do not have such a rich set of studies, which tend to be the ones of most interest to researchers and practitioners, it is still an open question whether the studies undertaken so far are additive and can be combined via meta-analysis to contribute to an eventual body of knowledge.
Share with your friends: