Web service QoS prediction using improved software source code metrics

Sarathkumar Rangarajan; Huai Liu; Hua Wang

doi:10.1371/journal.pone.0226867

Abstract

Due to the popularity of Web-based applications, various developers have provided an abundance of Web services with similar functionality. Such similarity makes it challenging for users to discover, select, and recommend appropriate Web services for the service-oriented systems. Quality of Service (QoS) has become a vital criterion for service discovery, selection, and recommendation. Unfortunately, service registries cannot ensure the validity of the available quality values of the Web services provided online. Consequently, predicting the Web services’ QoS values has become a vital way to find the most appropriate services. In this paper, we propose a novel methodology for predicting Web service QoS using source code metrics. The core component is aggregating software metrics using inequality distribution from micro level of individual class to the macro level of the entire Web service. We used correlation between QoS and software metrics to train the learning machine. We validate and evaluate our approach using three sets of software quality metrics. Our results show that the proposed methodology can help improve the efficiency for the prediction of QoS properties using its source code metrics.

Citation: Rangarajan S, Liu H, Wang H (2020) Web service QoS prediction using improved software source code metrics. PLoS ONE 15(1): e0226867. https://doi.org/10.1371/journal.pone.0226867

Editor: Xiaodi Huang, Charles Sturt University, AUSTRALIA

Received: September 19, 2019; Accepted: December 8, 2019; Published: January 15, 2020

Copyright: © 2020 Rangarajan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data files are available from https://github.com/sarath08/WS_aggregation.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

Service-Oriented Architecture (SOA) has become a platform for processing large amounts of information and knowledge to provide essential data and services to the users [1, 2]. Web service is the basic unit for a SOA. Web Services(WS) are loosely coupled application programs designed to support business-to-business interoperability using XML based Simple Object Access Protocol(SOAP) [3, 4]. Developers write and publish Web services at service repositories where users can discover, deploy and orchestrate them according to their business needs.

The success of cloud computing depends heavily on the quality of services provided in wireless terminals with limited computing and storage power such as mobile phones [5–7]. WS are of paramount importance among different kinds of cloud services. Indeed the number of WSs is rapidly increasing every day [8]. According to an open source WS repository “programmableweb.com”, they have 22,367 WSslisted. Due to the proliferation of WSs, there are numerous functionally similar services available [9, 10]. Consequently, searching for WS using the functionality keyword is no longer valid. Therefore, non-functional property such as Quality of Service (QoS) have become pivotal criterion in Web service discovery, selection, recommendation and orchestration [11].

Many QoS-driven Web service selection techniques have been proposed and successfully utilized in SOA [12–14]. However, they use QoS values for the Web services made available by the repositories. The QoS data accessible at service repositories have challenges due to some real-world scenarios as stated below:

Web service repository turns into a complex hierarchy in various situations. Therefore, end users would take quite a lot of time to go through enormous QoS records [15].
Public service repository such as Universal Description Discovery and Integration (UDDI) may hold untrustworthy QoS information due to lack of monitoring. Thus, they might list unavailable services and outdated QoS information for a user’ query [16].
Commercial Web services require users to pay a subscription fee to use. Thus, if a user want to test Web service by themselves end up paying considerable amount of money. Obviously, it is not practically possible for a user to monitor and collect QoS data for all the functionally similar WSs.
Some public repositories collect feedback from users to acquire QoS data. However, QoS information is influenced by network and geological factors. As Internet is dynamic and vulnerable, it is not possible to get the same QoS values for different users from diverse locations for a particular service [17, 18].

Even though service-level agreement (SLA) contains QoS parameters of a WS, users are still unsure with what is the quality it actually achieves. Certainly, a fundamental pre-request is to predict the QoS values instead of using the data available at repositories. Coscia JLO, Crasso M, Mateos C, Zunino A, and Misra S [19] found a statistically significant and strong relationship among a number of conventional sources code-level metrics and the catalogue of WSDL level service metrics. Observing software quality metrics is a prevalent methodology to evaluate software maintainability. Each Web service comprises of many micro-level software components such as class, method and package. Therefore, source code metrics are calculated at micro-level and aggregated into macro level to represent the entire software efficiently. We hereby list out some of the common practices in software industries for aggregating source code metrics and its disadvantages [20]:

Simple average: Calculating the mean of metric results for individual elements of a system might not be efficient enough to represent. Because, it does not express the standard deviation and may mitigate the effects of unwanted values in the generalized result. In other words, average function simply smoothed the results but does not reflect the reality.
Weighted average: Weight could be used to differentiate less important components from critical components. However, defining the weight is very critical and may introduce problem of its own.
Statistical aggregation methods: Central tendency measures such as mean, median or standard deviation cannot be trusted due to the highly-skewed distribution nature of software.

Impact of aggregation schemes for source code metrics on Web service QoS prediction remains unexplored. We hypothesize that the metrics aggregation scheme plays a vital role for high correlation between many metrics at the file-level. Furthermore, the performance of QoS prediction models may be negatively affected by the potential loss of information due to summation and aggregation.

In this paper, we investigate the impact of source code metrics aggregation on correlation between QoS attributes and source code metrics. Our investigation will be based on three different sets of quality metrics namely object oriented quality metrics proposed by Chidamber SR, and Kemerer CF [21], Complexity metrics proposed by H.M.Sneed [22] and maintainability suite by Baski and Misra [23].

The remainder of the paper is organized as follows: Section 2 presents the motivation and specifications of the research. Section 3 highlights the related work in the research background. Section 4 briefs about the methodologies used to aggregate code metrics and the configuration of the learning machine. Section 5 explains the experimental set up used to validate the proposed approach and discusses the results on Web service QoS prediction. Finally, Section 6 concludes the paper with vision for future work.

2 Problem specification

The aim of the research is to investigate the impact of source code metrics aggregation to predict QoS properties. Chidamber and Kemerer explained source code metrics namely Lines of code, functional abstraction measurement, the coupling between object classes, average method complexity, weighted methods per class, McCabe’s cyclomatic complexity [21]. Coscia JLO, Crasso M, Mateos C, Zunino A, and Misra S introduces the possibility to predict the service interface maintainability or QoS by applying traditional software metrics in service implementation. They used source code metrics as a primary pointers to support the software programmers for developing services with more maintainability [19]. Mateos C, Crasso M, Zunino A, and Coscia JLO introduced an interesting correlation between the anti-patterns in the WSDLs and its object-oriented metrics identified from its source code [24]. Kumar L, Kumar M, and Rath SK presented a learning machine to predict the QoS properties such as reliability, response time, throughput, modularity, interoperability, availability, and testability by exploring the correlation between its object oriented software source code metrics [25].

However, less attention had been paid to the aggregation of source code metrics for the Web services so far. Even though a Web service can act as a single standalone system, it comprises of many methods and classes. Thus, aggregation of different micro-level metric values for each pieces of code helps to obtain a single value for global evaluation of the Web service. Arithmetic mean is the predominantly used aggregation function for most of the performance evaluation metric calculation models. However, most of the software quality metric values are much skewed in nature. Consequently, the simple mean function is not reliable against this kind of distributions. A known approach to reduce this problem is to select a known family of distribution such as log-normal, exponential or negative binomial and aggregate the metric value by fitting its observed parameters. This method however is not viable because whenever a new metric is introduced, we need to repeat the fitting process.

As a response to these challenges, we proposed a methodology to calculate source code metrics using micro level software components. We propose an aggregation model for Web service at micro-level by utilizing an inequality measure Theil index. Theil index is widely used in econometrics to study inequality of welfare or income distribution among various groups of people. Distribution of data in econometrics is very much like the data distribution happens in software engineering. As it is having the potential to summarize a large amount of data, it has been proposed recently as an aggregation scheme for software source code metrics.

To achieve the goal of this study, the objectives will be as follows:

Extracting the source code metrics for micro level attributes from Web service description file named Web Service Description Language (WSDL).
Calculating source code metrics values using the micro level software components.
Identifying the correlation between source code metrics (Example: Weighted methods per class, Lack of cohesion in methods) and QoS properties.
Automate the prediction of QoS properties using the correlation via Machine learning techniques.

We will extract the class files from the WSDL files obtained from QWS-WSDL dataset using WSDL2Java tool. We will calculate source code metrics at class level. Potential source code metrics will be obtained by feature selection and reduction for each quality of service. Finally, learning machine with various kernels will be trained to predict the QoS properties.

3 Related work

3.1 Web service & quality of service

In practice, it is very difficult for an end user to obtain QoS information. The user needs to spend large amount of resource, time and cost to invoke and measure QoS for all available Web services. Different users will get dissimilar QoS experiences while using the same Web service due to the dynamic nature of the network environment and geographically distributed locations [26]. Therefore, predicting the QoS properties of a Web service became an important step to be followed in service-oriented systems. Using available QoS values in invocation records to calculate the unavailable or missing QoS parameters is called QoS Prediction [27]. Collaborative Filtering (CF) technique is widely adapted in Web service community due to the success in commercial recommender systems. CF predicts unknown QoS values based on historical user data [28].

Predicted QoS values can be used as additional criteria to rank the matching results during the service discovery and selection process. Top ranked service holds an importance among the other services [29]. In service orchestration, considering the QoS of services is as important as combining functionalities of different services.

3.2 Software source code metrics

Mateos C, Crasso M, Zunino A, and Coscia JLO [24] discussed the methods available in code-first Web services to remove unnecessary anti-patterns. The authors worked on the hypothesis that the occurrence of anti-pattern can be avoided using object-oriented source code metrics. To find the occurrence of anti-pattern at WSDL level, they have considered eleven source code metrics such as: Average Parameter Count (APC), Response for the Class (RFC), Abstract Type Count (ATC), Coupling between the objects (CBO), Lack of Cohesion among the Methods (LOCM), Void Type Count (VTC), Cohesion among Methods of Class (CAM), Lack of cohesion in methods Henderson-Sellers version(LCOM3), Total Parameter Count (TPC), Weighted Methods per Class (WMC), and Empty Parameters Methods (EPM) [25, 30]. Mateos C, Crasso M, Zunino A, and Coscia JLO used a real-time Web service dataset to identify the correlation between object-oriented metrics and occurrence of anti-pattern by using well-known statistical methods. They also measured the impact of simple metric-driven code refactoring on the occurrence of anti-pattern to some of the generated WSDLs from the dataset. As a summary, Mateos C, Crasso M, Zunino A, and Coscia JLO observed that the complexity and maintainability of Web services can be predicted using object-oriented metrics and refactoring.

3.3 Correlation between source code metrics and QoS

A high correlation between traditional Object-oriented source code-level metrics and WSDL-level service metrics have been found by Charrad et al [29]. They used the most comprehensive and thoroughly evaluated set of metrics to calculate the maintainability of Web service using WSDL interfaces. The findings of this paper suggest that the software developers can avoid developing non-maintainable service by applying simple early code refactoring. As Java is widely used as a programming language to develop back-end services, the authors focused on java based Web services but their finding does not depend on the programming language.

Romano et al. tried to identify the list of source code metrics which can be used to predict Java interfaces that are vulnerable to change [31]. The source code metrics such as Lack of Cohesion among Methods (LCOM), Coupling between Object (CBO), Depth of inheritance tree (DIT), Number of Children (NOC), Weighted Method per Class (WMC), Response for Class (RFC), and Interface Usage Cohesion (IUC) were used along with fine-grained source code changes in interfaces of ten open-source Java-based systems [25, 30]. The correlation between the metrics of the source code and the fine-grained changes in the source code was tested empirically. Romano et al. concluded that the external interface cohesion source metrics have greatest association with the number of changes in source code.

3.4 Software metrics aggregation

Software metrics calculated at micro-level artefacts and aggregated to the macro-level artefact for the analysis. The popular aggregation technique used for source code metrics is mean [32], [33] even though there are increasing research works to demonstrate the inappropriateness of this technique [34], [35] due to the skewness of source code metrics distribution [36]. The sum is another popular aggregation technique. Chidamber et al. used the sum to aggregate complexity of individual methods to the class level in their metrics suite [21]. Alexander et al. [37] used the Theil index, a widely used inequality measure in econometrics to identify the wealth distribution to aggregate the software metrics values as they both share the same kind of data distribution. Theil index is not specific to a particular metric and can be used to aggregate a wide range of metrics.

4 Proposed work

4.1 Research framework

Fig 1 demonstrates our proposed structure and methodologies for this research. The system is made up of many steps and the first step is to extract java class files from WSDL files using WSDL2java software. The next step is to measure the various metrics of the source code. By following the steps described in Fig 1, we used CKJM_extended tool to measure the object-oriented source code metrics. For each Java class file, we calculate 19 metrics of source code. Then we use the Theil index as an aggregation technique to sum up a single value for all the metrics of the source code. We used Principal Component Analysis to remove irrelevant features to achieve dimensionality reduction. As a final step, we apply linear regression to predict the quality metrics for the Web services. By using the different combination of available source code metric sets, we got seven sets of metrics to evaluate the performance of regression learner. Performance of prediction model using different sets of metrics evaluated by calculating estimator evaluation metrics.

Download:

Fig 1. The proposed framework.

https://doi.org/10.1371/journal.pone.0226867.g001

4.2 Source code metrics aggregation

Data of wealth distribution inequality from economics and source code metrics of software are sharing similar structure. The Gini coefficient, a widely applied economics inequality measures attracted attention in the field of software metrics. It can be easily explained using Lorenz curve. However, the Gini coefficient has a major drawback as it cannot be decomposed [37]. Serebrenik et al. proposed another inequality measure named Theil index instead of Gini index as it is decomposable so it can be used not only to calculate the inequality but also to explain it. Moreover, it is not specific to any specific metrics so it can be used to aggregate wide range of metrics. [38]. Therefore, we preferred Theil index to calculate the aggregation for source code metrics.

4.3 Feature redcution & selection

We used Principal Component Analysis (PCA) as data preprocessing method for feature extraction and selection. For each PC (Principal Components), we calculate eigenvalue, variance percent, cumulative percentage and source code metrics interpreted. PCs with eigen value more than 1 are considered as a potential source code metrics coherts. Table 1 shows the PCA results of object oriented metrics. Out of 19 source code metrics, 12 metrics were identified as having potential to predict QoS properties. Tables 2 and 3 shows the PCA results for Baski & Misra metrics and Sneed’s metrics.

Download:

Table 1. PCA results for object oriented metrics.

https://doi.org/10.1371/journal.pone.0226867.t001

Download:

Table 2. PCA results for Baski & Misra metrics.

https://doi.org/10.1371/journal.pone.0226867.t002

Download:

Table 3. PCA results for Sneed’s metrics.

https://doi.org/10.1371/journal.pone.0226867.t003

4.4 Learning machine

The aim of this research is to examine the impact of aggregation methods of source code metrics (e.g. Lack of cohesion in methods, coupling between object classes) on predicting QoS characteristics (e.g. reaction time, accessibility, throughput, testability, interoperability, etc.). Therefore, We preferred to use a simple regression model called multiple linear regression model to employ the prediction. By fitting a linear equation to measured data, multiple linear regression aims to model the relationship between two or more independent variables and a response variable. We developed the training set according to the correlation standards defined in [25]. Then the number of latent variables need to be defined for each QoS property. The next step is to build a model to generate the value of the QoS property by using the training set knowledge base. The data set was submitted to a 10-fold cross-validated paired t-test analysis. In the 10-fold cross-validated paired t-test procedure we segment the dataset into 10 parts of equally sized, each of which is then used for analysis, while the remaining 10-1 parts (joined together) are used to train the regressor (i.e., generic k-fold cross-validation). For demonstrating the reliability of the regression learner, we used two different performance metrics (MAE, RMSE).

5 Experiments & analysis of results

5.1 Research questions

Our experimental study designed to answer the following two research questions:

5.1.1 RQ1: How does the proposed methodology improve the predictability of source code metrics?

A software system is not a single standalone system to provide the solution. Normally, it comprises many subordinate pieces of code such as class, method or function. Therefore, the software metrics must calculated in the micro level and should aggregated into macro level for representing the source code metric of a software system. Since software code metrics are highly skewed values, it is inappropriate to use simple statistical aggregation (mean, median, etc.,) methods. The proposed inequality distribution models are very successful in economics data, which is as skewed as software source code data.

5.1.2 RQ2: How are source code metrics used to predict quality of service properties of Web service using source code metrics?

During the software development life cycle, developers extract source code metrics to evaluate the maintainability to reduce the future issues with the system. Thus, source code metrics and quality of service properties are correlated. We will use the correlation between source code metrics and quality of service to predict the QoS of Web services. We use linear regression based learning machines to create a prediction model.

5.2 Variables and objects

5.2.1 Independent variable.

A technique under investigation is defined as independent variable. So, Theil index is selected as the independent variable for this research. Dutch statistician Henri Theil presented an inequality measure named Theil index [39]. Given a (continuous) univariate distribution function F with the support and the mean μ(F) the first Theil index is defined as: (1) The Theil index was introduced for the field of unequal income distribution aggregation. Let x₀ ∈ X is a particular value of income, F is a distribution of income in the population, and F(x₀) is the proportion of the population with the income x less than or equal to x₀. We calculate source code metrics at micro level and aggregate to represent the macro level software using Theil index. Metrics that have negative values can not be aggregated using Theil index because of the logarithmic calculation in its formula. Since logx for x ≤ 0 can produce an undefined value, Theil index may also be an undefined value if they contains non-positive values [37].

5.2.2 Dependent variable.

Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) are the two well-known statistical precision measurements utilized to assess the prediction results. MAE is the average absolute deviation of predictions to the ground truth data. For all test services and test QoS properties, MAE is calculated as: (2)

In the Eq 2, Q_ij denotes the observed QoS value of Web service j obtained from data set entry i; is the predicted QoS value; and N is the number of predicted values. The smaller value of MAE indicates better prediction result. RMSE can be expressed as: (3)

RMSE can be measured using Eq 3 to find out the differences between the actual and predicted values. Once the model yields more than 90% accuracy level, the learning machine can able to predict the five chosen QoS properties for a WSDL of a service.

5.2.3 Objects.

We used the WS dream dataset, an open source dataset produced by a group of researchers from The Chinese University of Hong Kong. WS dream dataset contains two versions of datasets and we used version 1 for the experiment. The dataset contains URLs of 5825 Web services and its response time and throughput readings from 339 geographically distributed users. The dataset also has more details about both the Web services and users such as IP address, country, continent, longitude, latitude, region, and city.

Our main goal of this research is to identify the implications of aggregation of source code metrics on QoS prediction. Therefore, we need to calculate the source code metrics for the target Web services using its Web Service Description Language (WSDL) is necessary. wsimport is a Java function to process WSDL file and extract class files for the corresponding Web services.

Since the dataset only contains URL for the WSDL of Web services, we developed a Java application to crawl the WSDL files using URLs from the dataset. Only 457 out of 5825 Web services have active WSDL available on the internet. As we mentioned earlier, wsimport was used to extract the Java class files. The number of class files per Web services ranges from one to 281. Fig 2 shows the number of class files per Web service. The x-axis represents the Web service ID and y-axis represents the number of Java files extracted from the Web service.

Download:

Fig 2. Number of Java files extracted from available WSDL.

https://doi.org/10.1371/journal.pone.0226867.g002

5.3 Empirical environment

5.3.1 Source code metrics.

We used three sets of metrics sucha as object oriented source code metrics proposed by Chidamber et al. [21], Baski and Misra Metrics [23] and Sneed’s metrics [22].

Chidamber and Kemerer Metrics: Ckjm_extended tool introduced by Chidamber et al. [21] can be utilized to calculate 19 size and structure software source code metrics from the generated Java class files. The metrics are Weighted methods per class, Depth of Inheritance Tree, Number of Children, Coupling between object classes, Response for a Class, Lack of cohesion in methods, Afferent coupling, Efferent coupling, Number of Public Methods for a class, Lack of cohesion in methods Henderson-Sellers version, Lines of Code, Data Access Metric, Measure of Aggregation, Measure of Functional Abstraction, Cohesion Among Methods of Class, Inheritance Coupling, Coupling Between Methods, Average Method Complexity, McCabe’s Cyclomatic Complexity.

We developed a Java based system to integrate with ckjm tool and calculated 19 metrics for all the 457 Web services. For the next step, we used R language to calculate the statistical calculation. R library called ineq have been used to calculate the Theil index I_Theil for the Web services.

Sneed’s tool: Sneed et al. [22] developed a tool named softAudit for measuring Web Service interfaces. The suite consists of nine different source code metrics to measure complexity of service interfaces: Data Complexity (Chapin Metric), Data Flow Complexity (Elshof Metric), Data Access Complexity (Card Metric), Interface Complexity (Henry Metric), Control Flow Complexity (Mccabe Metric), Decisional Complexity (Mcclure Metric), Branching Complexity (Sneed Metric), Language Complexity (Halstead Metric), Weighted Average Program Complexity. We used softaudit tool provided by Sneed et al. to calculate the 09 complexity metrics for all the Web service by processing micro-level class files. We also calculated 09 quality metrics using sneed’s tool namely Degree of Modularity, Degree Of Portability, Degree Of Reusability, Degree Of Testability, Degree Of Convertibility, Degree Of Flexibility, Degree Of Conformity, Degree Of Maintainability, Weighted Average Program Quality.

Baski and Misra Metrics: Baski and Misra [23] proposed a tool to compute six different complexity metrics of WSDL file These metrics are based on analyzing the WSDL and XSD schema elements. The metrics are Data Weight of a WSDL (DW), Distinct Message Ratio (DMR), Distinct Message Count (DMC), Message Entropy (ME), Message Repetition Scale (MRS) and Operations Per Service (OPS). Baski & Misra metrics have been calculated by processing the WSDL file for each web service rather than processing micro-level class files. We used Baski & Misra’s tool to calculate the six metrics for the 457 Web services WSDL files.

We used above mentioned three sets of quality metrics as an independent variables and quality metric values calculated using Sneed’s tool as a dependent variable. Normalization is very important to process the data for regression. So, we applied unsupervised normalization with the range of 0.0 to 1.0 using Weka tool. Then we grouped the metric sets with different combination to populate more sets of metrics to compare our results. Consequently, we have Chidamber and Kemerer Metrics (CKM), Baski and Misra Metrics (BSM), Sneed’s metrics (SM), CKM-BSM metrics, CKM-SM metrics, BSM-SM metrics and All metrics (AM). After grouping, we got seven different sets of metrics available. We applied linear regression to predict modularity, quality of service value calculated. The results show that Sneed’s metrics individually outperform the other sets of metrics. Object-oriented metrics also have the potential to predict the QoS value but not as efficient as Sneed’s metrics. Baski & Misra metrics have the lowest efficiency among the available group of metrics. In summary, metric values calculated at the micro-level have better QoS prediction efficiency than those at macro level.

5.4 Result analysis

Three Quality of service metrics namely Modularity, Testability, Maintainability, Reusability for the available Web service had been calculated using Sneed’ tool. All three sets of metrics and its combinations had been used to predict the metrics value using robust linear regression. Fig 3 shows the graph between actual modularity values and predicted modularity values. The RMSE and MAE values respectively, 0.284, 0.172 which not a better values for a linear prediction model. Fig 4 contains the comparative graph of predicted modularity values using CKJM metrics and actual modularity values. RMSE and MAE values are 0.017, 0.011. The prediction results are better than Baski & Misra metrics. The graph depicted in Fig 5 illustrates the comparative study between actual vs predicted modularity values using sneed’s metrics. The results shows the sneed metrics have higher potential to predict the quality metrics for the Web service compare to other two metrics. The RMSE and MAE values are 0.0085, 0.0057 that implies the prediction results are very good.

Download:

Fig 3. BM metrics vs modularity prediction.

https://doi.org/10.1371/journal.pone.0226867.g003

Download:

Fig 4. CKJM metrics vs modularity prediction.

https://doi.org/10.1371/journal.pone.0226867.g004

Download:

Fig 5. Prediction model for modularity using SM metrics.

https://doi.org/10.1371/journal.pone.0226867.g005

Figs 6, 7 and 8 shows the prediction results with different combination among the three sets of metrics. Neither combination produced better results than the sneed’ set of metrics. As shown in Fig 7, BSM-CKM metrics have least potential to produce better prediction of modularity. The prediction and actual modularity values using all metrics comparative graph shown in Fig 9. It also could not produce better results than SM metrics. Table 4 shows the summary of RMSE and AME values of different sets of metrics used for the robust linear prediction. Sneed has set of metrics as a stand-alone predictors to produce better results for modularity quality prediction. As stated in Figs 10, 11 and 12, all metrics combined produces slightly better result than BaSki and Misra (BSM) metrics for predicting testability. However, Sneed’s metrics yields way better results than the all metrics and BSM metrics.

Download:

Fig 6. Modularity prediction results for SM-CKM metrics.

https://doi.org/10.1371/journal.pone.0226867.g006

Download:

Fig 7. Modularity prediction results with BSM-CKM metrics.

https://doi.org/10.1371/journal.pone.0226867.g007

Download:

Fig 8. Modularity prediction results with BSM-SM metrics.

https://doi.org/10.1371/journal.pone.0226867.g008

Download:

Fig 9. Modularity predicted vs actual for All metrics.

https://doi.org/10.1371/journal.pone.0226867.g009

Download:

Fig 10. Testability prediction results with BSM metrics.

https://doi.org/10.1371/journal.pone.0226867.g010

Download:

Fig 11. Testability predicted vs actual for all metrics.

https://doi.org/10.1371/journal.pone.0226867.g011

Download:

Fig 12. Testability prediction results with Sneed’s metrics.

https://doi.org/10.1371/journal.pone.0226867.g012

Download:

Table 4. RMSE & MAE comparison for different sets of metrics for modularity.

https://doi.org/10.1371/journal.pone.0226867.t004

Figs 13, 14 and 15 shows that using all metrics combined and BSM metrics could not achieve better results for Reusability prediction. Sneed’ metrics produces slightly better results while compare with other sets of metrics. By comparing Figs 16 and 17, all metrics together produces better prediction results than BSM metrics. As stated in Fig 18 Sneed’ metrics shows better potential than over all metrics combined and BSM metrics for predicting Maintainability.

Download:

Fig 13. Reusability predicted vs actual for all metrics.

https://doi.org/10.1371/journal.pone.0226867.g013

Download:

Fig 14. Reusability prediction results with BSM metrics.

https://doi.org/10.1371/journal.pone.0226867.g014

Download:

Fig 15. Reusability predicted vs actual for Sneed’s metrics.

https://doi.org/10.1371/journal.pone.0226867.g015

Download:

Fig 16. Maintanability prediction results with BSM metrics.

https://doi.org/10.1371/journal.pone.0226867.g016

Download:

Fig 17. Maintainability predicted vs actual for all metrics.

https://doi.org/10.1371/journal.pone.0226867.g017

Download:

Fig 18. Maintanability prediction results with Sneed’s metrics.

https://doi.org/10.1371/journal.pone.0226867.g018

The comparison table shows that the CKM metrics and SM metrics individually produce better results. When CKM and SM combined the efficiency of the prediction quality is reduced indicated by the increasing MAE and RMSE values. BSM metrics have lower potential to predict the quality values individually. The results of BSM combined with SM is better than CKM. As a conclusion, we demonstrated that the CKM and SM metrics that are calculated from micro level have higher potential to predict the quality of a Web service.

6 Conclusion

QoS values have become an essential criterion to choose a suitable service from abundance of functionally similar Web services. On the other hand, service providers do not provide adequate QoS data, while unequal computing and network environment make the user provided QoS data invalid. Therefore, predicting the QoS values of Web service is an important step in service-oriented systems. One of the independent methods to predict the quality parameters of the Web service is to utilize the software code metrics. Web service is not just a single system, but it contains many classes and methods. Thus, source code metrics should be calculated from the micro level such as classes and methods to a macro level system. However, most of the current systems either calculates the code metrics at a macro level or use basic arithmetic average and mean to lift the metrics from class level to system level. Such methods may cover up the inefficient values and provide the values of a rounded-up metric for the predictor.

In this paper, we investigated the impact of calculating source code metrics from a micro level on predicting the quality metrics. We used three sets of metrics namely CKM, BSM and SM to validate our system. For CKM metrics, we calculated source code metrics at the class level and used the Theil index, a method to aggregate source code metrics without compromising the distributed nature of the software source code. For SM metrics, we used Sneed’s tool to calculate complexity metrics from the Web service class files. BSM metrics have been calculated using its macro-level WSDL file instead of class files. We applied linear regression to predict modularity, quality of service value calculated. The results show that SM metrics individually outperform the other sets of metrics. CKM metrics also have the potential to predict the QoS value but not as efficient as SM metrics. BSM metrics have the lowest efficiency among the available group of metrics. In summary, metric values calculated at the micro level have better QoS prediction efficiency than those at macro level.

References

1. Khalil F, Li J, Wang H. An integrated model for next page access prediction. IJ Knowledge and Web Intelligence. 2009;1(1/2):48–80.
- View Article
- Google Scholar
2. Khalil F, Wang H, Li J. Integrating markov model with clustering for predicting web page accesses. In: Proceeding of the 13th Australasian World Wide Web Conference (AusWeb07). AusWeb; 2007. p. 63–74.
3. Al-Shammary D, Khalil I, Tari Z, Zomaya AY. Fractal self-similarity measurements based clustering technique for SOAP Web messages. Journal of Parallel and Distributed Computing. 2013;73(5):664–676.
- View Article
- Google Scholar
4. Li J, Liu C, Zhou R, Wang W. XML keyword search with promising result type recommendations. World wide web. 2014;17(1):127–159.
- View Article
- Google Scholar
5. Peng M, Zeng G, Sun Z, Huang J, Wang H, Tian G. Personalized app recommendation based on app permissions. World Wide Web. 2018;21(1):89–104.
- View Article
- Google Scholar
6. Li J, Liu C, Xu J. XBridge-Mobile: efficient XML keyword search on mobile web data. Computing. 2014;96(7):631–650.
- View Article
- Google Scholar
7. Li M, Sun X, Wang H, Zhang Y, Zhang J. Privacy-aware access control with trust management in web service. World Wide Web. 2011;14(4):407–430.
- View Article
- Google Scholar
8. Su K, Xiao B, Liu B, Zhang H, Zhang Z. TAP: a personalized trust-aware QoS prediction approach for web service recommendation. Knowledge-Based Systems. 2017;115:55–65.
- View Article
- Google Scholar
9. Huang X. Usageqos: Estimating the qos of web services through online user communities. ACM Transactions on the Web (TWEB). 2013;8(1):1.
- View Article
- Google Scholar
10. Huang X, Huang W, Lai W. Uip: Estimating true rating scores of services through online user communities. In: 2016 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE; 2016. p. 1–7.
11. Chen Z, Shen L, Li F, You D. Your neighbors alleviate cold-start: On geographical neighborhood influence to collaborative web service QoS prediction. Knowledge-Based Systems. 2017;138:188–201.
- View Article
- Google Scholar
12. Gupta R, Kamal R, Suman U. A QoS-supported approach using fault detection and tolerance for achieving reliability in dynamic orchestration of web services. International Journal of Information Technology. 2018;10(1):71–81.
- View Article
- Google Scholar
13. Li S, Wen J, Luo F, Gao M, Zeng J, Dong ZY. A New QoS-Aware Web Service Recommendation System Based on Contextual Feature Recognition at Server-Side. IEEE Transactions on Network and Service Management. 2017;14(2):332–342.
- View Article
- Google Scholar
14. Zhang Y, Lyu MR. QoS-Aware Web Service Searching. In: QoS Prediction in Cloud and Service Computing. Springer; 2017. p. 81–103.
15. Xu J, Zhu C, Xie Q. An Online Prediction Framework for Dynamic Service-Generated QoS Big Data. In: International Conference on Database Systems for Advanced Applications. Springer; 2017. p. 60–74.
16. Lin SY, Lai CH, Wu CH, Lo CC. A trustworthy QoS-based collaborative filtering approach for web service discovery. Journal of Systems and Software. 2014;93:217–228.
- View Article
- Google Scholar
17. Chen Z, Shen L, You D, Li F, Ma C. Alleviating Data Sparsity in Web Service QoS Prediction by Capturing Region Context Influence. In: International Conference on Collaborative Computing: Networking, Applications and Worksharing. Springer; 2016. p. 540–556.
18. Alexander H, Khalil I, Cameron C, Tari Z, Zomaya A. Cooperative web caching using dynamic interest-tagged filtered bloom filters. IEEE Transactions on Parallel and Distributed Systems. 2014;26(11):2956–2969.
- View Article
- Google Scholar
19. Coscia JLO, Crasso M, Mateos C, Zunino A, Misra S. Predicting web service maintainability via object-oriented metrics: a statistics-based approach. In: International Conference on Computational Science and Its Applications. Springer; 2012. p. 29–39.
20. Mordal K, Anquetil N, Laval J, Serebrenik A, Vasilescu B, Ducasse S. Software quality metrics aggregation in industry. Journal of Software: Evolution and Process. 2013;25(10):1117–1135.
- View Article
- Google Scholar
21. Chidamber SR, Kemerer CF. A metrics suite for object oriented design. IEEE Transactions on software engineering. 1994;20(6):476–493.
- View Article
- Google Scholar
22. Sneed HM. Measuring web service interfaces. In: Web Systems Evolution (WSE), 2010 12th IEEE International Symposium on. IEEE; 2010. p. 111–115.
23. Baski D, Misra S. Metrics suite for maintainability of extensible markup language Web Services. IET software. 2011;5(3):320–341.
- View Article
- Google Scholar
24. Mateos C, Crasso M, Zunino A, Coscia JLO. Detecting WSDL bad practices in code–first Web Services. International Journal of Web and Grid Services. 2011;7(4):357–387.
- View Article
- Google Scholar
25. Kumar L, Kumar M, Rath SK. Maintainability prediction of web service using support vector machine with various kernel methods. International Journal of System Assurance Engineering and Management. 2017;8(2):205–222.
- View Article
- Google Scholar
26. Chen Z, Shen L, Li F. Exploiting Web service geographical neighborhood for collaborative QoS prediction. Future Generation Computer Systems. 2017;68:248–259.
- View Article
- Google Scholar
27. He P, Zhu J, Xu J, Lyu MR. A Hierarchical Matrix Factorization Approach for Location-Based Web Service QoS Prediction. In: Service Oriented System Engineering (SOSE), 2014 IEEE 8th International Symposium on. IEEE; 2014. p. 290–295.
28. Zhu J, He P, Zheng Z, Lyu MR. A privacy-preserving QoS prediction framework for web service recommendation. In: Web Services (ICWS), 2015 IEEE International Conference on. IEEE; 2015. p. 241–248.
29. Charrad M, Ayadi NY, Ahmed MB. A semantic and QoS-aware broker for service discovery. Journal of Research and Practice in Information Technology. 2012;44(4):387.
- View Article
- Google Scholar
30. Kumar L, Krishna A, Rath SK. The impact of feature selection on maintainability prediction of service-oriented applications. Service Oriented Computing and Applications. 2017;11(2):137–161.
- View Article
- Google Scholar
31. Romano D, Pinzger M. Using source code metrics to predict change-prone java interfaces. In: 2011 27th IEEE International Conference on Software Maintenance (ICSM). IEEE; 2011. p. 303–312.
32. Lanza M, Marinescu R. Object-oriented metrics in practice: using software metrics to characterize, evaluate, and improve the design of object-oriented systems. Springer Science & Business Media; 2007.
33. Suh SD, Neamtiu I. Studying software evolution for taming software complexity. In: 2010 21st Australian Software Engineering Conference. IEEE; 2010. p. 3–12.
34. Lumpe M, Mahmud S, Vasa R. On the use of properties in java applications. In: 2010 21st Australian Software Engineering Conference. IEEE; 2010. p. 235–244.
35. Shatnawi R, Li W, Swain J, Newman T. Finding software metrics threshold values using ROC curves. Journal of software maintenance and evolution: Research and practice. 2010;22(1):1–16.
- View Article
- Google Scholar
36. Barkmann H, Lincke R, Löwe W. Quantitative evaluation of software quality metrics in open-source projects. In: 2009 International Conference on Advanced Information Networking and Applications Workshops. IEEE; 2009. p. 1067–1072.
37. Serebrenik A, van den Brand M. Theil index for aggregation of software metrics values. In: Software Maintenance (ICSM), 2010 IEEE International Conference on. IEEE; 2010. p. 1–9.
38. Vasa R, Lumpe M, Branch P, Nierstrasz O. Comparative analysis of evolving software systems using the Gini coefficient. In: 2009 IEEE International Conference on Software Maintenance. IEEE; 2009. p. 179–188.
39. Theil H. Economics and information theory. Studies in mathematical and managerial economics. North-Holland Pub. Co.; 1967. Available from: https://books.google.com.au/books?id=VVNVAAAAMAAJ.

[ref1] 1. Khalil F, Li J, Wang H. An integrated model for next page access prediction. IJ Knowledge and Web Intelligence. 2009;1(1/2):48–80.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Khalil F, Wang H, Li J. Integrating markov model with clustering for predicting web page accesses. In: Proceeding of the 13th Australasian World Wide Web Conference (AusWeb07). AusWeb; 2007. p. 63–74.

[ref3] 3. Al-Shammary D, Khalil I, Tari Z, Zomaya AY. Fractal self-similarity measurements based clustering technique for SOAP Web messages. Journal of Parallel and Distributed Computing. 2013;73(5):664–676.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. Li J, Liu C, Zhou R, Wang W. XML keyword search with promising result type recommendations. World wide web. 2014;17(1):127–159.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref5] 5. Peng M, Zeng G, Sun Z, Huang J, Wang H, Tian G. Personalized app recommendation based on app permissions. World Wide Web. 2018;21(1):89–104.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Li J, Liu C, Xu J. XBridge-Mobile: efficient XML keyword search on mobile web data. Computing. 2014;96(7):631–650.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Li M, Sun X, Wang H, Zhang Y, Zhang J. Privacy-aware access control with trust management in web service. World Wide Web. 2011;14(4):407–430.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Su K, Xiao B, Liu B, Zhang H, Zhang Z. TAP: a personalized trust-aware QoS prediction approach for web service recommendation. Knowledge-Based Systems. 2017;115:55–65.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. Huang X. Usageqos: Estimating the qos of web services through online user communities. ACM Transactions on the Web (TWEB). 2013;8(1):1.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Huang X, Huang W, Lai W. Uip: Estimating true rating scores of services through online user communities. In: 2016 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE; 2016. p. 1–7.

[ref11] 11. Chen Z, Shen L, Li F, You D. Your neighbors alleviate cold-start: On geographical neighborhood influence to collaborative web service QoS prediction. Knowledge-Based Systems. 2017;138:188–201.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref12] 12. Gupta R, Kamal R, Suman U. A QoS-supported approach using fault detection and tolerance for achieving reliability in dynamic orchestration of web services. International Journal of Information Technology. 2018;10(1):71–81.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref13] 13. Li S, Wen J, Luo F, Gao M, Zeng J, Dong ZY. A New QoS-Aware Web Service Recommendation System Based on Contextual Feature Recognition at Server-Side. IEEE Transactions on Network and Service Management. 2017;14(2):332–342.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref14] 14. Zhang Y, Lyu MR. QoS-Aware Web Service Searching. In: QoS Prediction in Cloud and Service Computing. Springer; 2017. p. 81–103.

[ref15] 15. Xu J, Zhu C, Xie Q. An Online Prediction Framework for Dynamic Service-Generated QoS Big Data. In: International Conference on Database Systems for Advanced Applications. Springer; 2017. p. 60–74.

[ref16] 16. Lin SY, Lai CH, Wu CH, Lo CC. A trustworthy QoS-based collaborative filtering approach for web service discovery. Journal of Systems and Software. 2014;93:217–228.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref17] 17. Chen Z, Shen L, You D, Li F, Ma C. Alleviating Data Sparsity in Web Service QoS Prediction by Capturing Region Context Influence. In: International Conference on Collaborative Computing: Networking, Applications and Worksharing. Springer; 2016. p. 540–556.

[ref18] 18. Alexander H, Khalil I, Cameron C, Tari Z, Zomaya A. Cooperative web caching using dynamic interest-tagged filtered bloom filters. IEEE Transactions on Parallel and Distributed Systems. 2014;26(11):2956–2969.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref19] 19. Coscia JLO, Crasso M, Mateos C, Zunino A, Misra S. Predicting web service maintainability via object-oriented metrics: a statistics-based approach. In: International Conference on Computational Science and Its Applications. Springer; 2012. p. 29–39.

[ref20] 20. Mordal K, Anquetil N, Laval J, Serebrenik A, Vasilescu B, Ducasse S. Software quality metrics aggregation in industry. Journal of Software: Evolution and Process. 2013;25(10):1117–1135.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref21] 21. Chidamber SR, Kemerer CF. A metrics suite for object oriented design. IEEE Transactions on software engineering. 1994;20(6):476–493.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref22] 22. Sneed HM. Measuring web service interfaces. In: Web Systems Evolution (WSE), 2010 12th IEEE International Symposium on. IEEE; 2010. p. 111–115.

[ref23] 23. Baski D, Misra S. Metrics suite for maintainability of extensible markup language Web Services. IET software. 2011;5(3):320–341.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref24] 24. Mateos C, Crasso M, Zunino A, Coscia JLO. Detecting WSDL bad practices in code–first Web Services. International Journal of Web and Grid Services. 2011;7(4):357–387.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref25] 25. Kumar L, Kumar M, Rath SK. Maintainability prediction of web service using support vector machine with various kernel methods. International Journal of System Assurance Engineering and Management. 2017;8(2):205–222.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref26] 26. Chen Z, Shen L, Li F. Exploiting Web service geographical neighborhood for collaborative QoS prediction. Future Generation Computer Systems. 2017;68:248–259.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref27] 27. He P, Zhu J, Xu J, Lyu MR. A Hierarchical Matrix Factorization Approach for Location-Based Web Service QoS Prediction. In: Service Oriented System Engineering (SOSE), 2014 IEEE 8th International Symposium on. IEEE; 2014. p. 290–295.

[ref28] 28. Zhu J, He P, Zheng Z, Lyu MR. A privacy-preserving QoS prediction framework for web service recommendation. In: Web Services (ICWS), 2015 IEEE International Conference on. IEEE; 2015. p. 241–248.

[ref29] 29. Charrad M, Ayadi NY, Ahmed MB. A semantic and QoS-aware broker for service discovery. Journal of Research and Practice in Information Technology. 2012;44(4):387.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref30] 30. Kumar L, Krishna A, Rath SK. The impact of feature selection on maintainability prediction of service-oriented applications. Service Oriented Computing and Applications. 2017;11(2):137–161.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref31] 31. Romano D, Pinzger M. Using source code metrics to predict change-prone java interfaces. In: 2011 27th IEEE International Conference on Software Maintenance (ICSM). IEEE; 2011. p. 303–312.

[ref32] 32. Lanza M, Marinescu R. Object-oriented metrics in practice: using software metrics to characterize, evaluate, and improve the design of object-oriented systems. Springer Science & Business Media; 2007.

[ref33] 33. Suh SD, Neamtiu I. Studying software evolution for taming software complexity. In: 2010 21st Australian Software Engineering Conference. IEEE; 2010. p. 3–12.

[ref34] 34. Lumpe M, Mahmud S, Vasa R. On the use of properties in java applications. In: 2010 21st Australian Software Engineering Conference. IEEE; 2010. p. 235–244.

[ref35] 35. Shatnawi R, Li W, Swain J, Newman T. Finding software metrics threshold values using ROC curves. Journal of software maintenance and evolution: Research and practice. 2010;22(1):1–16.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref36] 36. Barkmann H, Lincke R, Löwe W. Quantitative evaluation of software quality metrics in open-source projects. In: 2009 International Conference on Advanced Information Networking and Applications Workshops. IEEE; 2009. p. 1067–1072.

[ref37] 37. Serebrenik A, van den Brand M. Theil index for aggregation of software metrics values. In: Software Maintenance (ICSM), 2010 IEEE International Conference on. IEEE; 2010. p. 1–9.

[ref38] 38. Vasa R, Lumpe M, Branch P, Nierstrasz O. Comparative analysis of evolving software systems using the Gini coefficient. In: 2009 IEEE International Conference on Software Maintenance. IEEE; 2009. p. 179–188.

[ref39] 39. Theil H. Economics and information theory. Studies in mathematical and managerial economics. North-Holland Pub. Co.; 1967. Available from: https://books.google.com.au/books?id=VVNVAAAAMAAJ.

Figures

Abstract

1 Introduction

2 Problem specification

3 Related work

3.1 Web service & quality of service

3.2 Software source code metrics

3.3 Correlation between source code metrics and QoS

3.4 Software metrics aggregation

4 Proposed work

4.1 Research framework

4.2 Source code metrics aggregation

4.3 Feature redcution & selection

4.4 Learning machine

5 Experiments & analysis of results

5.1 Research questions

5.1.1 RQ1: How does the proposed methodology improve the predictability of source code metrics?

5.1.2 RQ2: How are source code metrics used to predict quality of service properties of Web service using source code metrics?

5.2 Variables and objects

5.2.1 Independent variable.

5.2.2 Dependent variable.

5.2.3 Objects.

5.3 Empirical environment

5.3.1 Source code metrics.

5.4 Result analysis

6 Conclusion

References