APIs play a significant role in the sharing, utilization and integration of information and service assets for enterprises, delivering significant business value. However, the documentation of service APIs can often be incomplete, ambiguous, or even non-existent, hindering API-based application development efforts. In this paper, we introduce an approach to automatically mine the fine-grained message formats required in defining the APIs of services and applications from their interaction traces, without assuming any prior knowledge. Our approach includes three major steps with corresponding techniques: (1) classifying the interaction messages of a service into clusters corresponding to message types, (2) identifying the keywords of messages in each cluster, and (3) extracting the format of each message type. We have applied our approach to network traces collected from four real services which used the following application protocols: REST, SOAP, LDAP and SIP. The results show that our approach achieves much greater accuracy in extracting message formats for service APIs than current state-of-art approaches.
|Title of host publication||IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER)|
|Publication status||Published - 05 Apr 2018|
|Event||25th IEEE International Conference on Software Analysis, Evolution and Reengineering: SANER 2018 - University of Molise , Campobasso, Italy|
Duration: 20 Mar 2018 → 23 Mar 2018
|Conference||25th IEEE International Conference on Software Analysis, Evolution and Reengineering|
|Period||20/03/18 → 23/03/18|
Hossain, M. A., Versteeg, S., Han, J., Kabir, A., Jiang, J., & Schneider, J-G. (2018). Mining accurate message formats for service APIs. In IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER) (pp. 266-276). IEEE Xplore. https://doi.org/10.1109/SANER.2018.8330215