Mining accurate message formats for service APIs

Md Arafat Hossain, Steve Versteeg, Jun Han, Ashad Kabir, Jiaojiao Jiang, Jean-Guy Schneider

Research output: Book chapter/Published conference paperConference paperpeer-review

11 Citations (Scopus)


APIs play a significant role in the sharing, utilization and integration of information and service assets for enterprises, delivering significant business value. However, the documentation of service APIs can often be incomplete, ambiguous, or even non-existent, hindering API-based application development efforts. In this paper, we introduce an approach to automatically mine the fine-grained message formats required in defining the APIs of services and applications from their interaction traces, without assuming any prior knowledge. Our approach includes three major steps with corresponding techniques: (1) classifying the interaction messages of a service into clusters corresponding to message types, (2) identifying the keywords of messages in each cluster, and (3) extracting the format of each message type. We have applied our approach to network traces collected from four real services which used the following application protocols: REST, SOAP, LDAP and SIP. The results show that our approach achieves much greater accuracy in extracting message formats for service APIs than current state-of-art approaches.
Original languageEnglish
Title of host publication25th IEEE International conference on software analysis, evolution and reengineering (SANER) 2018 Campobasso, Italy
PublisherIEEE Xplore
ISBN (Electronic)9781538649695
ISBN (Print)9781538649701 (Print on demand)
Publication statusPublished - 05 Apr 2018
Event25th IEEE International Conference on Software Analysis, Evolution and Reengineering: SANER 2018 - University of Molise , Campobasso, Italy
Duration: 20 Mar 201823 Mar 2018 (proceedings)


Conference25th IEEE International Conference on Software Analysis, Evolution and Reengineering
Internet address


Dive into the research topics of 'Mining accurate message formats for service APIs'. Together they form a unique fingerprint.

Cite this