Description

The research problem was identified and defined by the external industry, GSK Global Pty Ltd. After several sessions with direct feedbacks from GSK Global, the draft software has been developed and approved by our industry partner. GSK provided funding for this collaboration.

Methodologies used for developing the software tool:

This is a classic scene text recognition problem. There were two main challenges in this project. They are –
1.Real-time imaging data analysis, feature extraction, pattern recognition using machine learning and post-processing of the extracted data.
2.Processing the whole computational analysis locally in hand-held mobile device

This project has implemented algorithms from several open-source repositories and journals. The data processing pipeline for the implementation of this imaging data analysis broadly used following methods:
•We used CRAFT model developed by Baek, Lee, Han, Yun, and Lee (2019) for alpha-numeric text detection
•For sequencing model, we used an End-to-End Trainable Neural Network for Image-based Sequence Recognition, CRNN developed by Shi, Bai, and Yao (2016).
•For Feature extraction, we used Resnet model He, Zhang, Ren, and Sun (2016).
•For Word and character level Sequence labelling, we used LSTM based model developed Ma and Hovy (2016) .
•We used sequence learners based on recurrent neural networks (RNNs) developed by Graves, Fernández, Gomez, and Schmidhuber (2006)
•For Scene Text Recognition Model, we used training model developed by J. Baek et al. (2019)
•Finally, we used a customized pattern recognition model based on ISO 6346.
Period04 Aug 202230 Jan 2023
Work forGSK Global Pty Ltd
Degree of RecognitionNational

Keywords

  • image processing
  • data analysis
  • text recognition
  • data management

Sustainable Development Goals

  • SDG 11: Sustainable Cities and Communities

Countries where activity occurred

  • Australia