The aim is to build a simple python app/service which decodes sequences of digits from natural images (live camera). The model is primarily built on Convolutional Neural Networks (CNN) and Recurrent Neural Networks(RNN) using Python-Tensorflow framework. The model uses Long short-term memory (LSTM) cells to model RNNs. The project is still in progress.