Predera - The Full Stack AI/ML/LLM Operations Platform for Enterprise Solutions

However, sometimes it becomes challenging to utilize all this information available in these nutrition labels as they might not be familiar with nutritional terms or due to lack of time and motivation. So it is essential to automate data collection and interpretation process by integrating Computer Vision techniques. To make it more manageable and enjoyable we present a computer vision solution to extract data directly from nutrition label on the food product itself.

Predera Nutrition Information Extraction Module:

We built a nutrition information extraction module which takes images as input and classify all nutritional labels and performs extraction on them. We extract information directly from nutritional label on the food product itself. Our module uses computer vision and optical character recognition (OCR) techniques to extract relevant data from images.

‍Classification

The module takes image/images as input and selects only nutrition label images from them as input.

‍Image pre-processing

The pre-processing stage is responsible for doctoring the image into a clean, organized format in which text and numerical values are easily identifiable. As we need only detailed relevant information like calories, cholesterol, etc. the pre-processing will remove other noise or irrelevant information from the whole image. The region of interest in image which is nutrition label box in the image is identified and segmented.

‍Optical Character Recognition (OCR)

The OCR module takes the pre-processed images and extracts all the text in the image. We used open source Tesseract which is maintained by Google. Tesseract output contains some inaccuracies as well as extraneous information such as lines of dashes or some other irrelevant special characters.

‍Character correction & Domain Analysis

The tesseract output may contain errors such as misspellings or misidentified characters and there can be additional characters that are present on actual label. This module organizes nutrient information into key- value pairs and outputs that as an excel sheet.

Our module also triggers Google vision OCR when there is a lot missing key-value pairs in the output text after correction of text. It compares both outputs and returns the best one.

‍

Challenges:

There are many challenges with respect to images and the text extracted from images.

‍‍The nutrition label box in the entire image can be too small to detect.
Identifying the region of interest in the image(nutrition label) and remove irrelevant text or noise is quite difficult.
The image will be blurry and text in the nutritional label will not be clear all the time.
After extracting text from images, the text can as bad as possible, it means the text might be wrongly detected there can be many missing characters, wrong interpretation of characters or spelling mistakes and also the wrong interpretation of unit measurements and values.

‍

Conclusion:

Traditionally, the data entry process is manually done by people which takes a week or more to finish for all images and there can be a chance of human error. We can give any type of images as input to the nutrition extraction module, it identifies and takes only nutrition label images from them. The nutrition extraction module can process a single image or perform batch processing and it takes approximately 10 seconds per image and can process 200 images in less than half an hour in batch mode.

The mundane data entry process is either a handwritten or electronic log of eating habits, performing tedious calculations to keep one’s progress up to date. Health and fitness applications have arisen that provide an automated means of tracking nutritional data but many of these still require the user to input all necessary information. This manual input requires tedious repetition on the part of the user. Our module can be plugged as a mobile or web application to provide a simple and accurate means of tracking diet.

Extracting Information From Nutrition Labels

Predera Nutrition Information Extraction Module:

Challenges:

There are many challenges with respect to images and the text extracted from images.

Conclusion: