Business Challenge: Our midsize insurance client specializing in medical insurance had a conventional lead processing system that involved handling leads from diverse sources, such as telemarketing, websites, and partnerships. The existing workflow of the client involved following up with leads through cold calling and offering insurance policies. However, managing a vast variety of leads was challenging and required significant investment in resources, including hiring, training, and paying agents.
Therefore, the client approached Predera with a goal to optimize their resources and develop an efficient lead scoring system using machine learning. They wanted to enhance their lead processing system and identify leads that were more likely to convert into customers, thereby reducing the burden on their agents and minimizing expenses.
The insurance company also asked for our assistance in improving their lead scoring process using machine learning. The client had observed that not all leads were of equal value. They noted that their agents could spend up to three hours making cold calls to potential leads and only sell one policy. Conversely, on another day, the same agent could sell two or three policies within just 30 minutes of a call. The client wanted to streamline their lead scoring process to better identify leads that were more likely to convert into customers and reduce the time and effort spent on leads that were less likely to convert.
After consulting with the insurance company, we decided to develop a predictive Lead Scoring model that could forecast the likelihood of a lead converting into a customer and finally purchasing a policy. This would enable insurance agents to prioritize leads with high scores, saving time and resources.
To build the solution we took the following steps:
During this stage, we encountered issues with:
To address these problems, our engineers developed a Python module that contained various blocks, such as:
This approach enabled our data scientists to overcome data challenges and gain insights that improved the data collection logic and strengthened the ML lead scoring solution.
Our second solution aimed to improve lead efficiency through the use of the same features as before, with a different target variable. This time, we aimed to predict a continuous score, representing the number of sold policies by the call time spent on the lead by insurance agents. However, we encountered issues related to messy data related to call duration, including answering machine data and poor aggregation.
To solve this problem, we employed
We encountered another challenge in testing and validation. We couldn't use only converted leads because we couldn't know whether leads were sold while the model was still running a prediction for lead efficiency. To address this, we used the previous lead scoring model to test only those leads with high scores predicted by the ML model, reducing the number of non-sold leads and achieving better business metrics results.
By using our models, agents save time and focus on leads with high conversion potential, optimizing sales workflow and performance. We also deployed the models using AIQ a proprietary MLOPs platform of Predera and integrated them into the customer's system to avoid any integration challenges. We provide maintenance support and are available to fine-tune or retrain the models if needed.