Industry Healthcare, Insurance
Specialization Or Business Function
Technical Function Analytics (Predictive Modeling, Machine Learning)
Technology & Tools
About the Project: We would like to have one or more algorithms built that can use large, healthcare paid claim data sets and identify those claims most likely to be part of an accident (motor vehicle, slip and fall, etc), as well as most likely to NOT be part of an accident. It is possible that there may need to be separate algorithms for the distinct accident types.
About Us: We perform subrogation-related activities including the identification of claims that should be paid by another liable party (auto-medical insurance, homeowner’s insurance, etc). Our current identification methodology includes an ETL process that mines paid healthcare claims based on a number of criteria. Selected claims (or a single claim) are aggregated into a case for investigation as to whether or not the claim relates to an accident or injury for which a third party is responsible.
About the Existing Process: Currently, our process leverages a proprietary rule set, by which we both identify cases we definitely want to open, as well as claims we know we want to reject. We review the existing claim on its own merit, as well as claims for the same patient within a reasonable time period, to determine whether the full episode of care appears to be part of an accident. We augment this advanced process with human review of claims that are classified as “possible” candidates for selection.
About the Data: We have the claims data, our current rule sets, and our outcomes data, which would be included in this project.
Claims Data includes:
Existing Case Outcomes Data includes:
We are going to provide an Excel header file which shows, in more detail, the data elements which would be made available during the project.
About the Model: It is expected that the algorithm will leverage features, both from the claims, and potentially beyond the claims, to predict the likelihood of a claim being part of an accident. Some potentially predictive features may include:
About the Deliverable: The expected outcome of this project is that we have an algorithm, which can be inserted into our existing ETL process, and provides a single, key metric: likelihood of this claim being part of an accident. In addition, it is expected that the data scientist will demonstrate the expected claims whose decisioning changed between actual historical decision and modeled decision so that we can estimate profitability lift and ensure a sense of comfort in the expected outcome of the model.
Other Notes:
Matching Providers