Hi there đ, weâre Adesh and Shivam. Between Oct â23 and Feb â24 we spent many hours on Maya Health, a startup idea focused on automating medical coding.
We ultimately decided to pivot away from this idea, but we learned a lot through interviewing customers and industry experts, and building the product. We wanted to share some of those learnings with you.
Part 1. A new tech wave đ
When we started exploring potential startup ideas, ChatGPT was close to a year old and there was buzzing excitement around the possibilities of LLMs and Generative AI. Although we were skeptical that a blind adoption of LLMs alone could solve meaningful problems, we were excited by how this tech extended the software toolkit.
We spent time understanding how the core technology worked, mapping out new capabilities that LLMs were uniquely good at, and also itâs noteworthy limitations.
In parallel, we also explored unconventional service and usage-based business models that are made more exciting by LLMs. EvenUp, for example, uses in-house lawyers and paralegals greatly augmented by AI tools to help create âdemand packagesâ for law firms, an essential and time consuming work product for personal injury lawyers.
Part 2. The medical coding problem
Exploring the healthcare landscape
We both had worked in healthcare for years and had a few starting hypotheses about how new AI capabilities could help solve major inefficiencies in the industry. In particularly, we knew burnout rates in medicine were startlingly high. A major cause was physicians being tasked with non-clinical work they mostly hated.
In a poll conducted by Doximity, 46% of physicians said reducing the admin burden would be the most effective change to reduce burnout rates; more impactful than improving work-life balance or reducing clinical caseloads!
We began by mapping the workflows, paying particularly close attention to documentation requirements and money flows.
Medical coding
We quickly came across a potential candidate for a set of tasks that could be dramatically improved by LLMs: medical coding. Medical coding is the process of translating largely free text physician notes that contain healthcare diagnoses, procedures, and medical services into universal medical alphanumeric codes (e.g., ICD-10, CPT). The codes have expanded as the set of diagnoses and procedures has increased; in 2024 there were close to 100,000 codes across ICD and CPT standards.
Medical coding is a common practice globally, and itâs primarily used to collect standardized data for public health and scientific research. In the US ecosystem however, its role is broader. Medical codes are an integral part of insurance claims that all providers, from independent clinics to massive hospital groups, need to complete for every single patient encounter to get collect revenue.
Medical coding is still largely a manual process, conducted by a combination of in-house coders or consultants. Some key facts we found convincing about the medical coding problem:
- Manual coding is error prone, with the median encounter having a 20% error rate
- Denied claims due to coding errors result in the loss or delay of about $20 billion in the U.S.
- Many claims processing is delayed due to coding backlog which extends the AR cycle
- There is a 30% shortage of medical coders nationwide, with an imminent wave of retirements in the next five years (AMA, 2023)
Part 3. Validating the idea
We felt it was easier to get feedback on a concrete product pitch so we spun up a mock landing page. This was the primary value proposition we were testing:
NOTE
The Maya AI platform automatically codes over 90% of encounters and gets claims ready to bill. Our in-house coding team can complete the rest.
Customer and expert interviews
Healthcare is a labyrinthian market and is fragmented across provider type, provider size, state, EHR, funding model and medical specialty. We knew the needs of each segment could be markedly different so we wanted to start narrow and interview small to medium sized providers.
We interviewed 10 medical coders and physicians that worked at mid-sized mental health, family physicians and urgent care practices. We also spoke with 6 experts that had worked as medical coding consultants or ran revenue cycle management outsourcing firms.
đŹÂ Main themes from customer and expert interviews
- Large variation between medical specialty: Medical coding is more challenging and is a real burden in medical specialties where there is either a wide range of procedures (e.g. Emergency Medicine, Family Medicine, Urgent Care) or particularly complex procedures with a large number of decisions (e.g. Cardiac Surgery, Opthamalogy). However, meaningful segments of the market including specialized outpatient practices such as mental health or radiology donât feel that medical coding is a challenge.
- Midsized clinics offshore most of the medical coding work: Although there is an informal certificate body called the AAPC, medical coders donât require a license to practice. This has resulted in an off shoring of a large portion of medical coding work in mid sized clinics to Latin America and South Asia and has drove down cost significantly.
- The quality of physician documentation greatly affects coding: Physicians are often unaware of the medical coding and billing guidelines, they tend to under-document visits, miss documenting certain consultations or procedures, or incorrectly code encounters (at practices where physicians also do medical coding). This leads to under-billing, claim denials, or several follow-ups by the medical coder to assign the correct codes.
- Unaware of net impact of medical coding errors: Many small to mid-sized medical clinics donât diligently track and attribute the cause of insurance claim denials. As a result, they donât have a good grasp on the revenue theyâre losing out on because of medical coding related errors or delays.
- Every individual site requires EHR integration: Every individual site requires EHR integration which is still a largely manual process that requires expensive software engineers.
Key product and business model learnings
Along with running customer and expert interviews, we created a medical coding prototype, and a product market fit canvas to flesh out other elements of the business.
đ€Â Product and business model learnings
- Poor unit economics for mid-market customers: Every single clinic we deployed our coding automation product to requires a custom EMR integration and dedicated implementationâwe estimated these integrations would take 1-3 weeks of software engineering time and cost thousands of dollars. We explored vendors that could help accelerate this process (e.g. Metriport), but the cost of goods sold (COGS) remained significant. Also, the downward price pressure created by offshoring medical coding tasks to cheaper talent meant that the per client revenue was compressing over time.
- Conclusions from prototyping: The extremely low performance (micro F1 = 0.2) on MIMIC-4 using the out of the box Gemini Pro model highlighted several challenges that would make real-world implementation risky.
- Clinical coding requires exceptional precision since errors can impact patient care, billing accuracy, and regulatory compliance. The high dimensionality of ICD-10 (with over 70,000 codes) combined with complex coding rules and guidelines makes this a particularly challenging task for LLMs.
- Clinicians often document conditions using varied terminology and may include relevant historical context across multiple notes, requiring sophisticated temporal reasoning that current LLMs struggle with. Additionally, many codes require specific documentation elements to be present to justify their use - subtle distinctions that appear to be beyond current model capabilities.
- The hierarchical nature of ICD-10 means that selecting a code thatâs close but not exactly right can have significant downstream implications. With such a low F1-score, implementing this system would likely increase the workload for coding professionals, as they would need to verify and correct most of the modelâs predictions.
While we could go down the route of fine-tuning the LLM on a large dataset of validated clinical notes and ICD-10 codes, set up robust validation against official ICD-10 documentation, and implement monitoring for code accuracy, we decided against it.
Part 4. Pivoting away from medical coding
We remain excited about building products for small and midsized medical clinics, and generally are very optimistic about tech driven innovation in healthcare. But we decided to pivot away from medical coding as:
- Medical coding is not a hair on fire problem: Small and midsized clinics are often run by physicians who donât have the time to rigorously understand how much revenue theyâve lost due to medical coding errors. As a result, theyâre not actively searching for solutions that improve this element of revenue cycle management.
- Unattractive economics: Serving smaller and mid-market clinics with tech that requires EHR integration is challenging because of the large one-time integration cost relative to the annual revenue. This results in long payback periods.
- Services heavy: Over the next 5 years, we think medical coding can be close to fully automated in most medical specialities. However, as weâre not certainty how long it will take, the interim solutions require full-time or contracted staff to be in the loop which is expensive and further hurts the economics of the business.
Part 5. Our key learnings
- Solve a top 3 problem: In a B2B context, a problem thatâs not within the top 3 priorities for a buyer wonât get prioritized and solved. We asked customers to stack rank the problems theyâre facing using this question, âGiven you have 50 things to work on, is this problem top 5, bottom 5, or somewhere in the middle and why?â
- Relative product improvement matters: The goal should be to design a way of completing an existing customer âjobâ that is, along some axis, dramatically better. Incremental solutions donât generate the support and energy required to get a new startup off the ground.
- Understand macro and micro: In complex, heavily regulated environments itâs worth investing the time to understand and eventually become an expert in the broader industry. The macro industry context informs the day-to-day decisions related to customer, product and strategy.
- Talk to experts: The quickest way to understand the macro context of an industry is to talk to real experts who will have a large set of experiences that can help you quickly understand the history, influential trends, and red herrings in the industry. Founders and ex-founders are particularly excellent source of insight.
- Model unit economics: The exercise of creating a simple financial model early on in the discovery process sparks questions and conversations that are incredibly valuable. The model itself is almost certainly wrong, but accuracy is not the point anyways.