If your VLM based pipeline is really that good for OCR - and no reason to believe it cant be - why dont you just launch that as a product. The way I see it is that these are two separate products - VLM based OCR for messy documents, and automated credit review for developing markets.
I have some experience setting up automated OCR systems for one of the largest fintechs in Australia - and VLM based pipelines can definitely give an extra edge and this is easily a very large TAM market. However, existing players might also be upgrading their systems so might not be too easy to disrupt. That being said, credit analysis is also a very hard problem, but I am not sure how much quality OCR would help here.
Given what I know, I would focus on the VLM/OCR problem rather than the Credit scoring one.
If you are eyeing the South African market - I can promise you granting credit here is waaaaayyy ahead of the US. There is a very solid credit bureau and a few of the banks are already on the "use AI to process docs" train. For rest of Africa - they're bigger on using cellphone data (see Optasia). If you want some insight into the market - happy to have a chat (email on profile)
Solution to emerging markets capital access problem is not making the current predatory system more efficient, but investing in micro credit. Which will never happen at scale because it generates lower returns (better to have 10 bad payers than 1000 good payers)
Also, those markets, "venture capital" usually means vertical lending platforms. Healthtech? nah, just credit for dental treatment. Edutech? nah, just credit for classes. Etc.
Would love to chat, I recently wrapped up an initial version of an automated real estate appraisal review app which appears to have some of the same technical challenges and risks. https://getvalara.com / jwillis@valara.net
Would love to share notes. I was able to get away with landing.ai and some really careful schema design and multi-step workflow with a few agents sprinkled in at the end.
Sidenote: unlike in Tagalog, Kita means “day-care facility for children” in German, so names like Kita Capture and Kita Credit Agent could carry unintended connotations.
Since when did Germany victimize itself into emerging market category?
I would be even more worried by the fact that it resembles Keeta, a delivery brand owned by Meituan, which actually operates in a bunch of large emerging markets.
I have some experience setting up automated OCR systems for one of the largest fintechs in Australia - and VLM based pipelines can definitely give an extra edge and this is easily a very large TAM market. However, existing players might also be upgrading their systems so might not be too easy to disrupt. That being said, credit analysis is also a very hard problem, but I am not sure how much quality OCR would help here.
Given what I know, I would focus on the VLM/OCR problem rather than the Credit scoring one.
It's a very crowded space
Would love to share notes. I was able to get away with landing.ai and some really careful schema design and multi-step workflow with a few agents sprinkled in at the end.
I would be even more worried by the fact that it resembles Keeta, a delivery brand owned by Meituan, which actually operates in a bunch of large emerging markets.