Discussion about this post

User's avatar
Neural Foundry's avatar

Great framing on deployment as the actual bottleneck. That 2% catalog rate for economically valueable tasks is probaly still generous, we're basicaly retrofitting decades of tacit knowldge into structured eval sets from scratch. I saw similer patterns when tryin to automate parts of financial due deligence, the bottleneck wasn't the model quality but mapping what partners actually did vs what they thought they did.

Expand full comment

No posts

Ready for more?