Ssekirya Cosmah

Jan 26, 2026 • 2 min read

Refactoring FlowSpec AI for the Community

From Proprietary to Open-Source & Local-First

The journey of building FlowSpec AI has taken a massive turn. After a period of quiet development, I’m excited to share a major pivot in our philosophy: we are going Freemium, Open-Source, and Local-First.

Why the Pivot?

The decision was driven by two main factors:

  1. Community Accessibility: I want FlowSpec to be a tool for the entire React/Next.js ecosystem without the gatekeeping of high costs.

  2. Resilience: Recent internet shutdowns highlighted a major flaw in cloud-only AI. By allowing users to download and run FlowSpec locally, we provide total control and 100% uptime, regardless of the network situation.

The "Bootstrap" Tech Stack

Moving away from OpenAI’s expensive tokens meant getting creative with infrastructure. Without external funding, I’ve had to optimize every penny:

  • The Brain: Built with FastAPI and LangChain.

  • Vector Memory: Qdrant for high-performance embeddings.

  • The LLM: I’ve moved to a self-hosted Dolphin3 model running via Ollama.

  • The Hardware: I’m currently running this on a Contabo VPS. Here’s the kicker—it’s a CPU-only setup. No GPU. It’s a challenge, but it’s what’s fueling the mission right now.

The Current Hurdle: The Timeout Trap

I’ve spent the last few weeks refactoring the entire architecture to ensure it scales. However, I’ve hit a technical roadblock that I’m being transparent about: Inference Timeouts.

When testing the model via curl, I get code generation in under 15 seconds. However, when integrated into the FastAPI application, the terminal hits a timeout. I’m currently experimenting with different optimization paths, but the logic is proving stubborn.

Call for Collaborators 🤝

I’m not giving up, but I am looking for help. I am seeking FastAPI developers and DevOps enthusiasts who have experience with:

  • Self-hosting LLMs in production environments.

  • Asynchronous Python optimization for long-running tasks.

  • Managing inference on CPU-constrained hardware.

If you want to be part of a project revolutionizing security and testing for React web apps, let's build this together.

Get in touch:

FlowSpec AI is about more than just code; it’s about making high-level security tools accessible to every developer, everywhere.


#BuildInPublic #OpenSource #FastAPI #ReactJS #AI #CyberSecurity

Join Ssekirya on Peerlist!

Join amazing folks like Ssekirya and thousands of other builders on Peerlist.

peerlist.io/

It’s available... this username is available! 😃

Claim your username before it's too late!

This username is already taken, you’re a little late.😐

0

6

0