I'm deploying an LLM in a server that has primitive features, including a feasible networking infrastructure. The owners of the servers were asking my client to split the cost of creating an API networking infrastructure to allow the LLM (hosted in their A100 GPUs) to other servers.
The indicated that the estimated cost of developing this API infrastructure was going to be $12-15,000, developed in a time period of ~2.5 weeks with 4-5 developers with 90-95% chance of success.
They wanted my client to go halfsies of the total cost, to which he agreed.
A couple of week after the meeting, I proposed to him that I might be able to developed the networking infrastructure from their servers to any other server. To me it just seemed costly to spend $15,000 for an API. Having gained his trust, he agreed.
So today, I spend about 2.5 hours on setting up a SSH connection from the Linux based environment of the server to my local computer. The attempt was to connect my Windsurf agentic IDE to the server so I didn't have to use the terminal and or the outdated VS code editor innately integrated in their UI.
To by-pass the firewall of the server, I simply rellied on VPN service to make a safe encrypted connection between their server and my computer and.....BOOM!
Now I can use my Windsurf agentic IDE from my computer to connect to their server in order vibe code (while as before, I could only hard-code, ew!).
This now has to be tested on making inferential request to the deployed LLM but I think the current networking infrastructure will work on this.
I should make a video 🤔 about this