top of page
Search

Kateryna
Feb 113 min read
The Importance of Traffic Shaping and Rate Limiting in LLM Development with Docker, Trickle on Ubuntu
Learn how traffic shaping & rate limiting optimize LLM downloads, prevent network congestion, and reduce cloud bandwidth costs.
7 views0 comments

Chandan Kumar
Jan 254 min read
Stargate vs Deepseek R1 : The Diverging Paths of U.S. and China in AI Development
AI Cloud Infrastructure Investment The field of artificial intelligence (AI) is rapidly evolving, with breakthroughs and applications...
1 view0 comments

Chandan Kumar
Dec 19, 20245 min read
Setting Up a Developer Windows PC for AI Application Development - A Complete Guide
Windows PC for AI Application Dev with WSL Ollama. Python and Docker . Llama
81 views0 comments

Chandan Kumar
Nov 15, 20244 min read
Inferencing Options - TGI, VLLM, Ollama, and Triton
As AI applications become more selecting the right tool for model inference, scalability, and performance is increasingly important....
318 views0 comments


Kateryna
Nov 7, 20244 min read
Top Trends Shaping the AI Developer Ecosystem in 2024: Models, GPUs, Cloud Innovations, and More
The AI developer ecosystem is evolving rapidly, with groundbreaking advancements in models, performance optimization, GPU cloud...
8 views0 comments

Chandan Kumar
Aug 30, 20245 min read
Run Llama 3.1 405B with Ollama on H100 - QuickStart Guide on Denvr Cloud
Run Llama 3.1 405B with Ollama on Nvidia H100 - QuickStart Guide on Denvr Cloud
195 views0 comments

Chandan Kumar
Aug 14, 20243 min read
Deploy Chatbot style GenAI Chatbot with Llama 3
Running your own ChatGPT-style chatbot offers control, privacy, and customization. Leverage Llama 3.1 for efficient, compliant AI solutions.
71 views0 comments

Chandan Kumar
Mar 11, 20244 min read
GPU Scarcity, Inference and RAG 2.0
Facing GPU Scarcity for your AI workload and Navigating Challenges with Inference & RAG 2.0 Strategies
49 views0 comments


Chandan Kumar
Jan 26, 20245 min read
Navigating the Evolution of AI Models along with Cloud and GPU Technologies
The terms 'Cloud' and 'AI' continually evolve with each tech boom-and-bust cycle. This particular cycle stands out
16 views0 comments

Chandan Kumar
Jan 13, 20244 min read
Surviving the Age of AI - Developer Handbook
The age of AI brings with it a plethora of buzzwords and varying interpretations depending on different people and use cases. In this...
18 views0 comments


Chandan Kumar
May 31, 20213 min read
AI Cloud Data Centers: How Kubernetes is Reshaping Bare Metal Operations
Data centers shift from hardware to apps! Kubernetes ( & its distributions like RKE & Openshift) are reshaping the industry with cost-effici
16 views0 comments
bottom of page