The Hidden Cost of Running Local AI: Why Your Workstation Needs Better Cooling Than You Think Three months into building my local AI setup, I noticed something concerning. My RTX 3090 wasn’t just running hot during inference—it was throttling. The token generation I’d optimized for weeks suddenly slowed to a crawl, and the reason wasn’t …