Random Llama
Random Llama
ProductsSolutionsBlogCase StudiesContact
Get a Quote
Weekly Newsletter

Get AI & productivity insights weekly

Privacy-first tools, workflow tips, and early product access. No spam — unsubscribe anytime.

Random Llama Software

Texas-built weird tools and custom web platforms—fast shipping, no creepy tracking, no enterprise bloat.

Links
  • Home
  • Products
  • Case Studies
  • Blog
  • Solutions
  • Credentials
  • Contact
Services
  • Custom CMS
  • Booking Engines
  • Mobile Apps
  • AI Integration
  • Website Maintenance
Connect
  • Privacy Policy
  • Terms of Service
  • Cookie Policy

© 2026 Random Llama Software, LLC. All rights reserved. Privacy Policy

Back to Blog
ai-toolsopen-source

Cursor Got Caught and Gemini Is Coming for Your Memory

Robert HattalaMarch 29, 2026

Cursor launched Composer 2 on March 19 and called it "state-of-the-art programming intelligence." A developer found kimi-k2p5-rl-0317 buried in an API response a few days later. Composer 2 was Kimi K2.5 with extra fine-tuning, and Cursor never said that.

Cursor's Composer 2 Was Built on a Chinese Open-Source Model

Moonshot AI, backed by Alibaba, released Kimi K2.5 as an open-weight model earlier this year. Cursor used it as the base for Composer 2, added reinforcement learning on top, and shipped it without attribution.

Cursor's VP of dev education said "only ~1/4 of the compute came from the base." The co-founder called the omission a "miss." That's a polite way of saying they got caught by a developer reading API logs.

This matters beyond the drama. If a $50B company is shipping a top-tier coding assistant on a Chinese open-source foundation without disclosing it, the model provenance question just got serious for enterprise buyers.

Gemini Now Wants Your ChatGPT and Claude Memories

Google shipped an import tool on March 26 that lets you pull chat history and memories from ChatGPT or Claude straight into Gemini. Upload a ZIP or paste a summarization prompt, and Gemini absorbs your context.

The biggest switching cost between AI assistants isn't features. It's losing months of accumulated personalization. Google just removed that friction with one tool.

Anthropic built something similar for Claude about three weeks earlier. The memory portability race is on. For users that's good. For anyone betting on lock-in as a moat, it's a problem.

Gemini 3.1 Flash-Lite Is $0.25 per Million Tokens

Google also released Gemini 3.1 Flash-Lite recently with 2.5x faster response times and a price of $0.25 per million input tokens. For apps making lots of AI calls, that changes your infrastructure math in a real way.

Sub-dollar-per-million-tokens is becoming the default for commodity tasks. We're already routing lighter classification jobs through Flash-class models at Random Llama. The latency and cost both make production sense now.

The Cursor story and the Gemini memory importer are part of the same shift. The model layer is commoditizing fast, open-source foundations are everywhere, and the battle is moving to trust and data portability. Build on the product layer. Model dependencies are not a moat.

Related posts

AI Goes to Space While the Boardroom Plays Catch-Up

Microsoft and OpenAI loosened their partnership, Google wants data centers in orbit, and the chief AI officer just went mainstream. Here is what it means.

May 14, 2026

Today in AI: Pharma Deals, Hack Plots, and a Weird Twist

Novo Nordisk bets the whole company on OpenAI, Google stops an AI hack plot, and Anthropic finds its models were learning evil from sci-fi novels.

May 12, 2026

AI Today: Anthropic Booms, Pentagon Bails, Pharma Bets Big

Anthropic posts an 80x revenue jump, the Pentagon skips them entirely, Novo Nordisk bets the farm on OpenAI, and AI gets blamed for layoffs again.

May 11, 2026

Need custom software or maintenance?

We build privacy-first apps, booking engines, and full-stack platforms — and keep them running.

Browse SolutionsGet in Touch
All posts