Gemini Context MCP Server: Scalable Precision for Enterprise AI

Unlock Gemini's massive context power for Cursor—supercharge AI tools with industry-leading scalability and precision, turning complex workflows into seamless, game-changing results." )

Visit Repository

✨ Developer Tools

4.3(163 reviews)

244 saves

114 comments

Users create an average of 45 projects per month with this tool

About Gemini Context MCP Server

What is Gemini Context MCP Server: Scalable Precision for Enterprise AI?

Gemini Context MCP Server is a high-performance middleware solution designed to maximize the capabilities of Gemini's 2M token context window. Built on the Model Context Protocol (MCP), this server provides enterprise-grade tools for managing extended conversations, optimizing API costs, and maintaining semantic context across sessions. It acts as a bridge between Gemini's advanced language models and MCP-compatible applications, enabling seamless integration with tools like Cursor, VS Code, and Claude Desktop.

Key Features of Gemini Context MCP Server: Scalable Precision for Enterprise AI

Enterprise-Scale Context Handling: Maintain continuous conversations across up to 2 million tokens with automatic session management and metadata-driven context tracking
Cost-Optimized Caching: Store frequently used prompts and instructions with TTL-based expiration, reducing API token consumption by up to 40%
Semantic Intelligence: Built-in semantic search allows retrieval of contextually relevant entries using vector similarity analysis
Automated Maintenance: Self-cleaning sessions and cache systems ensure optimal performance without manual intervention
Multi-Tool Compatibility: Works natively with MCP-enabled platforms through standardized API endpoints

Gemini Context MCP Server Features

How to Use Gemini Context MCP Server: Scalable Precision for Enterprise AI

Installation & Setup


# Prerequisites: Node.js 18+, Gemini API key
git clone https://github.com/yourusername/gemini-context-mcp.git
npm install
# Configure .env with GEMINI_API_KEY
npm run build
node dist/mcp-server.js

Integration Examples

For VS Code extensions:

npm run install:vscode

To use in Node apps:


import { GeminiContextServer } from 'gemini-context-server';

const server = new GeminiContextServer({
  server: { sessionTimeoutMinutes: 60 }
});

await server.processMessage('user-123', 'Explain quantum computing');

Use Cases of Gemini Context MCP Server: Scalable Precision for Enterprise AI

Primary scenarios include:

Technical support chatbots maintaining multi-turn conversations
Document analysis tools requiring long-context summarization
API-intensive applications needing cost-effective caching strategies
Development environments benefiting from semantic code suggestions
Enterprise knowledge bases with version-controlled context entries

Gemini Context MCP Server FAQ

FAQ from Gemini Context MCP Server: Scalable Precision for Enterprise AI

How do I manage cache expiration?

Set TTL values during cache creation or update existing caches using the updateCacheTTL API. Default expiration aligns with session timeouts.

What models are supported?

Works with all Gemini series models (Pro, Enterprise, etc). Specify via GEMINI_MODEL environment variable.

Can I use this with custom tools?

Yes - the MCP interface allows creating custom tools in mcp-manifest.json following standardized schemas.

How is context security handled?

Sessions and caches are isolated by user identifier. Sensitive data storage requires integrating external database persistence (planned in v2.0).

Content

Gemini Context MCP Server

A powerful MCP (Model Context Protocol) server implementation that leverages Gemini's capabilities for context management and caching. This server maximizes the value of Gemini's 2M token context window while providing tools for efficient caching of large contexts.

🚀 Features

Context Management

Up to 2M token context window support - Leverage Gemini's extensive context capabilities
Session-based conversations - Maintain conversational state across multiple interactions
Smart context tracking - Add, retrieve, and search context with metadata
Semantic search - Find relevant context using semantic similarity
Automatic context cleanup - Sessions and context expire automatically

API Caching

Large prompt caching - Efficiently reuse large system prompts and instructions
Cost optimization - Reduce token usage costs for frequently used contexts
TTL management - Control cache expiration times
Automatic cleanup - Expired caches are removed automatically

🏁 Quick Start

Prerequisites

Node.js 18+ installed
Gemini API key (Get one here)

Installation

# Clone the repository
git clone https://github.com/yourusername/gemini-context-mcp.git
cd gemini-context-mcp

# Install dependencies
npm install

# Copy environment variables example
cp .env.example .env

# Add your Gemini API key to .env file
# GEMINI_API_KEY=your_api_key_here

Basic Usage

# Build the server
npm run build

# Start the server
node dist/mcp-server.js

MCP Client Integration

This MCP server can be integrated with various MCP-compatible clients:

Claude Desktop - Add as an MCP server in Claude settings
Cursor - Configure in Cursor's AI/MCP settings
VS Code - Use with MCP-compatible extensions

For detailed integration instructions with each client, see the MCP Client Configuration Guide in the MCP documentation.

Quick Client Setup

Use our simplified client installation commands:

# Install and configure for Claude Desktop
npm run install:claude

# Install and configure for Cursor
npm run install:cursor

# Install and configure for VS Code
npm run install:vscode

Each command sets up the appropriate configuration files and provides instructions for completing the integration.

💻 Usage Examples

For Beginners

Directly using the server:

Start the server:

node dist/mcp-server.js
Interact using the provided test scripts:

Test basic context management

node test-gemini-context.js

# Test caching features
node test-gemini-api-cache.js

Using in your Node.js application:

import { GeminiContextServer } from './src/gemini-context-server.js';

async function main() {
  // Create server instance
  const server = new GeminiContextServer();
  
  // Generate a response in a session
  const sessionId = "user-123";
  const response = await server.processMessage(sessionId, "What is machine learning?");
  console.log("Response:", response);
  
  // Ask a follow-up in the same session (maintains context)
  const followUp = await server.processMessage(sessionId, "What are popular algorithms?");
  console.log("Follow-up:", followUp);
}

main();

For Power Users

Using custom configurations:

// Custom configuration
const config = {
  gemini: {
    apiKey: process.env.GEMINI_API_KEY,
    model: 'gemini-2.0-pro',
    temperature: 0.2,
    maxOutputTokens: 1024,
  },
  server: {
    sessionTimeoutMinutes: 30,
    maxTokensPerSession: 1000000
  }
};

const server = new GeminiContextServer(config);

Using the caching system for cost optimization:

// Create a cache for large system instructions
const cacheName = await server.createCache(
  'Technical Support System',
  'You are a technical support assistant for a software company...',
  7200 // 2 hour TTL
);

// Generate content using the cache
const response = await server.generateWithCache(
  cacheName,
  'How do I reset my password?'
);

// Clean up when done
await server.deleteCache(cacheName);

🔌 Using with MCP Tools (like Cursor)

This server implements the Model Context Protocol (MCP), making it compatible with tools like Cursor or other AI-enhanced development environments.

Available MCP Tools

Context Management Tools:

* `generate_text` \- Generate text with context
* `get_context` \- Get current context for a session
* `clear_context` \- Clear session context
* `add_context` \- Add specific context entries
* `search_context` \- Find relevant context semantically

Caching Tools:

* `mcp_gemini_context_create_cache` \- Create a cache for large contexts
* `mcp_gemini_context_generate_with_cache` \- Generate with cached context
* `mcp_gemini_context_list_caches` \- List all available caches
* `mcp_gemini_context_update_cache_ttl` \- Update cache TTL
* `mcp_gemini_context_delete_cache` \- Delete a cache

Connecting with Cursor

When used with Cursor, you can connect via the MCP configuration:

{
  "name": "gemini-context",
  "version": "1.0.0",
  "description": "Gemini context management and caching MCP server",
  "entrypoint": "dist/mcp-server.js",
  "capabilities": {
    "tools": true
  },
  "manifestPath": "mcp-manifest.json",
  "documentation": "README-MCP.md"
}

For detailed usage instructions for MCP tools, see README-MCP.md.

⚙️ Configuration Options

Environment Variables

Create a .env file with these options:

# Required
GEMINI_API_KEY=your_api_key_here
GEMINI_MODEL=gemini-2.0-flash

# Optional - Model Settings
GEMINI_TEMPERATURE=0.7
GEMINI_TOP_K=40
GEMINI_TOP_P=0.9
GEMINI_MAX_OUTPUT_TOKENS=2097152

# Optional - Server Settings
MAX_SESSIONS=50
SESSION_TIMEOUT_MINUTES=120
MAX_MESSAGE_LENGTH=1000000
MAX_TOKENS_PER_SESSION=2097152
DEBUG=false

🧪 Development

# Build TypeScript files
npm run build

# Run in development mode with auto-reload
npm run dev

# Run tests
npm test

📚 Further Reading

For MCP-specific usage, see README-MCP.md
Explore the manifest in mcp-manifest.json to understand available tools
Check example scripts in the repository for usage patterns

📋 Future Improvements

Database persistence for context and caches
Cache size management and eviction policies
Vector-based semantic search
Analytics and metrics tracking
Integration with vector stores
Batch operations for context management
Hybrid caching strategies
Automatic prompt optimization

📄 License

MIT

Related MCP Servers & Clients

MCP Categories

Developer Tools