Reasoning Agent Plugin for LM Studio

Advanced AI reasoning plugin that enables LM Studio to invoke Google Gemini models with automatic API key rotation, intelligent error handling, and quota management.

🚀 Features

🔄 Automatic Key Rotation: Seamlessly switches between multiple API keys when one reaches quota or encounters errors
🎯 Gemini Model Support: Direct integration with Google Gemini AI (gemini-3-flash-preview, gemini-1.5-pro, etc.)
⏱️ Extended Thinking Mode: Utilize Gemini's extended thinking for complex multi-step reasoning
🔐 Easy Key Management: Configure multiple API keys directly in LM Studio's UI (comma-separated)
📊 Configurable Parameters: Adjust temperature, max tokens, timeouts, and retry attempts
🐛 Detailed Logging: Optional debug logging for troubleshooting key rotation and API calls
⚡ Robust Error Handling: Distinguishes between quota errors, auth errors, and transient failures
⏱️ Request Timeout Protection: Configurable timeouts (1-120 seconds) to prevent hanging requests

📋 Installation & Setup

1. Prerequisites

LM Studio v0.2.28 or later
Node.js 18+
Google AI API keys from https://ai.google.dev

2. Get Your API Keys

Visit Google AI Studio
Create a new API key
Copy the key (keep it secure!)
Create multiple keys for backup/rotation

3. Install the Plugin

4. Configure in LM Studio

Open LM Studio Settings → Plugins → Reasoning Agent
In the API Keys field, paste your keys (comma-separated):
Set your preferred model (default: gemini-3-flash-preview)
Adjust other parameters as needed
Save and reload plugin

🛠️ Configuration Options

Setting	Type	Default	Description
API Keys	string	(required)	Comma-separated Google AI API keys
Model Name	string	`gemini-3-flash-preview`	Gemini model to use
Max Tokens	number	2048	Maximum response length (100-10000)
Temperature	number	0.7	Response creativity (0-2)
Retry Attempts	number	3	Attempts per key before rotation
Request Timeout	number	30000 ms	Timeout per request (1000-120000 ms)
Detailed Logging	boolean	false	Enable debug logging

📖 Usage Examples

Basic Reasoning Task

With Key Rotation

Override Defaults

🔄 How Key Rotation Works

🔍 Error Handling Strategy

The plugin distinguishes between three error categories:

Error Type	Trigger	Action
Quota Error	429, rate_limit, quota exceeded	Immediately rotate to next key
Auth Error	401, 403, invalid API key	Immediately rotate to next key
Transient Error	Network timeout, service unavailable	Retry (up to `retryAttempts`)

📊 Logging & Debugging

Enable Detailed Logging in config to see:

Check LM Studio's console/logs for these messages while running in dev mode.

🧪 Testing Checklist

Before deploying to production, verify:

🐛 Troubleshooting

"No API keys configured"

Check plugin settings in LM Studio
Ensure API keys field is not empty
Keys must be comma-separated

"Invalid API Key"

Verify key is copied correctly (no extra spaces)
Check key hasn't been revoked in AI Studio
Ensure using correct Google AI API key (not ChatGPT key)

Slow Responses

Check network connection
Increase request timeout in config
Test with smaller prompts first
Extended thinking is slower but better for complex tasks

All Keys Exhausted

Check with curl -X POST https://generativelanguage.googleapis.com/v1beta/models/gemini-3-flash-preview:generateContent?key=YOUR_KEY
Review quota in Google AI Studio
Create new API keys and add to config
Check logs with detailed logging enabled

Plugin Not Loading

Run npm install in plugin directory
Verify TypeScript compiles: npm run build
Check console output in LM Studio for errors
Restart LM Studio after changes

📦 Project Structure

🔐 Security Notes

Never commit API keys to version control
Use .gitignore to exclude .env files
Rotate keys regularly in Google AI Studio
Monitor usage in Google Cloud Console
Never share keys via chat or email
Use separate keys for development/production if possible

🚀 Development

Build

Development Mode (Hot Reload)

Deploy to LM Studio

📝 API Reference

Tool: `reasoning_invoke`

Parameters:

Returns:

📄 License

MIT License - Free for personal and commercial use

🤝 Contributing

Found a bug? Want to improve the plugin?

Enable detailed logging to diagnose
Share the logs and your configuration
Create an issue with step-to-reproduce

📞 Support

For issues related to:

Plugin functionality: Check troubleshooting section above
Google Gemini API: Visit https://ai.google.dev/docs
LM Studio: Visit https://lmstudio.ai

🎯 Roadmap

Future enhancements:

Support for vision capabilities
Streaming responses
Caching for repeated prompts
Analytics dashboard for key usage
Integration with other AI providers
Custom safety settings panel

Version: 1.0.0
Last Updated: April 2026
LM Studio Compatibility: v0.2.28+

reasoning-agent

reasoning-agent

Reasoning Agent Plugin for LM Studio

🚀 Features

📋 Installation & Setup

1. Prerequisites

2. Get Your API Keys

3. Install the Plugin

4. Configure in LM Studio

🛠️ Configuration Options

📖 Usage Examples

Basic Reasoning Task

With Key Rotation

Override Defaults

🔄 How Key Rotation Works

🔍 Error Handling Strategy

📊 Logging & Debugging

🧪 Testing Checklist

🐛 Troubleshooting

"No API keys configured"

"Invalid API Key"

Slow Responses

All Keys Exhausted

Plugin Not Loading

📦 Project Structure

🔐 Security Notes

🚀 Development

Build

Development Mode (Hot Reload)

Deploy to LM Studio

📝 API Reference

Tool: `reasoning_invoke`

📄 License

🤝 Contributing

📞 Support

🎯 Roadmap

reasoning-agent

reasoning-agent

Reasoning Agent Plugin for LM Studio

🚀 Features

📋 Installation & Setup

1. Prerequisites

2. Get Your API Keys

3. Install the Plugin

4. Configure in LM Studio

🛠️ Configuration Options

📖 Usage Examples

Basic Reasoning Task

With Key Rotation

Override Defaults

🔄 How Key Rotation Works

🔍 Error Handling Strategy

📊 Logging & Debugging

🧪 Testing Checklist

🐛 Troubleshooting

"No API keys configured"

"Invalid API Key"

Slow Responses

All Keys Exhausted

Plugin Not Loading

📦 Project Structure

🔐 Security Notes

🚀 Development

Build

Development Mode (Hot Reload)

Deploy to LM Studio

📝 API Reference

Tool: reasoning_invoke

📄 License

🤝 Contributing

📞 Support

🎯 Roadmap

Tool: `reasoning_invoke`