Question 1

How do Gemini API REST calls authenticate?

Accepted Answer

A common REST pattern is the x-goog-api-key header; official SDKs can pass api_key at client initialization. Use environment variables in production and confirm the key belongs to the right Google Cloud project.

Question 2

How do I debug 403 PERMISSION_DENIED?

Accepted Answer

Confirm the key belongs to the right Google Cloud project, then check IAM, API restrictions, Generative Language API enablement, billing status and whether you are accessing a tuned model.

Question 3

What is 429 RESOURCE_EXHAUSTED?

Accepted Answer

It means a rate-limit dimension such as RPM, TPM or RPD was exceeded. Lower concurrency, reduce context length, retry with backoff or inspect your current tier limits.

Question 4

Why did Gemini return no text?

Accepted Answer

The response may have a safety, MAX_TOKENS, RECITATION or candidate-structure issue. Inspect finishReason, safety ratings, candidates and max output tokens; do not rely only on response.text.

Question 5

What should I do if a key is blocked?

Accepted Answer

If a key is suspected of leakage or is marked blocked, generate a replacement key in Google AI Studio or Cloud Console, update deployments, disable the old key and inspect recent usage/billing.

Question 6

Can I put a Gemini API key directly in frontend code?

Accepted Answer

Not recommended. Even with API restrictions, browser, mobile and public-repository exposure can create billing, quota and abuse risk. Production apps should use a backend proxy or server-side authorization layer.

Question 7

Why is there still risk after HTTP referrer restrictions?

Accepted Answer

Referrer, package-name or IP restrictions reduce misuse but do not replace server-side authorization. Recent community incidents cluster around frontend/mobile keys being reused, quotas drained and unexpected bills.

Question 8

Why do I get model not found or 404?

Accepted Answer

Common causes are stale model names, API-version mismatch, unsupported region/project access, or mixing Vertex AI and Gemini Developer API model paths.

Question 9

Why are free-tier and paid-tier limits different?

Accepted Answer

Gemini limits vary by project, model and tier. Do not treat example-code models or limits as production guarantees; verify the current rate-limits page and console before launch.

Question 10

How do I tell prompt issues from safety blocks?

Accepted Answer

Inspect finishReason and safety ratings first. If the finish reason is safety- or recitation-related, increasing max tokens will not fix it; adjust input, system instructions, output requirements or fallback handling.

Item	Value	Debug check
Base URL	`https://generativelanguage.googleapis.com`	Gemini Developer API REST calls use the Google Generative Language API host.
Generate content	`POST /v1beta/models/{model}:generateContent`	Common entry for text, image and multi-turn generation.
Model name	`gemini-3.5-flash etc.`	Model names change quickly; confirm from official model pages or SDK examples.
Authentication	`x-goog-api-key or SDK api_key`	For 403/permission issues, check key, project, permissions and API restrictions.
Google Cloud project	`project-bound key`	Each Gemini API key is associated with a Google Cloud project for billing and permissions.
Environment variable	`GEMINI_API_KEY`	Use env vars locally and in deployment; do not ship keys in frontend code.
Security restrictions	`API restrictions`	Keys used only for Gemini should be restricted to the Gemini API.
Output result	`finishReason / safety`	If no text is returned, inspect finishReason, safety blocks and candidates.

HTTP	Status / scenario	Meaning	Check first
`400`	`INVALID_ARGUMENT`	Invalid request parameter, model name or content shape.	Check model, contents, parts, role, JSON shape and generation config.
`401`	`UNAUTHENTICATED`	Missing or invalid authentication.	Check x-goog-api-key, environment variable and whether the service is enabled.
`403`	`PERMISSION_DENIED`	Key lacks permission, project is wrong, or tuned model auth is wrong.	Check project permissions, API restrictions, IAM and tuned model access path.
`404`	`NOT_FOUND`	Model, resource or endpoint not found.	Check model name, API version, region and resource ID.
`429`	`RESOURCE_EXHAUSTED`	RPM, TPM or RPD limit exceeded.	Lower concurrency, retry with backoff and inspect current tier limits.
`500`	`INTERNAL`	Server-side error.	Log request context and retry with backoff.
`503`	`UNAVAILABLE`	Temporary service unavailability or model overload.	Retry with backoff; optionally switch model or delay work.
`finishReason`	`SAFETY / MAX_TOKENS`	Response did not produce normal text.	Inspect safety ratings, max output tokens and prompt content.

Dimension	Form	Notes
RPM	`requests / minute`	Request-count dimension; exceeding any dimension triggers rate limit errors.
TPM	`input tokens / minute`	Input-token dimension; long context hits it faster.
RPD	`requests / day`	Daily request dimension, common on free or low-tier projects.
Project	`Google Cloud project`	Key, billing, permissions and API restrictions are project-bound.
Blocked key	`leaked key`	Keys reported as leaked may be blocked; generate a new key and update deployments.

Gemini API Setup and Error Reference

FAQ