Jina AI models

State-of-the-art models for each stage of the retrieval pipeline

Purpose-built for retrieval, Jina models deliver accuracy and speed that outperforms models 5× their size. Multilingual, multimodal — text, images, audio, and video — and now native on Elasticsearch.

Try Jina on Elasticsearch

Learn more

Meet the Jina AI models

Our frontier models form the search foundation for high-quality enterprise search and retrieval augmented generation (RAG) systems.

Reader
Convert complex documents, web pages, and PDFs into clean, structured input for search and large language models (LLMs).
Get started
Embeddings
Improve search and RAG systems with multimodal and multilingual embeddings for text, images, audio, video, and code.
Get started
Reranker
Maximize relevance with a world-class reranker that delivers precision for critical applications like RAG, AI assistant, and agents.
Get started

Compact by design, precise by results

Go from raw data to high-precision results in one API.

Multimodal search, 100+ languages
Jina's models work across text, images, audio, and video. With v5-omni, a single embedding model handles all four modalities in one shared space. Over 100 languages are supported natively, and cross-language search works out of the box.
Best results, not just nearest
Jina's reranking models are proven leaders. Get extra precision with rerankers that rescore every candidate against the original query, using deep analysis to get the most relevant answers on top.
Smart training, smaller models
Jina's models are trained on tasks that matter for retrieval: finding the right document and best answer from messy sources. That's why they match or outperform larger models at a fraction of the cost.
Zero-config semantic search
Map any field as semantic_text and Elasticsearch generates embeddings automatically. On EIS, Jina models default to deliver out-of-the-box multilingual and multimodal semantic search with zero config.
One API call, that's all
Combine traditional keyword search with Jina's semantic matching in a single query. Use one API call with reciprocal rank fusion to merge the best of each approach.
Lean at any scale
Combine Jina variable-sized embeddings with Elastic's vector quantization (BBQ) to reduce storage by up to 95% with minimal accuracy loss. Turn precision all the way up when accuracy matters the most.

Use Jina models wherever you build

From fully managed to self-hosted, Jina models meet you where your data lives. Pick the access path that fits.

Elastic Inference Service
Use Jina models natively in Elasticsearch with zero config, separate API keys, or model management.
Start a free trial on Elastic Cloud
Jina API
Get direct access, anywhere. Call Jina models straight from the Jina API for use outside Elasticsearch.
Explore the Embeddings API Explore the Reranker API Explore the Reader API
Hugging Face
Download the model weights and run them on your own hardware with full control over deployment, tuning, and scale.
Download from Hugging Face

Our research

Jina's models are built on research presented at top machine learning (ML) conferences, including CVPR, NeurIPS, and EMNLP. Explore how our frontier search models were trained from scratch in our latest publications.

Jina-embeddings-v5-text: Task-Targeted Embedding Distillation
We introduce a novel training regimen that combines model distillation techniques with task-specific contrastive loss to produce compact, high-performance embedding models.
Read more
Embedding Inversion via Conditional Masked Diffusion Language Models
We frame embedding inversion as conditional masked diffusion, recovering all tokens in parallel through iterative denoising rather than sequential autoregressive generation.
Read more
Embedding Compression via Spherical Coordinates
We present a compression method for unit-norm embeddings that achieves 1.5× compression, 25% better than the best prior lossless method.
Read more
jina-embeddings-v5-omni
We extend jina-embeddings-v5-text to images, audio, and video by composing frozen pretrained encoders through lightweight trained adapters — without retraining the text model or reindexing existing data.

Join our open source community

Jina's models are open-weight and freely available on Hugging Face, with millions of monthly downloads. The codebase is public on GitHub. The community has direct access to our developers.

Hugging Face
Access our models for embeddings, rerankers, and small LMs for better search.
Get started
GitHub
Explore public models and repos.
Explore repos
Community
Join our community of developers.
Learn more
Discuss
Share tips and learn from other developers.
Join

Frequently asked questions

What are Jina search models?

Jina models are open source, frontier AI models for retrieval. They include embedding models for vectors, rerankers for precision, and readers for extracting and structuring content from URLs and docs.

Jina AI models

State-of-the-art models for each stage of the retrieval pipeline

Meet the Jina AI models

Reader

Embeddings

Reranker

Compact by design, precise by results

Multimodal search, 100+ languages

Best results, not just nearest

Smart training, smaller models

Zero-config semantic search

One API call, that's all

Lean at any scale

Use Jina models wherever you build

Elastic Inference Service

Jina API

Hugging Face

Our research

Jina-embeddings-v5-text: Task-Targeted Embedding Distillation

Embedding Inversion via Conditional Masked Diffusion Language Models

Embedding Compression via Spherical Coordinates

jina-embeddings-v5-omni

Join our open source community

Hugging Face

GitHub

Community

Discuss

Frequently asked questions

What are Jina search models?

Do I need AI or machine learning expertise to use them?

How do I get started?

Which Jina models are available today?

How many languages are supported?

How does this relate to ELSER?

Is this a separate product?

How does this relate to Elastic's vector database page?

Try it now on Elastic Cloud

Follow us

About us

Join us

Partners

Trust & Security

Investor relations

Excellence Awards