Reka | Multimodal AI Models for Video, Image & Text

We are an AI model builder, with a focus on multimodal and efficiency. We regularly open source our technology and offer enterprise deployments of our agentic platforms.

For business:

Playground

For creators:

Reka Clip

Try in Playground

[CAPABILITIES]

Our platform,
Infinite possibilities

[CAPABILITIES]

Our platform,
Infinite possibilities

Vision

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

EXPLORE VISION

Vision

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

EXPLORE VISION

Research

State-of-the-art foundation models pushing the boundaries of multimodal AI. Access cutting-edge capabilities for complex reasoning tasks.

VIEW RESEARCH

Research

State-of-the-art foundation models pushing the boundaries of multimodal AI. Access cutting-edge capabilities for complex reasoning tasks.

VIEW RESEARCH

Speech

Advanced audio understanding that goes beyond transcription. Extract meaning, context, and insights from any audio source.

TRY SPEECH

Speech

Advanced audio understanding that goes beyond transcription. Extract meaning, context, and insights from any audio source.

TRY SPEECH

Vision
Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.
LEARN MORE
Research
State-of-the-art foundation models pushing the boundaries of multimodal AI. Access cutting-edge capabilities for complex reasoning tasks.
LEARN MORE
Speech
Advanced audio understanding that goes beyond transcription. Extract meaning, context, and insights from any audio source.
LEARN MORE

[CAPABILITIES]

Our platform,
Infinite possibilities

[WHY REKA]

Built for the physical world

While generic LLMs excel at text, Reka is purpose-built for multimodal perception and reasoning—understanding the visual, auditory, and contextual signals that define the real world.

True Multimodal

Native video, image, audio, and text understanding—not bolted-on features.

Deploy Anywhere

Cloud, on-prem, VPC, or fully air-gapped environments.

Domain Fine-Tuning

Customize models for your specific use case and data.

Enterprise Ready

Built for reliability, security, and compliance at scale.

[SOLUTIONS]

Customized Intelligence, Enterprise-Ready.

Media and entertainment

Advertising

Security & Defense

Fleet Management

content creators

Content Research & Creation

Research engagement and which topics are resonating within your target audience across social media and more. Edit raw footage and long videos into highlights and teaser clips with just prompts.

Archive Repurposing

Search decades of videos, interviews, and promos by scene, dialog, or object. Repurpose and auto-tag old assets for licensing, publishing, or remixing.

Game Highlights & Reels

Automatically create highlight reels and summaries from sports footage. Capture key plays with timestamps and generate broadcast-ready content.

Performance Analysis & Intel

Search game footage for plays, movements, and tactical patterns. Generate reports that track player progress and team performance over time.

Explore Media

[SOLUTIONS]

Customized Intelligence, Enterprise-Ready.

Media and entertainment

Advertising

Security & Defense

Fleet Management

content creators

Content Research & Creation
Research engagement and which topics are resonating within your target audience across social media and more. Edit raw footage and long videos into highlights and teaser clips with just prompts.
Archive Repurposing
Search decades of videos, interviews, and promos by scene, dialog, or object. Repurpose and auto-tag old assets for licensing, publishing, or remixing.
Game Highlights & Reels
Automatically create highlight reels and summaries from sports footage. Capture key plays with timestamps and generate broadcast-ready content.
Performance Analysis & Intel
Search game footage for plays, movements, and tactical patterns. Generate reports that track player progress and team performance over time.

Explore Media

[MODELS]

Models Built to Perform

Whether you're crafting a lightweight assistant or an enterprise-grade agent, Reka gives you the tools to build boldly.

[S]
[S]
[S]
[S]
[S]
Spark [1B]
Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
[E]
Edge [7B]
The fastest 7B-8B vision-language model that delivers real-time intelligence with lower latency and compute.
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
Flash [21B]
Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
Core [67B]
A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.

[S]
[S]
[S]
[S]
[S]
Spark [1B]
Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
[F]
Flash [21B]
Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
[C]
Core [67B]
A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.
- [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  Core [67B]
  A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.
- [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  Edge [7B]
  The fastest 7B-8B vision-language model that delivers real-time intelligence with lower latency and compute.
- [S]
  [S]
  [S]
  [S]
  [S]
  Spark [1B]
  Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.
- [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  Flash [21B]
  Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.
- [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  Core [67B]
  A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.
- [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  Edge [7B]
  The fastest 7B-8B vision-language model that delivers real-time intelligence with lower latency and compute.
- [S]
  [S]
  [S]
  [S]
  [S]
  Spark [1B]
  Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.
- [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  Flash [21B]
  Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.
- [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  [C]
  Core [67B]
  A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.
- [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  [E]
  Edge [7B]
  The fastest 7B-8B vision-language model that delivers real-time intelligence with lower latency and compute.
- [S]
  [S]
  [S]
  [S]
  [S]
  Spark [1B]
  Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.
- [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  [F]
  Flash [21B]
  Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.

Slide ->

[FOR DEVELOPERS]

Start building in minutes

Integrate multimodal AI into your applications with our developer-friendly APIs, comprehensive documentation, and ready-to-use examples.

View document

Powerful APIs
RESTful APIs with SDKs for Python, JavaScript, and more.
Comprehensive Docs
Detailed documentation, tutorials, and quickstart guides.
Sample Apps
Ready-to-use examples for common use cases.

[COMMUNITY]

Open Source, by Design

We open source key components of our research. From our reasoning model to our eval tools, code model, and quantization stack. Built in-house, shared intentionally.

Hugging Face

Download our reasoning model and more—built from scratch for transparency and performance.

Download It Now

Hugging Face

Download our reasoning model and more—built from scratch for transparency and performance.

Download It Now

GitHub

Browse our eval tools, code model, and quantization stack. Fully documented, freely available.

View on Github

GitHub

Browse our eval tools, code model, and quantization stack. Fully documented, freely available.

View on Github

Discord

Have questions, ideas, or feature requests? Connect directly with our team and open source users.

join our discord

Discord

Have questions, ideas, or feature requests? Connect directly with our team and open source users.

join our discord

Reka | Multimodal AI Models for Video, Image & Text

Reka | Multimodal AI Models for Video, Image & Text

Our platform,Infinite possibilities

Our platform,Infinite possibilities

Vision

Vision

Research

Research

Speech

Speech

Vision

Research

Speech

Our platform,Infinite possibilities

Built for the physical world

True Multimodal

Deploy Anywhere

Domain Fine-Tuning

Enterprise Ready

Customized Intelligence, Enterprise-Ready.

Content Research & Creation

Archive Repurposing

Game Highlights & Reels

Performance Analysis & Intel

Customized Intelligence, Enterprise-Ready.

Content Research & Creation

Archive Repurposing

Game Highlights & Reels

Performance Analysis & Intel

Models Built to Perform

Spark [1B]

Edge [7B]

Flash [21B]

Core [67B]

Spark [1B]

Flash [21B]

Core [67B]

Core [67B]

Edge [7B]

Spark [1B]

Flash [21B]

Core [67B]

Edge [7B]

Spark [1B]

Flash [21B]

Core [67B]

Edge [7B]

Spark [1B]

Flash [21B]

Start building in minutes

Powerful APIs

Comprehensive Docs

Sample Apps

Open Source, by Design

Hugging Face

Hugging Face

GitHub

GitHub

Discord

Discord

Our platform,
Infinite possibilities

Our platform,
Infinite possibilities

Our platform,
Infinite possibilities