Multimodal AI You Can Deploy Anywhere

Multimodal AI You Can Deploy Anywhere

Multimodal AI You Can Deploy Anywhere

We are an AI model builder, with a focus on multimodal and efficiency. We regularly open source our technology and offer enterprise deployments of our agentic platforms.

We are an AI model builder, with a focus on multimodal and efficiency. We regularly open source our technology and offer enterprise deployments of our agentic platforms.

[CAPABILITIES]

Our platform,
Infinite possibilities

[CAPABILITIES]

Our platform,
Infinite possibilities

Vision

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Vision

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Research

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Research

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Speech

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Speech

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

Complete multimodal perception and reasoning for video, images, and beyond. Captioning, detection, embeddings, Q&A, and search in one unified platform.

[CAPABILITIES]

Our platform,
Infinite possibilities

[WHY REKA]

Built for the physical world

While generic LLMs excel at text, Reka is purpose-built for multimodal perception and reasoning—understanding the visual, auditory, and contextual signals that define the real world.

While generic LLMs excel at text, Reka is purpose-built for multimodal perception and reasoning—understanding the visual, auditory, and contextual signals that define the real world.

True Multimodal

Native video, image, audio, and text understanding—not bolted-on features.

Native video, image, audio, and text understanding—not bolted-on features.

Deploy Anywhere

Cloud, on-prem, VPC, or fully air-gapped environments.

Cloud, on-prem, VPC, or fully air-gapped environments.

Domain Fine-Tuning

Customize models for your specific use case and data.

Customize models for your specific use case and data.

Enterprise Ready

Built for reliability, security, and compliance at scale.

Built for reliability, security, and compliance at scale.

[SOLUTIONS]

Customized Intelligence, Enterprise-Ready.

Media and entertainment

Advertising

Security & Defense

content creators

Illustration of the Content Research & Creation workflow: a search bar for trending videos connects to a button to "GENERATE A VIDEO," which produces video thumbnails.

Content Research & Creation

Research engagement and which topics are resonating within your target audience across social media and more. Edit raw footage and long videos into highlights and teaser clips with just prompts.

Illustration for Archive Repurposing: a search for "a silver bag" resulting in a 'Detected' label over an image of a person wearing a silver outfit within stacked archive photos.

Archive Repurposing

Search decades of videos, interviews, and promos by scene, dialog, or object. Repurpose and auto-tag old assets for licensing, publishing, or remixing.

Illustration for Game Highlights & Reels: a search for "CREATE HIGHLIGHT REELS" results in a video frame showing football players in action with a timestamp.

Game Highlights & Reels

Automatically create highlight reels and summaries from sports footage. Capture key plays with timestamps and generate broadcast-ready content.

Illustration for Performance Analysis & Intel: a search for "Search for ball reception" identifies a 'Detected' football play, followed by a report being generated below.

Performance Analysis & Intel

Search game footage for plays, movements, and tactical patterns. Generate reports that track player progress and team performance over time.

[SOLUTIONS]

Customized Intelligence, Enterprise-Ready.

Media and entertainment

Advertising

Security & Defense

content creators

[SOLUTIONS]

Customized Intelligence, Enterprise-Ready.

Media and entertainment

Advertising

Security & Defense

content creators

Illustration of the Content Research & Creation workflow: a search bar for trending videos connects to a button to "GENERATE A VIDEO," which produces video thumbnails.

Content Research & Creation

Research engagement and which topics are resonating within your target audience across social media and more. Edit raw footage and long videos into highlights and teaser clips with just prompts.

Illustration for Archive Repurposing: a search for "a silver bag" resulting in a 'Detected' label over an image of a person wearing a silver outfit within stacked archive photos.

Archive Repurposing

Search decades of videos, interviews, and promos by scene, dialog, or object. Repurpose and auto-tag old assets for licensing, publishing, or remixing.

Illustration for Game Highlights & Reels: a search for "CREATE HIGHLIGHT REELS" results in a video frame showing football players in action with a timestamp.

Game Highlights & Reels

Automatically create highlight reels and summaries from sports footage. Capture key plays with timestamps and generate broadcast-ready content.

Illustration for Performance Analysis & Intel: a search for "Search for ball reception" identifies a 'Detected' football play, followed by a report being generated below.

Performance Analysis & Intel

Search game footage for plays, movements, and tactical patterns. Generate reports that track player progress and team performance over time.

[MODELS]

Models Built to Perform

Whether you're crafting a lightweight assistant or an enterprise-grade agent, Reka gives you the tools to build boldly.

  • [S]
    [S]
    [S]
    [S]
    [S]

    Spark [1B]

    Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.

  • [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]

    Flash [21B]

    Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.

  • [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]

    Core [67B]

    A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.

  • [S]
    [S]
    [S]
    [S]
    [S]

    Spark [1B]

    Ultra-compact and versatile, perfect for embedding AI into the smallest devices without sacrificing essential capabilities.

  • [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]
    [F]

    Flash [21B]

    Reliable and cost-efficient, this model offers fast, robust reasoning for everyday applications.

  • [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]
    [C]

    Core [67B]

    A high-performance multimodal model built to handle text, image, audio, and video with precision and fluidity for the most complex tasks.

Slide ->

[FOR DEVELOPERS]

Start building in minutes

Integrate multimodal AI into your applications with our developer-friendly APIs, comprehensive documentation, and ready-to-use examples.

  • Powerful APIs

    RESTful APIs with SDKs for Python, JavaScript, and more.

    RESTful APIs with SDKs for Python, JavaScript, and more.

    RESTful APIs with SDKs for Python, JavaScript, and more.

  • Comprehensive Docs

    Detailed documentation, tutorials, and quickstart guides.

    Detailed documentation, tutorials, and quickstart guides.

    Detailed documentation, tutorials, and quickstart guides.

  • Sample Apps

    Ready-to-use examples for common use cases.

    Ready-to-use examples for common use cases.

    Ready-to-use examples for common use cases.

[COMMUNITY]

Open Source, by Design

We open source key components of our research. From our reasoning model to our eval tools, code model, and quantization stack. Built in-house, shared intentionally.

  • SVG Preview

    Hugging Face

    Download our reasoning model and more—built from scratch for transparency and performance.

  • SVG Preview

    Hugging Face

    Download our reasoning model and more—built from scratch for transparency and performance.

  • SVG Preview

    Hugging Face

    Download our reasoning model and more—built from scratch for transparency and performance.

  • SVG Preview

    GitHub

    Browse our eval tools, code model, and quantization stack. Fully documented, freely available.

  • SVG Preview

    GitHub

    Browse our eval tools, code model, and quantization stack. Fully documented, freely available.

  • SVG Preview

    GitHub

    Browse our eval tools, code model, and quantization stack. Fully documented, freely available.

  • SVG Preview

    Discord

    Have questions, ideas, or feature requests? Connect directly with our team and open source users.

  • SVG Preview

    Discord

    Have questions, ideas, or feature requests? Connect directly with our team and open source users.

  • SVG Preview

    Discord

    Have questions, ideas, or feature requests? Connect directly with our team and open source users.

Abstract 3D composition of black and pink cubes of various sizes, many featuring the Reka "[R]" logo, representing modularity and the building blocks of multimodal AI.