Close Menu
    Facebook X (Twitter) Instagram
    • Privacy Policy
    • Terms Of Service
    • Social Media Disclaimer
    • DMCA Compliance
    • Anti-Spam Policy
    Facebook X (Twitter) Instagram
    Fintech Fetch
    • Home
    • Crypto News
      • Bitcoin
      • Ethereum
      • Altcoins
      • Blockchain
      • DeFi
    • AI News
    • Stock News
    • Learn
      • AI for Beginners
      • AI Tips
      • Make Money with AI
    • Reviews
    • Tools
      • Best AI Tools
      • Crypto Market Cap List
      • Stock Market Overview
      • Market Heatmap
    • Contact
    Fintech Fetch
    Home»AI News»OceanBase Releases seekdb: An Open Source AI Native Hybrid Search Database for Multi-model RAG and AI Agents
    OceanBase Releases seekdb: An Open Source AI Native Hybrid Search Database for Multi-model RAG and AI Agents
    AI News

    OceanBase Releases seekdb: An Open Source AI Native Hybrid Search Database for Multi-model RAG and AI Agents

    November 27, 20255 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email
    kraken

    AI applications rarely deal with one clean table. They mix user profiles, chat logs, JSON metadata, embeddings, and sometimes spatial data. Most teams answer this with a patchwork of an OLTP database, a vector store, and a search engine. OceanBase released seekdb, an open source AI focused database (under the Apache 2.0 license). seekdb is described as an AI native search database that unifies relational data, vector data, text, JSON, and GIS in one engine and exposes hybrid search and in database AI workflows.

    What is seekdb?

    seekdb is positioned as the lightweight, embedded version of the OceanBase engine, aimed at AI applications rather than general purpose distributed deployments. It runs as a single node database, supports embedded mode and client or server mode, and remains compatible with MySQL drivers and SQL syntax.

    In the capability matrix, seekdb is marked as:

    • Embedded database supported
    • Standalone database supported
    • Distributed database not supported

    while the full OceanBase product covers the distributed case.

    From a data model perspective, seekdb supports:

    livechat
    • Relational data with standard SQL
    • Vector search
    • Full text search
    • JSON data
    • Spatial GIS data

    all inside one storage and indexing layer.

    Hybrid search as the core feature

    The main feature OceanBase pushes is hybrid search. This is search that combines vector based semantic retrieval, full text keyword retrieval, and scalar filters in a single query and a single ranking step.

    seekdb implements hybrid search through a system package named DBMS_HYBRID_SEARCH with two entry points:

    • DBMS_HYBRID_SEARCH.SEARCH which returns results as JSON, sorted by relevance
    • DBMS_HYBRID_SEARCH.GET_SQL which returns the concrete SQL string used for execution

    The hybrid search path can run:

    • pure vector search
    • pure full text search
    • combined hybrid search

    and can push relational filters and joins down into storage. It also supports query reranking strategies like weighted scores and reciprocal rank fusion and can plug in large language model based re-rankers.

    For retrieval augmented generation (RAG) and agent memory, this means you can write a single SQL query that does semantic matching on embeddings, exact matching on product codes or proper nouns, and relational filtering on user or tenant scopes.

    Vector and full text engine details

    At its core, seekdb exposes a modern vector and full text stack.

    For vectors, seekdb:

    • supports dense vectors and sparse vectors
    • supports Manhattan, Euclidean, inner product, and cosine distance metrics
    • provides in memory index types such as HNSW, HNSW SQ, HNSW BQ
    • provides disk based index types including IVF and IVF PQ

    Hybrid vector index show how you can store raw text, let seekdb call an embedding model automatically, and have the system maintain the corresponding vector index without a separate preprocessing pipeline.

    For text, seekdb offers full text search with:

    • keyword, phrase, and Boolean queries
    • BM25 ranking for relevance
    • multiple tokenizer modes

    The key point is that full text and vector indexes are first class and are integrated in the same query planner as scalar indexes and GIS indexes, so hybrid search does not need external orchestration.

    AI functions inside the database

    seekdb includes built in AI function expressions that let you call models directly from SQL, without a separate application service mediating every call. The main functions are:

    • AI_EMBED to convert text into embeddings
    • AI_COMPLETE for text generation using a chat or completion model
    • AI_RERANK to rerank a list of candidates
    • AI_PROMPT to assemble prompt templates and dynamic values into a JSON object for AI_COMPLETE

    Model metadata and endpoints are managed by the DBMS_AI_SERVICE package, which lets you register external providers, set URLs, and configure keys, all on the database side.

    Multimodal data and workloads

    seekdb is built to handle multiple data modalities in one node. It has a multimodal data and indexing layer that covers vectors, text, JSON, and GIS, and a multi-model compute layer for hybrid workloads across vector, full text, and scalar conditions.

    It also provides JSON indexes for metadata queries and GIS indexes for spatial conditions. This allows queries like:

    • find semantically similar documents
    • filter by JSON metadata like tenant, region, or category
    • constrain by spatial range or polygon

    without leaving the same engine.

    Because seekdb is derived from the OceanBase engine, it inherits ACID transactions, row and column hybrid storage, and vectorized execution, although high scale distributed deployments remain a job for the full OceanBase database.

    Comparison Table

    Key Takeaways

  • AI native hybrid search: seekdb unifies vector search, full text search and relational filtering in a single SQL and DBMS_HYBRID_SEARCH interface, so RAG and agent workloads can run multi signal retrieval in one query instead of stitching together multiple engines.
  • Multimodal data in one engine: seekdb stores and indexes relational data, vectors, text, JSON and GIS in the same engine, which lets AI applications keep documents, embeddings and metadata consistent without maintaining separate databases.
  • In database AI functions for RAG: With AI_EMBED, AI_COMPLETE, AI_RERANK and AI_PROMPT, seekdb can call embedding models, LLMs and rerankers directly from SQL, which simplifies RAG pipelines and moves more orchestration logic into the database layer.
  • Single node, embedded friendly design: seekdb is a single node, MySQL compatible engine that supports embedded and standalone modes, while distributed, large scale deployments remain the role of full OceanBase, which makes seekdb suitable for local, edge and service embedded AI workloads.
  • Open source and tool ecosystem: seekdb is open sourced under Apache 2.0 and integrates with a growing ecosystem of AI tools and frameworks, with Python support via pyseekdb and MCP based integration for code assistants and agents, so it can act as a unified data plane for AI applications.
  • frase
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Fintech Fetch Editorial Team
    • Website

    Related Posts

    DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

    DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

    June 26, 2026
    Exploring the societal impacts of AI | MIT News

    Exploring the societal impacts of AI | MIT News

    June 25, 2026
    Enterprise-grade AI image generation in 2 seconds is here: Krea 2 Raw and Turbo available as open weights under custom license

    Enterprise-grade AI image generation in 2 seconds is here: Krea 2 Raw and Turbo available as open weights under custom license

    June 24, 2026
    Mitigating vendor lock-in with Sakana AI Fugu multi-agent models

    Mitigating vendor lock-in with Sakana AI Fugu multi-agent models

    June 23, 2026
    Add A Comment

    Comments are closed.

    Join our email newsletter and get news & updates into your inbox for free.


    Privacy Policy

    Thanks! We sent confirmation message to your inbox.

    frase
    Latest Posts
    Forget Amazon Prime Days: Here's the Real Reason to Buy the Stock

    rewrite this title in other words: Forget Amazon Prime Days: Here’s the Real Reason to Buy the Stock

    June 26, 2026
    Cointelegraph

    Does Botanix’s Failure Prove Bitcoiners Don’t Care About DeFi?

    June 26, 2026
    DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

    DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

    June 26, 2026
    Give me 17 Minutes, I’ll Make you 1.5L/Month with AI (Passive)

    Give me 17 Minutes, I’ll Make you 1.5L/Month with AI (Passive)

    June 26, 2026
    AI for Beginners in 2026: Start With One Useful Workflow

    AI for Beginners in 2026: Start With One Useful Workflow

    June 26, 2026
    Customgpt
    LEGAL INFORMATION
    • Privacy Policy
    • Terms Of Service
    • Social Media Disclaimer
    • DMCA Compliance
    • Anti-Spam Policy
    Top Insights
    Cointelegraph

    Polymarket Third-Party Vendor Compromise Drains $2.9M from Users

    June 26, 2026
    Crypto M&A Surges to $7.23 Billion Despite Lowest Investor Count Since 2020

    rewrite this title in other words: Crypto M&A Surges to $7.23 Billion Despite Lowest Investor Count Since 2020

    June 26, 2026
    murf
    Facebook X (Twitter) Instagram Pinterest
    © 2026 FintechFetch.com - All rights reserved.

    Type above and press Enter to search. Press Esc to cancel.