Hybrid Search
Overview
Hybrid Search is a powerful technique that combines the strengths of both semantic (vector-based) and lexical (keyword-based) search. This approach enhances the accuracy and relevance of results by leveraging the capabilities of both methods.
Hybrid Search can be configured and enabled in DataDios Hybrid Search Configuration.
Key Characteristics
- Semantic Understanding: Returns results based on the meaning of a query, not just keywords.
- Hybrid Modes: Offers Full Text, Semantic, or Hybrid search options.
- Module & User-Level Control: Admins can specify which modules and users can access semantic search.
- Relevance Ranking: Results are scored and ranked by contextual relevance.
Benefits
Best of Both Worlds: Precision + Semantics
- Lexical search (e.g., BM25) excels at exact keyword matches.
- Semantic search (e.g., vector embeddings) captures meaning and context.
- Hybrid search combines both, providing relevant results even when:
- Users use different terminology (e.g., "laptop" vs. "notebook")
- Queries are vague, long, or in natural language
- Exact keywords are missing but the intent is clear
Improved User Experience
- Users find relevant content faster
- Reduces frustration from irrelevant or missing results
- Enhances discovery of related information
- Improves overall satisfaction and productivity
Improved Recall Without Losing Precision
- Lexical-only search may miss results due to vocabulary mismatch
- Semantic-only search may return overly broad or imprecise results
- Hybrid search increases recall (more results found) while maintaining precision (relevance to the query)
Flexibility and Customization
- Admins can configure which modules and users access semantic search
- Tailor the search experience to specific user needs and data types
- Supports different search types (Full Text, Semantic, Hybrid) based on context
- Enables gradual adoption of semantic search alongside keyword search
Best Practices
- Choose search type based on user needs and data characteristics
- Apply semantic search only to modules where it adds meaningful value
- Ensure underlying data is clean, consistent, and semantically rich:
- Use clear object naming
- Maintain well-labeled fields
- Provide meaningful descriptions
- Train Users on Semantic Behavior:
- Educate users on how to phrase queries for optimal results
- Clarify that ranking is based on meaning, not just word matches
- Communicate Submission Finality:
- Inform admins that once submitted, settings cannot be changed
- Encourage careful review before submission
For additional support or advanced features, contact the DataDios support team.