Domain-tunable concept extraction
Entity, concept, and sentiment models tune to customer vocabulary and subject matter. The result is relevance that generic search engines cannot match.
OpenText • Content services
Knowledge Discovery is the IDOL-based cognitive search and text analytics product for extracting meaning from unstructured enterprise content. Merito designs the analytics pipelines.
Merito sells OpenText Knowledge Discovery (IDOL lineage) and delivers cognitive search, text analytics, concept extraction, and privacy-preserving classification over controlled content.
What it is
OpenText Knowledge Discovery is the product most customers know as IDOL. It is a cognitive search and text analytics engine designed to extract meaning from unstructured content and documents, emails, transcripts, multimedia, and social feeds. Knowledge Discovery is not a chat AI; it is a deep analytics surface that powers investigation, discovery, and classification workloads at enterprise scale.
The product's defining property is domain-specific modeling. Concept extraction, entity recognition, sentiment analysis, and classification tune to the customer's vocabulary and subject matter. For regulated customers (legal, financial, pharma, government) that specificity matters; generic search engines cannot replace Knowledge Discovery's trained models.
Knowledge Discovery complements Content Aviator rather than replacing it. Aviator is a conversational grounded-AI surface. Knowledge Discovery is the deep analytics surface underneath. Programs running both use Aviator for user-facing retrieval and Knowledge Discovery for analyst-driven investigation and classification.
Ideal use cases
What it is best at
Entity, concept, and sentiment models tune to customer vocabulary and subject matter. The result is relevance that generic search engines cannot match.
Ingestion spans documents, email, multimedia, social, and database content. The analytics layer works across sources, not just within a single repository.
Legal, defense, intelligence, and financial services programs have used IDOL for decades in investigation and discovery workloads where accuracy matters.
Core capabilities
Ranked retrieval using concept and entity models rather than keyword matching alone.
Concept-based search
Queries ranked by semantic proximity in addition to keyword match.
Entity recognition
Named entity extraction (people, places, organizations, custom entities) for facet-driven search.
Multi-language support
Processing across multiple languages with consistent ranking.
Deep text analysis for classification, sentiment, and concept extraction.
Classification
Automated content classification against customer-trained taxonomies.
Sentiment analysis
Tone and sentiment scoring across content sources.
Summarization
Extractive summarization for long-form content.
Broad content ingestion across document stores, databases, and specialized sources.
File and document ingestion
Broad file-format support through KeyView (File Content Extraction).
Database and API ingestion
Connectors for databases, message queues, and enterprise APIs.
Multimedia analytics
Speech-to-text, image recognition, and video analytics for multi-modal investigation.
Where it fits in the stack
Deployment and implementation
Licensing and packaging
Knowledge Discovery standard
Cognitive search, concept extraction, and classification for enterprise content.
Best for: Programs running analytics workloads over unstructured enterprise content.
Knowledge Discovery with multimedia
Standard plus speech-to-text, image, and video analytics for multi-modal investigation.
Best for: Legal, intelligence, and compliance programs with multimedia evidence.
Merito services
Merito sells licenses and the delivery work around them. Pick the service that matches where you are in the lifecycle.
Ingestion design, taxonomy tuning, concept model configuration, and analyst workbench integration.
Explore service02Analytics pipelines integrated with customer-specific data sources and workbenches.
Explore service03Named engineer, priority SLAs, and release-time coverage for Knowledge Discovery in production.
Explore service04Long-term run support including ongoing model tuning and corpus management.
Explore service05Role-based training for administrators, model engineers, and analysts.
Explore service06Merito-placed analytics engineers and content classification specialists.
Explore serviceKnowledge Discovery licensing
Merito sells OpenText Knowledge Discovery and delivers ingestion design, taxonomy tuning, analytics pipelines, and analyst workbench integration that turn IDOL output into real investigation value.
Merito point of view
Merito has seen customers try to use Knowledge Discovery as a consumer-facing search experience and find it overkill, and try to use Content Aviator for deep analyst investigation and find it underkill. The products target different audiences. Merito's guidance and Aviator for user-facing grounded AI, Knowledge Discovery for analyst-driven investigation. Programs often run both.
Taxonomy tuning is the work. Generic Knowledge Discovery output produces generic results; the customer's domain-specific taxonomy and concept model are what deliver the lift. Expect taxonomy work to be a meaningful portion of the rollout timeline.
For legal and regulated investigation workloads specifically, the combination of Knowledge Discovery (analytics) with eDiscovery and Legal Solutions (workflow) could be the right stack. The two products are designed to work together.
What buyers usually underestimate
Related from Merito
Related solutions
Related services
Related products
Frequently Asked Questions
Consultation request
Share the analytics use case and the content sources in scope. A Merito analytics specialist follows up within one business day.
Analyst-first
Knowledge Discovery targets analyst-driven workloads where accuracy and taxonomy depth matter more than conversational surface.
Taxonomy tuning
Merito tunes concept and classification models to the customer's vocabulary before go-live.
Next step
A Merito Knowledge Discovery engagement starts with the analytics use case and the customer's domain-specific taxonomy.