True snippet-level detection (not only file-level)
Fingerprint matching extends below the file boundary into individual code snippets. Catches small fragments that file-level scanners miss.
SCANOSS • Application security
SCANOSS Snippet Detection performs file-level and snippet-level fingerprint matching against the OSSKB knowledge base of 100M+ open-source files to identify origin, license, version, and known issues for vendored, copy-pasted, and AI-generated code that manifest-based SCA cannot detect.
Merito sells SCANOSS Snippet Detection and operates the threshold calibration, AI code-assistant adoption discipline, license-review workflow, and remediation operating model that turn snippet detection into a working compliance and AppSec layer.
What it is
SCANOSS Snippet Detection extends the SCANOSS open-source-first SCA platform into the snippet-level surface. Fingerprint matching at file and snippet level against the OSSKB knowledge base of 100M+ open-source files surfaces vendored code, copy-pasted snippets, and AI-generated code that resembles licensed open-source. Manifest-based SCA misses these categories entirely. Snippet Detection is the layer that catches them.
AI code-assistant adoption makes Snippet Detection more important. A 2025 SCANOSS research study scanned 10,000 LLM-generated code samples against the OSSKB and found approximately 30% matched at a 10% similarity threshold. Over 1% still matched at a 30% threshold. Programs adopting Copilot, Cursor, or Claude Code at scale ship code that resembles licensed open-source unless detection runs at PR-time.
Snippet Detection is a detection layer. Programs that adopt it without an OSS approval workflow generate volume that does not turn into action. Merito's standard rollout pairs Snippet Detection with the customer's license-review workflow. Snippet matches surface for review by legal and AppSec teams against the customer's license tolerance policy. Confirmed matches drive remediation (rewrite, attribution, or removal). Unconfirmed matches inform the customer's open-source policy.
Ideal use cases
What it is best at
Fingerprint matching extends below the file boundary into individual code snippets. Catches small fragments that file-level scanners miss.
The Open Source Software Knowledge Base catalogs fingerprints from 100M+ open-source files. Matches AI-generated and copy-pasted code against the actual open-source ecosystem rather than a smaller commercial dataset.
2025 SCANOSS research study scanned 10,000 LLM-generated code samples and found approximately 30% matched OSSKB at 10% similarity. Evidence that the architecture catches AI-generated open-source slipping into production.
Customers configure similarity thresholds based on tolerance for false positives. Higher thresholds reduce volume; lower thresholds catch more matches.
Core capabilities
Where Snippet Detection actually finds undeclared open-source code.
File-level fingerprint matching
Whole-file fingerprints matched against the OSSKB. Catches vendored files copied from open-source projects.
Snippet-level fingerprint matching
Code-fragment fingerprints matched at sub-file granularity. Catches small snippets, function bodies, and inline code.
AI-generated code detection
Catches code generated by Copilot, Cursor, Claude Code, and other AI code assistants that resembles licensed open-source.
Vendored code detection
Catches files copied from open-source projects into the customer's repository without dependency-manager declarations.
How Snippet Detection consumes the SCANOSS knowledge base.
100M+ file fingerprint catalog
Continuous matching against the open-source ecosystem at scale.
Configurable similarity thresholds
Customer-tuned similarity thresholds match the customer's tolerance for false positives.
Match metadata
Each match returns origin project, license, version, and known issue references.
Where Snippet Detection findings reach legal, AppSec, and developer teams.
PR-time scanning
Scans run on PR creation/update with findings posted as PR comments on GitHub, GitLab, Azure DevOps, and Bitbucket.
License-review workflow
Findings route into legal-team review queues for license compliance assessment.
SCANOSS SCA pairing
Snippet Detection findings consolidate with SCA findings under the same console for unified open-source governance.
How findings turn into audit-ready evidence.
SBOM augmentation
Snippet matches augment the SCA SBOM with declaration-status metadata for previously-undeclared components.
License-evidence packages
Customer-specific evidence packages for license-compliance audits.
AI code-assistant adoption tracking
Match volume against AI-generated code provides metrics on AI code-assistant compliance posture.
Where it fits in the stack
Deployment and implementation
Licensing and packaging
SCANOSS Snippet Detection (Commercial)
Commercial enterprise edition with managed updates, support, and scale.
Best for: Programs running production AppSec with predictable SLAs and managed operations.
SCANOSS Snippet Detection (Open-Source)
Community open-source engine with basic snippet detection capability.
Best for: Pilot programs or organizations with internal capacity to operate open-source AppSec tools.
Merito services
Merito sells licenses and the delivery work around them. Pick the service that matches where you are in the lifecycle.
Engine deployment, similarity threshold calibration, license-review workflow setup, and remediation operating model design.
Explore service02Supply-chain program scoping for Snippet Detection adoption alongside Black Duck SCA snippet matching, FossID, and other snippet-detection options.
Explore service03PR-time snippet scanning gates and remediation routing across GitHub, GitLab, Azure DevOps, and Bitbucket.
Explore service04Developer-facing AI code-assistant adoption discipline and license-compliance training.
Explore service05Named engineer, priority SLAs, and release-window coverage for Snippet Detection in production.
Explore service06Long-term run support including ongoing threshold calibration, license-review queue operations, and AI code-assistant compliance reporting.
Explore service07Role-based training for AppSec engineers, legal teams, and developers using Snippet Detection findings.
Explore serviceSnippet Detection licensing
AI-generated open-source needs detection at PR-time and a workflow that turns matches into action. Buy Snippet Detection through Merito and get the calibration, the workflow, and the remediation discipline together.
Merito point of view
AI code assistants like Copilot, Cursor, and Claude Code generate substantial portions of new code. The 2025 SCANOSS research study showing 30% of LLM-generated code matched the OSSKB at a 10% similarity threshold demonstrates the magnitude of the AI-generated-OSS problem. Programs running pure manifest-based SCA miss this category entirely. Snippet Detection is the layer that catches it.
Vendored and copy-pasted code is the older problem Snippet Detection still addresses. Programs with significant historical OSS adoption have files copied from open-source projects without dependency-manager declarations. Manifest-only SCA cannot see those files. Snippet Detection identifies them and surfaces license posture for legal review.
Snippet Detection is a detection layer. Programs that adopt it without a license-review workflow generate volume that does not turn into action. Merito's rollout pairs Snippet Detection with the customer's license-review workflow so confirmed matches drive remediation (rewrite, attribution, or removal) and the customer's open-source policy reflects the actual posture.
What buyers usually underestimate
Related from Merito
Related solutions
Related services
Related products
Frequently Asked Questions
Consultation request
Share your AI code-assistant adoption posture, repository inventory, and license-review workflow. A Merito SCANOSS specialist follows up within one business day.
True snippet detection
Match granularity extends below file boundary into individual code snippets. Catches small fragments file-level scanners miss.
AI code-assistant coverage
2025 SCANOSS research study evidence. Programs adopting Copilot, Cursor, or Claude Code at scale need detection at PR-time.
Next step
A Snippet Detection engagement with Merito starts with the threshold calibration and license-review workflow setup. AI code-assistant adoption ships undeclared open-source unless PR-time detection runs.