The engine that
sees through obfuscation.
Zeus uses machine learning to understand code semantically—not just textually. Rename variables, shuffle functions, translate between languages. It doesn't matter. Zeus sees the logic underneath.
# Student renamed everything
# Zeus still finds the match
def calculate(x, y):
result = x * y + 42
return result
# → 94% match to GitHub repo
Not pattern matching. Understanding.
ML-Based Tokenization
Zeus converts code into semantic tokens that represent logic, not syntax. Variable names become irrelevant. The algorithm matches meaning.
Cross-Language Detection
Python to Java. C++ to Rust. Zeus understands the underlying algorithm regardless of which language it's written in.
Parallel Processing
Multi-threaded Node.js engine processes thousands of files simultaneously. A 500-student class takes seconds, not hours.
How Zeus processes code
Four stages transform raw source into comparable semantic fingerprints.
Parse
Language-specific parsers extract the AST. Comments and whitespace stripped. Pure structure remains.
Normalize
Variables become placeholders. Function names become generic tokens. The code is reduced to its logical skeleton.
Fingerprint
ML models generate semantic hashes. Similar logic produces similar fingerprints—regardless of surface changes.
Compare
Fingerprints are matched against peers, web sources, and known AI patterns. Matches ranked by confidence.
What Zeus catches
Every evasion technique students try. Zeus has seen it before.
Obfuscation Detection
Students try to hide. Zeus finds them anyway.
Variable renaming, function reordering, dead code insertion, loop restructuring—these tricks fool legacy tools. Zeus tokenizes at the semantic level, so surface changes don't affect detection.
- Variable/function renaming
- Code block reordering
- Dead code insertion
- Loop/conditional restructuring
ChatGPT & Copilot
Trained on millions of AI-generated samples
LLMs have telltale patterns—specific comment styles, naming conventions, code structures. Our ML models recognize these fingerprints with 95%+ accuracy.
- GPT-4 / GPT-4o detection
- Claude pattern matching
- GitHub Copilot signatures
- Confidence scoring
Zeus also includes access to MOSS, JPlag, and Dolos for institutions that want comparison baselines.
Zeus vs. legacy tools
"MOSS would take 3-4 days for our 400-student intro class. Zeus does it in under a minute. But the real win was catching students copying from GitHub and using ChatGPT—MOSS had no answer for that."
For the curious
What ML models does Zeus use?
Zeus uses a combination of transformer-based embeddings for semantic similarity and custom-trained classifiers for AI detection. The tokenization layer is proprietary but inspired by code2vec research.
How does cross-language detection work?
Code is normalized to cross-family tokens representing operations (assignment, loop, conditional, function call, etc.). Two implementations of the same algorithm—in Python and Java, for example—produce nearly identical token sequences.
What's the false positive rate?
Under 2% in our benchmarks. We tune for high precision because false accusations are worse than missed catches. Every match includes confidence scores so you can set your own thresholds.
Can students beat Zeus?
Not with known techniques. We've tested against every obfuscation method in academic literature plus novel approaches from red-team exercises. The semantic approach is fundamentally harder to fool than text matching.
How is my code stored?
Encrypted at rest (AES-256), encrypted in transit (TLS 1.3), auto-deleted after 90 days (configurable). FERPA compliant. We never use your code to train models.
Can I access Zeus via API?
Yes. Full REST API with batch processing support. See the docs →
See Zeus in action
Run your first check in under 2 minutes.