Assessment Conversion Studio

Send us one AI-vulnerable assignment.
We'll turn it into a defensible performance task.

Any essay, problem set, lab report, or case study that AI can pass — QLM converts it into a simulation that captures how students think, not just what they produce.

Why this matters now: A 2026 study in Science by researchers at Berkeley, UTS, and Cornell found broad GenAI use among students and an estimated 9% rate of AI-assisted cheating. The authors call for discipline-specific assessment reform rather than blanket bans or universal detection. QLM is that reform — converting AI-vulnerable assignments into process-based simulations that make cheating irrelevant.

Chirikov, I., Smirnov, I., & Kizilcec, R. F. (2026). Generative AI use and misuse call for assessment reform in higher education. Science, 392(6800), 818–820. DOI: 10.1126/science.aec5115


What QLM converts

AI-vulnerable
Take-home essay
AI-defensible
Scenario with evidence selection, reasoning defense, and oral probe
AI-vulnerable
Problem set
AI-defensible
Interactive decision path with prediction, strategy feedback, and misconception capture
AI-vulnerable
Lab report
AI-defensible
Virtual experiment with variable manipulation, observation, and CER evidence
AI-vulnerable
Case study analysis
AI-defensible
Dynamic case simulation with changing facts, tradeoffs, and decision rationale
AI-vulnerable
Coding assignment
AI-defensible
Debug and code-review simulation with AI-output verification and design defense
AI-vulnerable
Clinical worksheet
AI-defensible
Patient scenario with real-time vitals, escalation decisions, and reasoning trace
AI-vulnerable
Discussion board post
AI-defensible
Prediction → consequence → explanation cycle with peer counter-argument

The research is clear: detection doesn't work. Reform does.

What doesn't work

AI detection tools produce false positives on ESL students and false negatives on sophisticated users. The Berkeley study found detection is unreliable at scale.

Blanket bans are unenforceable and counterproductive. Students use AI anyway; bans just make the use invisible.

Honor codes alone didn't prevent the 9% assisted-cheating rate the researchers found.

What the researchers recommend

Discipline-specific reform — redesign assessments so AI cannot substitute for the reasoning they measure.

Process-based evidence — evaluate how students think, not just what they submit.

Responsible AI integration — measure whether students used AI well, not just whether they used it.

"Rather than treating GenAI use as a binary (cheating or not), institutions should redesign assessments to capture the reasoning process and make AI substitution irrelevant."

— Adapted from Chirikov, Smirnov, & Kizilcec (2026), Science 392(6800)

QLM's Assessment Conversion Studio is the direct implementation of this recommendation. We don't detect AI. We don't ban AI. We redesign the assessment so that the simulation is the assessment and the reasoning process is the evidence.


What the system captures

Six dimensions of process evidence — not just whether the answer is right.

Prediction Quality
85%
Decision Quality
78%
Explanation Depth
72%
Revision Quality
68%
AI-Use Quality
90%
Transfer Ability
65%

Every score is backed by behavioral evidence from the simulation — predictions made, decisions taken, explanations written, revisions after feedback.


Five capabilities no other platform offers

Interconnected Problem Chains

Every step depends on the previous one. Students can't skip ahead or delegate to AI — because step 3 requires the actual output of step 2.

Empirically validated: interconnected assessments correlate r=0.954 with true competency (arXiv:2512.10758)

Learning Visibility Timeline

See every prediction, decision, revision, and strategy shift as it happened. The timeline IS the evidence — not the final submission.

Based on the Learning Visibility Framework (Davalos & Zhang, 2026)

Automated Oral Defense

AI-generated follow-up questions probe each student's understanding. Three independent evaluators score responses. Higher concordance than human graders.

Validated by Barba & Stegner (2026) and Ipeirotis & Rizakos (2026)

Invisible Assessment

Students don't know they're being tested. The simulation IS the assessment. Behavioral patterns predict competency without explicit test questions.

Built on stealth assessment research (Shute, 2011–2025; JRTE 2025)

Universal Design for Learning

Every assessment offers multiple paths to demonstrate the same competency. Visual, written, interactive, or oral — students choose their strongest modality. Same rubric. Same rigor.

Implements UDL framework for simulation-based assessment (Advances in Simulation, 2025)


How it works

Paste your assignment

Upload or describe any existing assessment — essay prompt, problem set, lab procedure, case study, coding spec.

QLM extracts objectives and flags AI risk

The system identifies learning objectives, maps to Bloom's taxonomy, and shows exactly which parts AI can substitute.

A simulation blueprint is generated

Your assignment becomes a scenario with decision points, branching consequences, and process-evidence capture — matched to the best simulation template.

You review, customize, and deploy

Preview the simulation yourself. Edit decision points, adjust rubric weights, set AI-use policy. Approve when ready.

Students take it. Evidence flows.

Every prediction, decision, explanation, and revision is captured. You see misconception clusters, mastery confidence, and oral defense probes — not just grades.


What you get

Objective Extraction

QLM identifies the skills and competencies your assignment was designed to measure — tagged by Bloom's taxonomy level.

AI Risk Analysis

See exactly which parts of your assignment AI can pass without understanding. Severity-rated by component.

Simulation Blueprint

A runnable scenario with decision points, branching consequences, and variables that differ between students.

Process-Evidence Rubric

Your old rubric mapped to 6 process dimensions: prediction, decision, explanation, revision, AI-use, and transfer.

Oral Defense Prompts

Follow-up questions that verify authenticity and probe reasoning depth. Targeted to each student's evidence gaps.

Evidence Dashboard

See process evidence, misconception clusters, mastery confidence, and AI transparency for every student.


Try it now — no sign-up required.

Paste one assignment. See the simulation before your students do. 2 minutes.

Convert Free →

Know a professor or dean struggling with AI and assessment?

Share this tool with your department. It's free to try.

Email a Colleague Share on LinkedIn

"I found a tool that converts AI-vulnerable assignments into simulation-based assessments. Free to try: quantumlearningmachines.com/conversion-studio"