Assessment Conversion Studio

Send us one AI-vulnerable assignment.
We'll turn it into a defensible performance task.

Any essay, problem set, lab report, or case study that AI can pass — QLM converts it into a simulation that captures how students think, not just what they produce.

Why this matters now: A 2026 study in Science by researchers at Berkeley, UTS, and Cornell found broad GenAI use among students and an estimated 9% rate of AI-assisted cheating. The authors call for discipline-specific assessment reform rather than blanket bans or universal detection. QLM is that reform — converting AI-vulnerable assignments into process-based simulations that make cheating irrelevant.

Chirikov, I., Smirnov, I., & Kizilcec, R. F. (2026). Generative AI use and misuse call for assessment reform in higher education. Science, 392(6800), 818–820. DOI: 10.1126/science.aec5115

Convert an Assignment → How It Works

What QLM converts

AI-vulnerable

Take-home essay

→

AI-defensible

Scenario with evidence selection, reasoning defense, and oral probe

AI-vulnerable

Problem set

→

AI-defensible

Interactive decision path with prediction, strategy feedback, and misconception capture

AI-vulnerable

Lab report

→

AI-defensible

Virtual experiment with variable manipulation, observation, and CER evidence

AI-vulnerable

Case study analysis

→

AI-defensible

Dynamic case simulation with changing facts, tradeoffs, and decision rationale

AI-vulnerable

Coding assignment

→

AI-defensible

Debug and code-review simulation with AI-output verification and design defense

AI-vulnerable

Clinical worksheet

→

AI-defensible

Patient scenario with real-time vitals, escalation decisions, and reasoning trace

AI-vulnerable

Discussion board post

→

AI-defensible

Prediction → consequence → explanation cycle with peer counter-argument

The research is clear: detection doesn't work. Reform does.

What doesn't work

AI detection tools produce false positives on ESL students and false negatives on sophisticated users. The Berkeley study found detection is unreliable at scale.

Blanket bans are unenforceable and counterproductive. Students use AI anyway; bans just make the use invisible.

Honor codes alone didn't prevent the 9% assisted-cheating rate the researchers found.

What the researchers recommend

Discipline-specific reform — redesign assessments so AI cannot substitute for the reasoning they measure.

Process-based evidence — evaluate how students think, not just what they submit.

Responsible AI integration — measure whether students used AI well, not just whether they used it.

"Rather than treating GenAI use as a binary (cheating or not), institutions should redesign assessments to capture the reasoning process and make AI substitution irrelevant."

— Adapted from Chirikov, Smirnov, & Kizilcec (2026), Science 392(6800)

QLM's Assessment Conversion Studio is the direct implementation of this recommendation. We don't detect AI. We don't ban AI. We redesign the assessment so that the simulation is the assessment and the reasoning process is the evidence.

What the system captures

Six dimensions of process evidence — not just whether the answer is right.

Prediction Quality

85%

Decision Quality

78%

Explanation Depth

72%

Revision Quality

68%

AI-Use Quality

90%

Transfer Ability

65%

Every score is backed by behavioral evidence from the simulation — predictions made, decisions taken, explanations written, revisions after feedback.

Five capabilities no other platform offers

Interconnected Problem Chains

Every step depends on the previous one. Students can't skip ahead or delegate to AI — because step 3 requires the actual output of step 2.

Empirically validated: interconnected assessments correlate r=0.954 with true competency (arXiv:2512.10758)

Learning Visibility Timeline

See every prediction, decision, revision, and strategy shift as it happened. The timeline IS the evidence — not the final submission.

Based on the Learning Visibility Framework (Davalos & Zhang, 2026)

Automated Oral Defense

AI-generated follow-up questions probe each student's understanding. Three independent evaluators score responses. Higher concordance than human graders.

Validated by Barba & Stegner (2026) and Ipeirotis & Rizakos (2026)

Invisible Assessment

Students don't know they're being tested. The simulation IS the assessment. Behavioral patterns predict competency without explicit test questions.

Built on stealth assessment research (Shute, 2011–2025; JRTE 2025)

Universal Design for Learning

Every assessment offers multiple paths to demonstrate the same competency. Visual, written, interactive, or oral — students choose their strongest modality. Same rubric. Same rigor.

Implements UDL framework for simulation-based assessment (Advances in Simulation, 2025)

How it works

Paste your assignment

Upload or describe any existing assessment — essay prompt, problem set, lab procedure, case study, coding spec.

QLM extracts objectives and flags AI risk

The system identifies learning objectives, maps to Bloom's taxonomy, and shows exactly which parts AI can substitute.

A simulation blueprint is generated

Your assignment becomes a scenario with decision points, branching consequences, and process-evidence capture — matched to the best simulation template.

You review, customize, and deploy

Preview the simulation yourself. Edit decision points, adjust rubric weights, set AI-use policy. Approve when ready.

Students take it. Evidence flows.

Every prediction, decision, explanation, and revision is captured. You see misconception clusters, mastery confidence, and oral defense probes — not just grades.

What you get

Objective Extraction

QLM identifies the skills and competencies your assignment was designed to measure — tagged by Bloom's taxonomy level.

AI Risk Analysis

See exactly which parts of your assignment AI can pass without understanding. Severity-rated by component.

Simulation Blueprint

A runnable scenario with decision points, branching consequences, and variables that differ between students.

Process-Evidence Rubric

Your old rubric mapped to 6 process dimensions: prediction, decision, explanation, revision, AI-use, and transfer.

Oral Defense Prompts

Follow-up questions that verify authenticity and probe reasoning depth. Targeted to each student's evidence gaps.

Evidence Dashboard

See process evidence, misconception clusters, mastery confidence, and AI transparency for every student.

Try it now — no sign-up required.

Paste one assignment. See the simulation before your students do. 2 minutes.

Convert Free →

Know a professor or dean struggling with AI and assessment?

Share this tool with your department. It's free to try.

Email a Colleague Share on LinkedIn

"I found a tool that converts AI-vulnerable assignments into simulation-based assessments. Free to try: quantumlearningmachines.com/conversion-studio"

Send us one AI-vulnerable assignment.We'll turn it into a defensible performance task.

What QLM converts

The research is clear: detection doesn't work. Reform does.

What the system captures

Five capabilities no other platform offers

Interconnected Problem Chains

Learning Visibility Timeline

Automated Oral Defense

Invisible Assessment

Universal Design for Learning

How it works

Paste your assignment

QLM extracts objectives and flags AI risk

A simulation blueprint is generated

You review, customize, and deploy

Students take it. Evidence flows.

What you get

Objective Extraction

AI Risk Analysis

Simulation Blueprint

Process-Evidence Rubric

Oral Defense Prompts

Evidence Dashboard

Try it now — no sign-up required.

Know a professor or dean struggling with AI and assessment?

Send us one AI-vulnerable assignment.
We'll turn it into a defensible performance task.