EssayHero is a free, volunteer-run project that helps students improve their English writing. Using AI and official marking criteria (HKDSE Level Descriptors, IELTS Band Descriptors), it provides detailed, paragraph-by-paragraph feedback to help students understand exactly where they can improve.

Why is EssayHero free?

This project is entirely privately funded with no commercial aims or affiliations.

Who is EssayHero designed for?

EssayHero is designed for three groups: students preparing for HKDSE or IELTS who want detailed feedback on their writing, English teachers exploring AI's potential in education, and EdTech researchers interested in AI-assisted learning. If you have suggestions or feedback, email hello@essayhero.app.

How does EssayHero analyse my essay?

Your submission is sent to an AI language model with a system prompt that incorporates the publicly available official assessment criteria for your chosen exam. The AI analyses your work against these standards, providing a level/band estimate and specific feedback for each section.

Is my essay stored or shared?

For individual use, your submission is processed and immediately discarded — we do not store it unless you are signed in (in which case it is saved to your dashboard). If you use the sharing feature, your work and feedback are stored to generate a shareable link, then automatically deleted after 30 days. For Batch Essay Marking (teachers only), student names and scores extracted from scanned pages are stored for 30 days for report delivery, then auto-deleted. Teachers are responsible for obtaining appropriate consent before uploading student work. In all cases, your data is never shared with third parties and is used solely to provide analysis. We comply with Hong Kong's Personal Data (Privacy) Ordinance.

What happens to student data in Batch Essay Marking?

When a teacher uploads a class PDF for batch marking, student names, class information, and candidate numbers are extracted from the scanned pages to match reports to the correct student. This data is stored for 30 days (the same retention period as shared reports) and then permanently auto-deleted. Only the uploading teacher can view, export, or share results. Teachers are responsible for informing students and obtaining consent — particularly for students under 18, where parental or guardian consent may be appropriate. Students who wish to have their data removed should contact their teacher, who can delete individual reports at any time.

How accurate are the scores?

EssayHero provides estimates based on official marking criteria, but these are indicative, not definitive. Exam boards have the final say on actual scores. If any feedback seems off, trust your teacher's judgement — they know your work better than any AI. Use EssayHero as a learning tool, not a crystal ball.

What are the different marking standards?

Lenient gives the benefit of the doubt and focuses on strengths — useful for building confidence. Baseline applies standard HKDSE marking criteria. Harsh is deliberately strict, reserving high scores only for exceptional work — useful for students aiming for top scores who want rigorous feedback.

What do the Writing Statistics mean?

Writing Statistics measure aspects of your writing that research links to quality. Vocabulary (MTLD) measures word variety — aim for 80-100 (below 60 needs improvement). Sentences (MLT) measures average sentence length — aim for 14-18 words (under 11 is too basic, over 22 may indicate run-ons). Readability shows complexity level — target Form 4-5 (FK 10-12). The shaded zone on each bar shows your target range. These metrics are based on peer-reviewed linguistic research.

What are the Writing Tools?

Writing Tools are free, AI-powered practice tools to help you improve specific aspects of your writing. Vocabulary Enhancement analyses your word choices and suggests improvements. Text Type Checker verifies you're following the correct format conventions. Rewrite Assistant helps you improve weak sentences step-by-step. Prompt Generator creates exam-style questions for practice. Sentence Drills provides exercises to improve sentence variety. Access them from the "Writing Tools" link in the header.

How EssayHero Marks IELTS Essays (And What It Can't Do)

Trust & Methodology

Key Takeaways

EssayHero assesses IELTS Writing Task 1 and Task 2 using the four official criteria, each scored 0-9, with the overall band calculated as the average rounded to the nearest 0.5
The band descriptors used are aligned with the publicly available IELTS band descriptors (Updated May 2023)
Task 1 assesses data description (Task Achievement); Task 2 assesses argumentation (Task Response) — the AI handles both with separate prompt configurations
AI scoring is indicative for practice purposes and is not equivalent to scores from an official IELTS test centre

If You Teach IELTS, You Already Know the Problem

Your students need practice. They need to write essays, get feedback, revise, and write again. The revision cycle is where improvement happens — not in the first draft, but in the gap between the first and second. Between the second and third.

The problem is that feedback is scarce. You cannot mark thirty Task 2 essays every week. Your students cannot wait five days between drafts for your comments.

And the generic "grammar checker" tools they find online do not understand the difference between Band 5 cohesion and Band 7 cohesion. They do not know what an overview is or why it matters for Task 1. They certainly cannot tell the difference between a position that is "discernible" (Band 4) and one that is "clear and well-developed" (Band 8).

EssayHero attempts to fill that gap. Not replace you — fill the gap between your feedback sessions. This post explains exactly how it works, so you can decide whether it is worth recommending to your students.

The Four Criteria

IELTS Writing is assessed across four criteria, each weighted equally at 25% of the total.

Task 2 Criteria

Task Response — Does the essay address the prompt? Is there a clear position? Are ideas relevant, extended, and supported?
Coherence & Cohesion — Is the essay logically organised? Are cohesive devices used effectively? Is paragraphing appropriate?
Lexical Resource — Is the vocabulary range sufficient? Are there less common or idiomatic items? Are word choices accurate?
Grammatical Range & Accuracy — Is there variety in sentence structures? Are complex structures accurate? Are errors frequent or rare?

Task 1 Criteria

For Task 1, the first criterion changes:

Task Achievement — Does the response cover key features of the visual data? Is there a clear overview? Is data reported accurately?

The other three criteria (Coherence & Cohesion, Lexical Resource, Grammatical Range & Accuracy) are shared across both tasks, though the descriptors differ in detail.

How Scores Are Calculated

EssayHero scores each criterion independently on the 0-9 scale, then calculates the overall band as the average of all four, rounded to the nearest 0.5. This is the same method used in the official IELTS exam.

How the AI Applies the Criteria

The AI receives a detailed system prompt containing the full band descriptors for each criterion at every band level from 0 to 9. These are aligned with the publicly available IELTS band descriptors. The prompt instructs the AI to match the candidate's writing against these descriptors and assign a score that fits.

Not Keyword-Counting

This is not a keyword-counting exercise. The AI reads the full essay, assesses each criterion independently, and provides paragraph-by-paragraph feedback that names the specific criterion being discussed.

When it says "your Coherence & Cohesion benefits from effective use of referencing here," it is drawing on the Band 7 descriptor for that criterion. When it says "to improve your Task Response, develop this supporting idea with a specific example," it is identifying a gap against the Band 6 or 7 descriptor.

Adaptive Feedback by Level

The feedback adapts to the candidate's level:

Band 3 response — Full worked examples and before-and-after rewrites. The candidate needs scaffolding, not a critique.
Band 7 response — Genuine praise for what works well, with perhaps one optional suggestion for reaching Band 8. The AI is instructed not to nitpick strong writing.

Task 1 vs Task 2: What Changes

EssayHero uses separate prompt configurations for Task 1 and Task 2. This matters because the two tasks test different skills.

Task 2: Argumentative or Discursive Essays

Task 2 is an argumentative or discursive essay. The AI looks for:

A clear position
Well-developed body paragraphs with supporting evidence
Logical progression
A conclusion that synthesises the argument
Minimum word count: 250

The first criterion is Task Response: does the essay address all parts of the prompt?

Task 1: Data Description

Task 1 is a data description task. The AI looks for:

An overview of the main trends or features (critical for Band 7+ in Task Achievement)
Accurate data reporting
Meaningful comparisons
Appropriate use of Task 1-specific vocabulary (trend language, comparison language, approximation)
Minimum word count: 150

Why the Distinction Matters

The distinction matters. A student who writes a beautifully argued opinion piece in response to a Task 1 prompt will score poorly on Task Achievement, even if the writing quality is high. The AI is configured to catch this.

Similarly, a Task 1 response that lists every data point without identifying the key trends will be flagged — the Band 5 descriptor specifically mentions "mechanical" recounting without highlighting.

The Overview Requirement

For Task 1, the AI assesses whether an overview is present. A missing overview caps the Task Achievement score at Band 6. This is one of the most common mistakes IELTS candidates make, and the AI flags it prominently.

What the Band Descriptors Actually Say

Each band level is defined by a set of positive features (what the writing demonstrates) and negative features (what limits the score).

Examples of Band Descriptors

At Band 9, you see phrases like:

"the message can be followed effortlessly"
"minor errors are extremely rare"

At Band 5, you see:

"the range of structures is limited and rather repetitive"
"grammatical errors may be frequent"

How the AI Matches Writing to Descriptors

The AI matches writing against these descriptors:

A response where "complex sentences are attempted, but they tend to be faulty, and the greatest accuracy is achieved on simple sentences" fits the Band 5 descriptor for Grammatical Range & Accuracy
A response where "a variety of complex structures is used with some flexibility and accuracy" fits Band 7

The Key Principle: Full Fit Required

The key principle is that a script must fully fit the positive features of a descriptor to receive that band. Negative features limit the rating.

The AI is calibrated to follow this principle: it does not award Band 7 for Coherence & Cohesion simply because the essay is "reasonably well organised." It checks whether information and ideas are "logically organised" with "a clear progression throughout" and whether "paragraphing is generally used effectively."

These are specific requirements, not vague impressions.

The Revision Cycle

The real value is not in the score itself but in what happens after.

A candidate who receives a Band 5.5 and reads that their Task Response lacks supporting examples can revise the essay, add specific examples, and resubmit. If the revised version scores Band 6 or 6.5, they can see precisely which criterion improved and why.

Where Improvement Actually Happens

This iteration is where IELTS preparation actually works. You know this from your teaching:

The student who writes one essay per week improves more slowly
The student who writes a draft, gets feedback, revises, and writes again improves faster

EssayHero makes that cycle faster by providing structured feedback at any time, not just during your office hours.

Strictness Modes

The strictness modes help calibrate expectations:

Mode	Approach	Best For
Lenient	Gives benefit of the doubt, focuses on strengths	Building confidence early in preparation
Baseline	Applies standard marking criteria	Typical practice and self-assessment
Harsh	Rigorous standards where Band 7+ is genuinely earned	Test readiness and final preparation

Harsh Mode as a Benchmark

A student who consistently scores Band 6.5 on harsh mode is probably well-prepared.

What We Cannot Do

This is the section that matters most.

We Cannot Replicate IELTS Examiner Training

Official IELTS examiners go through rigorous certification by the British Council, IDP, or Cambridge. They are regularly re-certified and their marking is monitored for consistency.

The AI applies the published band descriptors but has not been through this standardisation process. It is applying criteria, not replicating the full examiner experience.

Scores Are Indicative, Not Official

Practice Tool, Not Prediction Engine

If EssayHero gives a Band 6.5 and the candidate receives a 6.0 in their official test, EssayHero is not "wrong" — it is a practice tool, not a prediction engine.

Use the scores to track improvement and identify weaknesses, not as a forecast of your official result.

We Cannot See Task 1 Visuals

For Task 1, the AI assesses the quality of the writing, the structure, and the language. But it cannot see the original chart, graph, or diagram.

It cannot verify whether the candidate has read the data correctly. If a student describes an upward trend that was actually downward, the AI may not catch it. Data accuracy checking relies on what the candidate writes, not on the original source.

We Cannot Assess Beyond Writing

IELTS has four components: Listening, Reading, Writing, and Speaking. EssayHero only covers Writing. An overall IELTS band score depends on all four.

AI Scoring Has Inherent Limitations

Large language models are pattern matchers, not examiners. They handle common essay types well but may be less reliable with unusual topics, unconventional structures, or highly idiomatic writing.

For borderline cases where a human examiner would deliberate, the AI applies the closest matching descriptor without that deliberation.

Full Transparency

The complete band descriptors, the detailed prompt instructions, the feedback rules, and the scoring calibration guidelines are all published on the IELTS methodology page. Everything the AI sees when it assesses an essay is visible there.

The source code is open under AGPL-3.0.

Privacy Commitment

Essays are processed and discarded. They are not stored unless the candidate opts in by creating an account. They are not used for AI training.

Privacy matters, especially when students are sharing their writing.

Try It Yourself

EssayHero is free. No account required. If you want to see what the feedback looks like on an IELTS essay, go to essayhero.app/?exam=ielts-task2, paste a practice essay, and read the output. The feedback names each criterion, explains why the score was given, and suggests specific improvements.

If the feedback helps your students iterate faster between drafts, share it with them. If you think the scoring does not align with your experience, or the feedback misses something important, I would genuinely like to hear about it. Email hello@essayhero.app.

I built this for students who need more feedback than they are currently getting. If it helps your students practise more effectively, I am glad. If not, I understand.

EssayHero is free, has no commercial aims, and is built by a Hong Kong teacher for students worldwide. Questions? Email hello@essayhero.app.

How EssayHero Marks Your Essays

A Hong Kong teacher explains the thinking behind EssayHero's marking system and when to trust it

How EssayHero Marks Business Essays (And What It Can't Do)

A transparent look at how EssayHero assesses university business and management essays, what the criteria actually measure, and where AI falls short.

How EssayHero Marks HKDSE Paper 2 Essays (And Why You Should Know)

A transparent explanation for teachers and tutors of how EssayHero assesses HKDSE English Paper 2 writing, how scoring works, and where AI falls short.

Trust & Methodology

Key Takeaways

EssayHero assesses IELTS Writing Task 1 and Task 2 using the four official criteria, each scored 0-9, with the overall band calculated as the average rounded to the nearest 0.5
The band descriptors used are aligned with the publicly available IELTS band descriptors (Updated May 2023)
Task 1 assesses data description (Task Achievement); Task 2 assesses argumentation (Task Response) — the AI handles both with separate prompt configurations
AI scoring is indicative for practice purposes and is not equivalent to scores from an official IELTS test centre

If You Teach IELTS, You Already Know the Problem

The problem is that feedback is scarce. You cannot mark thirty Task 2 essays every week. Your students cannot wait five days between drafts for your comments.

The Four Criteria

IELTS Writing is assessed across four criteria, each weighted equally at 25% of the total.

Task 2 Criteria

Task Response — Does the essay address the prompt? Is there a clear position? Are ideas relevant, extended, and supported?
Coherence & Cohesion — Is the essay logically organised? Are cohesive devices used effectively? Is paragraphing appropriate?
Lexical Resource — Is the vocabulary range sufficient? Are there less common or idiomatic items? Are word choices accurate?
Grammatical Range & Accuracy — Is there variety in sentence structures? Are complex structures accurate? Are errors frequent or rare?

Task 1 Criteria

For Task 1, the first criterion changes:

Task Achievement — Does the response cover key features of the visual data? Is there a clear overview? Is data reported accurately?

The other three criteria (Coherence & Cohesion, Lexical Resource, Grammatical Range & Accuracy) are shared across both tasks, though the descriptors differ in detail.

How Scores Are Calculated

How the AI Applies the Criteria

Not Keyword-Counting

Adaptive Feedback by Level

The feedback adapts to the candidate's level:

Band 3 response — Full worked examples and before-and-after rewrites. The candidate needs scaffolding, not a critique.
Band 7 response — Genuine praise for what works well, with perhaps one optional suggestion for reaching Band 8. The AI is instructed not to nitpick strong writing.

Task 1 vs Task 2: What Changes

EssayHero uses separate prompt configurations for Task 1 and Task 2. This matters because the two tasks test different skills.

Task 2: Argumentative or Discursive Essays

Task 2 is an argumentative or discursive essay. The AI looks for:

A clear position
Well-developed body paragraphs with supporting evidence
Logical progression
A conclusion that synthesises the argument
Minimum word count: 250

The first criterion is Task Response: does the essay address all parts of the prompt?

Task 1: Data Description

Task 1 is a data description task. The AI looks for:

An overview of the main trends or features (critical for Band 7+ in Task Achievement)
Accurate data reporting
Meaningful comparisons
Appropriate use of Task 1-specific vocabulary (trend language, comparison language, approximation)
Minimum word count: 150

Why the Distinction Matters

The Overview Requirement

What the Band Descriptors Actually Say

Each band level is defined by a set of positive features (what the writing demonstrates) and negative features (what limits the score).

Examples of Band Descriptors

At Band 9, you see phrases like:

"the message can be followed effortlessly"
"minor errors are extremely rare"

At Band 5, you see:

"the range of structures is limited and rather repetitive"
"grammatical errors may be frequent"

How the AI Matches Writing to Descriptors

The AI matches writing against these descriptors:

A response where "complex sentences are attempted, but they tend to be faulty, and the greatest accuracy is achieved on simple sentences" fits the Band 5 descriptor for Grammatical Range & Accuracy
A response where "a variety of complex structures is used with some flexibility and accuracy" fits Band 7

The Key Principle: Full Fit Required

The key principle is that a script must fully fit the positive features of a descriptor to receive that band. Negative features limit the rating.

These are specific requirements, not vague impressions.

The Revision Cycle

The real value is not in the score itself but in what happens after.

Where Improvement Actually Happens

This iteration is where IELTS preparation actually works. You know this from your teaching:

The student who writes one essay per week improves more slowly
The student who writes a draft, gets feedback, revises, and writes again improves faster

EssayHero makes that cycle faster by providing structured feedback at any time, not just during your office hours.

Strictness Modes

The strictness modes help calibrate expectations:

Mode	Approach	Best For
Lenient	Gives benefit of the doubt, focuses on strengths	Building confidence early in preparation
Baseline	Applies standard marking criteria	Typical practice and self-assessment
Harsh	Rigorous standards where Band 7+ is genuinely earned	Test readiness and final preparation

Harsh Mode as a Benchmark

A student who consistently scores Band 6.5 on harsh mode is probably well-prepared.

What We Cannot Do

This is the section that matters most.

We Cannot Replicate IELTS Examiner Training

Official IELTS examiners go through rigorous certification by the British Council, IDP, or Cambridge. They are regularly re-certified and their marking is monitored for consistency.

The AI applies the published band descriptors but has not been through this standardisation process. It is applying criteria, not replicating the full examiner experience.

Scores Are Indicative, Not Official

Practice Tool, Not Prediction Engine

If EssayHero gives a Band 6.5 and the candidate receives a 6.0 in their official test, EssayHero is not "wrong" — it is a practice tool, not a prediction engine.

Use the scores to track improvement and identify weaknesses, not as a forecast of your official result.

We Cannot See Task 1 Visuals

For Task 1, the AI assesses the quality of the writing, the structure, and the language. But it cannot see the original chart, graph, or diagram.

We Cannot Assess Beyond Writing

IELTS has four components: Listening, Reading, Writing, and Speaking. EssayHero only covers Writing. An overall IELTS band score depends on all four.

AI Scoring Has Inherent Limitations

Large language models are pattern matchers, not examiners. They handle common essay types well but may be less reliable with unusual topics, unconventional structures, or highly idiomatic writing.

For borderline cases where a human examiner would deliberate, the AI applies the closest matching descriptor without that deliberation.

Full Transparency

The source code is open under AGPL-3.0.

Privacy Commitment

Essays are processed and discarded. They are not stored unless the candidate opts in by creating an account. They are not used for AI training.

Privacy matters, especially when students are sharing their writing.

Try It Yourself

I built this for students who need more feedback than they are currently getting. If it helps your students practise more effectively, I am glad. If not, I understand.

EssayHero is free, has no commercial aims, and is built by a Hong Kong teacher for students worldwide. Questions? Email hello@essayhero.app.

Key Takeaways

If You Teach IELTS, You Already Know the Problem

The Four Criteria

Task 2 Criteria

Task 1 Criteria

How Scores Are Calculated

How the AI Applies the Criteria

Not Keyword-Counting

Adaptive Feedback by Level

Task 1 vs Task 2: What Changes

Task 2: Argumentative or Discursive Essays

Task 1: Data Description

Why the Distinction Matters

What the Band Descriptors Actually Say

Examples of Band Descriptors

How the AI Matches Writing to Descriptors

The Key Principle: Full Fit Required

The Revision Cycle

Where Improvement Actually Happens

Strictness Modes

What We Cannot Do

We Cannot Replicate IELTS Examiner Training

Scores Are Indicative, Not Official

We Cannot See Task 1 Visuals

We Cannot Assess Beyond Writing

AI Scoring Has Inherent Limitations

Full Transparency

Privacy Commitment

Try It Yourself

Related Articles

How EssayHero Marks Your Essays

How EssayHero Marks Business Essays (And What It Can't Do)

How EssayHero Marks HKDSE Paper 2 Essays (And Why You Should Know)

Key Takeaways

If You Teach IELTS, You Already Know the Problem

The Four Criteria

Task 2 Criteria

Task 1 Criteria

How Scores Are Calculated

How the AI Applies the Criteria

Not Keyword-Counting

Adaptive Feedback by Level

Task 1 vs Task 2: What Changes

Task 2: Argumentative or Discursive Essays

Task 1: Data Description

Why the Distinction Matters

What the Band Descriptors Actually Say

Examples of Band Descriptors

How the AI Matches Writing to Descriptors

The Key Principle: Full Fit Required

The Revision Cycle

Where Improvement Actually Happens

Strictness Modes

What We Cannot Do

We Cannot Replicate IELTS Examiner Training

Scores Are Indicative, Not Official

We Cannot See Task 1 Visuals

We Cannot Assess Beyond Writing

AI Scoring Has Inherent Limitations

Full Transparency

Privacy Commitment

Try It Yourself

Related Articles

How EssayHero Marks Your Essays

How EssayHero Marks Business Essays (And What It Can't Do)

How EssayHero Marks HKDSE Paper 2 Essays (And Why You Should Know)