PrepScholar

Choose Your Test

Sat / act prep online guides and tips, the best toefl writing practice: 300+ topics to study with.

author image

Writing is the last section you’ll complete on the TOEFL. You’re so close to finishing, yet you still have two essays to write before you can celebrate completing the exam. In order to finish the test on a high note, you’ll need to be prepared for this section.

In this guide, we explain the ins and outs of the Writing section and the materials you need to do well. We then go over all the best TOEFL Writing practice resources available , including free and official practice Writing topics. We'll end with final tips to keep in mind in order to ace the TOEFL Writing section.

Overview of TOEFL Writing

The TOEFL Writing section is 50 minutes long (broken into two parts) and contains two tasks: Integrated Writing and Independent Writing. You’ll type both essays on the computer.

The Integrated Writing task requires you to use listening, reading, and writing skills. For this task, you will have three minutes to read a short passage, then you will listen to a short (approximately two-minute long) audio clip of a speaker discussing the same topic the written passage covers. You will have 20 minutes to plan and write a response that references both of these sources. You won’t discuss your own opinion.

For the Independent Writing task, you’ll receive a question on a particular topic or issue. You’ll have 30 minutes to plan and write a response to that topic that explains your opinion on it. You’ll need to give reasons that support your decision.

Each essay will receive a score from 0-5. The sum will then be scaled to a score from 0-30, which is your official Writing score. The Writing section makes up 25% of your total TOEFL score (from 0-120).

What You’ll Need to Be Prepared for the TOEFL Writing Section

As you likely expect, you’ll spend most of your time on the TOEFL Writing section, well, writing. However, you’ll also need to have solid reading and listening skills for the Integrated task. Since the Writing section requires multiple skills, you’ll need multiple study tools in order to be completely prepared. Some of the most important things you’ll need to prepare for TOEFL Writing include:

  • Complete practice Writing sections
  • Individual practice questions or TOEFL Writing topics
  • Opportunities to practice your writing skills
  • Opportunities to practice your listening skills

In the next section, we'll go over the best TOEFL Writing practice tests and questions.

The Best TOEFL Writing Practice Materials

This section contains links to the top practice materials to use while preparing for TOEFL Writing. What makes a practice material the best?

  • First, the practice questions must be similar in content and format to the real TOEFL Writing section to give you the best preparation for the real exam.
  • Second, it’s a major plus if the practice questions come with answer explanations that help you understand how to answer an essay prompt well.
  • Finally, prep materials that include useful tips and strategies for answering Writing questions are useful because they give you advice on how to raise your score on this section.

Official Prep Materials

Official resources are the best to use since you can be confident they’ll be very similar to the real TOEFL Writing section. The topics will be much more realistic in format and content.

Below are all the official TOEFL Writing practice materials available, both free and paid resources. ETS doesn’t provide just Writing questions, so each of these resources also have practice resources for the other sections of the TOEFL. Be sure to include at least some of these materials in your studying. The next section has more tips on how to make the most of official practice resources.

body_laptop-Apr-01-2024-08-20-53-1764-PM

TOEFL iBT Sampler

The TOEFL iBT Sampler is a program you can download with official practice questions, and it’s a great free and official resource to use. In addition to other TOEFL sections, it includes a complete TOEFL iBT Writing practice section (two tasks). Sample answers for both tasks are included so you can get an idea of what a good essay looks like. Unfortunately, the Sampler only works with Windows; you can’t download it with a Mac.

TOEFL iBT Sample Questions

This PDF is another free and official resource. In addition to other question types, it includes two Writing tasks: one TOEFL Integrated Writing practice question and one Independent Writing practice question. Each sample TOEFL Writing topic is followed by a sample essay as well as an in-depth score explanation, which is a great tool for studiers.

TOEFL iBT Quick Prep

The Quick Prep contains four different volumes, each of which contains one or two Writing prompts. The first volume is the best for TOEFL iBT Writing practice, since it contains two tasks (the others each only contain one) and also has an in-depth explanation of what your essays should include. The other three volumes only contain the essay rubrics without any advice on how to answer the specific essay prompt given.

TOEFL Practice Online (TPO) Tests

TPO tests are retired TOEFL exams now offered for test prep. They give the closest experience to the real TOEFL, and, because of that, they aren’t cheap. You’ll have to pay $45 for each complete TOEFL you buy (you can’t just buy individual TOEFL iBT Writing practice sections).

Your exam will be automatically graded after you finish it, although I was not particularly impressed with how the Writing section was graded when I took it. For the actual TOEFL, two human graders and a computer program review your essays and assign grades to each one. For this exam, a computer grades your Writing section within less than a minute of you completing the exam, and there is no explanation of how that grade was determined.

This is a useful resource, but if you don’t want to spend that much money on a practice test, it’s completely possible to do well just using the above practice resources.

Official TOEFL Prep Books

There are several official TOEFL prep books for sale by ETS. The Official Guide to the TOEFL Test , in addition to explaining the types of questions on the test, contains numerous practice questions and three full-length exams.  This is also the only official prep book that includes sample essays of varying scores along with scoring explanations, which can be a big help if you’re trying to guess what score your essays would get.

There’s also the Official TOEFL iBT Tests Volumes 1 and 2. Each of these books contains five unique practice tests, available on paper and the computer. However, no sample responses are given for Writing questions, which make them a less useful resource compared to The Official Guide to the TOEFL Test.

body-pencil-notebook-writing

Unofficial Prep Materials

You have to be more careful when using unofficial prep resources since not all of them are reliable. Some are high-quality and very similar to the TOEFL, others are not. For the Writing section in particular, because it’s so easy to make up essay prompts, there are many sites that claim to offer TOEFL Writing practice when their practice questions are actually low-quality. Below are some of the top unofficial prep resources out there. All of them (except the prep books, if you buy them) are free.

English Club

This site contains one TOEFL Integrated Writing practice task and five Independent Writing practice tasks. All six TOEFL Writing topics are similar to the real test, and the Integrated task as well as one of the Independent tasks have sample responses you can compare your answers to. At the bottom of the web page is a checklist of things your essays should include to help give you a guideline of what you should be aiming for when you write.

Magoosh offers one TOEFL Integrated Writing practice task. That’s not a lot of practice, but it is a high-quality question that includes a sample response. Magoosh also offers three complete TOEFLs, so you can use those Writing questions as well.

Good Luck TOEFL

This site has a huge number (several hundred) of Independent Writing tasks, separated into five different categories depending on question type. Some of the questions are more simplistic than you’ll see on the real TOEFL , and there’s no scoring information or sample responses, but there are a good source if you need more TOEFL Writing topics to write about.

Beat the Test

This site contains 155 Independent Writing tasks. Like Good Luck TOEFL, some of these TOEFL Writing topics are easier than you’ll find on the TOEFL, and there are no sample responses included, but they do give you the opportunity to practice writing.

Unofficial Prep Books

Prep books, even unofficial ones, often are a great resource for practice questions. Most books include sample Writing questions, along with scoring explanations , and then contain one or more complete practice TOEFLs at the end of the book. You can learn all about the best TOEFL prep books by reading our guide .

body_bookstack-1

Other TOEFL Writing Practice Materials

There are other ways to practice besides just answering sample Writing questions. TOEFL Writing is designed to measure how strong your English writing skills are, so, any practice you get writing English will help you with this section, even if you’re not directly answering practice exam questions. There are many ways to get writing practice; several of them are described below.

Duolingo is a popular free language-learning site. Users answer different types of questions, including writing questions. You can’t choose to only answer writing questions, so this isn’t the best resource for targeted writing practice, bu t it’s a good way to strengthen your overall English skills.

For advanced English learners, many of Duolingo’s beginning problem sets will likely be too easy, but you can take a quiz to figure out where in the program you should start.

Free Writing

Finally, you can also practice writing about topics that have nothing to do with the TOEFL. As long as you’re writing in English, you’re getting good practice. Writing about something that interests you can also encourage you to write more. Some ideas for free writing include:

  • Keeping a journal
  • Getting an English-speaking pen pal
  • Starting a blog about a topic that interests you

How to Get the Most Out of Your TOEFL Writing Practice

Now that you’re an expert on the best practice resources for TOEFL Writing, the next step is to put those materials to use in the most effective way in order to see results on test day. Follow these four tips in order to get the most out of your practice.

Practice Writing in English Regularly

The most important thing you can do to practice for the Writing section of the TOEFL is to practice writing English regularly. If you can practice every day, that would be ideal, but at the very least you should aim to practice writing 1,000 words in English a week.

Remember, this writing practice doesn’t only have to consist of answering TOEFL Speaking questions; any free writing, even just jotting down what you did that day in your diary, counts as writing practice.

body_typing

Make Use of Official Materials

As mentioned above, official TOEFL resources have the best practice questions out there, so you want to make the most of them. Spread these questions throughout your TOEFL studying; don’t use them all up at the beginning or save them all for the end. You want to be regularly seeing these questions as you prepare.

Also, when you answer official practice questions, make sure you set enough time aside to devote your full attention to them. Practice them in a quiet room with no distractions, and carefully compare your responses to the sample responses. These aren’t the questions to practice when you have a few minutes to spare and need some quick practice while scrolling through your phone.

Time Yourself When Writing Practice Essays

When you are writing practice essays, you should also time yourself. Give yourself 20 minutes to plan and write each Integrated Writing task and 30 minutes for each Independent Writing task.

Timing yourself when you write will help you be better prepared for test day because you’ll have practice planning and writing essays within a limited time frame. When you first begin writing practice essays, it can be easy to spend too much time preparing and run out of time before you finish writing. Taking timed practice essays will help you avoid this. You should also count how many words each of your essays contain after you’ve finished writing them. Integrated tasks should be 150-225 words and Independent tasks should be at least 300 words.

Review Your Practice Essays

After you write each TOEFL practice essay, you should also review it and think about how well it answered the question. This is easier to do if the practice question comes with sample answers that you can compare your answer to, but you should do this step for all practice essays you write, even if they don’t come with any answer explanation. You can also assign your essays a score or have a tutor or friend who’s also studying for the TOEFL score your essay.

It’s tempting to take a break from TOEFL studying as soon as you’ve finished your essays, but it’s important to do this step because it will get you thinking about what great essays look like and how yours can be improved. The ETS provides the rubric it uses to grade TOEFL writing tasks which you can use to evaluate your essays.

Conclusion: Getting the Most Out of Your TOEFL Writing Practice

In order to write two awesome essays for the TOEFL Writing section, you’ll likely have to put in some practice. Once you know what to expect from this section and how you’ll be graded, use a variety of official and unofficial practice resources during your studying.

As you’re preparing for the Writing section, you should also practice writing in English regularly, use official resources wisely, time yourself when writing practice essays, and review your essays after you write them.

What's Next?

Looking for more information on the TOEFL Writing section? Learn all the tips you need to know in order to ace TOEFL Writing !

What score should you be aiming to get on the TOEFL? Learn what a good TOEFL score is based on the schools you're interested in attending.

Looking for a great TOEFL prep book? A good prep book can be the most important study tool you use, and we have information on all the best TOEFL prep books you should consider .

Need to boost your TOEFL score?   We have the industry's leading TOEFL prep program. Built by Harvard grads and SAT full scorers, our program offers individual review, interactive lessons, and realistic online practice, at an affordable price. It's the fastest way to get your target TOEFL score.   And the best part? You can try it out for 5 days absolutely free of charge!

Christine graduated from Michigan State University with degrees in Environmental Biology and Geography and received her Master's from Duke University. In high school she scored in the 99th percentile on the SAT and was named a National Merit Finalist. She has taught English and biology in several countries.

Ask a Question Below

Have any questions about this article or other topics? Ask below and we'll reply!

Improve With Our Famous Guides

  • For All Students

The 5 Strategies You Must Be Using to Improve 160+ SAT Points

How to Get a Perfect 1600, by a Perfect Scorer

Series: How to Get 800 on Each SAT Section:

Score 800 on SAT Math

Score 800 on SAT Reading

Score 800 on SAT Writing

Series: How to Get to 600 on Each SAT Section:

Score 600 on SAT Math

Score 600 on SAT Reading

Score 600 on SAT Writing

Free Complete Official SAT Practice Tests

What SAT Target Score Should You Be Aiming For?

15 Strategies to Improve Your SAT Essay

The 5 Strategies You Must Be Using to Improve 4+ ACT Points

How to Get a Perfect 36 ACT, by a Perfect Scorer

Series: How to Get 36 on Each ACT Section:

36 on ACT English

36 on ACT Math

36 on ACT Reading

36 on ACT Science

Series: How to Get to 24 on Each ACT Section:

24 on ACT English

24 on ACT Math

24 on ACT Reading

24 on ACT Science

What ACT target score should you be aiming for?

ACT Vocabulary You Must Know

ACT Writing: 15 Tips to Raise Your Essay Score

How to Get Into Harvard and the Ivy League

How to Get a Perfect 4.0 GPA

How to Write an Amazing College Essay

What Exactly Are Colleges Looking For?

Is the ACT easier than the SAT? A Comprehensive Guide

Should you retake your SAT or ACT?

When should you take the SAT or ACT?

Stay Informed

Follow us on Facebook (icon)

Get the latest articles and test prep tips!

Looking for Graduate School Test Prep?

Check out our top-rated graduate blogs here:

GRE Online Prep Blog

GMAT Online Prep Blog

TOEFL Online Prep Blog

Holly R. "I am absolutely overjoyed and cannot thank you enough for helping me!”

Shopping Cart

No products in the cart.

Ten Awesome Tips for the Writing Section of the TOEFL® Test

' src=

Ten Awesome Tips for the Writing Section of the TOEFL Test

We asked our team of TOEFL experts for their best TOEFL Writing tips. Here they are. 

We know it can feel impossible to write two complete essays in 30 minutes!

But today, we have some awesome TOEFL writing tips that will help you feel cool and confident on test day.

The first five tips are for the TOEFL integrated writing question, task number one. The last five tips are for the TOEFL Writing for an Academic Discussion question, task number two.

Here's a quick summary of our top 10 TOEFL Writing Tips

  • Tip #1 – Pay careful attention to the reading passage because the listening passage will argue against each point made in the reading.
  • Tip #2 – Practice taking short, but precise notes during the listening passage. You will only get to listen to it once.
  • Tip #3 – Paraphrase the passages. One of the grading criteria on the TOEFL is vocabulary, and you will earn zero points if you simply quote the material.
  • Tip #4 – Use lots of transitional words and phrases. They will help you develop your essay and connect your ideas.
  • Tip #5 – Manage your time wisely. You only have 20 minutes to construct an essay of at least 250 words.
  • Tip #6 – Be sure to add to the discussion. Do not repeat points and ideas from the discussion in your response.
  • Tip #7 – Stick to a few simple verb tenses.
  • Tip #8 – Try to include multiple sentences with more than one clause.
  • Tip #9 – Plan to edit your essay for at least one minute at the end. There is no spellcheck on the TOEFL.
  • Tip #10 – Practice writing at home and ask a teacher or trained TOEFL expert to give you a grade and point out your weakness.

TASK 1 (The Integrated Writing Task)

Tip #1: read, then listen.

Task one provides you with two passages, a reading passage, and a listening passage. The reading passage will appear first. You will have three minutes to read the passage and take notes. After that, the reading passage will disappear, and then the listening passage will play for two or three minutes. Then you have 20 minutes to write your essay.

Here is a breakdown of task one:

toefl writing essay

For writing task #2, you have ten minutes to read the discussion and then write a response of 100 words or more.

In the example above, the professor poses a question about whether or not globalization has a positive or negative effect on society. Bryan believes globalization is positive because it brings people and cultures closer together, while Alex thinks globalization leads to the exploitation of developing countries. When it is time to write, be sure to ADD to the discussion. Do not just repeat the same points made by the other students.

Look at the sample response below. The points highlighted are additional ideas that contribute to the discussion.

In my opinion, globalization is a positive force in the world today.  My personal belief is that when nations interact together, it leads to less suffering. For example, people who suffer from natural catastrophes can import goods and services to help mitigate their losses and recover quickly. Citizens of nations who may be going through war or famine can flee to neighboring countries for shelter . I understand Alex’s concern over cultural homogenization, but I believe she misses another critical point. Much of the food found in grocery stores around the world is imported. Our very diet is due to the international trade partnerships and alliances forged over the last half of the 20th century. International trade and travel are not possible in a world without globalization, which is why I believe it is crucial to maintain our current global policies for a safer world with a higher standard of living .

On test day, it is fine to agree or acknowledge the points made in the discussion; just be sure that the majority of your response to the Writing for an Academic Discussion question further develops the online conversation.

TIP #7: Stick to a few simple verb tenses

Test-takers tend to think they have to use complicated grammar in the TOEFL Writing section to earn a high test score.

This isn’t true.

In fact, for the new Writing for an Academic Discussion task, you only need three verb tenses:

  • Simple present
  • Simple past
  • Present perfect

I have highlighted all of the verb tenses used in my response to the question about globalization, and almost all of them are in the simple present.

In my opinion, globalization is a positive force in the world today.  My personal belief is that when nations interact together, it leads to less suffering. For example, people who suffer from natural catastrophes can import goods and services to help mitigate their losses and recover quickly. Citizens of nations who may be going through war or famine can flee to neighboring countries for shelter. I understand Alex’s concern over cultural homogenization, but I believe she misses another critical point. Much of the food found in grocery stores around the world is imported. Our very diet is due to the international trade partnerships and alliances forged over the last half of the 20th century. International trade and travel are not possible in a world without globalization, which is why I believe it is crucial to maintain our current global policies for a safer world with a higher standard of living.

But hold on a second.

Your grammar in the TOEFL Writing section can’t be too simple, and there is one more tip to keep in mind.

TIP #8: Try to include multiple sentences with more than one clause

According to Michael Goodine at Test Resources , the TOEFL Writing e-rater likes writing samples that include sentences with subordinating and/or coordinating conjunctions.

A subordinating conjunction is a word or phrase that links a dependent clause to an independent clause, and a coordinating conjunction joins two elements of equal importance.

Check out the table below for a handy list of both.

Subordonating Conjuctions

  • by the time
  • even though
  • in order that
  • in the event that
  • provided that
  • whether or not

Coordonating Conjuctions

  • nevertheless

It is not required to include these conjunctions in your response, but it will help you craft sentences with two or more clauses, which will have a positive impact on your score.

I have highlighted the sentences from the globalization example response that include multiple clauses.

In my opinion, globalization is a positive force in the world today.  My personal belief is that when nations interact together, it leads to less suffering. For example, people who suffer from natural catastrophes can import goods and services to help mitigate their losses and recover quickly . Citizens of nations who may be going through war or famine can flee to neighboring countries for shelter. I understand Alex’s concern over cultural homogenization, but I believe she misses another critical point . Much of the food found in grocery stores around the world is imported. Our very diet is due to the international trade partnerships and alliances forged over the last half of the 20th century. International trade and travel are not possible in a world without globalization, which is why I believe it is crucial to maintain our current global policies for a safer world with a higher standard of living .

TIP #9: Take the Time to Edit!

One of the best TOEFL Writing tips I could possibly share here is to leave yourself time to edit. Plan on finishing your essay around the 9-minute mark so you have at least a minute to edit your essay.

It’s annoying, I know.

Especially since, at this point, you will have taken the test for almost two hours.

Remember, there is no spellcheck on the exam. That’s right, you will be responsible for finding spelling and grammar mistakes yourself. Also, be sure to pay special attention to spacing and punctuation.

There is no automatic spellcheck on the TOEFL exam. You will be responsible for typing and correcting spelling and grammatical mistakes.

So, always give yourself a minute or more to proofread your essays and practice writing with spellcheck turned off !

Here are some common mistakes that are easy to spot:

  • Spelling errors
  • Capitalization and spacing mistakes
  • Problematic introduction sentence
  • Personal areas of weakness (article usage, subject-verb agreement, etc.)
  • Proper verb tense

Read your response aloud quietly. Listen to hear if your sentences sound correct. If part of your response sounds strange, you may have some awkward grammar that needs to be fixed.

TIP #10: Get Feedback

While preparing for the TOEFL writing section, it is crucial for you to get your hands on some practice questions.

And to get valuable feedback from a qualified TOEFL instructor who knows exactly what you need to write to earn a high score in the writing section. Relying on yourself or a teacher you know will certainly help, but you will not be able to reach your full potential without advice from an experienced instructor.

Therefore, the last tip while preparing for the TOEFL is to get help from a trained professional at TST Prep.

If you haven’t already, be sure to download TST Prep’s Complete Test #13  where you can see these sample essays and more.

Was there anything we missed?

If you have more TOEFL Writing tips, please don’t hesitate to let us know in the comments section below.

And, as always, we are here to help if you have any questions or concerns, [email protected]

Other articles

Topics and Sample Essays for the Writing Section of the TOEFL Test

100 Reading Questions for the TOEFL Test

Check other articles on TOEFL

69 Comments

Parisa

Thanks for this invaluable article🙏🏻.

Lin

Excellent samples

Hossein

Hi Josh. Thank you a lot. This post was very useful for me.

Gopal

Hi Josh, The sample for Task 1 provided on the ETS website is a bit different from the one you wrote. It solely focuses on the lecture while only saying, “That refutes that point from reading.” Could you have a look and recommend me some tips?

Josh

Hi, thanks for your question. To add length to your essay, we suggest including a bit more from the reading section. It can also help you in making sure each point from the listening matches a point referenced in the reading.

Iman

Hi Josh, First, thanks a million for this intelligent and helpful content; you are a great tutor. I found that the time management section has a small mistake with summing the minutes suggested for each step. 2 minutes outline + 3 minutes introduction + 11 minutes body + 2 minutes conclusion + 3 minutes editing = 21 minutes!

Hi Inman, thanks for pointing that out. We will have a look. I would maybe say 2 minutes for an introduction would be better. 🙂

Great! Thanks again.

Mohammad tareq

Hey Josh Thanks a lot for your time and effort. You have got a heart of gold. Lots of love from Afghanistan♥️🇦🇫

Ruth

Point 7 isn’t true (but perhaps you needed just one more to make 10 tips instead of 9?) I took both sides all the time when I took the TOEFL, and got a 119 (and the reason I missed one point was the speaking, where I forgot to mention two specific examples). Cheers!

Hi Ruth and thank you for taking the time to leave a comment. Congrats on your score, that’s incredible. In regards to your opinion on tip #7 ” HAVE A CLEAR OPINION AND STICK TO IT”, it is not put there arbitrarily to inflate the number of tips. We have found that students who fail to have a concrete opinion tend to have difficulty developing the topic and writing with clarity. This is not always the case and you can write a perfect essay by arguing both sides of the argument, generally speaking, however, it is easier to write a clear and well-developed essay when taking one clear side of an issue.

Rob

Hello Josh. What do “int” and “ind” mean? You used these in the TOEFL Writing Transitional Phrases table without explaining them. Thanks.

Hi Rob, and my apologies. I should clarify that. The “int” stands for “integrated writing (task 1),” and the “ind” stands for independent writing (task #2).

Thomas

Hello, I’m preparing for TOEFL, and I found a weird subject for the independent subject. “The average global life expectancy went from 29 years old in 1800 to 71 years old in 2015. According to you, what is the main cause for such a change?”. Do you know if it is possible to have a question like this? Thank you for your help.

Hi there and thank you for your question. This does indeed sound like a reasonable topic since anyone could have an opinion on it. It is a bit strange but definitely possible. (I’ve seen weirder). This would also be difficult because you would have to use past unreal unconditional forms and sentences like “It could have been” “might have been because”, etc. Hope that helps!

Rogerio

Hello Josh! How are you? Please where can I find the TOEFL Writing 24+ Guide to download?

Here is the link. Hope that helps you find what you are looking for. https://drive.google.com/file/d/1j-fGt9rvjklnU4v-ENUA2cE6d1fp2mt1/view?usp=sharing

Nikitha Crasta

Hi Josh, I’m taking my test in a few minutes. Thank you so much for the free material on the website. It has definitely helped me make my own template!

Thanks for leaving a comment Nikitha. How did the test go? I’m always interested to hear if anything was surprising/interesting/etc.

The test was alright! I haven’t received my scores yet, except for Reading (26) and Listening (29). The questions asked were congruent to the examination pattern. I couldn’t really relate to the personal opinion question asked in the speaking section. I found the reading section quite tough compared to everything else. In the end, it wasn’t so bad considering I started prep 24 hours before the exam. If I had put in the effort, I would’ve been able to do well.

Oya

Hi Josh. I’m preparing to write the TOEFL exam as a deaf test taker. I have practiced a lot, but the problem is the reading section. I can’t figure out how I answer to the reading section questions.

Have you considered seeing if perhaps you can be allowed extra time? I know in certain situations it is possible, but I’m not sure it would pertain to the Reading section. It might be worth you checking with ETS. Outside of that, have you thought about maybe taking the group reading course? I think it might really help you.

James Jhan

Thanks a lot for sharing the tips and the analysis, I really appreciate it!

Erfan

Hello Josh, How can I have this post in PDF format? Is it possible to send me the PDF file of this post ??

Hi Erfan and thank you for asking. Unfortunately, we do not have this post in PDF format but we have our TOEFL Writing Guide which includes tips like this, along with a lot of other useful information. I think you will find this document even more helpful. Here is the link – https://drive.google.com/file/d/12JWgyMLAAUM2z4GDDpTDbMXCXelkYS0h/view?usp=sharing

Submit a Comment

Your email address will not be published. Required fields are marked *

Cancel reply Submit Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed .

Related Articles

toefl writing essay

TOEFL iBT® Paper Edition: Your Top 10 Questions Answered by ETS®

Registration has already begun for the brand new TOEFL iBT Paper Edition test, with the first tests scheduled to be administered in Colombia, Mexico, the United States, and India in December 2021.

Wait, wait, wait… what?

A TOEFL iBT Paper Edition? I thought you had to take the TOEFL iBT on a computer.

toefl writing essay

Complete Practice Test for the TOEFL® Test

What you need is a free and complete TOEFL practice test with an answer key that explains the answer to each question. Not only that, but this test should include speaking responses and essay samples so you know how to speak and write on test day.

This is the free practice test you’ve been looking for.

And not only do you get access to a free TOEFL practice test, but we will also break down each section of the test and provide five study tips to help you improve your TOEFL score.

toefl writing essay

Duolingo English Test® vs. the TOEFL iBT®

Your dream is to study abroad, but the entire process is a massive headache. You have to fill out applications, research visa requirements, and prepare for exams.

One common requirement is an English language proficiency exam. You have probably heard about the TOEFL iBT® before, but there is a chance you might be able to take a new test: the Duolingo English Test®

toefl writing essay

Ten Awesome Tips for the Reading Section of the TOEFL® Test

The TOEFL Reading contains 10 different types of questions, each with their own point value and strategy. Some questions are straightforward, while others have answers designed to trick you. This short article will give you the skills to identify the question type and find the answer by systematically eliminating the wrong choices. It’s the perfect guide to help you study smarter and read better.

toefl writing essay

Take the TOEFL® Test at Home During the Coronavirus (updated August 2021)

How can you take the TOEFL IBT with all the test centers closed?

The Special Home Edition of the TOEFL iBT.

Due to the spread of COVID-19 and the subsequent closing of many in-person testing facilities, ETS has created a home version of the TOEFL iBT. This article will take an in-depth look at the special home edition of the TOEFL iBT by delving deeper into the registration process, technical requirements, and overall differences between this test and the original TOEFL iBT.

TST Prep

  • Privacy Overview
  • Strictly Necessary Cookies

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

TOEFL sample essays: Tips and techniques for a high score

TOEFL sample essays: Tips and techniques for a high score

The TOEFL exam is a standardised test that measures a non-native English speaker's ability to use and understand English at the university level. One of the components of the TOEFL exam is the writing section, which requires test-takers to write an essay in response to a prompt.

To score well on the writing section, it is essential to understand the requirements of the exam and the characteristics of a high-scoring essay.

In this article, we will explore tips and techniques for writing TOEFL sample essays that will help test-takers achieve a high score.

Table of Contents

Toefl writing section, integrated writing task, independent essay sample, 1. understand the task requirements, 2. plan your essay, 3. use effective paragraph structure, 4. use specific examples, 5. use varied sentence structure, 6. use academic vocabulary, 7. use correct grammar and punctuation.

  • TOEFL essay #1
  • TOEFL essay #2
  • TOEFL essay #3

Frequently asked questions

The TOEFL writing section is a crucial component of the TOEFL (Test of English as a Foreign Language), which measures the English language proficiency of non-native speakers of English who wish to study or work in English-speaking countries. The writing section evaluates a test-taker's ability to write effectively and clearly in academic English.

The TOEFL writing section comprises two tasks: the Independent Writing Task and the Integrated Writing Task. Both tasks assess different aspects of the test-taker's writing abilities, including their ability to express ideas clearly, organise thoughts, use appropriate vocabulary and grammar, and demonstrate critical thinking and analytical skills.

Also read: TOEFL Writing Topics

The Integrated Writing Task assesses your ability to comprehend and synthesise information from both a reading passage and a lecture. This section evaluates your ability to use your reading and listening comprehension skills to comprehend the given content and then integrate and organize the information effectively in your writing.

The Integrated Writing Task is composed of two parts: a reading passage and a recorded lecture. The reading passage presents a topic, and the lecture provides additional information on that topic. Both the reading passage and the lecture are related and complement each other. Test-takers must read and listen carefully, as they will be asked to summarise the information presented in both in a written response.

The reading passage is approximately 230-300 words long and usually includes a definition of the topic, background information, and supporting examples. The lecture is approximately 3-5 minutes long and adds additional information and examples to the topic presented in the reading passage.

Independent Writing Task

The Independent Writing Task assesses your ability to write in English on a given topic. The task requires you to express your thoughts, ideas, and opinions coherently and effectively in written form. The task aims to evaluate your ability to organise your thoughts, use appropriate vocabulary and grammar, and develop your ideas with sufficient details and examples.

The Independent Writing Task is a timed exercise, and you will be given 30 minutes to complete it. You will be presented with a prompt or a question, which you will be required to address in your response. The prompt may ask you to express your opinion on a particular issue or to describe a situation, problem or challenge and propose a solution. It is important to read the prompt carefully and understand what it requires you to do before starting to write.

Also read: TOEFL Exam Pattern

Prompt: Some people believe that it is better to live in a small town, while others think that life in a big city is more preferable. Discuss the advantages and disadvantages of each perspective and give your own opinion.

Living in a small town versus a big city has been a topic of debate for a long time. Some people prefer the peace and quiet of small towns, while others thrive in the hustle and bustle of big cities. In this essay, I will discuss the advantages and disadvantages of each perspective and provide my own opinion.

Living in a small town has its advantages. First of all, it is generally quieter and more peaceful than living in a big city. There is less noise pollution, less traffic, and fewer crowds. Additionally, small towns are often safer than big cities, with lower crime rates and a stronger sense of community. People in small towns tend to know each other better and are more likely to look out for one another.

However, small towns also have their disadvantages. They can be isolated and lacking in opportunities, particularly when it comes to education and employment. Small towns may not have access to the same resources as big cities, which can make it harder to pursue certain careers or receive a top-notch education. Additionally, small towns can sometimes feel claustrophobic, with everyone knowing everyone else's business and limited options for socialising or entertainment.

Living in a big city, on the other hand, has its own set of advantages. Big cities are often cultural hubs, with access to museums, theatres, concerts, and other forms of entertainment. They also offer more diverse dining options and a wider range of shopping opportunities. In terms of education and employment, big cities often have more options available, with top-notch universities and a greater number of job openings.

However, big cities also have their disadvantages. They can be noisy, polluted, and stressful, with crowds and traffic adding to the chaos. Crime rates are often higher in big cities, and there is often a lack of community and a sense of isolation. Additionally, the cost of living in a big city is often much higher than in a small town, which can be a significant barrier for many people.

In my opinion, both small towns and big cities have their advantages and disadvantages, and which one is preferable depends on individual preferences and priorities. While I appreciate the peace and quiet of small towns, I also value the opportunities and diversity of big cities. Ultimately, the choice between living in a small town versus a big city comes down to personal preference and what one prioritises in life.

Also read: TOEFL study material and resources

Tips for writing high-scoring TOEFL essay

Writing a high-scoring TOEFL essay requires effective preparation, practice, and the use of a clear writing structure. Here are some tips to help you write a high-scoring TOEFL essay:

The first step in writing a high-scoring TOEFL essay is to understand the task requirements. Make sure you read the prompt carefully and understand the topic you are being asked to write about. Additionally, make sure you understand the type of essay you need to write. The TOEFL independent writing task requires you to write an essay expressing your opinion or providing your perspective on a given topic.

Once you understand the prompt and the requirements of the task, take a few minutes to plan your essay. Jot down some notes, create an outline, or even brainstorm on paper to organise your thoughts. Make sure you have a clear idea of the main points you want to make and the evidence you will use to support them. Organising your thoughts and creating an outline will help you stay on track and make your essay more coherent.

Paragraphs are the building blocks of your essay. Each paragraph should have a clear topic sentence that relates to the main idea of the essay. Use supporting evidence and examples to back up your main point in each paragraph. Additionally, make sure each paragraph flows smoothly into the next. Using transition words and phrases can help link ideas and make your essay more cohesive.

One of the keys to writing a high-scoring TOEFL essay is to use specific examples to support your ideas. Use real-life examples, statistics, and facts to back up your argument. This will demonstrate your knowledge of the topic and show that you can think critically. Additionally, using specific examples will make your essay more interesting to read and help you stand out from other test-takers.

Varying your sentence structure can make your essay more interesting and engaging to read. Use short and long sentences, questions, and statements to add variety to your writing. This will help you avoid repetition and make your writing more engaging. However, make sure your sentences are grammatically correct and make sense in the context of your essay.

Using academic vocabulary can make your essay sound more professional and knowledgeable. Use vocabulary that is appropriate for the academic level of the test. Avoid slang, colloquialisms, and overly complex language. Additionally, make sure you use words correctly and avoid using words that you do not fully understand.

Using correct grammar and punctuation is essential to getting a high score on the TOEFL writing section. Make sure you use correct verb tenses, subject-verb agreement, and sentence structure. Use appropriate punctuation to ensure your writing is clear and easy to read. If you are unsure about a grammar or punctuation rule, look it up or ask a tutor for clarification.

Also Read TOEFL Exam Syllabus

TOEFL writing samples

TOEFL writing samples provide valuable insights into the structure and content expected in the writing section of the exam. These samples showcase both the integrated and independent writing tasks, allowing test-takers to familiarise themselves with the types of prompts and the expected responses. 

Below we have provided three TOEFL writing samples with answers. By analysing and studying these TOEFL sample essays, students can understand the effective use of transitions, supporting evidence, and coherent arguments.

TOEFL essay sample #1

The importance of learning a second language

Learning a second language has become increasingly important in today's globalised world. It offers numerous benefits such as improved cognitive abilities, enhanced cultural understanding, and better job prospects. In this essay, we will explore the advantages of learning a second language and how it can positively impact individuals and society.

Learning a second language has been shown to improve cognitive abilities. It requires mental effort and helps develop critical thinking, problem-solving, and multitasking skills. Research has shown that bilingual individuals exhibit higher levels of creativity and have improved memory and attention spans.

Furthermore, learning a second language enhances cultural understanding. Language is deeply intertwined with culture, and by learning a second language, individuals gain insights into different cultures, traditions, and perspectives. This fosters empathy, tolerance, and a broader worldview, enabling individuals to engage and connect with people from diverse backgrounds.

In addition, knowing a second language opens up better job prospects. In today's global job market, multilingualism is highly valued. It allows individuals to communicate effectively with clients, colleagues, and partners from different countries. Employers often seek candidates with language skills, as it demonstrate adaptability and a willingness to engage in cross-cultural interactions. Thus, learning a second language offers advantages in terms of career opportunities and professional growth.

TOEFL essay sample #2

The impact of technology on education

Technology has revolutionised the field of education, transforming the way students learn and teachers teach. In this essay, we will explore the positive impact of technology on education, including increased access to resources, improved engagement, and enhanced learning outcomes.

One significant impact of technology in education is the increased access to resources. Online platforms, digital libraries, and educational apps provide students with vast amounts of educational materials, regardless of their geographical location. Students can access textbooks, research materials, and interactive learning tools, empowering them to explore and learn at their own pace.

Moreover, technology has improved student engagement in the learning process. Multimedia elements such as videos, simulations, and gamification techniques capture students' attention and promote active participation. This not only enhances their understanding of the subject matter but also fosters creativity, critical thinking, and problem-solving skills.

Additionally, technology facilitates personalised learning experiences. Adaptive learning platforms analyze student performance and provide personalised feedback, allowing for targeted instruction and remediation. Online collaboration tools enable students to work together, fostering teamwork and communication skills. These personalised and collaborative learning experiences contribute to enhanced learning outcomes and student success.

TOEFL essay sample #3

The benefits of exercise for physical and mental health

Regular exercise is essential for maintaining good physical and mental health. It offers numerous benefits, including improved cardiovascular health, increased strength and flexibility, stress reduction, and enhanced mood. In this essay, we will explore the importance of exercise for overall well-being.

Engaging in regular exercise improves physical health. It helps prevent chronic diseases such as obesity, heart disease, and diabetes. Exercise strengthens the cardiovascular system, boosts metabolism, and enhances overall physical fitness. Activities like running, swimming, or weightlifting promote muscle strength, endurance, and flexibility.

Exercise also plays a vital role in stress reduction and mental well-being. Physical activity stimulates the release of endorphins, which are natural mood boosters. These endorphins promote feelings of happiness and well-being, helping to reduce stress, anxiety, and depression. Regular exercise can also improve sleep quality, increase energy levels, and enhance cognitive function.

Furthermore, exercise provides an opportunity for individuals to unwind, clear their minds, and focus on their physical well-being. It can serve as a form of self-care, allowing individuals to take time for themselves and prioritize their health. Incorporating exercise into one's routine can have a positive impact on their overall mental and emotional state.

In conclusion, regular exercise offers a multitude of benefits for both physical and mental health. It improves cardiovascular health, increases strength and flexibility, reduces stress, and enhances mood. By incorporating exercise into our daily lives, we can experience improved well-being and lead healthier, more fulfilling lives.

It is important to understand the requirements of the exam and the characteristics of a high-scoring essay to achieve a high score on the TOEFL writing section. By organizing your thoughts, using specific examples, using clear and concise language, focusing on coherence and cohesion, and proofreading and editing carefully, you can improve your essay writing skills and increase your chances of success on the exam.

What is the format of the TOEFL essay?

The TOEFL essay is a 30-minute written task where you are asked to express and support your opinion on a specific topic. The essay is typically four to five paragraphs long and should include an introduction, body paragraphs, and a conclusion.

Where can I find sample TOEFL essays to practice with?

You can find sample TOEFL essays in TOEFL preparation books or online. The official TOEFL website also provides sample essays and scoring guides.

How can I prepare for the TOEFL writing section?

To prepare for the TOEFL writing section, you should practice writing essays using sample prompts, develop your grammar and vocabulary skills, and learn how to organize and develop your ideas effectively. You may also want to work.

How to write a TOEFL essay?

To write a TOEFL essay, it is important to understand the prompt, brainstorm ideas, create an outline, and then write a well-structured essay with an introduction, body paragraphs, and a conclusion. Practice organising your thoughts and expressing them clearly within the time limit provided.

Is 27 a good writing score in TOEFL?

Yes, a score of 27 is considered a good writing score in TOEFL. It demonstrates a high level of proficiency in writing and indicates strong language skills, coherence, and clarity in expressing ideas.

What is the TOEFL essay format?

The TOEFL essay format consists of an introduction, two or three body paragraphs, and a conclusion. The introduction presents the main idea or argument, the body paragraphs provide supporting evidence or examples, and the conclusion summarises the main points and restates the thesis statement.

Is 24 a good TOEFL writing score?

A score of 24 in TOEFL writing is considered a good score. It reflects a strong command of English writing skills, including the ability to organise ideas coherently, use appropriate vocabulary and grammar, and develop well-structured paragraphs.

Is TOEFL writing difficult?

The difficulty level of TOEFL writing can vary from person to person. It requires strong language skills, critical thinking, and the ability to express ideas clearly within a time constraint.

author avatar

Study Abroad Expert

Disclaimer: The views and opinions shared in this site solely belong to the individual authors and do not necessarily represent t ...Read More

Best books for TOEFL exam preparation

Best books for TOEFL exam preparation

10 tips to help prepare for the TOEFL exam

10 tips to help prepare for the TOEFL exam

TOEFL vs IELTS: Which is best for applying to top universities?

TOEFL vs IELTS: Which is best for applying to top universities?

What is the TOEFL exam eligibility criteria?

What is the TOEFL exam eligibility criteria?

What you need to know about TOEFL iBT: A beginner's guide

What you need to know about TOEFL iBT: A beginner's guide

TOEFL study material: The best resources to help you succeed

TOEFL study material: The best resources to help you succeed

  • TOEFL Writing Topics

The lists of TOEFL Independent Writing topics below can help you gain a better understanding of the type of essay you could be asked to write on your exam. Rather than give you one long confusing list of topics, the essay topics have been divided into five categories. This is not an exhaustive list, but only a representative sample of essay topics. When practicing for your exam, make sure to write essays from each of the five categories so you will be ready for any kind of essay on exam day.

Types of TOEFL Essay Topics

  • Agree or Disagree
  • Description / Explanation
  • If / Imaginary
  • Compare & Contrast

1. TOEFL BASICS

Toefl overview.

  • TOEFL Exam Details
  • Registration
  • Administration

TOEFL iBT Format

2. toefl skills, primary skills.

  • Reading Skills
  • Listening Skills
  • Speaking Skills
  • Speaking Tips
  • Free TOEFL Speaking Samples
  • Writing Skills
  • Free Sample TOEFL Essays

Secondary Skills

  • Pronunciation

Support Skills

  • Summarizing
  • Paraphrasing
  • Mind Mapping
  • Synthesizing

3. TOEFL RESOURCES

Toefl books, in-class training.

  • TOEFL Prep Centres
  • ESL Schools

Online Training

Studying toefl abroad, 4. attitude, test anxiety.

  • Types of Anxiety
  • Causes of Anxiety
  • Symptoms of Anxiety
  • Reducing Anxiety

5. TOEFL Videos

6. toefl blog.

toefl writing essay

TOEFL Prep Online Guides and Tips

The best toefl writing templates for any prompt.

toefl writing essay

Even if English composition isn’t your forte, you can still earn a high score on the TOEFL Writing section by following a template. But what exactly is a TOEFL Writing template? Simply put, TOEFL essay templates teach you how to organize your thoughts, select strong pieces of evidence, and get the Writing score you want .

In this article, we’re providing you with  two high-quality TOEFL essay templates for the Integrated and Independent Writing tasks. Before that, we’ll go over the differences between the two TOEFL Writing tasks and then discuss how our TOEFL essay templates can benefit you. After, we’ll teach you how to customize and use our templates.

What Essays Will You Write on the TOEFL?

The Writing section is the final section you’ll face on the TOEFL, and it consists of two separate tasks: an Integrated Writing task and an Independent Writing task . The Integrated task requires you to write a response comparing a lecture and an article, whereas the Independent task requires you to write an opinionated essay in which you agree or disagree with an idea.

Below, we examine the two TOEFL Writing tasks in detail.

Integrated Writing Task

For the Integrated task, you must read a passage on an academic topic for three minutes and then listen to a short lecture on the same topic. This lecture will either support or challenge what’s written in the passage. Your response must  summarize the main points discussed in the lecture and explain how these points relate to the reading .

You’ll have a total of  20 minutes to write your response. Your response should be around  150-225 words . During this time, you may reread the passage; however, you may not listen to the lecture again.

By the way: we have built the world's best online TOEFL course . Get online practice (TPO-sytle!) and individual grading and feedback on Speaking and Writing.

Learn how you can improve your TOEFL score by 15 points today .

The score you receive for this task will be on a scale of 0-5 . According to the official rubric , a level-5 essay (i.e., a perfect essay) selects the most crucial information from the lecture and presents it in a coherent, accurate, and well-organized manner. A level-3 essay is satisfactory but overall vague, with fewer key points and several grammatical errors. Finally, a level-1 essay offers minimal coherency and fails to address any of the main points in the lecture. ETS (the creators of the TOEFL) offers  several samples of scored Integrated essays  online.

Below is an example of an Integrated Writing task borrowed from  ETS :

Reading Passage

Critics say that current voting systems used in the United States are inefficient and often lead to the inaccurate counting of votes. Miscounts can be especially damaging if an election is closely contested. Those critics would like the traditional systems to be replaced with far more efficient and trustworthy computerized voting systems.

In traditional voting, one major source of inaccuracy is that people accidentally vote for the wrong candidate. Voters usually have to find the name of their candidate on a large sheet of paper containing many names—the ballot—and make a small mark next to that name. People with poor eyesight can easily mark the wrong name. The computerized voting machines have an easy-to-use touch-screen technology: to cast a vote, a voter needs only to touch the candidate’s name on the screen to record a vote for that candidate; voters can even have the computer magnify the name for easier viewing.

Another major problem with old voting systems is that they rely heavily on people to count the votes. Officials must often count up the votes one by one, going through every ballot and recording the vote. Since they have to deal with thousands of ballots, it is almost inevitable that they will make mistakes. If an error is detected, a long and expensive recount has to take place. In contrast, computerized systems remove the possibility of human error, since all the vote counting is done quickly and automatically by the computers.

Finally some people say it is too risky to implement complicated voting technology nationwide. But without giving it a thought, governments and individuals alike trust other complex computer technology every day to be perfectly accurate in banking transactions as well as in the communication of highly sensitive information.

Lecture Transcript

(Narrator) Now listen to part of a lecture on the topic you just read about.

(Female professor) While traditional voting systems have some problems, it’s doubtful that computerized voting will make the situation any better. Computerized voting may seem easy for people who are used to computers. But what about people who aren’t? People who can’t afford computers, people who don’t use them on a regular basis—these people will have trouble using computerized voting machines. These voters can easily cast the wrong vote or be discouraged from voting altogether because of fear of technology. Furthermore, it’s true that humans make mistakes when they count up ballots by hand. But are we sure that computers will do a better job? After all, computers are programmed by humans, so “human error” can show up in mistakes in their programs. And the errors caused by these defective programs may be far more serious. The worst a human official can do is miss a few ballots. But an error in a computer program can result in thousands of votes being miscounted or even permanently removed from the record. And in many voting systems, there is no physical record of the votes, so a computer recount in the case of a suspected error is impossible! As for our trust of computer technology for banking and communications, remember one thing: these systems are used daily and they are used heavily. They didn’t work flawlessly when they were first introduced. They had to be improved on and improved on until they got as reliable as they are today. But voting happens only once every two years nationally in the United States and not much more than twice a year in many local areas. This is hardly sufficient for us to develop confidence that computerized voting can be fully trusted.

Question: Summarize the points made in the lecture, being sure to explain how they oppose specific points made in the reading passage.

body_thought_bubble

Independent Writing Task

For the Independent task, you will write about your opinion on a certain topic . You must provide clear reasons and specific examples for why you agree or disagree with the issue or statement. (This doesn’t have to be your real opinion, though!) You’ll have 30 minutes to write your response. A typical high-scoring essay is at least 300 words .

You’ll receive a score on a scale of 0-5 . According to the official rubric , a level-5 essay effectively addresses the topic, provides clear and ample details, and contains at most only minor issues with grammar and word choice. A level-3 essay offers a generally coherent response with occasional slips in clarity. Finally, a level-1 essay offers little to no detail and contains multiple technical errors. You can look at samples of Independent essays  on the ETS website.

Here is an example of an Independent Writing task taken from ETS :

Do you agree or disagree with the following statement?

Always telling the truth is the most important consideration in any relationship.

Use specific reasons and examples to support your answer.

Integrated Task vs. Independent Task

So far we’ve covered all of the basic components of the TOEFL Writing tasks. To briefly recap, here is an overview of the Integrated and Independent Writing tasks:

body_gray_question_marks

What Is a TOEFL Writing Template?

The purpose of a TOEFL Writing template is to help you compose a detailed and eloquent essay — and of course get you a high Writing score! But what exactly is a template?

It’s easiest to think of an essay template as a skeleton . While a skeleton is the framework for a body, a template is the framework for an essay . Thus, a TOEFL Writing template highlights basic structural patterns and phrases you can integrate into your own writing. What a template doesn’t do is write your essay for you. Ultimately, it’s up to you to decide how you’ll add the skin and muscles (i.e., the details and supporting evidence) to your “skeleton.”

Furthermore, writing templates are typically created to suit different types of essays . For example, a template for a compare/contrast essay will differ from a template for an editorial piece. This is why it’s so important you use a writing template specifically geared toward the TOEFL!

What Are the Benefits of a TOEFL Writing Template?

There are three major benefits to using a TOEFL Writing template.

It Organizes Your Thoughts

Using a TOEFL Writing template ensures you’ll have a focused and well-organized response . A high-quality template teaches you how to structure your response so that your introductory, body, and concluding paragraphs are all clearly defined. This way, even if your grammar and spelling aren’t perfect, your response will still have an overall strong framework that’s easy to follow.

TOEFL essay templates also show you  what kinds of transitions you can use and where . Transitions are a key component of essays that allow you to connect your thoughts and progress to new ideas smoothly. Truth be told, you probably won’t score super highly on the TOEFL Writing section if you don’t use any transitions!

It Saves You Time

Another big benefit of using a TOEFL Writing template is that it saves you time on the Writing section. Since you’ll already have a sense of how you’re going to structure your essay,  you’ll spend less time planning it out and   coming up with transitions, openings, and closings. As a result, you’ll get more time to actually write out your response!

It Gives You Confidence

Lastly, a TOEFL Writing template allows you to  feel more prepared on test day. Many test takers find it difficult to write out responses to English-language prompts in only 20 or 30 minutes. But TOEFL essay templates equip you with the skills you’ll need to feel more confident in your writing skills — and confidence is the first step toward getting a great TOEFL score!

body_colored_paper_sheets

2 TOEFL Essay Templates for You to Use

In this section, we offer you   a TOEFL Integrated Writing template and an Independent Writing TOEFL template . We also provide you with a handy list of key transitional words and phrases  you can incorporate into your essays.

Please note that you do not need to follow these templates directly. In fact, we strongly encourage you to replace our sample words and phrases with your own . We will talk more about how to customize our TOEFL essay templates in the following section. But for now, let’s take a look at the templates!

Template 1: Integrated Writing Task

Before we dive into our TOEFL Integrated Writing template, let’s get a quick reminder of what the Integrated task entails.

For this task, you’ll read a passage and then listen to a lecture. Your task is to summarize the lecture and explain whether the lecture challenges or supports what’s written in the passage. You’ll have 20 minutes to compose a response of around 150-225 words .

We’ll divide our template into four paragraphs:  an introduction and three body paragraphs . You shouldn’t need a separate concluding paragraph for this task, as you can still score a 5 without one (plus, you likely won’t have enough time to write one!). But if you want to write a conclusion, just be sure you keep it short —  two to three sentences  at most.

Now, on to the TOEFL Integrated Writing template!

Paragraph 1 (Introduction)

Your first paragraph will introduce the lecture and how it relates to the reading. This paragraph doesn’t need to be long; a simple  two or three sentences should suffice.

1. Begin with a topic sentence that summarizes the main point of the lecture.

  • According to the lecture, …
  • Based on the lecture, …
  • The lecturer states/believes that …
  • The lecturer talks about …
  • The lecturer discusses …

2. Next, explain whether the passage refutes or supports the main point of the lecture.

Examples (Contrast):

  • By contrast, the author of the passage explains/states/posits that …
  • The author of the passage, however, disagrees with this idea/belief. Instead, he/she believes that …
  • The author of the passage, however, doubts this idea/belief and thinks that …
  • The author challenges this point, however, by explaining/suggesting/positing that …

Examples (Agreement):

  • This line of thinking agrees with that of the author, who states that …
  • The author of the passage agrees with this notion/idea/belief, stating that …
  • Likewise, the author of the passage explains/states/posits that …
  • The author of the passage supports this idea/belief, explaining that …

Paragraph 2 (Body)

In this paragraph, you’ll want to focus on one of the key points in the lecture and explain whether the passage refutes or supports this idea.

1. Introduce the main point you’ll be discussing.

  • First, the lecturer asserts/claims/suggests that …
  • For one, the lecturer thinks/believes that …
  • The first point the lecturer makes is that …

2. Next, describe this particular point in more detail and then discuss how the passage either refutes or supports it. I suggest using one to two sentences here.

  • On the other hand, the author asserts/claims/suggests that …
  • Likewise, the author thinks/believes that …
  • This concept is refuted/supported by the passage, which asserts/claims/suggests that …

3. Finally, conclude your paragraph by explaining what this difference or similarity means .

Want to improve your TOEFL score by 15 points?

Registration is now open for our best TOEFL course . We guarantee your money back if you don't improve your TOEFL score by 15 points or more.

PrepScholar TOEFL is online and it features thousands of practice questions and 1-on-1 Speaking and Writing review and feedback.

  • This difference/similarity ultimately means/indicates that …
  • Basically, the lecturer/author is saying that …
  • The lecturer and author are essentially in agreement with each other. Both think/believe that …
  • Clearly, the lecturer and author maintain different views on …

body_leaf_water_droplets

Paragraph 3 (Body)

In this paragraph, you’ll focus on another key point in the lecture and again discuss how the passage either agrees or disagrees with this idea. The structure here is essentially identical to that of paragraph 2, so make sure you are using new transitions and varying your word choice.

1. Start by introducing the second point  in the lecture you’ll be discussing. Because this is your second body paragraph, do not use transitions such as “first” or “first of all.”

  • Second, the lecturer argues/contends/asserts that …
  • Next, the lecturer suggests/proposes that …
  • Another key point the lecturer makes is that …

2. Next, explain this key point in more detail  and elaborate on how it is either refuted or supported by the passage. As with paragraph 2, I recommend using one to two sentences here.

  • In other words, the lecturer is suggesting/proposing that …
  • The author, however, agrees/disagrees with this idea/belief, suggesting/proposing that …
  • Similarly, the author argues/contends/asserts that …

3. Finally, explain what this difference or similarity means .

  • What this difference/similarity means is that …
  • This difference/similarity in thinking tells us that …
  • Evidently, the lecturer thinks/believes that … , whereas the author thinks/believes that …

Paragraph 4 (Body + Conclusion)

This final body paragraph will offer  your last key point in addition to a brief conclusion . Once again, try to vary your transitions and words here so that your body paragraphs do not sound redundant.

1. Introduce and summarize the third point in the lecture . This is your third and final body paragraph, so do not use transitions such as “first” or “next.” Instead, use transitions such as “third,” “finally,” “last,” and “lastly.”

  • Third, the lecturer explains/states that …
  • Finally, the lecturer introduces the idea that …
  • Lastly, the lecturer goes on to say that …

2. Then, explain this point in more detail and discuss whether the passage matches or challenges it. Use one or two sentences here.

  • This point is also made in the passage, which argues/contends/asserts that …
  • The passage supports this idea, suggesting/proposing that …
  • On the contrary, the author writes that …

3. Finally, conclude your paragraph by explaining the significance of this similarity or difference . You should also look at this sentence as the conclusion to your entire response . Remember, you do not need to write a separate concluding paragraph for this task; it’s perfectly OK to combine it with your last body paragraph!

  • In conclusion, the lecturer and author appear to be in agreement/disagreement about …
  • All in all, the lecturer argues/contends/asserts that … , whereas the author argues/contends/asserts that …

body_speech_bubble

Template 2: Independent Writing Task

For this task, you will read a short prompt and then write a response explaining whether you agree or disagree with the issue or idea. Unlike the Integrated task, here you’ll be writing an opinionated response  (though you do not have to write about your actual opinion).

You’ll have 30 minutes to write an essay of at least 300 words . Because this essay should be longer than the Integrated task, it’s best to use the basic five-paragraph structure, with an introduction, three body paragraphs, and a conclusion.

Note that many of the example sentences below are based on the sample Independent Writing prompt used above . This means that these sentences will not apply exactly to other prompts! So don’t simply copy the sentences onto your test; instead, use them as a general guide to help you develop a better sense of style and flow in your writing.

Now, let’s take a look at our Independent Writing TOEFL template.

In this paragraph, you’ll introduce the main issue or idea and rephrase the prompt in your own words. Then, you’ll state whether you agree or disagree with the statement and why.

1. For your first sentence, you’ll want to come up with a hook  that introduces the topic of your essay in a unique and creative way. Most people  start broad and then get specific . This sentence is also a great opportunity to insert a hypothetical question.

  • There is great debate about .. .
  • It is said that .. . But is this always true?
  • Many people wonder  whether telling the truth at all times is necessary in order to maintain a healthy relationship with someone .

2. Next, introduce your opinion on the topic . You may list your specific reasons for your opinion here or in the following sentence(s). Remember, this doesn’t need to be your real opinion!

  • In my opinion, …
  • I believe that … is important because …
  • I agree/disagree with this idea/suggestion because …
  • For me, although telling the truth is important , there are many cases in which it’s OK to lie .

3. Here, state how many points (three is ideal) you’ll be discussing in your essay and then  briefly summarize what these reasons for agreeing or disagreeing are . Your reasons may come from facts, predictions, personal beliefs, experiences, etc.

  • In this essay, I will address three potential problems with lying in relationships .
  • Lying is never a smart idea because it undermines trust in relationships, causes feelings of betrayal, and often leads to more lying.

body_hearts

In this paragraph, you’ll introduce  your first key point about the issue and offer specific examples illustrating why you agree or disagree.

1. For the first sentence, introduce the key point you want to talk about.

  • First of all, lying can make it difficult to trust someone in a relationship.

2. Next, provide an example to explain why you believe this is the case . I suggest using a hypothetical situation somewhere in your essay to help illustrate one of your points.

  • For example, if a father lied to his daughter by telling her she was good at drawing, the daughter might begin to wonder whether her father has lied to her about other things, too.

3. Continue your paragraph by elaborating on your example . Explain any effect or consequence of the example and discuss how this outcome supports your viewpoint.

  • The next time the father praises his daughter for something, she might believe he is lying, even if he isn’t. In other words, the daughter might find it difficult to trust her father, thereby damaging their relationship.

This paragraph will follow a similar structure to that of paragraph 2, only this time you’ll discuss your second key point . To prevent this paragraph from sounding too similar to the one before it, vary your word choice and choose a different type of example on which to focus.

1. In the first sentence, introduce your second key point . Because this is your second body paragraph (and second point), make sure you employ appropriate transitions, such as “second,” “secondly,” and “next.”

  • Second, lying can cause feelings of betrayal in relationships.

2. As you did in paragraph 2, use these next few sentences to elaborate on your point and offer specific examples . Always clarify how your examples support your stance (agree or disagree) and how they relate back to the issue stated in the prompt.

  • When I was young, I lied to my best friend about being able to attend her birthday party. She later told me I’d betrayed her and that we couldn’t be friends anymore. Essentially, because I’d lied to her, I destroyed our relationship.

body_swimming_finish

Paragraph 4 (Body)

This is the last body paragraph in which you will introduce and explain your third and final key point .

1. First, introduce your third point . As this is your final body paragraph, make sure you’re using appropriate transitions, such as “finally,” “last,” “lastly,” and “third,” to introduce your point.

  • Lastly, lying often begets more lying. This means that once you tell a lie, you will become more likely to tell other lies.

2. As you did in paragraphs 2 and 3, elaborate on this point by providing new evidence, details, and/or examples. Explain why this point is important and how it supports your stance on the issue.

  • If you lie but nobody knows you’ve lied, you might begin to believe it’s OK to lie. As a result, you start to lie more and more. However, once people realize how often you lie, they’ll likely lose all trust in you and might even decide to end their relationships with you.

Paragraph 5 (Conclusion)

Like the Integrated Writing task, a separate concluding paragraph isn’t always necessary here . That said, if you have time, I recommend quickly wrapping up your points in a concise yet effective conclusion. One to three sentences should suffice.

1. In the first sentence or two, summarize your stance and explain why you feel this way . You may re-introduce your three main points here. You may also paraphrase the prompt and explain once more whether you agree or disagree with the issue and why.

  • In the end, I believe telling the truth is the most important consideration in a relationship.
  • Ultimately, lying can easily destroy relationships because it weakens trust, results in feelings of betrayal, and often causes more lying.

2. Finally, if you have time, bring together your concluding paragraph with a single general statement about your viewpoint .

  • This is why it is imperative we always refrain from lying.
  • Thus, the only way to ensure your relationships with others are healthy is to always tell the truth.

body_key_lock

54 Key Transitions & Phrases

In addition to TOEFL essay templates, we offer you an assortment of  common transitions and phrases you can use in your TOEFL responses. These words will allow your writing to flow more smoothly and connect your thoughts in a clearer, more logical manner.

To add information to a point, or to introduce a new (related) point:

  • In addition, …
  • Additionally, …
  • Furthermore, …
  • Moreover, …
  • Besides, …
  • Also, …

To introduce an example:

  • For example, …
  • For instance, …

To introduce an opinion:

  • I believe that …
  • I think that …

To start a new body paragraph:

  • First off, …
  • First of all, …
  • To begin/start, …
  • Secondly, …
  • Last, …

To add similar or related information:

  • Likewise, …
  • Similarly, …
  • On a related note, …

To restate information in a shorter or clearer way:

  • In other words, …
  • In short, …
  • Simply put, …
  • Essentially, …
  • In essence, …
  • Basically, …

To contrast information:

  • That (being) said, …
  • Nevertheless, …
  • Nonetheless, …
  • By/in contrast, …
  • On the contrary, …
  • On the other hand, …

To show cause and effect:

  • As a result, …
  • Because of this, …
  • As such, …
  • Consequently, …
  • Therefore, …

To emphasize information:

  • Evidently, …
  • Clearly, …
  • Certainly, …

To conclude information, a paragraph, or your overall response:

  • In conclusion, …
  • All in all, …
  • As you can see, …
  • In the end, …
  • Ultimately, …

body_editing_paper

How to Customize a TOEFL Writing Template

Below, we offer you a few pieces of advice to help you customize your TOEFL Writing template.

Integrated Writing Template

Here are two ways to customize our TOEFL Integrated Writing template.

#1: Tweak Sample Sentences & Openers

The sample sentence and openers we’ve provided above don’t contain any specific information, so you are welcome to incorporate these examples into your own Integrated Writing response however you like.

That said, you don’t need to use these examples exactly as they are . If you’d like to tweak them, try replacing the verbs and transitions with other (synonymous) words. I also suggest adding adverbs to emphasize certain words. So for example, instead of writing, “The author, however, disagrees with this belief,” you could write, “The author, on the other hand, strongly challenges this concept.”

Just be sure that the examples you’re using in your essay fit with the type of lecture-passage relationship you’re given. Remember, this relationship can be either supportive (in which the lecture agrees with the passage) or contrastive (in which the lecture disagrees with the passage).

#2: Use Different Words to Emphasize Contrast

Most Integrated Writing tasks will give you a lecture that challenges  what’s written in the passage. In these cases, it’s important you use a variety of transitions, conjunctions, and prepositions to emphasize this important contrast throughout your essay.

There is one caveat, though:  do not use a single transition more than once in your response . For instance, if you were to consistently start your supporting sentences with “however,” your essay would lack balance and begin to sound redundant. So be sure to vary your word choice!

Here is a list of strong contrastive words you can use in your Integrated Writing response:

  • Nonetheless …
  • Even though
  • In spite of

body_colored_pencils

Independent Writing Template

Here are three ways you can customize our Independent Writing TOEFL template.

#1: Choose the Side That’s Easier to Support

Although the Independent Writing task asks specifically whether you agree or disagree with an idea, you do not need to write about your real opinion . What you should really do is choose the side that’s easier to argue — regardless of whether you actually agree or disagree with it!

In other words, it’s perfectly OK to make things up for this response and pretend you support something you don’t actually believe. Remember, you are not being judged on which stance you choose but on  how effectively you support your stance.

#2: Do Not Copy Sample Sentences Word for Word

For this task’s examples, I’ve primarily supplied you with whole sentences, all of which were based on the sample Independent Writing prompt used above. Because our examples refer to a specific prompt, you cannot  copy these sample sentences word for word and use them in your own essay . Failing to change these sentences means your essay won’t make a whole lot of sense, if any at all!

Therefore, what you should actually be doing is using these example sentences to learn what types of details to include in your Independent Writing response. You may also use our examples to get a feel for how you can change up your sentences and word choice.

#3: Vary Your Supporting Details

For your response, you must come up with specific details and examples to support your viewpoint. These examples can range from facts and statistics to experiences and hypothetical situations.

In order to produce a truly compelling response (and get a high essay score), you must incorporate a variety of effective examples into your essay. This means you’ll want to  avoid using the same types of examples for each point you make . So if you were to discuss a personal experience for your first body paragraph, focus on a different type of detail, such as a universally accepted fact or a hypothetical scenario, for your second body paragraph.

body_die_4

How to Use a TOEFL Writing Template: 4 Tips

Finally, here are four general tips for using a TOEFL Writing template effectively.

#1: Employ a Variety of Transitions

The sample sentences and openers we’ve given you for each template above contain different transitions to help link ideas together; thus, we encourage you to use a variety of transitions in your own essays . For ideas, refer to our transition list above. This list can help you swap out some of our sample transitions for other ones you believe fit better with your writing.

Ultimately, varying your word choice is critical to ensuring your essay is well written . This means that a high-scoring TOEFL essay will not simply use transitions but use them well . So if you’ve already inserted “therefore” in one area, try using “as a result” or “thus” in another area. You’ll still get the same meaning but with a stronger, more versatile effect.

#2: Add Details Without Rambling

Don’t feel limited by the number of sentences we’ve included in each body paragraph in our templates. If you need to use two or three sentences to explain a point, that’s OK! You should always give a lot of detail for your points — so long as you are staying concise and focused.

For example, if you spend a paragraph discussing a hypothetical scenario but fail to offer any analysis, you probably won’t receive a high TOEFL score. Likewise, if you write a verbose introduction or conclusion, your score may end up being lower than you wanted it to be.

Basically, you must find the perfect balance between detail and analysis. A good way to monitor this balance is to  always try to hit the word minimum for each task . Doing this means you’ll likely have an essay that’s sufficiently detailed. Just try not to exceed the maximum or you’ll risk rambling!

#3: 2 Strong Points Is Better Than 3 Weak Ones

Each of our TOEFL essay templates above has three body paragraphs and three main points. Although three points is a solid number of points, if you can only think of two points, just stick with those and try to   support them as clearly and effectively as possible.

As you can see in the sample TOEFL essays here , many essays with scores of 2 and 3 actually address three points — but their points are altogether weak, with few supporting details and insufficient analysis. What this means is that just having three points in your essay isn’t enough to guarantee you a high essay score . Thus, it’s far better to write an essay with two strong points than three weak ones.

#4: Do Not Copy Entire Sentences From the Prompt

If you struggle with English, you might be tempted to borrow some sentences and phrases from the prompt. While it’s OK to paraphrase sentences, you should never copy exact sentences from your prompt .

Copying the prompt shows evaluators that you lack the English level necessary for writing creatively and analytically. On top of this, copying is also a form of plagiarism, which means you are essentially “stealing” another’s work.

So what does this mean for your score? If you copy entire sentences and phrases from the reading or topic, you will receive a score of 0 .

body_laptop_hands

Additional Resources for TOEFL Essay Templates

In addition to our templates above, here are a couple of online resources offering high-quality TOEFL essay templates you can use for free.

  • TOEFL Resources : This resource offers detailed templates for the Integrated and Independent Writing tasks along with useful transitional phrases and sentence openers.
  • Magoosh : This handy PDF contains a lengthy list of transitions as well as various sample TOEFL essays.

Recap: How Can a TOEFL Writing Template Help You?

TOEFL essay templates are an especially useful component of TOEFL prep because  they raise your chance of achieving a high TOEFL Writing score .

With TOEFL essay templates, you can organize your thoughts more clearly, spend less time outlining your responses, and prepare for the TOEFL with increased confidence. Our templates are also flexible , allowing you to customize certain elements to your liking and cater the templates to many different prompts.

When using a TOEFL template, always remember the following four tips:

  • Employ a variety of transitions throughout your essay
  • Add details but try not to ramble
  • Two strong point are better than three weak ones
  • Do not copy exact sentences from the prompt

Now that you’re equipped with our TOEFL essay templates and the knowledge of how to use them, you should have no trouble getting a great TOEFL Writing score!

What’s Next?

Seeking additional help with the TOEFL Writing section?  Start practicing with our 13 Writing topics  and get information on what resources you can use for high-quality Writing topics .

What does a high-scoring TOEFL essay look like? Take a look at our analyses of two perfect-scoring TOEFL essays  to learn what you can do to guarantee yourself a high essay score on test day.

Want more TOEFL templates?  Check out our TOEFL Speaking templates to improve your English-speaking skills and confidence.

Ready to improve your TOEFL score by 15 points?

toefl writing essay

Author: Hannah Muniz

Hannah graduated summa cum laude from the University of Southern California with a bachelor’s degree in English and East Asian languages and cultures. After graduation, she taught English in Japan for two years via the JET Program. She is passionate about education, writing, and travel. View all posts by Hannah Muniz

  • Toefl Exam 2024
  • Registration
  • Types of Toefl Exam
  • Eligibility
  • Exam Pattern
  • Documents Required
  • Slot Booking
  • Toefl Exam Date
  • List of Important Essay Transition Words You Should Know
  • 500+ Words Essay on Importance of Education in English
  • SAT Exam Maths Syllabus 2024: Important Topics & Preparation Tips
  • 500+ Words Essay on Newspaper in English For Students
  • IIT JAM Exam Pattern 2024: Question Types, Marking Scheme, Sections
  • NCERT Solutions for Class 9 English Moments Chapter 3 Iswaran The Storyteller
  • English Essay Writing Tips, Examples, Format
  • Class 8 History Solutions for Chapter 8 – The Making of the National Movement: 1870s–1947
  • Know The Uses Of Articles For The Preparation Of Govt. Exams
  • A Guide to Writing an Essay for Job Interviews
  • NCERT Indian Polity Notes for UPSC Exam 2024
  • How to write other experiences?
  • Why Writing Skills Are Important For Students?
  • Study Materials and Notes for UPSC 2024 for FREE!
  • Infiflex Technologies Interview Experience for Junior Solutions Associate
  • Essay on My Favourite Teacher (10 Lines, 100 Words, 200 Words)
  • How has Quantitative Ability in CAT changed over the years?
  • Thinklayer Minds Work Experience as Content Writer
  • Types of Writs in Indian Constitution with Questions and Answers and MCQs

Types of Essay in TOEFL: Check Samples and Important Questions

The TOEFL exam contains various essay types that students should familiarize themselves with. Within the essay section, there are primarily two types of tasks: independent writing and integrated writing. The integrated writing task involves reading a passage and listening to an audio, which may aid in elucidating or supplementing the text.

Conversely, the independent writing task entails expressing one’s opinions, highlighting specific facts, or taking a particular stance. Continue reading to delve deeper into the types of essays in TOEFL and access sample essays. Let us learn more!

Types-of-Essays-in-TOEFL

Types of Essay in TOEFL

Table of Content

Types of Essays in TOEFL

Types of essays in toefl: agree/disagree essays, types of essays in toefl: multiple choice essays, types of essays in toefl: paired choice essays, types of essays in toefl: good idea essays, sample essay.

Various online resources offer samples of essays along with their solutions, catering to the four primary essay types found in the TOEFL exam.

These categories include essays for building arguments, disagreeing/agreeing on something, providing an explanation, or stating a preference. The selection of essay type depends on the specific question given to the test-taker. Typically, the independent writing section mandates a minimum word count of 300-350 words. Here, we outline some essay styles along with questions.

Question : Do you agree or disagree with the following statement? The widespread usage of the internet has a positive impact on modern life.

Question : Do you agree or disagree with the following statement? When conducting research, it is always preferable to utilize articles, books, and other printed materials over internet resources. Provide examples and reasons to support your answer.

Question 1: What is the most ideal type of neighbor for an individual? Options provided:

  • A neighbor who is amicable and provides assistance.
  • A neighbor who is easygoing.
  • A neighbor who shares similar preferences, habits, and lifestyle.

Question 2: What is the optimal strategy for addressing a problem? Options offered:

  • Conducting thorough research and gathering information about the problem.
  • Seeking guidance from someone experienced.
  • Taking a break to rejuvenate, reflect, and contemplate on the issue.

Question: Some students prefer traditional offline classes with face-to-face interactions between instructors and peers, while others advocate for online classes with limited discussions. Which option do you prefer? Support your answer with specific examples and reasoning.

Question : Some individuals favor solo travel for the freedom it provides, while others opt to travel with companions. Which do you prefer? Provide examples and instances to justify your choice.

Question : The administration of an institution is reallocating its financial resources, opting to prioritize the enhancement of sports facilities while reducing investments in campus libraries. Do you consider this decision to be advantageous? State your stance with examples and reasoning.

Question : Many residents welcome the establishment of shopping malls near their homes for convenient access to amenities. However, some oppose such developments, citing concerns about noise, traffic congestion, and environmental impact. Would you support or oppose the opening of a shopping mall in your locality? Justify your position with specific reasons and examples.

Question: “Some people believe that it is better to live in a small town, while others think that life in a big city is more advantageous. Discuss the advantages and disadvantages of living in a small town versus living in a big city. Use specific examples and reasons to support your answer.”

Living in a Small Town vs. Living in a Big City: Exploring the Pros and Cons

Choosing between living in a small town and residing in a big city is a decision that greatly influences one’s lifestyle and overall experience. While some may argue that the charm of a small town offers a more peaceful and close-knit community, others may advocate for the opportunities and excitement that come with urban living. In this essay, we will delve into the advantages and disadvantages of both settings to understand their implications on daily life.

Living in a small town presents several notable advantages. Firstly, small towns often offer a strong sense of community. With fewer residents, people tend to know each other well, fostering close relationships and a supportive network. This tight-knit community can lead to a greater sense of belonging and security. Additionally, small towns are typically characterized by a slower pace of life, providing residents with a peaceful environment away from the hustle and bustle of city living. The tranquil surroundings of nature and open spaces in small towns can also promote a healthier lifestyle, encouraging outdoor activities and relaxation.

However, small towns also have their drawbacks. One of the main disadvantages is limited access to amenities and services. Small towns may lack diverse shopping options, entertainment venues, and healthcare facilities, requiring residents to travel to nearby cities for certain necessities. Moreover, job opportunities in small towns may be scarce and limited in scope, leading to economic challenges for residents seeking career advancement or specialized employment.

On the other hand, living in a big city offers a myriad of opportunities and conveniences. Cities are hubs of economic activity, providing abundant job opportunities across various industries. The diversity of businesses and organizations in cities also fosters innovation and creativity, attracting ambitious individuals seeking career growth and professional development. Furthermore, big cities boast a rich cultural scene with numerous museums, theaters, restaurants, and nightlife options, catering to a wide range of interests and tastes.

Despite these advantages, big city living comes with its own set of challenges. The fast-paced lifestyle and crowded environment can lead to increased stress levels and feelings of isolation among residents. Additionally, the high cost of living in cities, including housing expenses, transportation costs, and daily expenditures, can place a significant financial burden on individuals and families. Moreover, the anonymity of city life may hinder the development of close interpersonal relationships, leading to a sense of social detachment and loneliness for some residents.

In conclusion, the choice between living in a small town and residing in a big city ultimately depends on individual preferences, priorities, and lifestyle preferences. While small towns offer a sense of community and tranquility, big cities provide abundant opportunities and cultural diversity. By weighing the pros and cons of each setting, individuals can make informed decisions that align with their personal values and goals.

Also Check: TOEFL Preparation Strategy 2024 – TOEFL Preparation Plan & Materials TOEFL Exam Pattern 2024: Check Question Types & Scores TOEFL Exam 2024: Fees, Syllabus, Dates, Registration & More Best TOEFL Exam Preparation Books & Resources 2024: List of Top TOEFL Books

Types of Essay in TOEFL- FAQs

What are the key differences between the integrated writing task and the independent writing task in the toefl exam.

The integrated writing task involves reading a passage and listening to an audio recording, which can assist in understanding or expanding upon the text. Participants have 20 minutes to compose an essay based on this material, with a word limit of 150-225 words. Conversely, the independent writing task requires candidates to express their opinions, highlight specific facts, or take a particular stance without any accompanying material. This section typically necessitates a minimum word count of 300-350 words.

What are the primary categories of essays in the TOEFL exam, and how are they selected for test-takers?

The four main categories of essays in the TOEFL exam include building arguments, disagreeing/agreeing on a statement, providing explanations, and stating preferences. The selection of essay type depends on the specific question presented to the test-taker.

How should one approach multiple choice essays in the TOEFL exam?

Multiple choice essays in the TOEFL exam present questions where one must choose from several options to express their opinions or preferences. For instance, one might be asked to identify the ideal type of neighbor or the most effective approach to resolving an issue. Students should carefully consider each option and select the one that best aligns with their views.

What are some of the most common topics in TOEFL writing?

The most common topics in TOEFL writing include education, social media, mobile devices, and technology, among various others.

Please Login to comment...

Similar reads.

  • Study Abroad

Improve your Coding Skills with Practice

 alt=

What kind of Experience do you want to share?

Enjoy this post? Rate it!

TOEFL integrated writing

TOEFL integrated writing task 2023 | Examples and sample essay.

Want to excel in the toefl writing task elevate your skills with the most up-to-date examples, carefully crafted sample essays, and insights in 2023. maximize your potential to succeed in the integrated writing task., table of contents, introduction, toefl integrated writing task structure and format , toefl writing task topics , toefl writing task sample , toefl writing task pdf and other resources , strategies for toefl integrated writing task , scoring criteria for toefl writing task , key tips for success , example 1: environmental conservation , example 2: technological advancements in medicine , introduction: , body: , key takeaways .

TOEFL (Test of English as a Foreign Language) is a critical stepping stone for students who aspire to study in English-speaking universities. The TOEFL writing task is one part of this examination that can often become a cause for concern. It not only tests your ability to understand English but also your ability to express thoughts, ideas, and opinions in a clear and precise manner. 

In this comprehensive guide, we will explore various aspects of the TOEFL writing task, including topics, samples, format, and specific answers to common questions. The information provided is tailored to the 2023 edition of the test, ensuring relevance and applicability to your preparation. So, if you’re planning to take on this challenge in 2023, let’s begin by understanding the structure and requirements of the TOEFL writing task. 

The TOEFL integrated writing task is the first of the two writing tasks in the TOEFL exam. It’s designed to assess your ability to combine listening and reading skills to write a coherent and well-structured essay. Here’s a detailed breakdown: 

  • Reading passage : A passage around 200-250 words long is provided. You’ll have three minutes to read it. 
  • Listening clip : A short lecture related to the reading passage is played. You’ll be given time to take notes to remember the content better. 
  • Writing task : You will have 20 minutes to write a response of about 150-225 words, summarizing the points made in the lecture and explaining how they relate to specific points in the reading passage. 

The topics for the TOEFL integrated writing task are generally academic and range from subjects like history, science, art, and social sciences. Here’s an example of how the topics will be given: 

  • Reading passage : An excerpt will be provided about Renaissance art and its influence. 
  • Listening clip : A lecture discussing a specific Renaissance painting. 
  • Writing task : Compare and contrast the information in the reading passage and the lecture.

Practicing in advance by working on such topics will help you write and format your writing tasks better. Here’s a sample for you to practice: 

TOEFL integrated writing

  • Reading passage: Brief description of climate change and its effects. 
  • Listening clip : A lecture discussing various solutions to combat climate change.
  • Writing task : Summarize the solutions from the lecture and relate them to the problems mentioned in the reading passage. 

To support your preparation, TOEFL writing task PDF materials, containing practice questions and samples are available online. These resources often include: 

  • Guides on TOEFL writing format 
  • Collection of TOEFL writing samples with answers PDF 
  • TOEFL writing task 1 sample answers and TOEFL writing task 2 sample answers 
  • Practice tests for TOEFL writing task 2 

These materials are instrumental in understanding the pattern and honing your skills to succeed in the TOEFL writing task. 

  • Understanding the structure : Familiarize yourself with the TOEFL writing format, including reading, listening, and writing phases. 
  • Time management: Allocate time for reading, note-taking, and writing, keeping in mind the 20-minute time limit for the writing task. 
  • Note-taking skills : Practice jotting down crucial points from both the reading passage and the listening clip. Focus on the main ideas, supporting details, and the relationship between the reading and listening parts. 
  • Creating an outline : Before diving into writing, create a rough outline to organize your thoughts. This helps in maintaining coherence and connection between various sections of the essay. 
  • Practicing with samples : Utilize the TOEFL writing task sample, TOEFL writing task 1 sample answers, and TOEFL writing task 2 sample answers for regular practice. 

Understanding the scoring can guide you in preparing effectively. The TOEFL integrated writing task is scored on a scale of 0-5 based on the following criteria: 

  • Content : Accuracy, completeness, and connection between reading and listening. 
  • Organization : Logical progression, clear introduction, body, and conclusion. 
  • Language use : Grammar, vocabulary, sentence structure, and overall fluency.
  • Use transitional phrases : Transitional words like “however,” “in addition,” and “therefore” can help in maintaining the flow. 
  • Avoid repetition : While using keywords is vital, avoid unnecessary repetition. Make sure your content is varied and engaging. 
  • Proofread : Reserve some time at the end for revising and correcting errors.
  • Access Quality Resources : Consider TOEFL writing samples with answers PDF, TOEFL writing task PDF guides, and practice tests for well-rounded preparation.
  • Tables, quizzes, and other interactive elements: Incorporating tables and quizzes can be an excellent way to enhance the learning experience. For instance, a table comparing different TOEFL writing task topics or a quiz testing understanding of TOEFL writing format can be included in online learning platforms or books. 

TOEFL integrated writing

Examples of TOEFL integrated writing task 

The TOEFL integrated writing task presents a unique challenge to assess your ability to analyze information from both a reading passage and a lecture. In this task, you must demonstrate your comprehension of the material and your capacity to connect ideas between the text and the spoken content. To give you a clearer picture, let’s dive into a few illustrative examples of TOEFL-integrated writing tasks:

Reading passage : Description of deforestation and its impact on biodiversity.

Listening Clip : A lecture discussing various conservation methods employed globally. 

Writing task: 

  • Reading: The passage highlights the critical loss of forests, leading to a decline in biodiversity and environmental balance. 
  • Listening: The speaker introduces multiple conservation techniques such as reforestation, wildlife corridors, and legal enforcement. 
  • Essay: The essay must summarise the conservation methods mentioned in the lecture and relate them to the problems of deforestation and biodiversity loss detailed in the reading passage. 

Reading passage: An overview of the traditional medical practices and their limitations. 

Listening clip: A lecture elaborating on recent technological advancements in medical diagnostics and treatments. 

Writing task : 

  • Reading: The passage outlines traditional medical practices, emphasizing their limitations in accuracy and efficiency. 
  • Listening : The lecturer elaborates on cutting-edge technologies like AI-powered diagnostics, robotic surgeries, and personalized medicine.
  • Essay : The essay should connect the advancements discussed in the lecture with the limitations outlined in the reading passage, showcasing how technology is revolutionizing medical practice. 

In-depth Analysis of a Sample Essay | Environmental conservation 

The essay must have a good flow and cohesiveness. This makes it easier to understand and leave a good impression. Here is the in-depth analysis of an essay on environmental conservation.

“The loss of forests and biodiversity has long been a global concern. However, modern conservation methods, as described in the lecture, offer promising solutions to the challenges outlined in the reading passage.” 

  • Paragraph 1: Discuss reforestation, its importance, and how it directly addresses deforestation. 
  • Paragraph 2: Explore wildlife corridors and their role in preserving biodiversity. 
  • Paragraph 3 : Explain legal enforcement, international agreements, and their impact on conservation efforts. 
  • The TOEFL-integrated writing task involves reading, listening, and writing. Understand the flow and practice each part. 
  • Make use of TOEFL writing samples with answers PDF, TOEFL writing task PDF guides, and various other materials for practice. 
  • Your essay should logically connect the reading and listening parts, maintaining a clear and concise structure. 

The innovative conservation techniques described in the lecture provide a comprehensive approach to combating the grave issues of deforestation and biodiversity loss mentioned in the reading passage. These methods signify hope and progress in environmental preservation. 

We hope you found this article insightful. If you have any more queries please reach out to us and get them solved quickly!

Liked this blog? Read: TOEFL requirements 2023 | Documents and minimum requirements guide. 

1. How much time do I have for the TOEFL Integrated Writing Task? 

Ans. You have 3 minutes to read the passage, a listening time for the clip, and 20 minutes to write the essay. 

2. Can I take notes during the listening part of the TOEFL Writing Task? 

Ans. Yes, taking notes is allowed and advisable during the listening portion.

3. What types of topics are covered in the TOEFL Writing Task Topics? 

Ans. Topics are typically academic, ranging from history, science, art to social sciences. 

4. Where can I find TOEFL Writing Task 1 sample answers and TOEFL Writing Task 2 sample answers? 

Ans. Various online platforms, prep books, and official TOEFL guides provide these samples.

How useful was this post?

Click on a star to rate it!

Average rating 3.3 / 5. Vote count: 3

No votes so far! Be the first to rate this post.

toefl writing essay

People also liked

IELTS success

IELTS success| Tips to master each section

IELTS Grammar

Importance of IELTS grammar | Tips and tricks

TOEFL and IELTS

TOEFL and IELTS | Role of contextual & academic words

simulated tests

Simulated tests | Opportunity to overcome challenges

GRE exam structure

TOEFL, IELTS, & GRE exam structure | Key differences 2024!

GMAT exam

Ace the GMAT exam | Navigate quant & verbal difficulties!

Leave a reply cancel reply.

Your email address will not be published. Required fields are marked *

Please enter an answer in digits: four − four =

Start your journey with iSchoolPrep

Need help with your Test Preparations? Contact Us for more details

toefl writing essay

Inquire Now

Get e-books, expert guidance, live classes and more....

Design Framework for the TOEFL Primary Writing Test TOEFL ELP AD AES YL

The TOEFL Primary tests, designed as international English language proficiency assessments, have been widely used in over 120 countries since 2013. The tests are primarily intended to support the teaching and learning of young language learners aged 8 and older. Until recently, the TOEFL Primary tests measured only reading, listening, and speaking skills. In 2023, with the introduction of the TOEFL Primary Writing test, it became possible to assess all four language skills—reading, listening, speaking, and writing—through the TOEFL Primary tests. This research memorandum documents the underlying design principles behind the development of the TOEFL Primary Writing test and provides detailed information about the test. The directions for validity research concerning the TOEFL Primary Writing test are also discussed.

  • Request Copy  (specify title and report number, if any)
  • https://www.ets.org/Media/Research/pdf/RM-24-02.pdf

Essays for IELTS, PTE, TOEFL 4+

Sandyarani lagishetti, diseñada para ipad.

  • Ofrece compras dentro de la app

Capturas de pantalla

Descripción.

Introducing ‘Essays for IELTS, PTE, TOEFL' – the quintessential writing companion designed to elevate your performance in various exams. Tailored for excellence, this app is your go-to resource for mastering the art of essay writing. - Exam-Centric Mastery: 'Essays for IELTS, PTE, TOEFL' caters to a range of exams, offering a specialized approach to help you conquer the essay sections of IELTS, PTE, TOEFL, and more. - Precision in Writing Models: Immerse yourself in a curated collection of exemplary essays, providing precise guidance on structure, language proficiency, and content mastery. - Targeted Success: From IELTS to PTE and TOEFL, our app delivers tailored materials, ensuring you are well-prepared for the unique demands of each exam's essay writing component. - User-Friendly Interface: Navigate effortlessly through the app's user-friendly interface, focusing your energy on honing your writing skills without any distractions. - Your Personal Essay Companion: 'Essays for IELTS, PTE, TOEFL' is not just an app; it's your dedicated companion on the journey to achieving top scores in essay writing across various exams. Call to Action: Ready to ascend to top scores? Download 'Essays for IELTS, PTE, TOEFL' now and transform your essay writing prowess for success in IELTS, PTE, TOEFL, and beyond. Terms of Use: https://www.apple.com/legal/internet-services/itunes/dev/stdeula/

Privacidad de la app

El desarrollador ( Sandyarani Lagishetti ) indicó que, entre las prácticas de privacidad de la app, pueden incluirse el manejo de datos que se describe a continuación. Para obtener detalles, consulta la política de privacidad del desarrollador .

No se recopilan datos

El desarrollador no recopila ningún dato en esta app.

Las prácticas de privacidad pueden variar; por ejemplo, según tu edad o las funciones que uses. Obtén detalles

Información

  • Entire App - Six Months $2.49
  • Entire App - Yearly $3.49
  • Sitio web del desarrollador
  • Soporte para apps
  • Política de privacidad

Más sobre este desarrollador

Matrix Lottery Predictor

IELTS Writing Triumph

Quizás te interese

English Verbs Pro

2400 English Essays

IDIY - Essay Writing Lessons

Gurully Institute

IELTS Speaking Forecast

ProVo IELTS Synonym Dictionary

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • My Account Login
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 03 June 2024

Applying large language models for automated essay scoring for non-native Japanese

  • Wenchao Li 1 &
  • Haitao Liu 2  

Humanities and Social Sciences Communications volume  11 , Article number:  723 ( 2024 ) Cite this article

129 Accesses

1 Altmetric

Metrics details

  • Language and linguistics

Recent advancements in artificial intelligence (AI) have led to an increased use of large language models (LLMs) for language assessment tasks such as automated essay scoring (AES), automated listening tests, and automated oral proficiency assessments. The application of LLMs for AES in the context of non-native Japanese, however, remains limited. This study explores the potential of LLM-based AES by comparing the efficiency of different models, i.e. two conventional machine training technology-based methods (Jess and JWriter), two LLMs (GPT and BERT), and one Japanese local LLM (Open-Calm large model). To conduct the evaluation, a dataset consisting of 1400 story-writing scripts authored by learners with 12 different first languages was used. Statistical analysis revealed that GPT-4 outperforms Jess and JWriter, BERT, and the Japanese language-specific trained Open-Calm large model in terms of annotation accuracy and predicting learning levels. Furthermore, by comparing 18 different models that utilize various prompts, the study emphasized the significance of prompts in achieving accurate and reliable evaluations using LLMs.

Similar content being viewed by others

toefl writing essay

Accurate structure prediction of biomolecular interactions with AlphaFold 3

toefl writing essay

Testing theory of mind in large language models and humans

toefl writing essay

Highly accurate protein structure prediction with AlphaFold

Conventional machine learning technology in aes.

AES has experienced significant growth with the advancement of machine learning technologies in recent decades. In the earlier stages of AES development, conventional machine learning-based approaches were commonly used. These approaches involved the following procedures: a) feeding the machine with a dataset. In this step, a dataset of essays is provided to the machine learning system. The dataset serves as the basis for training the model and establishing patterns and correlations between linguistic features and human ratings. b) the machine learning model is trained using linguistic features that best represent human ratings and can effectively discriminate learners’ writing proficiency. These features include lexical richness (Lu, 2012 ; Kyle and Crossley, 2015 ; Kyle et al. 2021 ), syntactic complexity (Lu, 2010 ; Liu, 2008 ), text cohesion (Crossley and McNamara, 2016 ), and among others. Conventional machine learning approaches in AES require human intervention, such as manual correction and annotation of essays. This human involvement was necessary to create a labeled dataset for training the model. Several AES systems have been developed using conventional machine learning technologies. These include the Intelligent Essay Assessor (Landauer et al. 2003 ), the e-rater engine by Educational Testing Service (Attali and Burstein, 2006 ; Burstein, 2003 ), MyAccess with the InterlliMetric scoring engine by Vantage Learning (Elliot, 2003 ), and the Bayesian Essay Test Scoring system (Rudner and Liang, 2002 ). These systems have played a significant role in automating the essay scoring process and providing quick and consistent feedback to learners. However, as touched upon earlier, conventional machine learning approaches rely on predetermined linguistic features and often require manual intervention, making them less flexible and potentially limiting their generalizability to different contexts.

In the context of the Japanese language, conventional machine learning-incorporated AES tools include Jess (Ishioka and Kameda, 2006 ) and JWriter (Lee and Hasebe, 2017 ). Jess assesses essays by deducting points from the perfect score, utilizing the Mainichi Daily News newspaper as a database. The evaluation criteria employed by Jess encompass various aspects, such as rhetorical elements (e.g., reading comprehension, vocabulary diversity, percentage of complex words, and percentage of passive sentences), organizational structures (e.g., forward and reverse connection structures), and content analysis (e.g., latent semantic indexing). JWriter employs linear regression analysis to assign weights to various measurement indices, such as average sentence length and total number of characters. These weights are then combined to derive the overall score. A pilot study involving the Jess model was conducted on 1320 essays at different proficiency levels, including primary, intermediate, and advanced. However, the results indicated that the Jess model failed to significantly distinguish between these essay levels. Out of the 16 measures used, four measures, namely median sentence length, median clause length, median number of phrases, and maximum number of phrases, did not show statistically significant differences between the levels. Additionally, two measures exhibited between-level differences but lacked linear progression: the number of attributives declined words and the Kanji/kana ratio. On the other hand, the remaining measures, including maximum sentence length, maximum clause length, number of attributive conjugated words, maximum number of consecutive infinitive forms, maximum number of conjunctive-particle clauses, k characteristic value, percentage of big words, and percentage of passive sentences, demonstrated statistically significant between-level differences and displayed linear progression.

Both Jess and JWriter exhibit notable limitations, including the manual selection of feature parameters and weights, which can introduce biases into the scoring process. The reliance on human annotators to label non-native language essays also introduces potential noise and variability in the scoring. Furthermore, an important concern is the possibility of system manipulation and cheating by learners who are aware of the regression equation utilized by the models (Hirao et al. 2020 ). These limitations emphasize the need for further advancements in AES systems to address these challenges.

Deep learning technology in AES

Deep learning has emerged as one of the approaches for improving the accuracy and effectiveness of AES. Deep learning-based AES methods utilize artificial neural networks that mimic the human brain’s functioning through layered algorithms and computational units. Unlike conventional machine learning, deep learning autonomously learns from the environment and past errors without human intervention. This enables deep learning models to establish nonlinear correlations, resulting in higher accuracy. Recent advancements in deep learning have led to the development of transformers, which are particularly effective in learning text representations. Noteworthy examples include bidirectional encoder representations from transformers (BERT) (Devlin et al. 2019 ) and the generative pretrained transformer (GPT) (OpenAI).

BERT is a linguistic representation model that utilizes a transformer architecture and is trained on two tasks: masked linguistic modeling and next-sentence prediction (Hirao et al. 2020 ; Vaswani et al. 2017 ). In the context of AES, BERT follows specific procedures, as illustrated in Fig. 1 : (a) the tokenized prompts and essays are taken as input; (b) special tokens, such as [CLS] and [SEP], are added to mark the beginning and separation of prompts and essays; (c) the transformer encoder processes the prompt and essay sequences, resulting in hidden layer sequences; (d) the hidden layers corresponding to the [CLS] tokens (T[CLS]) represent distributed representations of the prompts and essays; and (e) a multilayer perceptron uses these distributed representations as input to obtain the final score (Hirao et al. 2020 ).

figure 1

AES system with BERT (Hirao et al. 2020 ).

The training of BERT using a substantial amount of sentence data through the Masked Language Model (MLM) allows it to capture contextual information within the hidden layers. Consequently, BERT is expected to be capable of identifying artificial essays as invalid and assigning them lower scores (Mizumoto and Eguchi, 2023 ). In the context of AES for nonnative Japanese learners, Hirao et al. ( 2020 ) combined the long short-term memory (LSTM) model proposed by Hochreiter and Schmidhuber ( 1997 ) with BERT to develop a tailored automated Essay Scoring System. The findings of their study revealed that the BERT model outperformed both the conventional machine learning approach utilizing character-type features such as “kanji” and “hiragana”, as well as the standalone LSTM model. Takeuchi et al. ( 2021 ) presented an approach to Japanese AES that eliminates the requirement for pre-scored essays by relying solely on reference texts or a model answer for the essay task. They investigated multiple similarity evaluation methods, including frequency of morphemes, idf values calculated on Wikipedia, LSI, LDA, word-embedding vectors, and document vectors produced by BERT. The experimental findings revealed that the method utilizing the frequency of morphemes with idf values exhibited the strongest correlation with human-annotated scores across different essay tasks. The utilization of BERT in AES encounters several limitations. Firstly, essays often exceed the model’s maximum length limit. Second, only score labels are available for training, which restricts access to additional information.

Mizumoto and Eguchi ( 2023 ) were pioneers in employing the GPT model for AES in non-native English writing. Their study focused on evaluating the accuracy and reliability of AES using the GPT-3 text-davinci-003 model, analyzing a dataset of 12,100 essays from the corpus of nonnative written English (TOEFL11). The findings indicated that AES utilizing the GPT-3 model exhibited a certain degree of accuracy and reliability. They suggest that GPT-3-based AES systems hold the potential to provide support for human ratings. However, applying GPT model to AES presents a unique natural language processing (NLP) task that involves considerations such as nonnative language proficiency, the influence of the learner’s first language on the output in the target language, and identifying linguistic features that best indicate writing quality in a specific language. These linguistic features may differ morphologically or syntactically from those present in the learners’ first language, as observed in (1)–(3).

我-送了-他-一本-书

Wǒ-sòngle-tā-yī běn-shū

1 sg .-give. past- him-one .cl- book

“I gave him a book.”

Agglutinative

彼-に-本-を-あげ-まし-た

Kare-ni-hon-o-age-mashi-ta

3 sg .- dat -hon- acc- give.honorification. past

Inflectional

give, give-s, gave, given, giving

Additionally, the morphological agglutination and subject-object-verb (SOV) order in Japanese, along with its idiomatic expressions, pose additional challenges for applying language models in AES tasks (4).

足-が 棒-に なり-ました

Ashi-ga bo-ni nar-mashita

leg- nom stick- dat become- past

“My leg became like a stick (I am extremely tired).”

The example sentence provided demonstrates the morpho-syntactic structure of Japanese and the presence of an idiomatic expression. In this sentence, the verb “なる” (naru), meaning “to become”, appears at the end of the sentence. The verb stem “なり” (nari) is attached with morphemes indicating honorification (“ます” - mashu) and tense (“た” - ta), showcasing agglutination. While the sentence can be literally translated as “my leg became like a stick”, it carries an idiomatic interpretation that implies “I am extremely tired”.

To overcome this issue, CyberAgent Inc. ( 2023 ) has developed the Open-Calm series of language models specifically designed for Japanese. Open-Calm consists of pre-trained models available in various sizes, such as Small, Medium, Large, and 7b. Figure 2 depicts the fundamental structure of the Open-Calm model. A key feature of this architecture is the incorporation of the Lora Adapter and GPT-NeoX frameworks, which can enhance its language processing capabilities.

figure 2

GPT-NeoX Model Architecture (Okgetheng and Takeuchi 2024 ).

In a recent study conducted by Okgetheng and Takeuchi ( 2024 ), they assessed the efficacy of Open-Calm language models in grading Japanese essays. The research utilized a dataset of approximately 300 essays, which were annotated by native Japanese educators. The findings of the study demonstrate the considerable potential of Open-Calm language models in automated Japanese essay scoring. Specifically, among the Open-Calm family, the Open-Calm Large model (referred to as OCLL) exhibited the highest performance. However, it is important to note that, as of the current date, the Open-Calm Large model does not offer public access to its server. Consequently, users are required to independently deploy and operate the environment for OCLL. In order to utilize OCLL, users must have a PC equipped with an NVIDIA GeForce RTX 3060 (8 or 12 GB VRAM).

In summary, while the potential of LLMs in automated scoring of nonnative Japanese essays has been demonstrated in two studies—BERT-driven AES (Hirao et al. 2020 ) and OCLL-based AES (Okgetheng and Takeuchi, 2024 )—the number of research efforts in this area remains limited.

Another significant challenge in applying LLMs to AES lies in prompt engineering and ensuring its reliability and effectiveness (Brown et al. 2020 ; Rae et al. 2021 ; Zhang et al. 2021 ). Various prompting strategies have been proposed, such as the zero-shot chain of thought (CoT) approach (Kojima et al. 2022 ), which involves manually crafting diverse and effective examples. However, manual efforts can lead to mistakes. To address this, Zhang et al. ( 2021 ) introduced an automatic CoT prompting method called Auto-CoT, which demonstrates matching or superior performance compared to the CoT paradigm. Another prompt framework is trees of thoughts, enabling a model to self-evaluate its progress at intermediate stages of problem-solving through deliberate reasoning (Yao et al. 2023 ).

Beyond linguistic studies, there has been a noticeable increase in the number of foreign workers in Japan and Japanese learners worldwide (Ministry of Health, Labor, and Welfare of Japan, 2022 ; Japan Foundation, 2021 ). However, existing assessment methods, such as the Japanese Language Proficiency Test (JLPT), J-CAT, and TTBJ Footnote 1 , primarily focus on reading, listening, vocabulary, and grammar skills, neglecting the evaluation of writing proficiency. As the number of workers and language learners continues to grow, there is a rising demand for an efficient AES system that can reduce costs and time for raters and be utilized for employment, examinations, and self-study purposes.

This study aims to explore the potential of LLM-based AES by comparing the effectiveness of five models: two LLMs (GPT Footnote 2 and BERT), one Japanese local LLM (OCLL), and two conventional machine learning-based methods (linguistic feature-based scoring tools - Jess and JWriter).

The research questions addressed in this study are as follows:

To what extent do the LLM-driven AES and linguistic feature-based AES, when used as automated tools to support human rating, accurately reflect test takers’ actual performance?

What influence does the prompt have on the accuracy and performance of LLM-based AES methods?

The subsequent sections of the manuscript cover the methodology, including the assessment measures for nonnative Japanese writing proficiency, criteria for prompts, and the dataset. The evaluation section focuses on the analysis of annotations and rating scores generated by LLM-driven and linguistic feature-based AES methods.

Methodology

The dataset utilized in this study was obtained from the International Corpus of Japanese as a Second Language (I-JAS) Footnote 3 . This corpus consisted of 1000 participants who represented 12 different first languages. For the study, the participants were given a story-writing task on a personal computer. They were required to write two stories based on the 4-panel illustrations titled “Picnic” and “The key” (see Appendix A). Background information for the participants was provided by the corpus, including their Japanese language proficiency levels assessed through two online tests: J-CAT and SPOT. These tests evaluated their reading, listening, vocabulary, and grammar abilities. The learners’ proficiency levels were categorized into six levels aligned with the Common European Framework of Reference for Languages (CEFR) and the Reference Framework for Japanese Language Education (RFJLE): A1, A2, B1, B2, C1, and C2. According to Lee et al. ( 2015 ), there is a high level of agreement (r = 0.86) between the J-CAT and SPOT assessments, indicating that the proficiency certifications provided by J-CAT are consistent with those of SPOT. However, it is important to note that the scores of J-CAT and SPOT do not have a one-to-one correspondence. In this study, the J-CAT scores were used as a benchmark to differentiate learners of different proficiency levels. A total of 1400 essays were utilized, representing the beginner (aligned with A1), A2, B1, B2, C1, and C2 levels based on the J-CAT scores. Table 1 provides information about the learners’ proficiency levels and their corresponding J-CAT and SPOT scores.

A dataset comprising a total of 1400 essays from the story writing tasks was collected. Among these, 714 essays were utilized to evaluate the reliability of the LLM-based AES method, while the remaining 686 essays were designated as development data to assess the LLM-based AES’s capability to distinguish participants with varying proficiency levels. The GPT 4 API was used in this study. A detailed explanation of the prompt-assessment criteria is provided in Section Prompt . All essays were sent to the model for measurement and scoring.

Measures of writing proficiency for nonnative Japanese

Japanese exhibits a morphologically agglutinative structure where morphemes are attached to the word stem to convey grammatical functions such as tense, aspect, voice, and honorifics, e.g. (5).

食べ-させ-られ-まし-た-か

tabe-sase-rare-mashi-ta-ka

[eat (stem)-causative-passive voice-honorification-tense. past-question marker]

Japanese employs nine case particles to indicate grammatical functions: the nominative case particle が (ga), the accusative case particle を (o), the genitive case particle の (no), the dative case particle に (ni), the locative/instrumental case particle で (de), the ablative case particle から (kara), the directional case particle へ (e), and the comitative case particle と (to). The agglutinative nature of the language, combined with the case particle system, provides an efficient means of distinguishing between active and passive voice, either through morphemes or case particles, e.g. 食べる taberu “eat concusive . ” (active voice); 食べられる taberareru “eat concusive . ” (passive voice). In the active voice, “パン を 食べる” (pan o taberu) translates to “to eat bread”. On the other hand, in the passive voice, it becomes “パン が 食べられた” (pan ga taberareta), which means “(the) bread was eaten”. Additionally, it is important to note that different conjugations of the same lemma are considered as one type in order to ensure a comprehensive assessment of the language features. For example, e.g., 食べる taberu “eat concusive . ”; 食べている tabeteiru “eat progress .”; 食べた tabeta “eat past . ” as one type.

To incorporate these features, previous research (Suzuki, 1999 ; Watanabe et al. 1988 ; Ishioka, 2001 ; Ishioka and Kameda, 2006 ; Hirao et al. 2020 ) has identified complexity, fluency, and accuracy as crucial factors for evaluating writing quality. These criteria are assessed through various aspects, including lexical richness (lexical density, diversity, and sophistication), syntactic complexity, and cohesion (Kyle et al. 2021 ; Mizumoto and Eguchi, 2023 ; Ure, 1971 ; Halliday, 1985 ; Barkaoui and Hadidi, 2020 ; Zenker and Kyle, 2021 ; Kim et al. 2018 ; Lu, 2017 ; Ortega, 2015 ). Therefore, this study proposes five scoring categories: lexical richness, syntactic complexity, cohesion, content elaboration, and grammatical accuracy. A total of 16 measures were employed to capture these categories. The calculation process and specific details of these measures can be found in Table 2 .

T-unit, first introduced by Hunt ( 1966 ), is a measure used for evaluating speech and composition. It serves as an indicator of syntactic development and represents the shortest units into which a piece of discourse can be divided without leaving any sentence fragments. In the context of Japanese language assessment, Sakoda and Hosoi ( 2020 ) utilized T-unit as the basic unit to assess the accuracy and complexity of Japanese learners’ speaking and storytelling. The calculation of T-units in Japanese follows the following principles:

A single main clause constitutes 1 T-unit, regardless of the presence or absence of dependent clauses, e.g. (6).

ケンとマリはピクニックに行きました (main clause): 1 T-unit.

If a sentence contains a main clause along with subclauses, each subclause is considered part of the same T-unit, e.g. (7).

天気が良かった の で (subclause)、ケンとマリはピクニックに行きました (main clause): 1 T-unit.

In the case of coordinate clauses, where multiple clauses are connected, each coordinated clause is counted separately. Thus, a sentence with coordinate clauses may have 2 T-units or more, e.g. (8).

ケンは地図で場所を探して (coordinate clause)、マリはサンドイッチを作りました (coordinate clause): 2 T-units.

Lexical diversity refers to the range of words used within a text (Engber, 1995 ; Kyle et al. 2021 ) and is considered a useful measure of the breadth of vocabulary in L n production (Jarvis, 2013a , 2013b ).

The type/token ratio (TTR) is widely recognized as a straightforward measure for calculating lexical diversity and has been employed in numerous studies. These studies have demonstrated a strong correlation between TTR and other methods of measuring lexical diversity (e.g., Bentz et al. 2016 ; Čech and Miroslav, 2018 ; Çöltekin and Taraka, 2018 ). TTR is computed by considering both the number of unique words (types) and the total number of words (tokens) in a given text. Given that the length of learners’ writing texts can vary, this study employs the moving average type-token ratio (MATTR) to mitigate the influence of text length. MATTR is calculated using a 50-word moving window. Initially, a TTR is determined for words 1–50 in an essay, followed by words 2–51, 3–52, and so on until the end of the essay is reached (Díez-Ortega and Kyle, 2023 ). The final MATTR scores were obtained by averaging the TTR scores for all 50-word windows. The following formula was employed to derive MATTR:

\({\rm{MATTR}}({\rm{W}})=\frac{{\sum }_{{\rm{i}}=1}^{{\rm{N}}-{\rm{W}}+1}{{\rm{F}}}_{{\rm{i}}}}{{\rm{W}}({\rm{N}}-{\rm{W}}+1)}\)

Here, N refers to the number of tokens in the corpus. W is the randomly selected token size (W < N). \({F}_{i}\) is the number of types in each window. The \({\rm{MATTR}}({\rm{W}})\) is the mean of a series of type-token ratios (TTRs) based on the word form for all windows. It is expected that individuals with higher language proficiency will produce texts with greater lexical diversity, as indicated by higher MATTR scores.

Lexical density was captured by the ratio of the number of lexical words to the total number of words (Lu, 2012 ). Lexical sophistication refers to the utilization of advanced vocabulary, often evaluated through word frequency indices (Crossley et al. 2013 ; Haberman, 2008 ; Kyle and Crossley, 2015 ; Laufer and Nation, 1995 ; Lu, 2012 ; Read, 2000 ). In line of writing, lexical sophistication can be interpreted as vocabulary breadth, which entails the appropriate usage of vocabulary items across various lexicon-grammatical contexts and registers (Garner et al. 2019 ; Kim et al. 2018 ; Kyle et al. 2018 ). In Japanese specifically, words are considered lexically sophisticated if they are not included in the “Japanese Education Vocabulary List Ver 1.0”. Footnote 4 Consequently, lexical sophistication was calculated by determining the number of sophisticated word types relative to the total number of words per essay. Furthermore, it has been suggested that, in Japanese writing, sentences should ideally have a length of no more than 40 to 50 characters, as this promotes readability. Therefore, the median and maximum sentence length can be considered as useful indices for assessment (Ishioka and Kameda, 2006 ).

Syntactic complexity was assessed based on several measures, including the mean length of clauses, verb phrases per T-unit, clauses per T-unit, dependent clauses per T-unit, complex nominals per clause, adverbial clauses per clause, coordinate phrases per clause, and mean dependency distance (MDD). The MDD reflects the distance between the governor and dependent positions in a sentence. A larger dependency distance indicates a higher cognitive load and greater complexity in syntactic processing (Liu, 2008 ; Liu et al. 2017 ). The MDD has been established as an efficient metric for measuring syntactic complexity (Jiang, Quyang, and Liu, 2019 ; Li and Yan, 2021 ). To calculate the MDD, the position numbers of the governor and dependent are subtracted, assuming that words in a sentence are assigned in a linear order, such as W1 … Wi … Wn. In any dependency relationship between words Wa and Wb, Wa is the governor and Wb is the dependent. The MDD of the entire sentence was obtained by taking the absolute value of governor – dependent:

MDD = \(\frac{1}{n}{\sum }_{i=1}^{n}|{\rm{D}}{{\rm{D}}}_{i}|\)

In this formula, \(n\) represents the number of words in the sentence, and \({DD}i\) is the dependency distance of the \({i}^{{th}}\) dependency relationship of a sentence. Building on this, the annotation of sentence ‘Mary-ga-John-ni-keshigomu-o-watashita was [Mary- top -John- dat -eraser- acc -give- past] ’. The sentence’s MDD would be 2. Table 3 provides the CSV file as a prompt for GPT 4.

Cohesion (semantic similarity) and content elaboration aim to capture the ideas presented in test taker’s essays. Cohesion was assessed using three measures: Synonym overlap/paragraph (topic), Synonym overlap/paragraph (keywords), and word2vec cosine similarity. Content elaboration and development were measured as the number of metadiscourse markers (type)/number of words. To capture content closely, this study proposed a novel-distance based representation, by encoding the cosine distance between the essay (by learner) and essay task’s (topic and keyword) i -vectors. The learner’s essay is decoded into a word sequence, and aligned to the essay task’ topic and keyword for log-likelihood measurement. The cosine distance reveals the content elaboration score in the leaners’ essay. The mathematical equation of cosine similarity between target-reference vectors is shown in (11), assuming there are i essays and ( L i , …. L n ) and ( N i , …. N n ) are the vectors representing the learner and task’s topic and keyword respectively. The content elaboration distance between L i and N i was calculated as follows:

\(\cos \left(\theta \right)=\frac{{\rm{L}}\,\cdot\, {\rm{N}}}{\left|{\rm{L}}\right|{\rm{|N|}}}=\frac{\mathop{\sum }\nolimits_{i=1}^{n}{L}_{i}{N}_{i}}{\sqrt{\mathop{\sum }\nolimits_{i=1}^{n}{L}_{i}^{2}}\sqrt{\mathop{\sum }\nolimits_{i=1}^{n}{N}_{i}^{2}}}\)

A high similarity value indicates a low difference between the two recognition outcomes, which in turn suggests a high level of proficiency in content elaboration.

To evaluate the effectiveness of the proposed measures in distinguishing different proficiency levels among nonnative Japanese speakers’ writing, we conducted a multi-faceted Rasch measurement analysis (Linacre, 1994 ). This approach applies measurement models to thoroughly analyze various factors that can influence test outcomes, including test takers’ proficiency, item difficulty, and rater severity, among others. The underlying principles and functionality of multi-faceted Rasch measurement are illustrated in (12).

\(\log \left(\frac{{P}_{{nijk}}}{{P}_{{nij}(k-1)}}\right)={B}_{n}-{D}_{i}-{C}_{j}-{F}_{k}\)

(12) defines the logarithmic transformation of the probability ratio ( P nijk /P nij(k-1) )) as a function of multiple parameters. Here, n represents the test taker, i denotes a writing proficiency measure, j corresponds to the human rater, and k represents the proficiency score. The parameter B n signifies the proficiency level of test taker n (where n ranges from 1 to N). D j represents the difficulty parameter of test item i (where i ranges from 1 to L), while C j represents the severity of rater j (where j ranges from 1 to J). Additionally, F k represents the step difficulty for a test taker to move from score ‘k-1’ to k . P nijk refers to the probability of rater j assigning score k to test taker n for test item i . P nij(k-1) represents the likelihood of test taker n being assigned score ‘k-1’ by rater j for test item i . Each facet within the test is treated as an independent parameter and estimated within the same reference framework. To evaluate the consistency of scores obtained through both human and computer analysis, we utilized the Infit mean-square statistic. This statistic is a chi-square measure divided by the degrees of freedom and is weighted with information. It demonstrates higher sensitivity to unexpected patterns in responses to items near a person’s proficiency level (Linacre, 2002 ). Fit statistics are assessed based on predefined thresholds for acceptable fit. For the Infit MNSQ, which has a mean of 1.00, different thresholds have been suggested. Some propose stricter thresholds ranging from 0.7 to 1.3 (Bond et al. 2021 ), while others suggest more lenient thresholds ranging from 0.5 to 1.5 (Eckes, 2009 ). In this study, we adopted the criterion of 0.70–1.30 for the Infit MNSQ.

Moving forward, we can now proceed to assess the effectiveness of the 16 proposed measures based on five criteria for accurately distinguishing various levels of writing proficiency among non-native Japanese speakers. To conduct this evaluation, we utilized the development dataset from the I-JAS corpus, as described in Section Dataset . Table 4 provides a measurement report that presents the performance details of the 14 metrics under consideration. The measure separation was found to be 4.02, indicating a clear differentiation among the measures. The reliability index for the measure separation was 0.891, suggesting consistency in the measurement. Similarly, the person separation reliability index was 0.802, indicating the accuracy of the assessment in distinguishing between individuals. All 16 measures demonstrated Infit mean squares within a reasonable range, ranging from 0.76 to 1.28. The Synonym overlap/paragraph (topic) measure exhibited a relatively high outfit mean square of 1.46, although the Infit mean square falls within an acceptable range. The standard error for the measures ranged from 0.13 to 0.28, indicating the precision of the estimates.

Table 5 further illustrated the weights assigned to different linguistic measures for score prediction, with higher weights indicating stronger correlations between those measures and higher scores. Specifically, the following measures exhibited higher weights compared to others: moving average type token ratio per essay has a weight of 0.0391. Mean dependency distance had a weight of 0.0388. Mean length of clause, calculated by dividing the number of words by the number of clauses, had a weight of 0.0374. Complex nominals per T-unit, calculated by dividing the number of complex nominals by the number of T-units, had a weight of 0.0379. Coordinate phrases rate, calculated by dividing the number of coordinate phrases by the number of clauses, had a weight of 0.0325. Grammatical error rate, representing the number of errors per essay, had a weight of 0.0322.

Criteria (output indicator)

The criteria used to evaluate the writing ability in this study were based on CEFR, which follows a six-point scale ranging from A1 to C2. To assess the quality of Japanese writing, the scoring criteria from Table 6 were utilized. These criteria were derived from the IELTS writing standards and served as assessment guidelines and prompts for the written output.

A prompt is a question or detailed instruction that is provided to the model to obtain a proper response. After several pilot experiments, we decided to provide the measures (Section Measures of writing proficiency for nonnative Japanese ) as the input prompt and use the criteria (Section Criteria (output indicator) ) as the output indicator. Regarding the prompt language, considering that the LLM was tasked with rating Japanese essays, would prompt in Japanese works better Footnote 5 ? We conducted experiments comparing the performance of GPT-4 using both English and Japanese prompts. Additionally, we utilized the Japanese local model OCLL with Japanese prompts. Multiple trials were conducted using the same sample. Regardless of the prompt language used, we consistently obtained the same grading results with GPT-4, which assigned a grade of B1 to the writing sample. This suggested that GPT-4 is reliable and capable of producing consistent ratings regardless of the prompt language. On the other hand, when we used Japanese prompts with the Japanese local model “OCLL”, we encountered inconsistent grading results. Out of 10 attempts with OCLL, only 6 yielded consistent grading results (B1), while the remaining 4 showed different outcomes, including A1 and B2 grades. These findings indicated that the language of the prompt was not the determining factor for reliable AES. Instead, the size of the training data and the model parameters played crucial roles in achieving consistent and reliable AES results for the language model.

The following is the utilized prompt, which details all measures and requires the LLM to score the essays using holistic and trait scores.

Please evaluate Japanese essays written by Japanese learners and assign a score to each essay on a six-point scale, ranging from A1, A2, B1, B2, C1 to C2. Additionally, please provide trait scores and display the calculation process for each trait score. The scoring should be based on the following criteria:

Moving average type-token ratio.

Number of lexical words (token) divided by the total number of words per essay.

Number of sophisticated word types divided by the total number of words per essay.

Mean length of clause.

Verb phrases per T-unit.

Clauses per T-unit.

Dependent clauses per T-unit.

Complex nominals per clause.

Adverbial clauses per clause.

Coordinate phrases per clause.

Mean dependency distance.

Synonym overlap paragraph (topic and keywords).

Word2vec cosine similarity.

Connectives per essay.

Conjunctions per essay.

Number of metadiscourse markers (types) divided by the total number of words.

Number of errors per essay.

Japanese essay text

出かける前に二人が地図を見ている間に、サンドイッチを入れたバスケットに犬が入ってしまいました。それに気づかずに二人は楽しそうに出かけて行きました。やがて突然犬がバスケットから飛び出し、二人は驚きました。バスケット の 中を見ると、食べ物はすべて犬に食べられていて、二人は困ってしまいました。(ID_JJJ01_SW1)

The score of the example above was B1. Figure 3 provides an example of holistic and trait scores provided by GPT-4 (with a prompt indicating all measures) via Bing Footnote 6 .

figure 3

Example of GPT-4 AES and feedback (with a prompt indicating all measures).

Statistical analysis

The aim of this study is to investigate the potential use of LLM for nonnative Japanese AES. It seeks to compare the scoring outcomes obtained from feature-based AES tools, which rely on conventional machine learning technology (i.e. Jess, JWriter), with those generated by AI-driven AES tools utilizing deep learning technology (BERT, GPT, OCLL). To assess the reliability of a computer-assisted annotation tool, the study initially established human-human agreement as the benchmark measure. Subsequently, the performance of the LLM-based method was evaluated by comparing it to human-human agreement.

To assess annotation agreement, the study employed standard measures such as precision, recall, and F-score (Brants 2000 ; Lu 2010 ), along with the quadratically weighted kappa (QWK) to evaluate the consistency and agreement in the annotation process. Assume A and B represent human annotators. When comparing the annotations of the two annotators, the following results are obtained. The evaluation of precision, recall, and F-score metrics was illustrated in equations (13) to (15).

\({\rm{Recall}}(A,B)=\frac{{\rm{Number}}\,{\rm{of}}\,{\rm{identical}}\,{\rm{nodes}}\,{\rm{in}}\,A\,{\rm{and}}\,B}{{\rm{Number}}\,{\rm{of}}\,{\rm{nodes}}\,{\rm{in}}\,A}\)

\({\rm{Precision}}(A,\,B)=\frac{{\rm{Number}}\,{\rm{of}}\,{\rm{identical}}\,{\rm{nodes}}\,{\rm{in}}\,A\,{\rm{and}}\,B}{{\rm{Number}}\,{\rm{of}}\,{\rm{nodes}}\,{\rm{in}}\,B}\)

The F-score is the harmonic mean of recall and precision:

\({\rm{F}}-{\rm{score}}=\frac{2* ({\rm{Precision}}* {\rm{Recall}})}{{\rm{Precision}}+{\rm{Recall}}}\)

The highest possible value of an F-score is 1.0, indicating perfect precision and recall, and the lowest possible value is 0, if either precision or recall are zero.

In accordance with Taghipour and Ng ( 2016 ), the calculation of QWK involves two steps:

Step 1: Construct a weight matrix W as follows:

\({W}_{{ij}}=\frac{{(i-j)}^{2}}{{(N-1)}^{2}}\)

i represents the annotation made by the tool, while j represents the annotation made by a human rater. N denotes the total number of possible annotations. Matrix O is subsequently computed, where O_( i, j ) represents the count of data annotated by the tool ( i ) and the human annotator ( j ). On the other hand, E refers to the expected count matrix, which undergoes normalization to ensure that the sum of elements in E matches the sum of elements in O.

Step 2: With matrices O and E, the QWK is obtained as follows:

K = 1- \(\frac{\sum i,j{W}_{i,j}\,{O}_{i,j}}{\sum i,j{W}_{i,j}\,{E}_{i,j}}\)

The value of the quadratic weighted kappa increases as the level of agreement improves. Further, to assess the accuracy of LLM scoring, the proportional reductive mean square error (PRMSE) was employed. The PRMSE approach takes into account the variability observed in human ratings to estimate the rater error, which is then subtracted from the variance of the human labels. This calculation provides an overall measure of agreement between the automated scores and true scores (Haberman et al. 2015 ; Loukina et al. 2020 ; Taghipour and Ng, 2016 ). The computation of PRMSE involves the following steps:

Step 1: Calculate the mean squared errors (MSEs) for the scoring outcomes of the computer-assisted tool (MSE tool) and the human scoring outcomes (MSE human).

Step 2: Determine the PRMSE by comparing the MSE of the computer-assisted tool (MSE tool) with the MSE from human raters (MSE human), using the following formula:

\({\rm{PRMSE}}=1-\frac{({\rm{MSE}}\,{\rm{tool}})\,}{({\rm{MSE}}\,{\rm{human}})\,}=1-\,\frac{{\sum }_{i}^{n}=1{({{\rm{y}}}_{i}-{\hat{{\rm{y}}}}_{{\rm{i}}})}^{2}}{{\sum }_{i}^{n}=1{({{\rm{y}}}_{i}-\hat{{\rm{y}}})}^{2}}\)

In the numerator, ŷi represents the scoring outcome predicted by a specific LLM-driven AES system for a given sample. The term y i − ŷ i represents the difference between this predicted outcome and the mean value of all LLM-driven AES systems’ scoring outcomes. It quantifies the deviation of the specific LLM-driven AES system’s prediction from the average prediction of all LLM-driven AES systems. In the denominator, y i − ŷ represents the difference between the scoring outcome provided by a specific human rater for a given sample and the mean value of all human raters’ scoring outcomes. It measures the discrepancy between the specific human rater’s score and the average score given by all human raters. The PRMSE is then calculated by subtracting the ratio of the MSE tool to the MSE human from 1. PRMSE falls within the range of 0 to 1, with larger values indicating reduced errors in LLM’s scoring compared to those of human raters. In other words, a higher PRMSE implies that LLM’s scoring demonstrates greater accuracy in predicting the true scores (Loukina et al. 2020 ). The interpretation of kappa values, ranging from 0 to 1, is based on the work of Landis and Koch ( 1977 ). Specifically, the following categories are assigned to different ranges of kappa values: −1 indicates complete inconsistency, 0 indicates random agreement, 0.0 ~ 0.20 indicates extremely low level of agreement (slight), 0.21 ~ 0.40 indicates moderate level of agreement (fair), 0.41 ~ 0.60 indicates medium level of agreement (moderate), 0.61 ~ 0.80 indicates high level of agreement (substantial), 0.81 ~ 1 indicates almost perfect level of agreement. All statistical analyses were executed using Python script.

Results and discussion

Annotation reliability of the llm.

This section focuses on assessing the reliability of the LLM’s annotation and scoring capabilities. To evaluate the reliability, several tests were conducted simultaneously, aiming to achieve the following objectives:

Assess the LLM’s ability to differentiate between test takers with varying levels of oral proficiency.

Determine the level of agreement between the annotations and scoring performed by the LLM and those done by human raters.

The evaluation of the results encompassed several metrics, including: precision, recall, F-Score, quadratically-weighted kappa, proportional reduction of mean squared error, Pearson correlation, and multi-faceted Rasch measurement.

Inter-annotator agreement (human–human annotator agreement)

We started with an agreement test of the two human annotators. Two trained annotators were recruited to determine the writing task data measures. A total of 714 scripts, as the test data, was utilized. Each analysis lasted 300–360 min. Inter-annotator agreement was evaluated using the standard measures of precision, recall, and F-score and QWK. Table 7 presents the inter-annotator agreement for the various indicators. As shown, the inter-annotator agreement was fairly high, with F-scores ranging from 1.0 for sentence and word number to 0.666 for grammatical errors.

The findings from the QWK analysis provided further confirmation of the inter-annotator agreement. The QWK values covered a range from 0.950 ( p  = 0.000) for sentence and word number to 0.695 for synonym overlap number (keyword) and grammatical errors ( p  = 0.001).

Agreement of annotation outcomes between human and LLM

To evaluate the consistency between human annotators and LLM annotators (BERT, GPT, OCLL) across the indices, the same test was conducted. The results of the inter-annotator agreement (F-score) between LLM and human annotation are provided in Appendix B-D. The F-scores ranged from 0.706 for Grammatical error # for OCLL-human to a perfect 1.000 for GPT-human, for sentences, clauses, T-units, and words. These findings were further supported by the QWK analysis, which showed agreement levels ranging from 0.807 ( p  = 0.001) for metadiscourse markers for OCLL-human to 0.962 for words ( p  = 0.000) for GPT-human. The findings demonstrated that the LLM annotation achieved a significant level of accuracy in identifying measurement units and counts.

Reliability of LLM-driven AES’s scoring and discriminating proficiency levels

This section examines the reliability of the LLM-driven AES scoring through a comparison of the scoring outcomes produced by human raters and the LLM ( Reliability of LLM-driven AES scoring ). It also assesses the effectiveness of the LLM-based AES system in differentiating participants with varying proficiency levels ( Reliability of LLM-driven AES discriminating proficiency levels ).

Reliability of LLM-driven AES scoring

Table 8 summarizes the QWK coefficient analysis between the scores computed by the human raters and the GPT-4 for the individual essays from I-JAS Footnote 7 . As shown, the QWK of all measures ranged from k  = 0.819 for lexical density (number of lexical words (tokens)/number of words per essay) to k  = 0.644 for word2vec cosine similarity. Table 9 further presents the Pearson correlations between the 16 writing proficiency measures scored by human raters and GPT 4 for the individual essays. The correlations ranged from 0.672 for syntactic complexity to 0.734 for grammatical accuracy. The correlations between the writing proficiency scores assigned by human raters and the BERT-based AES system were found to range from 0.661 for syntactic complexity to 0.713 for grammatical accuracy. The correlations between the writing proficiency scores given by human raters and the OCLL-based AES system ranged from 0.654 for cohesion to 0.721 for grammatical accuracy. These findings indicated an alignment between the assessments made by human raters and both the BERT-based and OCLL-based AES systems in terms of various aspects of writing proficiency.

Reliability of LLM-driven AES discriminating proficiency levels

After validating the reliability of the LLM’s annotation and scoring, the subsequent objective was to evaluate its ability to distinguish between various proficiency levels. For this analysis, a dataset of 686 individual essays was utilized. Table 10 presents a sample of the results, summarizing the means, standard deviations, and the outcomes of the one-way ANOVAs based on the measures assessed by the GPT-4 model. A post hoc multiple comparison test, specifically the Bonferroni test, was conducted to identify any potential differences between pairs of levels.

As the results reveal, seven measures presented linear upward or downward progress across the three proficiency levels. These were marked in bold in Table 10 and comprise one measure of lexical richness, i.e. MATTR (lexical diversity); four measures of syntactic complexity, i.e. MDD (mean dependency distance), MLC (mean length of clause), CNT (complex nominals per T-unit), CPC (coordinate phrases rate); one cohesion measure, i.e. word2vec cosine similarity and GER (grammatical error rate). Regarding the ability of the sixteen measures to distinguish adjacent proficiency levels, the Bonferroni tests indicated that statistically significant differences exist between the primary level and the intermediate level for MLC and GER. One measure of lexical richness, namely LD, along with three measures of syntactic complexity (VPT, CT, DCT, ACC), two measures of cohesion (SOPT, SOPK), and one measure of content elaboration (IMM), exhibited statistically significant differences between proficiency levels. However, these differences did not demonstrate a linear progression between adjacent proficiency levels. No significant difference was observed in lexical sophistication between proficiency levels.

To summarize, our study aimed to evaluate the reliability and differentiation capabilities of the LLM-driven AES method. For the first objective, we assessed the LLM’s ability to differentiate between test takers with varying levels of oral proficiency using precision, recall, F-Score, and quadratically-weighted kappa. Regarding the second objective, we compared the scoring outcomes generated by human raters and the LLM to determine the level of agreement. We employed quadratically-weighted kappa and Pearson correlations to compare the 16 writing proficiency measures for the individual essays. The results confirmed the feasibility of using the LLM for annotation and scoring in AES for nonnative Japanese. As a result, Research Question 1 has been addressed.

Comparison of BERT-, GPT-, OCLL-based AES, and linguistic-feature-based computation methods

This section aims to compare the effectiveness of five AES methods for nonnative Japanese writing, i.e. LLM-driven approaches utilizing BERT, GPT, and OCLL, linguistic feature-based approaches using Jess and JWriter. The comparison was conducted by comparing the ratings obtained from each approach with human ratings. All ratings were derived from the dataset introduced in Dataset . To facilitate the comparison, the agreement between the automated methods and human ratings was assessed using QWK and PRMSE. The performance of each approach was summarized in Table 11 .

The QWK coefficient values indicate that LLMs (GPT, BERT, OCLL) and human rating outcomes demonstrated higher agreement compared to feature-based AES methods (Jess and JWriter) in assessing writing proficiency criteria, including lexical richness, syntactic complexity, content, and grammatical accuracy. Among the LLMs, the GPT-4 driven AES and human rating outcomes showed the highest agreement in all criteria, except for syntactic complexity. The PRMSE values suggest that the GPT-based method outperformed linguistic feature-based methods and other LLM-based approaches. Moreover, an interesting finding emerged during the study: the agreement coefficient between GPT-4 and human scoring was even higher than the agreement between different human raters themselves. This discovery highlights the advantage of GPT-based AES over human rating. Ratings involve a series of processes, including reading the learners’ writing, evaluating the content and language, and assigning scores. Within this chain of processes, various biases can be introduced, stemming from factors such as rater biases, test design, and rating scales. These biases can impact the consistency and objectivity of human ratings. GPT-based AES may benefit from its ability to apply consistent and objective evaluation criteria. By prompting the GPT model with detailed writing scoring rubrics and linguistic features, potential biases in human ratings can be mitigated. The model follows a predefined set of guidelines and does not possess the same subjective biases that human raters may exhibit. This standardization in the evaluation process contributes to the higher agreement observed between GPT-4 and human scoring. Section Prompt strategy of the study delves further into the role of prompts in the application of LLMs to AES. It explores how the choice and implementation of prompts can impact the performance and reliability of LLM-based AES methods. Furthermore, it is important to acknowledge the strengths of the local model, i.e. the Japanese local model OCLL, which excels in processing certain idiomatic expressions. Nevertheless, our analysis indicated that GPT-4 surpasses local models in AES. This superior performance can be attributed to the larger parameter size of GPT-4, estimated to be between 500 billion and 1 trillion, which exceeds the sizes of both BERT and the local model OCLL.

Prompt strategy

In the context of prompt strategy, Mizumoto and Eguchi ( 2023 ) conducted a study where they applied the GPT-3 model to automatically score English essays in the TOEFL test. They found that the accuracy of the GPT model alone was moderate to fair. However, when they incorporated linguistic measures such as cohesion, syntactic complexity, and lexical features alongside the GPT model, the accuracy significantly improved. This highlights the importance of prompt engineering and providing the model with specific instructions to enhance its performance. In this study, a similar approach was taken to optimize the performance of LLMs. GPT-4, which outperformed BERT and OCLL, was selected as the candidate model. Model 1 was used as the baseline, representing GPT-4 without any additional prompting. Model 2, on the other hand, involved GPT-4 prompted with 16 measures that included scoring criteria, efficient linguistic features for writing assessment, and detailed measurement units and calculation formulas. The remaining models (Models 3 to 18) utilized GPT-4 prompted with individual measures. The performance of these 18 different models was assessed using the output indicators described in Section Criteria (output indicator) . By comparing the performances of these models, the study aimed to understand the impact of prompt engineering on the accuracy and effectiveness of GPT-4 in AES tasks.

Based on the PRMSE scores presented in Fig. 4 , it was observed that Model 1, representing GPT-4 without any additional prompting, achieved a fair level of performance. However, Model 2, which utilized GPT-4 prompted with all measures, outperformed all other models in terms of PRMSE score, achieving a score of 0.681. These results indicate that the inclusion of specific measures and prompts significantly enhanced the performance of GPT-4 in AES. Among the measures, syntactic complexity was found to play a particularly significant role in improving the accuracy of GPT-4 in assessing writing quality. Following that, lexical diversity emerged as another important factor contributing to the model’s effectiveness. The study suggests that a well-prompted GPT-4 can serve as a valuable tool to support human assessors in evaluating writing quality. By utilizing GPT-4 as an automated scoring tool, the evaluation biases associated with human raters can be minimized. This has the potential to empower teachers by allowing them to focus on designing writing tasks and guiding writing strategies, while leveraging the capabilities of GPT-4 for efficient and reliable scoring.

figure 4

PRMSE scores of the 18 AES models.

This study aimed to investigate two main research questions: the feasibility of utilizing LLMs for AES and the impact of prompt engineering on the application of LLMs in AES.

To address the first objective, the study compared the effectiveness of five different models: GPT, BERT, the Japanese local LLM (OCLL), and two conventional machine learning-based AES tools (Jess and JWriter). The PRMSE values indicated that the GPT-4-based method outperformed other LLMs (BERT, OCLL) and linguistic feature-based computational methods (Jess and JWriter) across various writing proficiency criteria. Furthermore, the agreement coefficient between GPT-4 and human scoring surpassed the agreement among human raters themselves, highlighting the potential of using the GPT-4 tool to enhance AES by reducing biases and subjectivity, saving time, labor, and cost, and providing valuable feedback for self-study. Regarding the second goal, the role of prompt design was investigated by comparing 18 models, including a baseline model, a model prompted with all measures, and 16 models prompted with one measure at a time. GPT-4, which outperformed BERT and OCLL, was selected as the candidate model. The PRMSE scores of the models showed that GPT-4 prompted with all measures achieved the best performance, surpassing the baseline and other models.

In conclusion, this study has demonstrated the potential of LLMs in supporting human rating in assessments. By incorporating automation, we can save time and resources while reducing biases and subjectivity inherent in human rating processes. Automated language assessments offer the advantage of accessibility, providing equal opportunities and economic feasibility for individuals who lack access to traditional assessment centers or necessary resources. LLM-based language assessments provide valuable feedback and support to learners, aiding in the enhancement of their language proficiency and the achievement of their goals. This personalized feedback can cater to individual learner needs, facilitating a more tailored and effective language-learning experience.

There are three important areas that merit further exploration. First, prompt engineering requires attention to ensure optimal performance of LLM-based AES across different language types. This study revealed that GPT-4, when prompted with all measures, outperformed models prompted with fewer measures. Therefore, investigating and refining prompt strategies can enhance the effectiveness of LLMs in automated language assessments. Second, it is crucial to explore the application of LLMs in second-language assessment and learning for oral proficiency, as well as their potential in under-resourced languages. Recent advancements in self-supervised machine learning techniques have significantly improved automatic speech recognition (ASR) systems, opening up new possibilities for creating reliable ASR systems, particularly for under-resourced languages with limited data. However, challenges persist in the field of ASR. First, ASR assumes correct word pronunciation for automatic pronunciation evaluation, which proves challenging for learners in the early stages of language acquisition due to diverse accents influenced by their native languages. Accurately segmenting short words becomes problematic in such cases. Second, developing precise audio-text transcriptions for languages with non-native accented speech poses a formidable task. Last, assessing oral proficiency levels involves capturing various linguistic features, including fluency, pronunciation, accuracy, and complexity, which are not easily captured by current NLP technology.

Data availability

The dataset utilized was obtained from the International Corpus of Japanese as a Second Language (I-JAS). The data URLs: [ https://www2.ninjal.ac.jp/jll/lsaj/ihome2.html ].

J-CAT and TTBJ are two computerized adaptive tests used to assess Japanese language proficiency.

SPOT is a specific component of the TTBJ test.

J-CAT: https://www.j-cat2.org/html/ja/pages/interpret.html

SPOT: https://ttbj.cegloc.tsukuba.ac.jp/p1.html#SPOT .

The study utilized a prompt-based GPT-4 model, developed by OpenAI, which has an impressive architecture with 1.8 trillion parameters across 120 layers. GPT-4 was trained on a vast dataset of 13 trillion tokens, using two stages: initial training on internet text datasets to predict the next token, and subsequent fine-tuning through reinforcement learning from human feedback.

https://www2.ninjal.ac.jp/jll/lsaj/ihome2-en.html .

http://jhlee.sakura.ne.jp/JEV/ by Japanese Learning Dictionary Support Group 2015.

We express our sincere gratitude to the reviewer for bringing this matter to our attention.

On February 7, 2023, Microsoft began rolling out a major overhaul to Bing that included a new chatbot feature based on OpenAI’s GPT-4 (Bing.com).

Appendix E-F present the analysis results of the QWK coefficient between the scores computed by the human raters and the BERT, OCLL models.

Attali Y, Burstein J (2006) Automated essay scoring with e-rater® V.2. J. Technol., Learn. Assess., 4

Barkaoui K, Hadidi A (2020) Assessing Change in English Second Language Writing Performance (1st ed.). Routledge, New York. https://doi.org/10.4324/9781003092346

Bentz C, Tatyana R, Koplenig A, Tanja S (2016) A comparison between morphological complexity. measures: Typological data vs. language corpora. In Proceedings of the workshop on computational linguistics for linguistic complexity (CL4LC), 142–153. Osaka, Japan: The COLING 2016 Organizing Committee

Bond TG, Yan Z, Heene M (2021) Applying the Rasch model: Fundamental measurement in the human sciences (4th ed). Routledge

Brants T (2000) Inter-annotator agreement for a German newspaper corpus. Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00), Athens, Greece, 31 May-2 June, European Language Resources Association

Brown TB, Mann B, Ryder N, et al. (2020) Language models are few-shot learners. Advances in Neural Information Processing Systems, Online, 6–12 December, Curran Associates, Inc., Red Hook, NY

Burstein J (2003) The E-rater scoring engine: Automated essay scoring with natural language processing. In Shermis MD and Burstein JC (ed) Automated Essay Scoring: A Cross-Disciplinary Perspective. Lawrence Erlbaum Associates, Mahwah, NJ

Čech R, Miroslav K (2018) Morphological richness of text. In Masako F, Václav C (ed) Taming the corpus: From inflection and lexis to interpretation, 63–77. Cham, Switzerland: Springer Nature

Çöltekin Ç, Taraka, R (2018) Exploiting Universal Dependencies treebanks for measuring morphosyntactic complexity. In Aleksandrs B, Christian B (ed), Proceedings of first workshop on measuring language complexity, 1–7. Torun, Poland

Crossley SA, Cobb T, McNamara DS (2013) Comparing count-based and band-based indices of word frequency: Implications for active vocabulary research and pedagogical applications. System 41:965–981. https://doi.org/10.1016/j.system.2013.08.002

Article   Google Scholar  

Crossley SA, McNamara DS (2016) Say more and be more coherent: How text elaboration and cohesion can increase writing quality. J. Writ. Res. 7:351–370

CyberAgent Inc (2023) Open-Calm series of Japanese language models. Retrieved from: https://www.cyberagent.co.jp/news/detail/id=28817

Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, Minneapolis, Minnesota, 2–7 June, pp. 4171–4186. Association for Computational Linguistics

Diez-Ortega M, Kyle K (2023) Measuring the development of lexical richness of L2 Spanish: a longitudinal learner corpus study. Studies in Second Language Acquisition 1-31

Eckes T (2009) On common ground? How raters perceive scoring criteria in oral proficiency testing. In Brown A, Hill K (ed) Language testing and evaluation 13: Tasks and criteria in performance assessment (pp. 43–73). Peter Lang Publishing

Elliot S (2003) IntelliMetric: from here to validity. In: Shermis MD, Burstein JC (ed) Automated Essay Scoring: A Cross-Disciplinary Perspective. Lawrence Erlbaum Associates, Mahwah, NJ

Google Scholar  

Engber CA (1995) The relationship of lexical proficiency to the quality of ESL compositions. J. Second Lang. Writ. 4:139–155

Garner J, Crossley SA, Kyle K (2019) N-gram measures and L2 writing proficiency. System 80:176–187. https://doi.org/10.1016/j.system.2018.12.001

Haberman SJ (2008) When can subscores have value? J. Educat. Behav. Stat., 33:204–229

Haberman SJ, Yao L, Sinharay S (2015) Prediction of true test scores from observed item scores and ancillary data. Brit. J. Math. Stat. Psychol. 68:363–385

Halliday MAK (1985) Spoken and Written Language. Deakin University Press, Melbourne, Australia

Hirao R, Arai M, Shimanaka H et al. (2020) Automated essay scoring system for nonnative Japanese learners. Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pp. 1250–1257. European Language Resources Association

Hunt KW (1966) Recent Measures in Syntactic Development. Elementary English, 43(7), 732–739. http://www.jstor.org/stable/41386067

Ishioka T (2001) About e-rater, a computer-based automatic scoring system for essays [Konpyūta ni yoru essei no jidō saiten shisutemu e − rater ni tsuite]. University Entrance Examination. Forum [Daigaku nyūshi fōramu] 24:71–76

Hochreiter S, Schmidhuber J (1997) Long short- term memory. Neural Comput. 9(8):1735–1780

Article   CAS   PubMed   Google Scholar  

Ishioka T, Kameda M (2006) Automated Japanese essay scoring system based on articles written by experts. Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, Australia, 17–18 July 2006, pp. 233-240. Association for Computational Linguistics, USA

Japan Foundation (2021) Retrieved from: https://www.jpf.gp.jp/j/project/japanese/survey/result/dl/survey2021/all.pdf

Jarvis S (2013a) Defining and measuring lexical diversity. In Jarvis S, Daller M (ed) Vocabulary knowledge: Human ratings and automated measures (Vol. 47, pp. 13–44). John Benjamins. https://doi.org/10.1075/sibil.47.03ch1

Jarvis S (2013b) Capturing the diversity in lexical diversity. Lang. Learn. 63:87–106. https://doi.org/10.1111/j.1467-9922.2012.00739.x

Jiang J, Quyang J, Liu H (2019) Interlanguage: A perspective of quantitative linguistic typology. Lang. Sci. 74:85–97

Kim M, Crossley SA, Kyle K (2018) Lexical sophistication as a multidimensional phenomenon: Relations to second language lexical proficiency, development, and writing quality. Mod. Lang. J. 102(1):120–141. https://doi.org/10.1111/modl.12447

Kojima T, Gu S, Reid M et al. (2022) Large language models are zero-shot reasoners. Advances in Neural Information Processing Systems, New Orleans, LA, 29 November-1 December, Curran Associates, Inc., Red Hook, NY

Kyle K, Crossley SA (2015) Automatically assessing lexical sophistication: Indices, tools, findings, and application. TESOL Q 49:757–786

Kyle K, Crossley SA, Berger CM (2018) The tool for the automatic analysis of lexical sophistication (TAALES): Version 2.0. Behav. Res. Methods 50:1030–1046. https://doi.org/10.3758/s13428-017-0924-4

Article   PubMed   Google Scholar  

Kyle K, Crossley SA, Jarvis S (2021) Assessing the validity of lexical diversity using direct judgements. Lang. Assess. Q. 18:154–170. https://doi.org/10.1080/15434303.2020.1844205

Landauer TK, Laham D, Foltz PW (2003) Automated essay scoring and annotation of essays with the Intelligent Essay Assessor. In Shermis MD, Burstein JC (ed), Automated Essay Scoring: A Cross-Disciplinary Perspective. Lawrence Erlbaum Associates, Mahwah, NJ

Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 159–174

Laufer B, Nation P (1995) Vocabulary size and use: Lexical richness in L2 written production. Appl. Linguist. 16:307–322. https://doi.org/10.1093/applin/16.3.307

Lee J, Hasebe Y (2017) jWriter Learner Text Evaluator, URL: https://jreadability.net/jwriter/

Lee J, Kobayashi N, Sakai T, Sakota K (2015) A Comparison of SPOT and J-CAT Based on Test Analysis [Tesuto bunseki ni motozuku ‘SPOT’ to ‘J-CAT’ no hikaku]. Research on the Acquisition of Second Language Japanese [Dainigengo to shite no nihongo no shūtoku kenkyū] (18) 53–69

Li W, Yan J (2021) Probability distribution of dependency distance based on a Treebank of. Japanese EFL Learners’ Interlanguage. J. Quant. Linguist. 28(2):172–186. https://doi.org/10.1080/09296174.2020.1754611

Article   MathSciNet   Google Scholar  

Linacre JM (2002) Optimizing rating scale category effectiveness. J. Appl. Meas. 3(1):85–106

PubMed   Google Scholar  

Linacre JM (1994) Constructing measurement with a Many-Facet Rasch Model. In Wilson M (ed) Objective measurement: Theory into practice, Volume 2 (pp. 129–144). Norwood, NJ: Ablex

Liu H (2008) Dependency distance as a metric of language comprehension difficulty. J. Cognitive Sci. 9:159–191

Liu H, Xu C, Liang J (2017) Dependency distance: A new perspective on syntactic patterns in natural languages. Phys. Life Rev. 21. https://doi.org/10.1016/j.plrev.2017.03.002

Loukina A, Madnani N, Cahill A, et al. (2020) Using PRMSE to evaluate automated scoring systems in the presence of label noise. Proceedings of the Fifteenth Workshop on Innovative Use of NLP for Building Educational Applications, Seattle, WA, USA → Online, 10 July, pp. 18–29. Association for Computational Linguistics

Lu X (2010) Automatic analysis of syntactic complexity in second language writing. Int. J. Corpus Linguist. 15:474–496

Lu X (2012) The relationship of lexical richness to the quality of ESL learners’ oral narratives. Mod. Lang. J. 96:190–208

Lu X (2017) Automated measurement of syntactic complexity in corpus-based L2 writing research and implications for writing assessment. Lang. Test. 34:493–511

Lu X, Hu R (2022) Sense-aware lexical sophistication indices and their relationship to second language writing quality. Behav. Res. Method. 54:1444–1460. https://doi.org/10.3758/s13428-021-01675-6

Ministry of Health, Labor, and Welfare of Japan (2022) Retrieved from: https://www.mhlw.go.jp/stf/newpage_30367.html

Mizumoto A, Eguchi M (2023) Exploring the potential of using an AI language model for automated essay scoring. Res. Methods Appl. Linguist. 3:100050

Okgetheng B, Takeuchi K (2024) Estimating Japanese Essay Grading Scores with Large Language Models. Proceedings of 30th Annual Conference of the Language Processing Society in Japan, March 2024

Ortega L (2015) Second language learning explained? SLA across 10 contemporary theories. In VanPatten B, Williams J (ed) Theories in Second Language Acquisition: An Introduction

Rae JW, Borgeaud S, Cai T, et al. (2021) Scaling Language Models: Methods, Analysis & Insights from Training Gopher. ArXiv, abs/2112.11446

Read J (2000) Assessing vocabulary. Cambridge University Press. https://doi.org/10.1017/CBO9780511732942

Rudner LM, Liang T (2002) Automated Essay Scoring Using Bayes’ Theorem. J. Technol., Learning and Assessment, 1 (2)

Sakoda K, Hosoi Y (2020) Accuracy and complexity of Japanese Language usage by SLA learners in different learning environments based on the analysis of I-JAS, a learners’ corpus of Japanese as L2. Math. Linguist. 32(7):403–418. https://doi.org/10.24701/mathling.32.7_403

Suzuki N (1999) Summary of survey results regarding comprehensive essay questions. Final report of “Joint Research on Comprehensive Examinations for the Aim of Evaluating Applicability to Each Specialized Field of Universities” for 1996-2000 [shōronbun sōgō mondai ni kansuru chōsa kekka no gaiyō. Heisei 8 - Heisei 12-nendo daigaku no kaku senmon bun’ya e no tekisei no hyōka o mokuteki to suru sōgō shiken no arikata ni kansuru kyōdō kenkyū’ saishū hōkoku-sho]. University Entrance Examination Section Center Research and Development Department [Daigaku nyūshi sentā kenkyū kaihatsubu], 21–32

Taghipour K, Ng HT (2016) A neural approach to automated essay scoring. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, Texas, 1–5 November, pp. 1882–1891. Association for Computational Linguistics

Takeuchi K, Ohno M, Motojin K, Taguchi M, Inada Y, Iizuka M, Abo T, Ueda H (2021) Development of essay scoring methods based on reference texts with construction of research-available Japanese essay data. In IPSJ J 62(9):1586–1604

Ure J (1971) Lexical density: A computational technique and some findings. In Coultard M (ed) Talking about Text. English Language Research, University of Birmingham, Birmingham, England

Vaswani A, Shazeer N, Parmar N, et al. (2017) Attention is all you need. In Advances in Neural Information Processing Systems, Long Beach, CA, 4–7 December, pp. 5998–6008, Curran Associates, Inc., Red Hook, NY

Watanabe H, Taira Y, Inoue Y (1988) Analysis of essay evaluation data [Shōronbun hyōka dēta no kaiseki]. Bulletin of the Faculty of Education, University of Tokyo [Tōkyōdaigaku kyōiku gakubu kiyō], Vol. 28, 143–164

Yao S, Yu D, Zhao J, et al. (2023) Tree of thoughts: Deliberate problem solving with large language models. Advances in Neural Information Processing Systems, 36

Zenker F, Kyle K (2021) Investigating minimum text lengths for lexical diversity indices. Assess. Writ. 47:100505. https://doi.org/10.1016/j.asw.2020.100505

Zhang Y, Warstadt A, Li X, et al. (2021) When do you need billions of words of pretraining data? Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Online, pp. 1112-1125. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.acl-long.90

Download references

This research was funded by National Foundation of Social Sciences (22BYY186) to Wenchao Li.

Author information

Authors and affiliations.

Department of Japanese Studies, Zhejiang University, Hangzhou, China

Department of Linguistics and Applied Linguistics, Zhejiang University, Hangzhou, China

You can also search for this author in PubMed   Google Scholar

Contributions

Wenchao Li is in charge of conceptualization, validation, formal analysis, investigation, data curation, visualization and writing the draft. Haitao Liu is in charge of supervision.

Corresponding author

Correspondence to Wenchao Li .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Ethical approval

Ethical approval was not required as the study did not involve human participants.

Informed consent

This article does not contain any studies with human participants performed by any of the authors.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplemental material file #1, rights and permissions.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Li, W., Liu, H. Applying large language models for automated essay scoring for non-native Japanese. Humanit Soc Sci Commun 11 , 723 (2024). https://doi.org/10.1057/s41599-024-03209-9

Download citation

Received : 02 February 2024

Accepted : 16 May 2024

Published : 03 June 2024

DOI : https://doi.org/10.1057/s41599-024-03209-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

toefl writing essay

Test Resources

TOEFL® Resources by Michael Goodine

Toefl writing test evaluation and scoring service,  writing should be the easiest part of the toefl. ® .

Many students take the TOEFL again and again to get the writing score they need. This makes me sad because writing should be the easiest part of the test. Students just need proper feedback and guidance to overcome this problem.

 Don’t Worry.  I Can Help You.

Sign up for my TOEFL Writing evaluation service for:

  • Line-by-line corrections of your practice responses.
  • A ccurate score estimates.
  • Personalized advice on  how to get a higher score .
  • Complete  templates for both tasks.
  • Feedback within  48 hours .
  • No expiration dates – send your responses any time.
  • Submit both integrated essays and “writing for an academic discussion” responses .

Click the “learn more” links below for more information or scroll down a bit to read about my qualifications. Note that I also offer  speaking evaluations  and proofreading services for university applications.

toefl writing essay

“Thanks for your for your amazing feedback, Michael! I’m happy to let you know that I scored 113 (28 in writing). Yes, I got the required scores for a pharmacy license!”

– M.S.

One TOEFL Writing Test

Get Started!

  •   Writing Score Estimate
  •   Line-by-Line Corrections
  •   Personal Advice
  •   Feedback in 48 Hours

Two TOEFL Writing Tests

$20 per test!

Six TOEFL Writing Tests

$18 per test!

12 TOEFL Writing Tests

$17 per test!

Twenty TOEFL Writing Tests

$16 per test!  Wow!

Personal Lesson & Consultation

Meet the Teacher!

  •   2 Test Evaluations
  •   Corrections & Score
  •   60 minute Zoom Session
  •   Ask me Anything!

TOEFL Speaking Evaluations

  •   A Human Score
  •   A SpeechRater Score
  •   Detailed Advice

Who Will Check Your Work?

Me! My name is  Michael Goodine , and I am the founder of TOEFL Resources.  I personally check every essay.  Here’s some information about me:

  • I’ve been helping students prepare for the TOEFL for about a decade, and in that time I have read more than  twenty-five thousand  TOEFL essays from students.  I know what works, and I know what doesn’t work.  My students have entered Ivy League business schools, have qualified for pharmacy and dental licenses, and have gained admission to undergraduate programs all over the world.  I work with all kinds of people!
  • I hold a degree in English Literature from beautiful  Dalhousie University in Canada. 
  • I have received  TOEFL teacher training from ETS , the creators of the TOEFL.  I update this training every year.  You can see my ETS certificates from April 2019 ,   December 2019 ,  November 2020 , January 2021 , and June 2022 . 
  • I am obsessed with TOEFL writing, and have written several research articles about how the test is scored.  Check out my  detailed study of the e-rater software   and an  update from 2020 .
  • I worked as a researcher and textbook writer (TOEFL & TOEIC) for  Hackers Education Group , one of the largest publishers of TOEFL books in Asia.
  • My YouTube Channel, Test Resources , has more than 73,000 subscribers. 

I like to think I’m an expert when it comes to the TOEFL writing section.  I want to use that expertise to help you reach your goals.

What are People Saying about the Service?

I would like to share my happiness with you as I finally passed the TOEFL exam with a total score of 94 and you were helpful in an amazing way. I appreciate your comments which have transformed my writing to achieve 22 . Thank you so much , I will definitely recommend you to my friends. -B.M
I just received my scores (28/29/29/29). Thank you so much for you help! Your tips gave me enough confidence to push through this difficult test! -F.B
Thank you for your help. I finally got 26 in writing with an overall score of 107. Your help was great. -S.K, Dentist
I just want to share some good news with you, I got R29 / L26 / S23 / W25 on my last TOEFL test, and that’s enough for me. Many thanks for your help! You really offer effective training. An expert like you will never be replaced by AI models! -G.S, China

( more testimonials )

Note that TOEFL is a registered trademark of ETS. This service is not endorsed or approved by ETS .

IMAGES

  1. TOEFL

    toefl writing essay

  2. essay samples for toefl

    toefl writing essay

  3. Free Sample TOEFL Essay

    toefl writing essay

  4. How to Write a TOEFL Integrated Essay

    toefl writing essay

  5. Writing Toefl Template

    toefl writing essay

  6. TOEFL Essay 011_020

    toefl writing essay

VIDEO

  1. TOEFL Writing: Five things you need to know about the NEW writing question

  2. TOEFL Review Wednesdays: Structure and expand like a pro

  3. TOEFL Writing: Integrated Essay

  4. TOEFL Writing practice test 2 with answers, New version (2023)

  5. TOEFL Writing practice test 10, New version (2023)

  6. TOEFL Writing practice test 20 with answers, New version (2023)

COMMENTS

  1. TOEFL® Writing Practice Questions (2024 Update)

    About these TOEFL ® Writing Samples Practice TOEFL Integrated Essays. On test day, you'll first get a TOEFL Integrated Writing Question. You will first read an article, then listen to a lecture, and finally write an essay using details from both. Below are some practice questions you can use to get ready for the test.

  2. TOEFL Writing Sample Essays

    TOEFL Writing Introduction. The writing section is the fourth section of the TOEFL iBT test and takes about 50 minutes to complete. It consists of two separate tasks: an Integrated Writing task and an Academic Discussion Writing task.The Independent task requires you to write an opinionated essay based on your own knowledge and experience, whereas the Integrated task requires you to write a ...

  3. TOEFL iBT Test Writing Section

    The TOEFL iBT test Writing section measures your ability to write in English in an academic setting, and to present your ideas in a clear, well-organized way. There are two writing tasks. Integrated writing task (20 minutes) — read a short passage and listen to a short lecture, then write in response to what you read and listened to.

  4. 2 Perfect-Scoring TOEFL Writing Samples, Analyzed

    Below is an official TOEFL Integrated Writing sample question and as well as an essay response that received a score of 5. It includes a written passage, the transcript of a conversation (which would be an audio recording on the actual TOEFL, and the essay prompt. After the prompt is an example of a top-scoring essay.

  5. Sample Essays for the Writing Section of the TOEFL Test ( )

    Here's an expert TOEFL teacher's sample essay to this particular TOEFL Writing topic. The article introduces the topic of corn-based ethanol. More specifically, the writer discusses the advantages of switching from fossil fuels to this alternative energy source. The lecturer in the listening passage disagrees.

  6. Complete TOEFL Essay Templates (2024 Update)

    TOEFL essay templates can help you answer both of the TOEFL writing questions. To write a strong TOEFL essay just fill in the blanks with the required information from your notes (in the integrated essay) or from your ideas (the writing for an academic discussion task). Note that I also have a set of TOEFL speaking templates.

  7. How to Ace the TOEFL Writing Section: 7 Expert Tips

    After you complete the exam, your essays will be graded by several (typically four) graders. Each essay will receive a score from 0-5. The sum of those two scores will then be scaled to a score from 0-30, which is your official Writing score. The Writing section makes of 25% of your total TOEFL score (from 0-120).

  8. Master the TOEFL Writing Section in 2024

    The writing section is the final part of the TOEFL ® test. You'll have about 30 minutes to answer two writing questions. They are known as the TOEFL Integrated essay, and the TOEFL Writing for an Academic Discussion Task.You'll be graded based on your content, organization, grammar and language use. Below are links to my best stuff, or read on for descriptions of each task.

  9. PDF TOEFL iBT® Writing Practice Questions

    Writing Practice Set 2 (Independent): Question. Directions: Read the question below. Give yourself 30 minutes to plan, write, and revise your essay. Typically, an effective response will contain a minimum of 300 words.

  10. 13 TOEFL Writing Topics to Help You Practice for the Exam

    For the Independent Writing task, you'll receive a question on a particular topic or issue. You'll need to write a response to that topic that explains your opinion, and you'll also need to give reasons and examples that support your opinion. Writing Time: 30 minutes. Suggested Essay Length: At least 300 words.

  11. The Best TOEFL Writing Practice: 300+ Topics to Study With

    Overview of TOEFL Writing. The TOEFL Writing section is 50 minutes long (broken into two parts) and contains two tasks: Integrated Writing and Independent Writing. You'll type both essays on the computer. The Integrated Writing task requires you to use listening, reading, and writing skills.

  12. Ten Awesome Tips for the Writing Section of the TOEFL® Test

    Here's a quick summary of our top 10 TOEFL Writing Tips. Tip #1 - Pay careful attention to the reading passage because the listening passage will argue against each point made in the reading. Tip #2 - Practice taking short, but precise notes during the listening passage. You will only get to listen to it once.

  13. TOEFL sample essays: Tips and techniques for a high score

    The TOEFL exam is a standardised test that measures a non-native English speaker's ability to use and understand English at the university level. One of the components of the TOEFL exam is the writing section, which requires test-takers to write an essay in response to a prompt. To score well on the writing section, it is essential to understand the requirements of the exam and the ...

  14. Good Luck TOEFL

    end with a memorable conclusion. Try to write an essay each day. The added practice will enable you to complete a strong essay, with ease, confidence, and speed, on the day of your TOEFL exam and throughout your life. Free Sample TOEFL Essay #1. "Doing Unenjoyable Things". Free Sample TOEFL Essay #2. "Money & Success". Free Sample TOEFL Essay ...

  15. TOEFL Sample Essays: Writing Topics & Samples

    Among the English proficiency tests TOEFL is a widely accepted one. It aims to assess candidates on four parameters, i.e. Reading, Writing, Speaking and Listening. Essays are a part of the Writing section of the exam. Also, according to the recent changes in the TOEFL exam pattern, the most fundamental change has happened in the writing section.

  16. Good Luck TOEFL

    The lists of TOEFL Independent Writing topics below can help you gain a better understanding of the type of essay you could be asked to write on your exam. Rather than give you one long confusing list of topics, the essay topics have been divided into five categories. This is not an exhaustive list, but only a representative sample of essay topics.

  17. How to Write a Fantastic TOEFL Integrated Essay (2023)

    It is the first writing task on the TOEFL test. First, you will have three minutes to read an article (four paragraphs, 250 to 300 words) about an academic topic. Next, you will listen to a lecture (about 2 minutes) about the same topic. Finally, you will have 20 minutes to write an essay about the relationship between the two sources.

  18. How to Score 26+ in TOEFL Writing (Independent Essay)

    A quick guide to scoring 26 points or more in the TOEFL independent writing section. Includes an easy template, sample question and complete sample essay.My...

  19. The Best TOEFL Writing Templates for Any Prompt

    TOEFL essay templates are an especially useful component of TOEFL prep because they raise your chance of achieving a high TOEFL Writing score. With TOEFL essay templates, you can organize your thoughts more clearly, spend less time outlining your responses, and prepare for the TOEFL with increased confidence.

  20. Types of Essay in TOEFL: Check Samples and Important Questions

    The TOEFL exam contains various essay types that students should familiarize themselves with. Within the essay section, there are primarily two types of tasks: independent writing and integrated writing. The integrated writing task involves reading a passage and listening to an audio, which may aid in elucidating or supplementing the text.

  21. TOEFL Integrated Writing Task 2023

    The TOEFL integrated writing task is the first of the two writing tasks in the TOEFL exam. It's designed to assess your ability to combine listening and reading skills to write a coherent and well-structured essay. Here's a detailed breakdown: Reading passage: A passage around 200-250 words long is provided.

  22. Master the TOEFL Independent Essay (2023)

    TOEFL Independent Writing Master Guide (2023 Update) Here's how the TOEFL Independent writing question works: It is the second writing question on the TOEFL Test. You will be asked a question about your personal opinion and be given 30 minutes to write about it. Questions can be about any topics, but they are most often about education, work ...

  23. Design Framework for the TOEFL Primary Writing Test TOEFL ELP AD AES YL

    Abstract. The TOEFL Primary tests, designed as international English language proficiency assessments, have been widely used in over 120 countries since 2013. The tests are primarily intended to support the teaching and learning of young language learners aged 8 and older. Until recently, the TOEFL Primary tests measured only reading, listening ...

  24. Essays for IELTS, PTE, TOEFL 4+

    ‎Introducing 'Essays for IELTS, PTE, TOEFL' - the quintessential writing companion designed to elevate your performance in various exams. Tailored for excellence, this app is your go-to resource for mastering the art of essay writing. - Exam-Centric Mastery: 'Essays for IELTS, PTE, TOEFL' caters to…

  25. Essay on My Hobbies For School Students

    Essay on My Hobbies in 500 Words. Hobbies play a crucial role in shaping our lives by providing a necessary balance between work and leisure. They enrich our existence, offer avenues for personal growth, and foster a sense of accomplishment. Among various hobbies, my favourite is table tennis, a sport that I am deeply passionate about.

  26. Sample TOEFL Agree or Disagree Essay

    This essay follows our TOEFL Writing templates for independent essays. Note that we have many more sample essays for you to read. Special Offer: TOEFL Essay Evaluation and Scoring. You can now sign up to have your practice essays evaluated and scored by me! I'll check your work line-by-line, estimate your score and tell you how to do better ...

  27. Applying large language models for automated essay scoring for non

    A dataset comprising a total of 1400 essays from the story writing tasks was collected. ... conducted a study where they applied the GPT-3 model to automatically score English essays in the TOEFL ...

  28. TOEFL Writing Test Evaluation and Scoring Service

    Don't Worry. I Can Help You. Sign up for my TOEFL Writing evaluation service for: Line-by-line corrections of your practice responses. Accurate score estimates. Personalized advice on how to get a higher score. Complete templates for both tasks. Feedback within 48 hours. No expiration dates - send your responses any time.