spoken_monologue spoken_dialogue edited_essays written_essays
  • Released in 2015
  • 1,100 subjects
  • 4,400 samples
  • Approx. 500,000 tokens
  • Released in 2020
  • 425 subjects
  • 4,250 samples
  • Approx. 1,600,000 tokens
  • Released in 2017
  • 320 subjects
  • 640 samples (edited)
  • Approx. 150,000 tokens (edited)
  • Released in 2013
  • 2,800 subjects
  • 5,600 samples
  • Approx. 1,300,000 tokens