The Instructure Community will enter a read-only state on November 22, 2025 as we prepare to migrate to our new Community platform in early December.
Read our blog post for more info about this change.
We've been doing some validation across the dataset as a whole, and one thing that has jumped out is that the various free-form text fields are truncated to 255 (or sometimes 256) characters at the schema level. Some of the work that our researchers have done on the edX dataset in the past has involved analyzing forum posts (running tone analysis, for example, and correlating that to outcomes or retention). Obviously that requires the full text of a post, message etc. Would it be possible to add the full body of any free text fields to the dataset, rather than just the first 255 characters?
We were told that is a Redshift limitation - that we would need to go to the files to get the full text over 256 characters. I'm not sure if that has changed at all...
I believe this is still correct, @rubyn
Looking into it further: Character Types - Amazon Redshift
It looks like VARCHAR can hold more than 256 characters. Is this something that Instructure might change in the future so that entire comments fields will be stored in the database?
Community helpTo interact with Panda Bot, our automated chatbot, you need to sign up or log in:
Sign inTo interact with Panda Bot, our automated chatbot, you need to sign up or log in:
Sign in
This discussion post is outdated and has been archived. Please use the Community question forums and official documentation for the most current and accurate information.