How to Count Student Discussion Posts

James
Community Champion
58
54873

Updated November 29, 2016, to reflect Version 3

The spreadsheet now processes group discussions. They take a lot longer to process since you have to make calls for each group.

If you were a user of the older version and want to upgrade to support discussions, you'll need to make a new copy of the spreadsheet. New downloads will automatically get the new version. If you are a user of version 2 and don't use group discussions, then you do not have to upgrade to version 3.

Updated December 31, 2015, to reflect Version 2

Introduction

There are several places in the community where people have asked about counting student posts within discussions:

Basically, they want some form of a cross-tabulation of student and discussion, with the values containing the number of posts made by a particular student for a particular discussion.

149458_pastedImage_2.png

The discussion counts are real, the names have been anonymized using random suggestions from NameThingy.

Background

This can be done through the API. You get a list of students, you get a list of discussions, you then iterate through each discussion topic, getting the full view for the discussion topic, and count how many times each combination of student and discussion occurred.

There has also been a lot of hype discussion recently about using Google Sheets to obtain information. APIs are difficult for the non-programmer and so this leaves out most teachers who would like the information like this report provides. As I answer questions in the community, it seems a lot of them explain how to do something but then I feel I've lost people because it's too complicated for the masses. Putting it into a Google Spreadsheet that someone could copy, specify a few parameters, and then run themselves is a great way to make it available to more people.

I've written my own Canvas API libraries, first in PERL and then in PHP. They're not complete, they have the functionality I needed to accomplish the task, and they're heavily tailored to the needs and setup of our institution. In other words, they're not easy to share.

Google Sheets uses a form of Javascript, and I'm not a Javascript programmer by any stretch of the mind. I can figure stuff out, but a lot of it is knowing what I want to accomplish and then looking it up in the documentation to figure out how to code it in Javascript. For experienced programmers who can write this stuff, you'll take a look at it and go "What school did he graduate from? Let's revoke their accreditation!" With that disclaimer out of the way, I didn't see any professional programmers contributing these things to the community, so I thought I would go ahead and put it out there.

Instructions

So I am making available a Google Spreadsheet that people can use until something better comes along.

  1. Make sure you're logged into Google Drive and then open the spreadsheet Discussion Counts
  2. Go to File and choose Make a Copy. Name it whatever you like.
  3. There will be a new menu item after Help called Canvas. This is where you need to do everything.
  4. A one-time setup is necessary to authorize Google to access your Canvas information. Go to Canvas > Configure API Settings. These will get stored in the Google userProperties(), which is specific to this spreadsheet and your user account and so you can share this file with others without sharing your credentials.
    1. Authorize Google to access your Canvas information.
    2. Enter your Canvas hostname. The easiest way is probably just to copy/paste a URL from your course into the box. Alternatively, you may enter just the hostname like richland.instructure.com or if your site ends in instructure.com, you may just enter the first part like richland.
    3. Enter your Access Token. If you don't have an access token, then see How do I obtain an API access token for an account?
  5. Choose Canvas > Specify Course and enter the Canvas Course ID for your course. This is the number that follows the /course/ portion in your course homepage URL. Alternatively, you may just copy/paste the URL from a page in your course.
  6. Choose Canvas > Refresh All Discussions and wait. For my course with 42 students, 14 discussions, and about 2000 posts, it took about 45 seconds.

You can also watch the video I made that demonstrates this.

Changes since Version 1

Version 1 of the spreadsheet downloaded and counted the information itself and you had to edit the code if you wanted to make changes like including instructors. Version 2 creates a pivot table and you can just change the filters to specify what roles or status to include.

This is made possible by actually storing the data in the spreadsheet rather than counting it internally within memory. You'll see a page called Data that looks like this after you run the script.

149810_pastedImage_15.png

You'll notice that the UserID and DiscussionID are stored as numbers as well. This is to help eliminate ambiguity if you have to items with the same name. There is also a CourseID and CourseName field. This would allow you to gather information for more than one course. The normal process would be to just make multiple copies of the spreadsheet. One possible use would be if you're documenting student learning outcomes and want a single file. You could also add a pivot table filter for the course if you like.

There is a menu item called Append New Discussions that would need to be used if you have multiple courses in the same file. It's also intended to be a quick way to add new discussions to the count without going through and refreshing the entire discussion. You may find this helpful if you grade throughout the course rather than waiting until the end. Note that this option will not check any previously-scanned discussions for new posts, so it's not appropriate if you have on-going discussions throughout the course.

There is a menu item called Forget API Settings that will erase all your personal information from the User Properties within Google Sheets. This should not be necessary with the new interface, and User Properties are unique to each user and each spreadsheet, so sharing the spreadsheet with someone should keep your credentials safe. This option was primarily for my testing, but I left it in there for people who want to make sure their credentials are not shared.

Version 1 also failed to properly count nested discussions. It would count top level posts and replies to top level posts, but not replies to replies. That has been corrected. When I wrote the original code, I was just learning how to use Google Sheets and program JavaScript and this version incorporates a lot of what I've learned in the 5 months since then. If you never saw version 1, you won't really appreciate how much easier version 2 is.

Google is deprecating support for their default (native) sandbox mode on June 16, 2016. Version 2 uses the new mode (iframe) so it should continue to work after that date.

Original Disclaimers from July 31, 2015

I've only tested this with two courses. Both were using threaded discussions. Your mileage may vary, but it works for me.

I was intending to use this as a framework for other projects because it seems a lot of my responses in the Community are "Yes, you can do that through the API and here's how" and then I write a really technical explanation that may never get implemented. But if there was an easy way to share it, like a Google Spreadsheet, it might encourage people more.

This is, however, my first time sharing code like this and I really have no idea if I did it right or not. I'm also not a Javascript programmer, so I basically spent a lot of time looking at the Google Apps Scripts codes and then the Mozilla Javascript documentation. I originally decided I would look at someone else's code and then found it too complicated for me to follow, so I wrote my own from scratch. I didn't do the sums through the spreadsheet, I found them by hand within my code. It definitely could be improved upon, but it's a first attempt. The problem is that I tend to be a perfectionist and so things never get done because they're not perfect. I'm stepping out of my comfort zone here, so hopefully the code works for you.

Only the GET functionality of the API has been tested. It might work, but until I have a need for a POST, PUT, or DELETE, I won't know for sure.

People can do what they would like with the code, but it would be really awesome if someone who did know Javascript or Google Apps wanted to collaborate and improve it. I would like for there to be a single copy of the canvasAPI.gs file that is reasonably up to date and then modify the other script to do the actual processing. The only thing I ask is that you don't sue me if it doesn't work.

58 Comments
DeletedUser
Not applicable

Hi @DR_EDR Do you need any more help and maybe if I know the situation better I can understand. Thanks for posting it for us to help you!

James
Community Champion
Author

@DR_EDR 

I've found the most reliable thing is to copy/paste a Canvas URL from your browser when it asks for the hostname. I've had issues with some of my early scripts not working if I type richland, but they seem to with the full address.  Make sure you're using your Canvas instance and not richland.instructure.edu (mine) -- pasting the URL in does that.

If you do not get the confirmation screen after the API configuration, it did not work and nothing else will work.

There are some institutions using a proxy that keeps the script from working. Indiana University was the first one I found out about, but I imagine there are others.

DR_EDR
Community Explorer

Thank you so much, @James - firstly, for designing this script, and secondly for trying to help me troubleshoot it.

I have been using Discussion Counts for years, and it's exactly what I need for my courses.

Unfortunately, I have tried all variations on this Canvas URL/instance/hostname theme, and none are working. And yes, I'm using my university's name, not yours 😂

Just to be fully clear about what's going on:

After I make a new copy of the Discussion Counts spreadsheet, I go to Canvas>Configure API & at that point I get a window saying "authorization required". I hit continue. Then google asks me for permissions, which I allow.

It's after that, when I go back to Canvas>Configure API, and enter the relevant API info, that it bricks.

It's possible that my IT department has set up a proxy. I will contact them to ask for help, as well.

James
Community Champion
Author

@DR_EDR 

I just went through the whole process again and made a new video of the entire process. I discovered a mistake in my previous statement -- when you specify the hostname, do NOT paste a URL from the course, but from your dashboard. That is, do not include the /courses/xxx at the end.

One other thing I thought of is to ask whether you're using a school Google account or a personal one? Some institutions like to put in security restrictions that prevent outside material from executing. When people use their personal accounts, then it works. That is more likely than setting up a proxy.

 

DR_EDR
Community Explorer

Success!

Thank you so much, James.

I used my personal g-mail account instead of my work one & it worked. I also used Chrome, not Firefox, as someone else suggested.

I guess my institution had put in security restrictions that prevent outside material from executing?

Anyway, this will help tremendously in grading my courses.

I had almost given up & started to count manually. Sort of like my poor old mum who once 'artisanally'  counted the words in a MS Office document because she didn't know that there was a word count feature 😂

DrAJP
Community Participant

These steps are so helpful! This feature (and so many other things!) should be core, built-in feature of Canvas. 

hmacdermott
Community Participant

I've used this fabulous tool/hack for several years, but when I duplicate the Discussion Counts spreadsheet, I don't see the Canvas tab. I'm using Chrome and I'm logged in with a person Google account. 

What am I doing wrong?

hmacdermott
Community Participant

I can't delete my previous post. I don't know why I can't successfully duplicate a version of Discussion Counts to work, but since I've used this in the past, I updated an existing spreadsheet by updating the token and course URL and that works just fine.