Hi @bliszewski -- We have also found that Canvas data is large, difficult to work with, and doesn't really provide much insight into actual student learning behaviors despite many people's fervent hope that it will do so (at least on an enterprise level). That said, we do push our web_logs data to a Redshift table (I think we have a little under three billion records so far for CD2 web_logs). Because we are pulling down the data to an S3 bucket first, we still have all of the downloaded files (CD2 and CD1), so in that sense we have retained all of the data. It's a lot of data, but S3 storage is not a huge expense in the grand scheme of things.
In terms of "learning analytics" using this data, because we have so much variety in terms of how instructors design their course spaces and the extent to which they use various Canvas tools, most of what we report on from web_logs might just be considered "system usage". For example, how many unique users (i.e., unique user_ids) showed activity (i.e., records in web_logs) each day over the course of a semester.
Even with non-web_logs tables, I think that, in many cases, the data tells us more about course requirements than anything else. For example, if an instructor requires their students to respond to at least three posts by other students in the discussion, you will likely see most students dutifully responding to three posts. In other words, the data will provide some insight into student compliance, but will likely not shed light on the kind of questions that educators really care about, such as how students interact with new ideas, co-construct knowledge, etc. (I'm speaking at a general level -- I do think the data become more useful when you are working with a specific instructor who is able to provide context and may have specific expectations of usage, e.g., after a course redesign.)
Also, I don't know anything about specific state policies, but I would hope that something like web_logs would not be considered academic data that needs to be maintained. (But I am not a lawyer or state official.)
Just my two cents.
Best,
Martyn