Hey @mclark19 ! This is an interesting question. This hadn't jumped out at us in any of our web_logs queries. I just did a few queries and found:
- 1.89% of our web_logs records today have our shard id as value.course_id (as a course ID, the number belongs to a course deleted before CD2 web_logs came into production);
- over 99% of the web_logs records with our shard id as value.course_id have value.web_application_controller 'files', value.web_application_action 'show_relative', and value.http_status 302;
- the urls in relevant web_logs I spot checked have the form /courses/[shard_id]~[course_id]/files/[shard_id]~[attachment_id]/[folder_path]/[filename]?download_frd=1&sf_verifier=[verifier];
- the records have null value.user_id.
It is odd that the ETL of these logs puts shard id as course_id even though the course ID is in the URL field. I think these log records are basically noise that follow after requests for attachments that have been copied at least once, hence the HTTP status.