process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) #11

lsloan · 2023-02-10T03:30:09Z

Resolves #6
Resolves #7
Resolves #8
Resolves #10

Test Plan

Follow the steps in README.md.
When configuring the .env file, set it up with Canvas-test and use the following for course IDs:
- COURSE_IDS_CSV=545451
  Uses "ECON 101 200 FA 2022", which contains three peer-reviewed assignments.
- COURSE_IDS_CSV=200,545451,461474
  Add other courses, which may or may not include peer-reviewed assignments.

Significantly newer Django versions are available, but their support timeline is shorter than the release currently in use. It makes more sense to upgrade to the latest patch version.

…st` default value

lsloan · 2023-02-10T20:19:50Z

Implementing #6 is more complicated than I expected. Trying to save peer reviews for some of the other assignments in the test course is revealing more abnormalities in the Canvas data than expected.

Trying to switch from look ahead for assessor/submission to handling exception caused by missing ones when the assessment is saved.

This should eliminate problems with saving submissions/assessments/comments associated with the test_student that cause missing user errors.

Skipping untyped submissions seems to be too aggressive. It turns out that some of those submissions DO receive peer reviews. Just because I don't know what the type is doesn't mean that Canvas can't handle it and users successfully interact with it.

Also continue with next assessment when one is missing its assessor or submission.

Replace `try…catch` blocks with calls to `getattr()` instead.

N816 actually highlighted a problem, not to be ignored. Rather than have config export variable that's not a constant, move that variable into a function for checking the rest of the config.

lsloan · 2023-04-27T16:47:42Z

@ssciolla, @zqian: If you get a moment to review this PR again, I'd appreciate it. It's probably good enough for an approval now, as I've created issues for some of the points that were raised earlier. I could merge it tomorrow. (I'm away today.)

ssciolla · 2023-04-27T21:02:22Z

@lsloan, hi, sorry, it's been a busy week. I am off tomorrow. I probably won't be able to finish reviewing this until Monday. I'll get to it as soon as I can.

ssciolla

Sorry for the delay, @lsloan. This is a pretty meaty pull request, and I hadn't really seen this code before, so I basically felt like I was reviewing the whole project.

I'm approving, with some comments for further thought or future work. It would at least be good to see the GitHub Action fixed or removed before you merge. I don't know with a 100 percent certainty that this is working exactly as intended, but it did collect data, and in the poking around I did, things made sense. Thus I think it's okay to merge and work iteratively on improvements. Looks like some type hints could be tightened up, but I understand it's difficult in some cases since canvaspi doesn't seem to have them.

Couple more general suggestions:

You may want to add a knob for changing the level of canvasapi log messages, since they can be really noisy too like Django's. Something like this should work in settings.py:

  'loggers': {
      'django': {
          'handlers': ['console'],
          'level': os.getenv('LOG_LEVEL', 'INFO').upper(),
          'propagate': False
      },
      'canvasapi': {
          'handlers': ['console'],
          'level': os.getenv('CANVAS_API_LOG_LEVEL', 'WARN').upper()
      }
  }

Part of the difficulty in reviewing this is not knowing how to test the results. To what do I compare the data? What queries ensure that we can do the analysis we need later? I'm not sure what kind of tests this application needs, but selective unit tests and/or the ability to programmatically run certain queries could be useful. Here are some queries I ended up writing to so some of this work (in case they're helpful):

-- Courses, assignments, and submissions
select *
from course c
join assignment a
	on a.course_id=c.id
join submission s
	on s.assignment_id=a.id;

-- Assessor gave to submitter these comments about their work in this course
select
	assessor.login_id as `assessor_login_id`,
	submitter.login_id as `submitter_login_id`,
	cm.comments,
	s.id as `submission_id`
from course c
join assignment a
	on a.course_id=c.id
join submission s
	on s.assignment_id=a.id
join `user` submitter
	on s.user_id=submitter.id
join assessment am
	on am.submission_id=s.id
join `user` assessor 
	on am.assessor_id=assessor.id
join comment cm
	on cm.assessment_id=am.id
where c.id='XXXXX';

-- count peer review comments given by each student in a course
select
	assessor.login_id as `assessor_login_id`,
	count(*)
from course c
join assignment a
	on a.course_id=c.id
join submission s
	on s.assignment_id=a.id
join `user` as submitter
	on s.user_id=submitter.id
join assessment am
	on am.submission_id=s.id
join `user` assessor 
	on am.assessor_id=assessor.id
join comment cm
	on cm.assessment_id=am.id
where c.id='XXXXX'
group by assessor.id;

This process seemed to take about 2 and a half minutes for the large course you suggested we look at. I'm not sure how much we would/will scale this process, but it's possible we reach a point where performance is an issue. Adopting some async/await or doing bulk database actions could improve the time to complete. Maybe the async becomes less relevant if we switch to Canvas Data 2 queries, but something like Django's bulk_create would drastically reduce the number of database calls currently made.

config.py

peer_review_data/main.py

.github/workflows/pycodestyle.yaml

peer_review_data/models.py

peer_review_data/main.py

lsloan · 2023-05-03T01:38:55Z

@ssciolla, thanks for the approval and further comments. I'll review them closely tomorrow morning before merging.

lsloan · 2023-05-03T14:01:16Z

@ssciolla, thanks again for the thoughtful, thorough code review. I really appreciate it. I understand the delay.

Yes, this PR has a lot going on. The earlier PRs were mostly about getting the base application going. Zhen had me pause progress on this project to work on deploying the Longhorn Open peer grading tool. So when I was able to resume work here, I've been trying to get as much done as possible and quickly before something else comes along to interrupt me.

Addressing the general suggestions in your top-of-review message:

Yes, I am dissatisfied with the current logging config. I think what you've suggested for differing general and canvasapi log levels may serve me well for handling enable separate log levels for app, framework, canvasapi #9. I wasn't sure how to set log levels for just parts of the application. I'll add this info to that issue.
Thanks for the SQL queries. I had it on my to-do list to make some. I should've had them in the test plan for this PR and added to the documentation. I'll be sure to do the latter.
This data will be used by the MWrite AI analysis dashboard. That dashboard already runs using the MPR DB. This DB has a similar structure, so I think the dashboard's query, maybe somewhat modified, will be what we should use here. Your queries are similar to that one.
I'll make a new issue about improving performance, including a mention of bulk_create.

Co-authored-by: Sam Sciolla <35741256+ssciolla@users.noreply.github.com>

@ssciolla

Per @ssciolla's code review: tl-its-umich-edu#11 (review)

Helps with IDE reporting errors about `typing.Self`.

Remove an old and now unneeded function `return`; bring back a `continue` that was removed for testing, but may reduce unnecessary processing; and use `hasattr()` (NOT `getattr()`) as a more readable replacement for `dir()`.

Other code mostly eliminates possibility of the exception being thrown, so instead of catching it and returning `None`, remove the `try…except` block. This also eliminates the need for importing `Optional`.

While removing the apparently unneeded exception handler for errors saving users, realized the function doesn't really need to return the course object, either.

lsloan added 4 commits February 9, 2023 16:11

tl-its-umich-edu#7 - upgrade Django; specify other module versions

d9c5d6e

Significantly newer Django versions are available, but their support timeline is shorter than the release currently in use. It makes more sense to upgrade to the latest patch version.

tl-its-umich-edu#8 - keep container alive for debugging

e5448bf

tl-its-umich-edu#10 - rename CANVAS_API_URL & remove U-M `canvas-te…

a78268b

…st` default value

tl-its-umich-edu#10 - reformat code

a9d5170

lsloan self-assigned this Feb 10, 2023

lsloan changed the title ~~process all courses' assignments; misc. other cleaning up (iss. #6, #7)~~ process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) Feb 10, 2023

lsloan marked this pull request as draft February 10, 2023 15:53

lsloan changed the title ~~process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10)~~ 🚧 DRAFT 🚧 - process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) Feb 10, 2023

lsloan added 18 commits March 24, 2023 10:50

tl-its-umich-edu#6 - phasing out assignment ID config

f71f136

tl-its-umich-edu#6 - debugging util: Canvas data to JSON

6cb8c6e

tl-its-umich-edu#6 - phasing out assignment ID config

c4554fe

tl-its-umich-edu#6 - handle assessment missing assessor/submission

2ba99f5

Trying to switch from look ahead for assessor/submission to handling exception caused by missing ones when the assessment is saved.

tl-its-umich-edu#6 - include test_student among users

6573671

This should eliminate problems with saving submissions/assessments/comments associated with the test_student that cause missing user errors.

tl-its-umich-edu#6 - phasing out assignment ID config

51bdcf9

tl-its-umich-edu#6 - handle problems saving assessments

94116bb

tl-its-umich-edu#6 - remove skip for non-peer-reviews

27ea688

Also continue with next assessment when one is missing its assessor or submission.

tl-its-umich-edu#6 - new module for config-like code

178b0e4

tl-its-umich-edu#6 - new module for config-like code

088f350

tl-its-umich-edu#6 - additional logging

bdb2b1b

tl-its-umich-edu#6 - use CSV list of course IDs

2ee46b1

tl-its-umich-edu#6 - iterate over all assignments in each course

fe98342

tl-its-umich-edu#6 - remove unneeded comments & code; reformatting

87c644d

tl-its-umich-edu#6 - remove unneeded comments & code; reformatting

fc515f1

tl-its-umich-edu#6 - add DB model migrations

ea7e9f6

tl-its-umich-edu#6 - update and reformat documentation

99fc0c9

lsloan changed the title ~~🚧 DRAFT 🚧 - process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10)~~ process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) Mar 29, 2023

lsloan requested review from ssciolla and zqian March 29, 2023 19:54

lsloan requested a review from ssciolla April 21, 2023 20:41

lsloan added 11 commits April 21, 2023 22:16

tl-its-umich-edu#10 - clean up imports

624be57

tl-its-umich-edu#10 - reformatted

61b4f06

tl-its-umich-edu#10 - support for Python style checking

231211c

tl-its-umich-edu#10 - ignore another unreasonable style warning

ecaea7a

tl-its-umich-edu#10 - code style cleanup

e57dcd6

tl-its-umich-edu#10 - reformatting

e5d5f9f

tl-its-umich-edu#8 - simplify logic of debugging utility functions

2a5913f

Replace `try…catch` blocks with calls to `getattr()` instead.

tl-its-umich-edu#10 - improve config and Python style

0b81702

N816 actually highlighted a problem, not to be ignored. Rather than have config export variable that's not a constant, move that variable into a function for checking the rest of the config.

tl-its-umich-edu#6 - skip non-number course IDs from config

6303004

tl-its-umich-edu#6 - handle missing courses, no-rubric assignments

4c3d14c

tl-its-umich-edu#10 - attempt to set up Python style check action

7b2895f

ssciolla previously approved these changes May 2, 2023

View reviewed changes

This was referenced May 3, 2023

enable separate log levels for app, framework, canvasapi #9

Open

add dashboard SQL queries to docs #15

Open

typing.Optional → | None

a132a38

Co-authored-by: Sam Sciolla <35741256+ssciolla@users.noreply.github.com>

lsloan dismissed ssciolla’s stale review via a132a38 May 3, 2023 17:11

lsloan added 2 commits May 3, 2023 14:39

tl-its-umich-edu#10 - typing changes and related support

2f0dc10

Per @ssciolla's code review: tl-its-umich-edu#11 (review)

tl-its-umich-edu#10 - log to stdout to allow redirection

94597e0

This was referenced May 3, 2023

consider upgrade to Django 4.2.1 #16

Open

fix code quality GitHub action #17

Open

lsloan added 4 commits May 4, 2023 12:18

tl-its-umich-edu#10 - install mypy

64c0297

Helps with IDE reporting errors about `typing.Self`.

tl-its-umich-edu#10 - clean-up: remove unnecessary exception catch

5ecc8b3

Other code mostly eliminates possibility of the exception being thrown, so instead of catching it and returning `None`, remove the `try…except` block. This also eliminates the need for importing `Optional`.

tl-its-umich-edu#10 - clean-up: remove exception handler & return

c814520

While removing the apparently unneeded exception handler for errors saving users, realized the function doesn't really need to return the course object, either.

lsloan merged commit aaafbf4 into tl-its-umich-edu:master May 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) #11

process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) #11

lsloan commented Feb 10, 2023 •

edited

Loading

lsloan commented Feb 10, 2023

lsloan commented Apr 27, 2023

ssciolla commented Apr 27, 2023

ssciolla left a comment •

edited

Loading

lsloan commented May 3, 2023

lsloan commented May 3, 2023

process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) #11

process all courses' assignments; misc. other cleaning up (iss. #6, #7, #8, #10) #11

Conversation

lsloan commented Feb 10, 2023 • edited Loading

Test Plan

lsloan commented Feb 10, 2023

lsloan commented Apr 27, 2023

ssciolla commented Apr 27, 2023

ssciolla left a comment • edited Loading

Choose a reason for hiding this comment

lsloan commented May 3, 2023

lsloan commented May 3, 2023

lsloan commented Feb 10, 2023 •

edited

Loading

ssciolla left a comment •

edited

Loading