Run PLSR plot #101

andrewram4287 · 2024-11-01T15:11:52Z

No description provided.

andrewram4287 · 2024-11-01T15:14:01Z

@JacksonLChin This is the code I'm using to run the PLSR plots. Let me know if you find any glaring issues! I based this file mostly by your D8 file.

JacksonLChin · 2024-11-02T01:47:25Z

pf2/figures/figureA9.py

    covid_acc = score(
-        labels.loc[meta_data.loc[:, "patient_category"] == "COVID-19"],
-        probabilities.loc[meta_data.loc[:, "patient_category"] == "COVID-19"],
+        labels.loc[meta_data.loc[:, "patient_category"] == "COVID-19"].to_numpy().astype(int),
+        probabilities.loc[meta_data.loc[:, "patient_category"] == "COVID-19"].to_numpy(),
    )


To avoid typing headaches later, let's use to_numpy() for either all of the probabilities variables or none of them.

JacksonLChin · 2024-11-02T01:49:11Z

@andrewram4287 Looks good! Just one minor change for type handling.

JacksonLChin · 2024-11-02T01:51:25Z

Other thing to note--not sure if this is the same plot you showed in lab meeting yesterday, but the predict_mortality function splits out patients by COVID-19 and non-COVID status before predicting. The Overall column in this case is the accuracy of the two models after joining their predictions together, so this isn't building a third model that looks at all the patients at once.

andrewram4287 · 2024-11-02T01:53:17Z

Other thing to note--not sure if this is the same plot you showed in lab meeting yesterday, but the predict_mortality function splits out patients by COVID-19 and non-COVID status before predicting. The Overall column in this case is the accuracy of the two models after joining their predictions together, so this isn't building a third model that looks at all the patients at once.

Okay that makes sense I can look into what the prediction if we actually combine all the patient samples. Yes, the code creates the same plot I showed in lab meeting.... So that means accuracy does decrease with more PLSR components...

JacksonLChin · 2024-11-02T01:57:53Z

Yup! We may want to swap to the 1-component model for each then. As Dr. Meyer suggested yesterday, I think we could move to a single scores plot with the COVID and non-COVID PLSR components as the x and y axes for interpretation.

andrewram4287 · 2024-11-02T02:02:04Z

Yup! We may want to swap to the 1-component model for each then. As Dr. Meyer suggested yesterday, I think we could move to a single scores plot with the COVID and non-COVID PLSR components as the x and y axes for interpretation.

I find it weird though that when you ran the PLSR model before that was not the case...

JacksonLChin · 2024-11-02T02:06:08Z

In the past, we've found the non-COVID model worked best at one component, and the COVID one worked slightly better at two. I think this is mostly in line with what we've seen previously--I'm guessing that the change to how we normalize factors prior to PLSR changed this a little bit.

andrewram4287 · 2024-11-04T17:05:13Z

@JacksonLChin
This is the updated plot based on actually trying to predict all samples at once. It looks like it does worse than breaking them up apart.

JacksonLChin · 2024-11-04T17:28:36Z

@andrewram4287 Cool! Thanks for looking into this. It's interesting that 1-component seems better for accuracy while 2 is better for AUC-ROC--given that they're comparable, though, I think we should continue with the 1-component.

Run PLSR plot

c250413

JacksonLChin reviewed Nov 2, 2024

View reviewed changes

Add prediction for all pneumonia samples

0be8a77

andrewram4287 closed this Nov 5, 2024

andrewram4287 deleted the InvestigatingPLSR branch November 5, 2024 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run PLSR plot #101

Run PLSR plot #101

andrewram4287 commented Nov 1, 2024

andrewram4287 commented Nov 1, 2024 •

edited

Loading

JacksonLChin Nov 2, 2024

JacksonLChin commented Nov 2, 2024

JacksonLChin commented Nov 2, 2024

andrewram4287 commented Nov 2, 2024

JacksonLChin commented Nov 2, 2024

andrewram4287 commented Nov 2, 2024

JacksonLChin commented Nov 2, 2024

andrewram4287 commented Nov 4, 2024

JacksonLChin commented Nov 4, 2024

Run PLSR plot #101

Run PLSR plot #101

Conversation

andrewram4287 commented Nov 1, 2024

andrewram4287 commented Nov 1, 2024 • edited Loading

JacksonLChin Nov 2, 2024

Choose a reason for hiding this comment

JacksonLChin commented Nov 2, 2024

JacksonLChin commented Nov 2, 2024

andrewram4287 commented Nov 2, 2024

JacksonLChin commented Nov 2, 2024

andrewram4287 commented Nov 2, 2024

JacksonLChin commented Nov 2, 2024

andrewram4287 commented Nov 4, 2024

JacksonLChin commented Nov 4, 2024

andrewram4287 commented Nov 1, 2024 •

edited

Loading