add ability to cache detector response to marginalize time model and us… #4806

ahnitz · 2024-06-28T01:15:09Z

This implements two features into the marginalized time model.

it can now take a 'sample_rate' argument. If given, a different sample rate can be used for the marginalization than the intrinsic one of the data. Note it really should be higher than the data sample rate to make sense.
The detector response factors are now retrieved from the precalculated ones which improves the marginalization speed by ~ 50 %.

…e alternate sampler rate than data

ahnitz · 2024-07-18T16:27:25Z

@cdcapano Can I merge this?

cdcapano

@ahnitz Have some questions and concerns before approving. See below.

cdcapano · 2024-07-18T16:51:04Z

pycbc/inference/models/marginalized_gaussian_noise.py

+ tlen = int(round(float(sample_rate) *
+ self.whitened_data[det].duration))
+ flen = tlen // 2 + 1
+ self._whitened_data[det].resize(flen)


Is it worth raising an error here if the sample rate is smaller than the data?

Also, this might lead to a bug... the whitened data is set when the psd is set (see here). In principle you can change the psd after the class is initialized. I don't think it's done in an inference run anywhere currently, but could be done in a notebook and/or possibly implemented later. I think it might be better to do this by redefining psd.setter so that it calls the parent, then runs this block of code to update the whitened data. You'll need to save sample_rate as an attribute in that case. Maybe call it something else to distinguish it from the data sample rate (likelihood_sample_rate?).

Ok, interesting. Really, I just need this done before we do the FFT in the likelihood to calculate the SNR time series. I'm not sure this make more sense in the psd setter, but maybe I should put it directly in the likelihood? Is the psd setter preferable over that?

Yeah, truncating should probably be an error.

If you're modifying the _whitened_data data than it would make more sense to do it in the PSD setter, to avoid the error I mentioned. But yeah, that is a round about way to just increase the sample rate of the likelihood time series. Don't you also need to zero pad the waveform in that case too? In which case, there's a bunch of 0*0 going on in the match function. What if you add this as an option to the matched_filter_core instead? That is, it takes in an optional sample rate argument, and if provided you zero pad qtilde just prior to doing the ifft? In the model you would store the sample rate as an attribute and pass it to matched_filter_core in the likelihood call. Adding that functionality to matched_filter_core seems like it could be more broadly useful too.

@cdcapano Indeed. Actually the waveform is already zeropadding to whatever the whitened data is. I guess I could just do both there (probably not too slow to just copy out the data too, so it's not replacing the original values).

@cdcapano I'll make this change.

@cdcapano I think for now I'll make this directly in the likelihood. I thought about adding to matched filter core, but I'm not sure the best strategy at the moment.

cdcapano · 2024-07-18T16:51:48Z

pycbc/inference/models/marginalized_gaussian_noise.py

@@ -207,6 +208,7 @@ class MarginalizedTime(DistMarg, BaseGaussianNoise):
 def __init__(self, variable_params,
 data, low_frequency_cutoff, psds=None,
 high_frequency_cutoff=None, normalize=False,
+ sample_rate=None,


What's the advantage of doing this instead of just increasing the sample rate of the data? Also, this might need some documentation, or maybe change the name of the argument to make it clearer what it is. People are going to think this is the data sample rate otherwise.

Increasing the sample rate of the data means that you need to do data conditioning and parts of the generation at a higher sample rate. That's only useful if you actually care about the frequency band at higher frequencies. If you just need better timing resolution that would be extremely slow and memory intensive.

I think the separation is clear as the data sample rate is in the data section. This is only about the model itself. BTW, we already have this distinction for the heterodyne likelihoods when doing sky marginalization. This was just advancing the same interface / feature to the more regular model

cdcapano · 2024-07-18T16:54:12Z

pycbc/inference/models/marginalized_gaussian_noise.py

- params['dec'],
- params['tc'])
+
+ if self.precalc_antenna_factors:


How is precalc_antenna_factors set to True in a run? Is this an argument for the model? I couldn't find where it would be set.

It's already handled by the marginalization parent class. The features were implemented first for other cases, and this just extends the use to the marginalized time model.

Ok, but I was asking how you set that in the config file and how that's passed to the model initialization. I couldn't figure it out from looking at the code, although I didn't dig too deep.

Ah, it's automatically populated if you do sky marginalization so it's the same set of settings really. These factors are calculated in any case when the sky grid is set up.

ahnitz · 2024-07-22T19:11:12Z

@cdcapano Is this now ok?

ahnitz · 2024-07-22T19:11:46Z

examples/inference/margtime/margtime.ini

@@ -3,8 +3,11 @@
 name = marginalized_time
 low-frequency-cutoff = 30.0

+# This is the sample rate used for the model and marginalization
+sample_rate = 4096
+


I updated the example to make explicit use of this feature.

cdcapano

Ok, this looks good. This seems like it would be a good thing to add to the matched filter core at some point, but this will do for now.

ahnitz requested a review from cdcapano June 28, 2024 01:15

ahnitz added 2 commits July 2, 2024 15:38

add ability to cache detect response to marginalize time model and us…

01bc4ae

…e alternate sampler rate than data

cc

5a9d151

ahnitz force-pushed the gov3 branch from 983f308 to 5a9d151 Compare July 2, 2024 19:41

cdcapano reviewed Jul 18, 2024

View reviewed changes

ahnitz added 2 commits July 22, 2024 12:28

update

ffa3478

update example

9540ec7

ahnitz commented Jul 22, 2024

View reviewed changes

cdcapano approved these changes Jul 22, 2024

View reviewed changes

update example

ffdeb30

ahnitz enabled auto-merge (squash) July 22, 2024 20:26

ahnitz merged commit f756e18 into gwastro:master Jul 22, 2024
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ability to cache detector response to marginalize time model and us… #4806

add ability to cache detector response to marginalize time model and us… #4806

ahnitz commented Jun 28, 2024

ahnitz commented Jul 18, 2024

cdcapano left a comment

cdcapano Jul 18, 2024

ahnitz Jul 18, 2024

cdcapano Jul 18, 2024 •

edited

Loading

ahnitz Jul 18, 2024

ahnitz Jul 18, 2024

ahnitz Jul 22, 2024

cdcapano Jul 18, 2024

ahnitz Jul 18, 2024

cdcapano Jul 18, 2024

ahnitz Jul 18, 2024

cdcapano Jul 18, 2024

ahnitz Jul 18, 2024

ahnitz commented Jul 22, 2024

ahnitz Jul 22, 2024

cdcapano left a comment

add ability to cache detector response to marginalize time model and us… #4806

add ability to cache detector response to marginalize time model and us… #4806

Conversation

ahnitz commented Jun 28, 2024

ahnitz commented Jul 18, 2024

cdcapano left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cdcapano Jul 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ahnitz commented Jul 22, 2024

Choose a reason for hiding this comment

cdcapano left a comment

Choose a reason for hiding this comment

cdcapano Jul 18, 2024 •

edited

Loading