JF: more strixels
As discussed on Redmine and https://git.xfel.eu/calibration/planning/-/issues/195 we're getting more strixels.
Design decision: LUT
I currently feel like prebaking and shipping some simple LUTs is much more convenient than including custom Cython / CUDA kernels for each and every strixel transformation we meet. Therefore, I've used the reference Python implementations to generate flat LUTs with masks (see added tests for comparison) and then strixel transformation is just loading one of these and applying it. Bonus: one less Cython kernel to compile, one less CUDA kernel to maintain. Also it's plenty fast.
Brief test
Quick check in Karabo:
- using arbitrary uncorrected data looking roughly like this:
-
cols_A0123
(HED strixels) makes it look like this: -
rows_A1256
(SCS strixels) makes it look like this:
Should probably check with actual strixel data - preferably burst mode for better performance analysis.
But as the test shows, the mapping should be the same as what the reference implementations yield.
Also, on max-jhub, applying the rows_A1256
to 16 frames with a thread pool (which the CPU runner has anyway) takes less than 5 ms.
In my Karabo test, timing looked like this:
- GPU warmup time (pinned memory? caching?)
- GPU regular (keep in mind: no correction applied and single frame mode)
- GPU applying
cols_A0123
(brief slowness while switching; here, the LUT was loaded) - GPU applying
rows_A1256
(somehow not noticeable hiccup when switching) - CPU version also doing no corrections (though kernels are still run doing nothing interesting)
- CPU applying
cols_A0123
- CPU applying
rows_A1256
Masking non-standard size pixels
Non-standard size pixels (well, ones that are not like the other long ones) are not corrected.
As discussed with Nuno and Marco, the ones on the border are cropped out and the ones between the ASICs are not handled (both of these in the version Nuno wrote).
In the gaps between ASICs, the otherwise unset pixels get corrections.badPixels.maskingValue
- this is done regardless of corrections.subsetToUse.NON_STANDARD_SIZE
, because we don't want to send uninitialized data.
Future work
Nuno and Marco discussed a bit what to do with the very long strixels on the inner ASIC borders and the weird extra column of small strixels on the outermost ASIC borders. They agreed that, for an initial deployment, the former can be left masked rather than corrected and the latter can be cropped. Also, they didn't find it likely that we'll ever bother going back for these pixels.
Merge request reports
Activity
assigned to @hammerd
107 108 def reconfigure(self, config): 109 if (strixel_type := config.get("strixel.type")) is not None: 110 strixel_package = np.load( 111 base_kernel_runner.kernel_dir 112 / f"strixel_{strixel_type}-lut_mask.npz" 113 ) 114 self._strixel_out_shape = tuple(strixel_package["frame_shape"]) 115 self._processed_data_strixel = None 116 # TODO: use bad pixel masking config here 117 self.correction_kernel_strixel = functools.partial( 118 utils.apply_partial_lut, 119 lut=self._xp.asarray(strixel_package["lut"]), 120 mask=self._xp.asarray(strixel_package["mask"]), 121 missing=self._xp.float32("nan"), 122 ) - Comment on lines +108 to +122
This is a bit atypical for correction runners; it looks more like how for example the upcoming
StackingFriend
handles its own configuration. I did this because I hope to refactor the correction runners to behave more like that - it would make interface between the correction devices and their runners nicer (currently,BaseCorrection
needs to know way too much aboutBaseKernelRunner
to work - a consequence of my attempt to make the runner unaware that it's operating based on a schema).
Sure! To generate the ones included here, I just used the following (with
partial=True
):def make_flat_lut(input_shape, fun, output_shape=None, partial=False): num_elems = np.product(input_shape) unmapped = np.arange(np.product(input_shape)).reshape(input_shape) if output_shape is None: mapped = fun(unmapped) output_shape = mapped.shape mapped = np.empty(shape=output_shape, dtype=np.int64) mapped.fill(-1) fun(unmapped, out=mapped) mapped = mapped.ravel() # let's see how big of a dtype we really need max_mapped_index = np.max(mapped) for dtype in (np.uint8, np.uint16, np.uint32, np.uint64): if max_mapped_index < np.iinfo(dtype).max: break mask = (mapped == -1) mapped[mask] = 0 mapped = mapped.astype(dtype) if partial: mapped = mapped[~mask] return mapped, mask
in a notebook with the provided reference functions now present in the test file followed by a
np.savez_compressed
* to store the keyslut
,mask
, andframe_shape
. We can consider whetherpycalibration
should ship with prebaked LUTs, too (leaving code to regenerate them in a notebook), or whether they should be bootstrapped anew for every job.*The resulting file is, I suppose, a detector-specific constant. I'd strongly prefer it not be treated as such, though, with CalCat and all. Especially as it doesn't really have conditions - a constant constant, if you will.
530 557 .key("corrections.strixel.preview") 531 558 .setNewDefaultValue(False) 532 559 .commit(), 560 561 STRING_ELEMENT(expected) 562 .key("corrections.strixel.type") 563 .description( 564 "Which kind of strixel layout is used for this module? cols_A0123 is " 565 "the first strixel layout deployed at HED and rows_A1256 is the one " 566 "later deployed at SCS." 567 ) 568 .assignmentOptional() 569 .defaultValue("cols_A0123") 570 .options("cols_A0123,rows_A1256") changed this line in version 2 of the diff
added 1 commit
- ef6b5ff8 - Think about correction_kernel_strixel being unset initially
mentioned in commit 3b329123
mentioned in merge request pycalibration!919 (merged)